Apriel-H1-15b-Thinker-SFT uses incremental distillation from Apriel-Nemotron-15B-Thinker, selectively replacing less critical attention layers with linear Mamba blocks to reduce computational complexity while preserving reasoning quality.
That's exactly the point (ie just prior to distribution) where a simple sanity check should have been run and the config replacement/update pipeline stopped on failure. When they introduced the 200 entry limit memory optimised feature loader it should have been a no-brainer to insert that sanity check in the config production pipeline.
Come come now. Elon has built Tesla on something closer to $38bn of taxpayer largesse/avoidance according to the Washington Post.0 I doubt many people would fail to create public entities with healthy casino ponzi stock trading if they had umpteen billion $ of govt funny money propping them up. Hell they might even start a rocket business as a sideline...
MS could always refocus themselves as a global company (in the legal rather than marketing-only sense), and move their HQ out of the US, then there could be no Trump tantrums affecting other countries, the worse that could happen would be some sanctions on what would then be their in-country US affiliate, with no ability to affect their other global operations whatsoever. Why haven't they followed this approach? Haven't lost enough customers yet?
MS lives by corporate contracts and there are a lot of very powerful US companies that will roll over if Trump barks - if MS had already fled the US in a legal sense they'd definitely be in a better place but trying to leave during this administration would cause Trump's ire to focus on them and likely cost them an immense amount of money. I don't particularly like MS and both office and windows are declining in quality quickly so I wouldn't be opposed to the move but... nothing would sink that ship faster than losing a bunch of large US contracts as Trump toadies demonstrate their loyalty by bravely switching to alternatives.
Its pretty decent. Decent enough in fact that I can run a Windows 11 ARM install on vmware Fusion on my macbook m4 pro, and it will happily run win arm and x86 binaries (via builtin MS x86 emulation) decently fast and without complaint (we're talking apps, gaming I haven't tried.)
I decided a Macbook pro M4 pro was the right option for me, 48gb/36 accessible to the gpus with very decent tokens/s throughput for (increasingly impressive) offline midrange open LLM inference and huge battery life. Nothing in the windows world to touch it. But a few windows-only bits of software I occasionally use now run very happily in a Windows 11 arm vm on a free VMware fusion 25H2 on my mac, with a 5 dollar win oem license.
> It looks like a process model; isolation between programs with a system for inter-process communication, and running within a single process's memory.
Which is better handled by existing mature and simpler abstractions from the actor model, like Akka.net, or maybe Orleans.
For sure, but the author also proposed "unmanaged spaces" which would run in-process but with no GC. This seems to be the main goal and everything else is definitely better handled with existing solutions.
It does but the GC can still stop-the-world pause threads that aren't touching GC memory. The author's proposed isolation would provide a way to avoid that.
I wanted to reply to you directly. I was truly impressed by your comments. You understood the precise, deep technical problem I was aiming at—the GC's 'stop-the-world' pauses affecting even non-GC threads—even from my admittedly clumsy and brief initial post. Thank you for that.
What I failed to convey properly, however, is that this performance mechanism is just one small consequence of a much larger idea. It's a foundational piece of a complete architectural model I've designed to address the fundamental pains of the current microservice paradigm.
I have now finished a full manifesto that lays out this entire vision. It includes a deep critique of our current ecosystem and presents the philosophy for a new style of programming intended to solve these core issues.
Given the depth of your understanding from that first article, I would be genuinely honored to get your critical feedback on the full proposal. If you're interested, the new post and discussion are here:
https://news.ycombinator.com/item?id=45477324
Thanks again for your incredibly insightful comments.
I'm divided, a lot of people internally and from suppliers depend on them for work. Equally they are majority owned by an indian company (Tata) now that also mandated they use their useless in-group IT outsourcer whose secrity appears to be pretty lax. In the end I'd say the govt should make the condition for a loan that they dilute out much of the equity holders / share out the loss - in order for them to step in and prevent a total loss, and so the taxpayer gets some longer term contingent benefit.
The Landrover side is clearly doing work, even if the Jaguar part was doing poorly before this. There are a lot of suppliers also that end up having to furlough if their main client shuts down. And there are second order effects on local economies when that happens, and therefore on HMRC. And you also don't want to lose engineering skills that might be needed for e.g. military crossgrades/upscales. Ultimately its better to have that continue than not, but I also believe the consequences should be significant on the mismanaging equity holders, esp. Tata, and potentially beneficial for taxpayers longer term, which reduces the moral hazard aspect for a better overall solution.
Two things about backdoor layoffs. Mostly its about who. When its corporate dictat, those most likely to leave are those with other options, ie the best talent. So sure a business might save on severances in aggregate, but it doesn't get to decide on who, but simple statistics show it will be the best who move on. So a demoralised and increasingly mediocre workforce is then faced with a much tougher hiring environment with unfillable positions and the downward cycle continues, destroying customer value and reputation to a far greater degree than any temporary layoff savings. All for what exactly, control? Its the C-suites that should be being marched out the door.
No, quarterly earnings. In this case, retained earnings, but they want to show profits in a situation with high inflation, stagnant employment, and other issues where customers are not as spendy as they once were.
The move of offshoring in many projects, changed my mindest that companies care one second about their talent, at a size like Microsoft is all about replaceable cogs, little ants every doing their own small task.
Why in your opinion do Microsoft or any large org pay software engineers in America or western Europe? India would be cheaper. South America, Phillipines cheaper still. Etc etc etc. Plenty of educated folks speaking English.
I think a large part of it is that they want people physically and culturally close to themselves for projects they care about. Piles of companies have tried outsourcing core development and in my experience whatever minimal home team thats left keeps growing and growing and the oversess guys get pushed to the least desirable jobs. It's almost always a failure in the end, or at least the overseas team ends up being given limited scope, simple tasks while complex work finds it's way back home. I say this having worked with many talented overseas colleagues; I think this is a management level effect, not individual developer
Mostly to play firefighters and many countries in Europe, business people actually rather not deal directly with offshoring teams in English.
Which is what the on site team does, besides firefighting, handling the cultural interface.
For many businesses even if it looks a failure from engineering point of view, as long as it is within the budget, many businesses see it as a success, versus having paid a whole team onsite.
reply