More

mshockwave · 2025-11-07T17:48:17 1762537697

Is it normal to spend 10minutes on tuning nowadays? Do we need to spend another 10 minutes upon changing the code?

anvuong · 2025-11-07T19:49:13 1762544953

You mean autotune? I think 10 minutes is pretty normal, torch.compile('max-autotune') can be much slower than that for large models.

Mars008 · 2025-11-07T22:40:51 1762555251

Add to that it can be done only once by developers before distribution for major hardware. Configs saved. Then on client side selected.

mshockwave · 2025-11-06T18:57:02 1762455422

It's likely that Swift compiler is using LLVM LIT (https://llvm.org/docs/CommandGuide/lit.html), which is implemented in python, as the test driver

airspeedswift · 2025-11-06T21:21:28 1762464088

Python and LIT are used heavily to build and test the compiler, but that is only for building it, you do not need it to download and use the built toolchain. The python dependency is more about its use in LLDB.

mshockwave · 2025-10-03T05:38:16 1759469896

> In the end, programs will want probably to stay conservative and will implement only the core ISA

Unlikely, as pointed out in sibling comments the core ISA is too limited. What might prevail is profiles, specifically profiles for application processors like RVA22U64 and RVA23U64, which the latter one makes a lot more sense IMHO.

sylware · 2025-10-04T09:37:48 1759570668

Come on, what was to be understood is to 'stick to the core ISA' as much as possible.

I had to clarify the obvious: if a program does not need more than a conservative usage of the ISA to run at reasonable speed, no hardcore change to the hardware should be investigated.

Additionnally, the 'adding new machine instructions' fan boys tend to forget about machine instruction fusion (they probably want they names in the extension specifications) which has to be investigated first, and often in such niche cases, it may be not the CPU to think about, but specialized ASIC blocks and/or FPGA.

mshockwave · 2025-09-28T19:52:16 1759089136

yes, it has been done for at least a decade if not more

> Even more of a wild idea is to pair up two cores and have them work together this way

I don't think that'll be profitable, because...

> When you have a core that would have been idle anyway

...you'll just schedule in another process. Modern OS rarely runs short on available tasks to run

mshockwave · 2025-09-28T18:10:15 1759083015

The article is easy to follow but I think the author missed the e point: branchless programming (a subset of the more known constant time programming) is almost exclusively used in cryptography only nowadays. As shown by the benchmarks in the article, modern branch predictors can easily achieve over 95% if not 99% precision since like a decade ago

mshockwave · 2025-09-17T17:30:57 1758130257

yes, the short answer is LLVM uses RegPressureTracker (https://llvm.org/doxygen/classllvm_1_1RegPressureTracker.htm...) to do all those calculations. Slightly longer answer: I should probably be a little more specific that in most cases, Machine Scheduler cares more about register pressure _delta_ caused by a single instruction, either traverses from bottom-up or top-down. In which case it's easier to make an estimation when some of other instructions are not scheduled yet.

mshockwave · 2025-08-24T16:45:27 1756053927

The scenario suitable for TBM is surprisingly limited so even nowadays many tunnels are still dig using the good’o way, mostly with explosives

GeoAtreides · 2025-08-24T18:20:21 1756059621

??? explosives? in mud? (well, clay)?!

tunnels are bored depending on the soil, length, diameter. Most projects I have seen use TBMs and the New Austrian Tunneling Method. Explosives are quite the minority (not many tunnels are in solid rock), even the Gotthard tunnels were dug with TBMs

mshockwave · 2025-08-23T23:36:54 1755992214

Came to say Apple also did a great job on tagging my bois who are both grey-ish cats, even in pictures they faced backward, no idea how they did that

dhosek · 2025-08-24T00:00:44 1755993644

What I found impressive was that Apple Photos, given pictures of my cousins when they were 50 or more years old, was able to identify pictures of them as kids. On the other hand, it could never consistently distinguish between my two older brothers (although to be fair, they were identical twins). It also insists that a beagle I once owned was a cat. I mean, sure, he sometimes slept on his back with his paws in the air like a cat, but he was all dog.

dmd · 2025-08-24T14:50:23 1756047023

On the other hand, it has no understanding of time. I have thousands of photos of me from the 1970s up through today, and Apple Photos is remarkably good at identifying me in all of them. And yet when my daughter was born it started identifying her, as a baby, as me. You'd think you could build a model to grasp the idea that a photo of a baby taken in 2015 is probably not of me.

jpc0 · 2025-08-24T19:58:37 1756065517

I raise you the photo of a photo edge case.

Metadata is 2015, photo is 1960

dmd · 2025-08-24T20:06:15 1756065975

But you shouldn't optimize for the edge case! (All of my 20k+ photos, dating back to 1905, have correct metadata + GPS).

jpc0 · 2025-08-25T13:50:34 1756129834

I would argue that AI is exactly how you should handlr these edge cases, but likely a fine tuned model.

There would be hints that a photo is from the 60s vs 15s, a human would be able to tell in many cases even without other context.

That is exactly the use case AI is meant to excel at, something that is arguably hard to do algorithmically but is possible for an ML model

dhosek · 2025-08-25T02:42:23 1756089743

I hate to say it, but you are the edge case. Most users are not fixing dates of photos (especially pre-digital scans) or adding GPS data to photos which didn’t originally have it.

rkomorn · 2025-08-25T13:52:59 1756129979

I'd suspect there are more people removing metadata from photos that have it than are adding metadata to photos that don't have it.

actionfromafar · 2025-09-03T23:49:36 1756943376

A photo from 1960 was scanned at <some later date>.

mshockwave · 2025-08-09T20:06:01 1754769961

I thought the original comment meant “_for_ who doesn’t use X or Discord, here is the github mirror link”. There’s a “for” missing, and thus I think they agree with you

mshockwave · 2025-08-07T16:34:19 1754584459

second this, it didn't show anything when I hovered over North America