Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Great respect for Ilya, but I don’t see an explicit argument why scaling RL in tons of domains wouldn’t work.


I think that scaling RL for all common domains is already done to death by big labs.


Not sure why they care about his opinion and discard yours.

They’re just as valid and well informed.


doesnt RL by definition not generalize? thats Ilya's entire criticism of the current paradigm




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: