Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think there's a lot of difference between sounding like someone and being someone. The models are excellent at pretending indeed.


I don't think that sama was arguing that ChatGPT actually passed a PhD thesis defense. But arguably, it could make for an interesting benchmark.


Please do not get swayed by nor defend the words vomited by a snake oil salesman.

Also what benchmark? How will you you design it?


exactly. this is what whole RL thing is optimizing for, even if that is not the intent.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: