Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We were talking about linear improvements and I have yet to see it


check the benchmarks or make one of your own


I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated.


on what benchmarks? pretty much every major one is linear improvement




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: