We were talking about linear improvements and I have yet to see it | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		attemptone 5 months ago \| parent \| context \| favorite \| on: My AI skeptic friends are all nuts We were talking about linear improvements and I have yet to see it

mountainriver 4 months ago [–]

check the benchmarks or make one of your own

attemptone 4 months ago | [–]

I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated.

mountainriver 4 months ago | | [–]

on what benchmarks? pretty much every major one is linear improvement

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact