Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
attemptone
5 months ago
|
parent
|
context
|
favorite
| on:
My AI skeptic friends are all nuts
We were talking about linear improvements and I have yet to see it
mountainriver
4 months ago
[–]
check the benchmarks or make one of your own
attemptone
4 months ago
|
parent
[–]
I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated.
mountainriver
4 months ago
|
root
|
parent
[–]
on what benchmarks? pretty much every major one is linear improvement
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: