Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
ACCount37
1 day ago
|
parent
|
context
|
favorite
| on:
EuroLLM: LLM made in Europe built to support all 2...
It is true. Datasets are somewhat cleaned, but only somewhat. When you have terabytes worth of text, there's only so much cleaning you can do economically.
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: