Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Only if you have access to corporate-level hardware:

"It took us 48 hours to build the suffix array for RedPajama on a single node with 128 CPUs and 1TiB RAM"



It's okayish. Considering 64G to 128G are available for (nerd) high-end consumers you're just off with a factor 5 (if we can squeeze out a little bit more performance).

Thas is pretty astonishing in my opinion.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: