I wouldn't have expected there to be enough text from before 1913 to properly tr... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		arikrak 20 days ago \| parent \| context \| favorite \| on: History LLMs: Models trained exclusively on pre-19... I wouldn't have expected there to be enough text from before 1913 to properly train a model, it seemed like they needed an internet of text to train the first successful LLMs?

alansaber 20 days ago [–]

This model is more comparable to GPT-2 than anything we use now.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact