1 Comment
â­  Return to thread

> Trained on 1.1 Trillion Tokens across 100+ languages

is the dataset open publicly?

Expand full comment