🦅 Eagle 7B : Soaring past Transformers with…

Jan 29, 2024

A brand new era for the RWKV-v5 architecture and linear transformer's has arrived - with the strongest multi-lingual model in open source today

6 Comments

petitlegumechien

RWKV is the SOTA for non-Transformer architecture.

Expand full comment

Jan 29, 2024Edited

caw caw! Congrats!

Is v4 the same as the paper? What's v5?

Expand full comment

The current paper is v4 : https://arxiv.org/abs/2305.13048

The paper for v5 is being written now, eta 1 month!

Expand full comment

The latest iteration of this model EagleX is here: https://substack.recursal.ai/p/eaglex-17t-soaring-past-llama-7b

Expand full comment

Feb 1, 2024Edited

> Trained on 1.1 Trillion Tokens across 100+ languages

is the dataset open publicly?

Expand full comment

I love the work you guys are doing. Wanna Collab?

Expand full comment

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts