why llama-3-8B is 8 billion parameters instead of 7?

Опубликовано: 16 Февраль 2026
на канале: Chris Hay

3,746

116

llama-3 has ditched it's tokenizer and has instead opted to use the same tokenizer as gpt-4 (tiktoken created by openai), it's even using the same first 100K token vocabulary.

In this video chris walks through why Meta has switched tokenizer and the implications on the model sizes, embeddings layer and multi-lingual tokenization.

he also runs his tokenizer benchmark and show's how it's more efficient in languages such as japanese

repos
------
https://github.com/chrishayuk/embeddings
https://github.com/chrishayuk/tokeniz...

CSS xHope 2 Deagle actions

CSS xHope 2 Deagle actions

НОВЫЕ КРЫЛЬЯ УДАЛЯТ!!! САМАЯ ПРОВАЛЬНАЯ КОЛЛАБОРАЦИЯ ЗА ВСЮ ИСТОРИЮ ГЕНШИНА | Genshin Impact

НОВЫЕ КРЫЛЬЯ УДАЛЯТ!!! САМАЯ ПРОВАЛЬНАЯ КОЛЛАБОРАЦИЯ ЗА ВСЮ ИСТОРИЮ ГЕНШИНА | Genshin Impact

00:00:00

(Оригинал) Заставка ТНТ (2006-2007)

(Оригинал) Заставка ТНТ (2006-2007)

SIFIRDAN HESABA 1 DAKİKADA 1.000 TAKİPÇİ HİLESİ - ÜCRETSİZ BEDAVA İNSTAGRAM TAKİPÇİ HİLESİ 2024

SIFIRDAN HESABA 1 DAKİKADA 1.000 TAKİPÇİ HİLESİ - ÜCRETSİZ BEDAVA İNSTAGRAM TAKİPÇİ HİLESİ 2024

Midnight Bloom Synth Vapor Mountain VJ Background 4K || Abstract VJ Loop 4k Screensaver

Midnight Bloom Synth Vapor Mountain VJ Background 4K || Abstract VJ Loop 4k Screensaver

Concurso Musa Do Interior 2020 - Angélica Simas

Concurso Musa Do Interior 2020 - Angélica Simas

00:00:10

Real Madrid squad vs Borussia Dortmund Final Champions League 23/24 | Where they left from?🤔💰

Real Madrid squad vs Borussia Dortmund Final Champions League 23/24 | Where they left from?🤔💰

10 hunters survive in the wilderness and hunt a cow. Who will win the final $500,000?

10 hunters survive in the wilderness and hunt a cow. Who will win the final $500,000?