git-researchGit-Research
Sign in

Quick Links

DashboardNamespacesPull RequestsIssuesStarred
ModelsDatasetsPapersSpacesCollectionsExperimentsNotebooks
Settings

Quick Links

DashboardNamespacesPull RequestsIssuesStarred
ModelsDatasetsPapersSpacesCollectionsExperimentsNotebooks
Settings
Back to Collections

Evolution of Transformer Models

by nlp-research

A curated collection tracing the development of transformer architectures from the original paper to modern LLMs

nlptransformersllmdeep-learning
28 items12.4K followersUpdated 2 days ago
paper

Attention Is All You Need

Vaswani et al. · NeurIPS 2017 · 98,234 citations
model

meta-research/llama-3.1-70b

2.4M downloads · pytorch
paper

BERT: Pre-training of Deep Bidirectional Transformers

Devlin et al. · NAACL 2019 · 76,543 citations
space

Transformer Visualization

125K views · gradio