Arxiv

BERT (2018) Paper Notes

Paper notes covering the core ideas of BERT: the bidirectional Transformer encoder, masked language model, next sentence prediction, and the fine-tuning paradigm.

llm paper-reading transformer bert arxiv

2026년 4월 18일

Attention Is All You Need (2017) — Paper Notes

A reading note on the Transformer paper — the core ideas, why it mattered, and what to read next.

llm paper-reading transformer arxiv

2026년 4월 17일