teaching | Eva Koroleva

End-to-end implementation of transformer architectures with production-grade practices.

Notebook	English	Russian
1. Understanding Tokenization: Byte-Pair Encoding
2. Implementing Transformer Architecture
3. Improving Architecture with SoTA Techniques
4. Evaluation Metrics: BLEU, ROUGE, METEOR, WandB
5. Hyperparameter tuning with Wandb Sweeps
6. Complete Transformer: End-to-End Python Pipeline
7. Create API and use Gradio and Streamlit
8. Model Deployment and Monitoring

Notebook	English	Russian
1. BERT: Bidirectional Encoder Representations
2. GPT: Generative Pre-trained Transformers
3. T5: Text-to-Text Transfer Framework
4. Mixture of Experts: Switch Transformers
5. Vision Transformers: ViT, DeiT
6. Longformers: Efficient Long-Context Attention
7. Multimodal Architectures: CLIP, Llava
8. LLM Architectures: Gemma2, LLaMA, Mistral