Introduction
Current Focus
Recommender Systems
1. Recent Trends
2. Gradient Boosting
3. TF-IDF
4. Cross Encoders
5. SentenceTransformers
6. Collaborative Filtering
7. Evaluation
AB Testing
1. Examples
2. Power Analysis
LLMs
1. Auto Prompt Optimization
2. Fine-tuning
3. Useful Models
4. Encoder vs Decoder
5. Contextualized Recommendations
6. MinGPT
Statistics
1. Conformal Predictions
Miscellaneous
1. Bradley-Terry Model
2. Setting up WSL
3. To Read
4. Packages
5. Skills
6. Hash Collisions
7. IMO 2025 Q6
Identities
1. Sigmoid
2. Statistics
Papers
1. Weinberger 2009 - Hashing for Multitask Learning
2. Rendle 2009 - Bayesian Personalized Ranking
3. Burges 2010 - RankNET to LambdaMART
4. Schroff 2015 - FaceNET
5. Covington 2016 - Deep NNs for Youtube Recs
6. Schnabel 2016 - Recs as Treatments
7. Doersch 2016 - Tutorial on VAEs
8. Bateni 2017 - Affinity Clustering
9. Guo 2017 - DeepFM
10. Van Den Oord 2017 - VQ-VAE
11. Hamilton 2017 - GraphSAGE
12. Ma 2018 - Entire Space Multi-Task Model
13. Kang 2018 - SASRec
14. Reimers 2019 - Sentence-BERT
15. Yi 2019 - LogQ Correction for In Batch Sampling
16. Zhao 2019 - Recommending What to Watch Next
17. Lee 2020 - Large Scale Video Representation Learning
18. He 2020 - LightGCN
19. Lewis 2020 - Retrieval Augmented Generation
20. Wang 2020 - DCNv2
21. Gao 2021 - GradCache
22. Gao 2021 - SimCSE
23. Weng 2021 - Contrastive Representation Learning
24. Li 2021 - TaoBao Embedding-Based Retrieval
25. Zou 2021 - PLM Based Ranking in Baidu Search
26. Dao 2022 - Flash Attention
27. Wei 2022 - CoT Prompting in LLMs
28. Honovich 2022 - Instruction Induction
29. Huang 2022 - LLMs can Self Improve
30. Tunstall 2022 - SetFit
31. Wang 2022 - Self Consistency LLM
32. Lee 2022 - RQ-VAE
33. Tay 2022 - Differentiable Search Index
34. Rafailov 2023 - Direct Preference Optimization
35. Blecher 2023 - Nougat
36. Dong 2023 - MINE Loss
37. Liu 2023 - Meaning Representations from Trajectories
38. Klenitskiy 2023 - BERT4Rec vs SASRec
39. Singh 2023 - Semantic IDs for Recs
40. Yang 2023 - OPRO
41. Rajput 2023 - Generative Retrieval
42. Borisyuk 2024 - GNN at LinkedIn
43. Wang 2024 - LLM for Pinterest Search
44. Solatorio 2024 - GISTEmbed
45. Sanjabi 2025 - 360Brew
46. Zhang 2025 - Qwen3 Embedding
47. Chlon 2025 - LLMs are Bayesian in Expectation
Talks / Articles
1. Yan 2025 - LLM for Recsys
2. Tandon 2025 - Gemini for YouTube
3. Hameed 2025 - 360Brew
4. Han 2025 - GRPO Example
5. Yan 2025 - Semantic IDs
NLP Course
Database Course
1. Lecture 1
2. Lecture 2
3. Lecture 3
RL Course
1. Introduction
2. MDPs
3. Dynamic Programming
4. Model Free Prediction
5. Model Free Control
6. Value Function Approximation
7. Policy Gradient
8. Learning & Planning
PMPP
1. Introduction
2. Data Parallel Computing
3. Multidimensional grids
4. Compute Architecture
5. Memory Architecture
6. Performance Considerations