diff --git a/pages/pages.txt b/pages/pages.txt index 13ae253aa..bf354d00c 100644 --- a/pages/pages.txt +++ b/pages/pages.txt @@ -11,6 +11,39 @@ Volkova Art 01 of 1:28:48 Psy - Common Psychology p1 of p54 https://www.youtube.com/playlist?list=PLt3fgqeygGTVk5khY228EBHujarUgyLfv +Jay Alammar +https://github.com/jalammar +https://jalammar.github.io/ +https://jalammar.github.io/about/ +https://jalammar.github.io/illustrated-gpt2/ +https://jalammar.github.io/illustrated-transformer/ +https://github.com/SocratesClub/machine-learning/tree/master/readings +https://github.com/zw76859420/ASR_Theory/tree/master/Transformer +https://github.com/zw76859420/ASR_Theory/blob/master/Transformer/Attention%20Is%20All%20You%20Need.pdf +https://github.com/zw76859420/ASR_Theory/blob/master/Transformer/The%20Illustrated%20Transformer%20%E2%80%93%20Jay%20Alammar%20%E2%80%93%20Visualizing%20machine%20learning%20one%20concept%20at%20a%20time.pdf +https://github.com/SocratesClub/machine-learning/tree/master/readings +https://github.com/SocratesClub/machine-learning/blob/master/readings/The%20Illustrated%20Transformer%20%E2%80%93%20Jay%20Alammar%20%E2%80%93%20Visualizing%20machine%20learning%20one%20concept%20at%20a%20time.html +https://www.llm-book.com/ + 44BF8AE4FD6EB0C873190856F9EC6B35 + 71667727781896F8E475D78604A7E3AC +JayAlammar - Learn how ChatGPT and DeepSeek models work: How Transformer LLMs Work [Free Course] of 3:36 + https://www.youtube.com/watch?v=k1ILy23t89E + https://github.com/HandsOnLLM/Hands-On-Large-Language-Models +Maarten Grootendorst + https://github.com/MaartenGr + https://substack.com/@maartengrootendorst + https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-quantization + https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-mamba-and-state + https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-mixture-of-experts +https://www.deeplearning.ai/courses/ + https://www.deeplearning.ai/short-courses/how-transformer-llms-work/ + +Transformer +https://research.google/blog/transformer-a-novel-neural-network-architecture-for-language-understanding/ +https://arxiv.org/abs/1706.05137 +JayAlammar - The Narrated Transformer Language Model 0:00 of 29:29 + https://www.youtube.com/watch?v=-QH8fRhqFHM + proxmox: inst begin