Deep Learning
Understanding Llama2: KV Cache, Grouped Query Attention, Rotary Embedding and More
Grouped Query Attention, Rotary Embedding, KV Cache, Root Mean Square Normalization
nlpartificial-intelligencedeep-learning
Fine-Tuning Vs RAG in Generative AI
generative-ai-toolsdeep-learningartificial-intelligence
Vanishing Gradient Problem in RNNs
generative-ai-toolsartificial-intelligencedeep-learning
Beginner’s Guide to OpenAI’s GPT-3.5-Turbo Model
From GPT-3 to GPT-3.5-Turbo: Understanding the Latest Upgrades in OpenAI’s Language Model API.
aigpt-3openai
Creating Your Own ChatGPT: A Guide to Fine-Tuning LLMs with LoRA
As AI continues to evolve, chatbots are becoming increasingly sophisticated and human-like in their ability to respond to natural language inputs. One of the most impressive models in this space is…
chatgptchatbotspytorch
Leave One Subject Out Cross Validation for Machine Learning Models
How to implement cross validation in Python from scratch.
data-sciencecross-validationmachine-learning
What is Gaussian Noise in Deep Learning? How and Why it is used?
machine-learningdeep-learninggaussian-noise
Visualizing Attention in Vision Transformers
machine-learningdeep-learningtensorflow
Knowledge Distillation, aka Teacher-Student Model
With the release of large models in the last few years, from GPT-3 to Megatron, I keep pondering how to experiment and use these models for a specific use case. These models are trained on massive…
deep-learningmachine-learningpython
Understand collate_fn in PyTorch
A beginner-friendly tutorial on how to create batches in PyTorch and how to modify default behavior for our needs.
pytorchpythondeep-learning