PinnedStraightforward yet productive tricks to boost deep learning model trainingHi fellow Deep Learning researchers,Jan 20, 2023Jan 20, 2023
PinnedPyTorch’s Magic with Automatic Mixed PrecisionPytorch library is one of the go-to framework used these days for implementing neural networks or deep learning models. These models have…Jan 25, 20231Jan 25, 20231
Scale baby Scale — Test-time scale!Inference scaling or Test-time scaling is the latest buzzword capturing the interest of everyone working in the high-boiling field of AI…1d ago1d ago
Long-Context Large Language ModelsLanguage modeling has captured the attention of researchers over the years, leading to numerous iterations and modifications of proposed…Jan 22, 2024Jan 22, 2024
Beyond Basics: A Comprehensive Interview Question Bank for DL, NLP, and Diffusion ModelsGone are the days when a basic understanding of machine learning sufficed for tech interviews. Today, employers seek candidates who can…Dec 27, 2023Dec 27, 2023
Published inAI AdvancesThe Symphony of Numbers: Inside the GPU’s Power of 2 ArchitectureIn the heart of the data center, where the pulse of artificial intelligence reverberates, lies a remarkable piece of technology — the…Sep 14, 20231Sep 14, 20231
Long-Range Transformer with Unlimited Length InputPretrained transformers generally have a context window of 512 (e.g. BERT , T5 ) or 1024 tokens (e.g. BART), which are sufficient lengths…Aug 25, 2023Aug 25, 2023
Large Language Models are just another hypeSince the latter part of the previous year, in the midst of a flurry of remarkable technological advancements, ChatGPT has risen as a…Jul 15, 2023Jul 15, 2023
Restrictive Freedom with Open Source LicensingDo you know the concept of a “commons” or a shared resource that is accessible to everyone. In medieval times, villages would have a…Apr 15, 2023Apr 15, 2023
Igniting a creative revolution with Generative ModelsThe creative revolution is about empowering individuals to express themselves in new and unique ways, and promoting diversity and…Mar 28, 2023Mar 28, 2023
Deep Learning — Coding Patterns & PractisesIn the field of Software Development, good coding patterns and practises are something which are religiously followed. They help make…Feb 23, 2023Feb 23, 2023
Deep Learning OptimizersThe choice of optimization algorithm can depend on several factors, including the specific problem you’re trying to solve, the size and…Feb 17, 2023Feb 17, 2023
PyTorch important APIsThe heart of Pytorch DL Framework are its functional APIs which give access to various Neural Layers ranging from linear, convolution to…Jan 15, 2023Jan 15, 2023
Metrics for ML Model in ProductionThe dominant approach to creating ML systems is to collect a dataset of training examples demonstrating correct behaviour for a desired…Jan 13, 2023Jan 13, 2023
Understanding the Attention mechanismLanguage modelling is the task of generating next word given previous words of a sequence such as Autocomplete working in our mobile…Jan 1, 2023Jan 1, 2023
Positional Encoding in TransformersTransformer architecture is famous for a while having precisely designed components in itself such as Encoder-Decoder stack…Dec 28, 20221Dec 28, 20221
Struggles in making error-free decisionsI will delve into the intriguing phenomenon of how the act of repeating false claims can inadvertently boost their credibility.Dec 23, 2022Dec 23, 2022
Interrogating the design choices of Diffusion ArchitectureIf you are reading about generative modelling techniques used these days, then no doubt you would have encountered the term Diffusion…Nov 27, 2022Nov 27, 2022