Model Training
-
I examined several well-known object detection pipelines and designed one that best suits my needs…
16 min read -
Capturing and reproducing failures in PyTorch training with Lightning
10 min read -
In this latest part of my series, I will share what I have learned on…
8 min read -
DeepSeek has recently made quite a buzz in the AI community, thanks to its impressive…
10 min read -
A deep dive into “Not All Tokens Are What You Need for Pretraining”
7 min read -
Learn the concepts and the practice. How a model behaves in each case.
7 min read -
Teaching is Hard: How to Train Small Models and Outperforming Large Counterparts
Artificial IntelligenceDistilling the knowledge of a large model is complex but a new method shows incredible…
13 min read -
Boosting Model Accuracy: Techniques I Learned During My Machine Learning Thesis at Spotify (+Code…
Data ScienceA tech data scientist’s stack to improve stubborn ML models
14 min read -
A review of the challenges in Synchronous distributed training and best solutions for stragglers and…
11 min read -
Starting from a given dataset, training a machine learning model implies the computation of a…
4 min read