Model Training
-

I examined several well-known object detection pipelines and designed one that best suits my needs…
16 min read -

Capturing and reproducing failures in PyTorch training with Lightning
10 min read -

In this latest part of my series, I will share what I have learned on…
8 min read -

DeepSeek has recently made quite a buzz in the AI community, thanks to its impressive…
10 min read -

A deep dive into “Not All Tokens Are What You Need for Pretraining”
7 min read -

Learn the concepts and the practice. How a model behaves in each case.
7 min read -

Teaching is Hard: How to Train Small Models and Outperforming Large Counterparts
Artificial IntelligenceDistilling the knowledge of a large model is complex but a new method shows incredible…
13 min read -

Boosting Model Accuracy: Techniques I Learned During My Machine Learning Thesis at Spotify (+Code…
Data ScienceA tech data scientist’s stack to improve stubborn ML models
14 min read -

A review of the challenges in Synchronous distributed training and best solutions for stragglers and…
11 min read -

Starting from a given dataset, training a machine learning model implies the computation of a…
4 min read