Large Language Models | Towards Data Science

Kernel Case Study: Flash Attention

Machine Learning

Understanding all versions of flash attention through a triton implementation

Arunjith A

April 3, 2025

16 min read

Build Your Own AI Coding Assistant in JupyterLab with Ollama and Hugging Face

Artificial Intelligence

A step-by-step guide to creating a local coding assistant without sending your data to the…

Parul Pandey

March 24, 2025

8 min read

A cartoon of a head of lettuce dressed as a detective

LettuceDetect: A Hallucination Detection Framework for RAG Applications

Machine Learning

How to capitalize on ModernBERT’s extended context window to build a token-level classifier for hallucination…

Adam Kovacs

March 10, 2025

10 min read

AI-generated image showing agents building a pyramid of knowledge

Overcome Failing Document Ingestion & RAG Strategies with Agentic Knowledge Distillation

Machine Learning

Introducing the pyramid search approach

Tula Masterman

March 5, 2025

17 min read

Generative AI Is Declarative

Artificial Intelligence

And how to order a cheeseburger with an LLM

Michael Herman

March 5, 2025

28 min read

Avoidable and Unavoidable Randomness in GPT-4o

Machine Learning

Exploring the sources of randomness in GPT-4o from the known and controllable to the opaque…

Vincent Vatter

March 3, 2025

14 min read

Unraveling Large Language Model Hallucinations

Machine Learning

Understanding hallucinations as emergent cognitive effects of the training pipeline

Prashal Ruchiranga

February 28, 2025

11 min read

How to Use an LLM-Powered Boilerplate for Building Your Own Node.js API

Large Language Models

For a long time, one of the common ways to start new Node.js projects was…

Uladzimir Yancharuk

February 20, 2025

7 min read

A Comprehensive Guide to LLM Temperature 🔥🌡️

Large Language Models

While building my own LLM-based application, I found many prompt engineering guides, but few equivalent…

Kelsey Wang

February 7, 2025

8 min read

DeepSeek-V3 Explained 1: Multi-head Latent Attention

Deep Learning

Key architecture innovation behind DeepSeek-V2 and DeepSeek-V3 for faster inference

Shirley Li

January 31, 2025

9 min read