Avishek Biswas, Author at Towards Data Science

Sesame Speech Model: How This Viral AI Model Generates Human-Like Speech

A deep dive into residual vector quantizers, conversational speech AI, and talkative transformers.

April 11, 2025

9 min read

NLP

A visual tour of what it takes to build CHAD-level LLM pipelines

October 29, 2024

14 min read

Simplifying the neural nets behind Generative Video Diffusion

September 19, 2024

10 min read

Foundation + Promptable + Interactive + Video. How?

August 6, 2024

12 min read

How do neural networks learn to estimate depth from 2D images?

July 24, 2024

11 min read

A tour through the history of Computer Vision!

June 28, 2024

18 min read