Linklog
A curated collection of links and resources I have found over time.
February 2025
- A Gentle Intro to Running a Local LLM (www.dbreunig.com)
Has lots of great insights on which LLM models to use locally, depending on needs and performance available.
- How to fine-tune open LLMs in 2025 with Hugging Face (www.philschmid.de)
An in-depth example of how to fine-tune an LLM using the Hugging Face ecosystem.
- A Visual Guide to How Diffusion Models Work (towardsdatascience.com)
An interesting dive into what makes diffusion models work. The summary is that diffusion models are models trained on data with noise to find the original data, at various level of noise. They eventually learn the probability distribution of the images in the space of all possible pixel arrangements. You can then iteratively denoise a pure Gaussian noise picture until you generate a new image: this is like sampling the learned probability distribution.
November 2024
- Perspectives on diffusion – Sander Dieleman (sander.ai)
Some interesting thoughts on diffusion models.
- Thoughts on Riemannian metrics and its connection with diffusion/score matching [Part I] (blog.christianperone.com)
An in-depth description of the connections between diffusion models and Riemannian geometry.
September 2024
- dleemiller/WordLlama (github.com)
Natural language processing toolkit optimized for CPU hardware. I haven't tested it yet but it looks really useful for quick clustering, deduplication, similarity search, etc...