With the rapid evolution of large vision-language models (VLMs), a natural question emerges: what’s the best OCR solution available right now? In particular, there’s been buzz around Mistral’s OCR...

What's the Best OCR Model in 2024? A Look at Mistral vs Gemini vs Azure

Mastering Django Timezones: Handling Deprecated Features and Common Pitfalls
Mastering Django Timezones: Handling Deprecated Features and Common Pitfalls Working with dates and times in web applications can be complex. Django’s powerful timezone support provides essential ...
DeepSeek V3, R10, & R1: A Detailed Overview
Introduction: DeepSeek’s Breakthrough Models Based on a video summary of the DeepSeek Math Paper by Vibhu Sapra at Latent Space: Watch the video Presentation overview of DeepSeek’s latest lang...
DeepSeek Math: A Detailed Summary
Unlocking Mathematical Reasoning in Language Models Based on a video summary of the DeepSeek Math Paper by Yannic Kilcher: Watch the video Introduction DeepSeek’s Rise: DeepSeek is a promin...

R1-Zero: When Pure Reinforcement Learning Creates a Mind We Can't Decode
The AI research community is buzzing about DeepSeek-R1-Zero—a model that achieved extraordinary capabilities through pure reinforcement learning (RL), bypassing supervised fine-tuning (SFT). But it...

When (and How) to Hire Your First Salesperson - A Founder's Guide
The Foundation First Before you even think about hiring a salesperson, there’s one critical truth you need to accept: you must establish market validation for your product. Salespeople need somethi...
Supercharging Python Development with a Custom Claude Sonnet Prompt
AI assistants like Claude can be incredibly powerful development aids, but their effectiveness often depends on how well you instruct them. The custom prompt I recommend is You are an expert in Py...

Building a GPU Home Server for AI
Building a GPU Home Server for AI Want to build a GPU home server for running quantized models? Here’s some tips and tricks for setting up the server. Components Overview GPUs RTX 3090: Two R...
Notes on Gradient Decent
Intro: Gradient descent is a first order optimisation algorithem used for finding for the local minimum of a real-valued function \(\min_x f(x)\) with respect to the variable \(x\). Usually the fu...
Code and Coffee Meeetup - Notes on LLM tokenizers
What is a Tokenizer? How do they affect the training of large language models? These are lecture notes from AI Code and Coffee meetup on (2024-03-06) (https://www.facebook.com/events/1058526405447...
- Mastering Django Timezones: Handling Deprecated Features and Common Pitfalls
- DeepSeek V3, R10, & R1: A Detailed Overview
- Building a GPU Home Server for AI
- Using GPT4 to generate git logs for OpenSource projects in the style of conventional commits via a terminal
- Setting up PyTorch Development for Mac M1/M2 ARM