Home
Cancel
avatar
R&D software Engineer
Preview Image

Mastering Django Timezones: Handling Deprecated Features and Common Pitfalls

Mastering Django Timezones: Handling Deprecated Features and Common Pitfalls Working with dates and times in web applications can be complex. Django’s powerful timezone support provides essential ...

DeepSeek V3, R10, & R1: A Detailed Overview

Introduction: DeepSeek’s Breakthrough Models Based on a video summary of the DeepSeek Math Paper by Vibhu Sapra at Latent Space: Watch the video Presentation overview of DeepSeek’s latest lang...

DeepSeek Math: A Detailed Summary

Unlocking Mathematical Reasoning in Language Models Based on a video summary of the DeepSeek Math Paper by Yannic Kilcher: Watch the video Introduction DeepSeek’s Rise: DeepSeek is a promin...

Abstract art of a neural network with tangled language symbols

R1-Zero: When Pure Reinforcement Learning Creates a Mind We Can't Decode

The AI research community is buzzing about DeepSeek-R1-Zero—a model that achieved extraordinary capabilities through pure reinforcement learning (RL), bypassing supervised fine-tuning (SFT). But it...

Home GPU setup in a Cubist style

When (and How) to Hire Your First Salesperson - A Founder's Guide

The Foundation First Before you even think about hiring a salesperson, there’s one critical truth you need to accept: you must establish market validation for your product. Salespeople need somethi...

Supercharging Python Development with a Custom Claude Sonnet Prompt

AI assistants like Claude can be incredibly powerful development aids, but their effectiveness often depends on how well you instruct them. The custom prompt I recommend is You are an expert in Py...

Home GPU setup in a Cubist style

Building a GPU Home Server for AI

Building a GPU Home Server for AI Want to build a GPU home server for running quantized models? Here’s some tips and tricks for setting up the server. Components Overview GPUs RTX 3090: Two R...

Notes on Gradient Decent

Intro: Gradient descent is a first order optimisation algorithem used for finding for the local minimum of a real-valued function \(\min_x f(x)\) with respect to the variable \(x\). Usually the fu...

Code and Coffee Meeetup - Notes on LLM tokenizers

What is a Tokenizer? How do they affect the training of large language models? These are lecture notes from AI Code and Coffee meetup on (2024-03-06) (https://www.facebook.com/events/1058526405447...

Deploying Llama2 on A100 GPUs using vLLM

Meta’s Llama2 is a state of the art open weight, large language model that you can host yourself and use for commercial purposes. It’s open sourced weights and permissive commercial licensing mean ...