LLM

June 3, 2025

Building Production-Grade AI Data Access: AWS Athena MCP Server

Learn how to build a production-grade AWS Athena MCP server that gives AI agents secure access to your data warehouse.

May 28, 2025

Building RAG Pipelines on AWS: A Practical Guide to Bedrock + Pinecone

A practical guide to building retrieval-augmented generation pipelines on AWS with Bedrock and Pinecone.

May 25, 2025

Claude 4 Sonnet: Advanced Hybrid AI for Coding and Reasoning

Learn how Claude 4 Sonnet delivers near-instant responses and extended thinking with powerful coding and AI agent capabilities.

May 25, 2025

Claude 4 Opus Guide: Advanced AI Coding and Long-Running Tool Use

Explore Claude 4 Opus, Anthropic's advanced AI model for superior coding, extended reasoning, and tool integration. Learn its features and how to implement it effectively.

May 17, 2025

Introduction to Retrieval-Augmented Generation

An overview of RAG, combining language models with external knowledge retrieval.

January 27, 2025

AI Document Classification with GPT-4o-mini: A Zero-Shot RVL-CDIP Experiment

A quick experiment on using GPT-4o-mini for document classification.

November 25, 2024

LLM Evals: Evaluation Techniques for Large Language Models

Learn about Large Language Model (LLM) evaluations, their importance, key metrics, and advanced evaluation techniques. Discover how to build effective evaluation sets, integrate human feedback, and compare LLM models.

October 14, 2024

Converting OpenAI's Swarm Framework to TypeScript Using AI

Read about how I converted OpenAI's Swarm multi-agent educational framework to TypeScript using AI. Explore the conversion process, key insights, and the resulting TypeScript version.

October 14, 2024

Build a Fast LLM Text-to-SQL Engine Without Frameworks

Learn how to create a lightweight LLM text-to-SQL engine in Python. This guide explains generating SQL from natural language queries without relying on heavy frameworks.

October 11, 2024

How I saved an open-source project 90% on their AI agent LLM Costs

Learn how I coordinated code changes across two open-source projects to save an AI agent system 90% on their LLM costs. Discover the challenges, solutions, and the impact of this cost-saving initiative.

September 9, 2024

Building a Vector Database Service with OpenDevin: An LLM Code Agent Experiment

Explore the process of tasking an LLM code agent with building a vector database service using FastAPI. Learn about the challenges, iterations, and final product of this AI-driven development experiment.

August 13, 2024

Implementing Claude Tool Use with Pydantic for Structured AI Responses

This guide demonstrates how to use Claude's tool use capability in conjunction with Pydantic to generate structured AI responses. By combining these technologies, developers can ensure consistent, type-safe output from Claude that integrates seamlessly with Python applications.

August 9, 2024

Building Reliable LLMs for Production: Structured Outputs and Data Validation

Learn how to make large language models (LLMs) behave predictably and provide structured outputs for production systems. Explore techniques for prompt engineering, JSON mode, function calling, and data validation with Pydantic.

August 8, 2024

Building an Gmail Auto Labeler With LLMs: A Step-by-Step Guide

Learn how to build an email classifier using affordable Language Models (LLMs) like GPT-4-mini and Ollama. Automate your inbox management with machine learning.

August 1, 2024

LLM System Design: A Practical Guide

Learn how to build an effective Large Language Model (LLM) system. Explore key components, building blocks, and advanced techniques for designing LLM systems.

July 31, 2024

Llama 3's Synthetic Code Generation: Developing an AI-Powered Coding Environment

Explore the technical details of Llama 3's synthetic data generation approach for code generation tasks. Learn how this self-supervised learning system generates, solves, and learns from diverse coding challenges.

July 18, 2024

Building an LLM Evaluation Tool with Synthetic Data Generation in 20 mins

Learn how to build a flexible LLM evaluation tool with dynamic schema generation, synthetic data creation, and model evaluation using Python and LLMs.

July 11, 2024

Whisper Audio to Text: Transcribing Long Audio Files with Python

Learn how to use Whisper audio to text conversion for long files with our Python script. Overcome file size limits and easily transcribe podcasts, interviews, and lectures. Step-by-step guide with code examples.

January 6, 2024

Using LLMs to Build A Code Generation Dataset

Learn how to use LLMs, gpt-3.5-turbo, to scrape and clean code from the web