Building Production-Grade AI Data Access: AWS Athena MCP Server
Learn how to build a production-grade AWS Athena MCP server that gives AI agents secure access to your data warehouse.
Learn how to build a production-grade AWS Athena MCP server that gives AI agents secure access to your data warehouse.
A practical guide to building retrieval-augmented generation pipelines on AWS with Bedrock and Pinecone.
Learn how Claude 4 Sonnet delivers near-instant responses and extended thinking with powerful coding and AI agent capabilities.
Explore Claude 4 Opus, Anthropic's advanced AI model for superior coding, extended reasoning, and tool integration. Learn its features and how to implement it effectively.
An overview of RAG, combining language models with external knowledge retrieval.
A quick experiment on using GPT-4o-mini for document classification.
Learn about Large Language Model (LLM) evaluations, their importance, key metrics, and advanced evaluation techniques. Discover how to build effective evaluation sets, integrate human feedback, and compare LLM models.
Read about how I converted OpenAI's Swarm multi-agent educational framework to TypeScript using AI. Explore the conversion process, key insights, and the resulting TypeScript version.
Learn how to create a lightweight LLM text-to-SQL engine in Python. This guide explains generating SQL from natural language queries without relying on heavy frameworks.
Learn how I coordinated code changes across two open-source projects to save an AI agent system 90% on their LLM costs. Discover the challenges, solutions, and the impact of this cost-saving initiative.
Explore the process of tasking an LLM code agent with building a vector database service using FastAPI. Learn about the challenges, iterations, and final product of this AI-driven development experiment.
This guide demonstrates how to use Claude's tool use capability in conjunction with Pydantic to generate structured AI responses. By combining these technologies, developers can ensure consistent, type-safe output from Claude that integrates seamlessly with Python applications.
Learn how to make large language models (LLMs) behave predictably and provide structured outputs for production systems. Explore techniques for prompt engineering, JSON mode, function calling, and data validation with Pydantic.
Learn how to build an email classifier using affordable Language Models (LLMs) like GPT-4-mini and Ollama. Automate your inbox management with machine learning.
Learn how to build an effective Large Language Model (LLM) system. Explore key components, building blocks, and advanced techniques for designing LLM systems.
Explore the technical details of Llama 3's synthetic data generation approach for code generation tasks. Learn how this self-supervised learning system generates, solves, and learns from diverse coding challenges.
Learn how to build a flexible LLM evaluation tool with dynamic schema generation, synthetic data creation, and model evaluation using Python and LLMs.
Learn how to use Whisper audio to text conversion for long files with our Python script. Overcome file size limits and easily transcribe podcasts, interviews, and lectures. Step-by-step guide with code examples.
Learn how to use LLMs, gpt-3.5-turbo, to scrape and clean code from the web