My Battles With Technology

A Deep Dive into Google's Gemini CLI: Local AI Model Execution

A Deep Dive into Google’s Gemini CLI: Local AI Model Execution Introduction Google’s Gemini CLI is a powerful, open-source AI agent designed to bring Google’s Gemini models directly into the terminal. This tool allows developers to interact with advanced AI models through natural language prompts, enabling a wide range of tasks beyond traditional coding. In this article, we’ll delve into the features, capabilities, and applications of Gemini CLI, exploring its potential to revolutionize developer productivity and democratize AI. ...

Production Passkey Implementation: WebAuthn/FIDO2 Security Analysis and Complete Code

Research Disclaimer This tutorial is based on: W3C WebAuthn Level 3 Specification (October 2024) FIDO2/CTAP2 specification (FIDO Alliance, 2023) @simplewebauthn/server v9.0+ (Node.js library) py_webauthn v2.0+ (Python library) Web Crypto API (W3C standard) NIST SP 800-63B Digital Identity Guidelines All code examples follow documented WebAuthn best practices and are production-ready. Security analysis is based on FIDO Alliance and W3C standards. Examples tested on Chrome 119+, Safari 17+, Firefox 120+, Edge 119+. ...

AI-Powered Code Security: Production Vulnerability Scanning with OpenAI API

⚠️ Update Notice (October 2025) Lambda Inference API Deprecation: This post was originally written for Lambda Labs’ Inference API, which was deprecated on September 25, 2025. All code examples have been updated to use the OpenAI API with GPT-4, which provides similar or superior vulnerability detection capabilities. The core concepts, methodologies, and security patterns remain unchanged. Alternative Providers: The patterns demonstrated here work with any OpenAI-compatible API, including: OpenAI (GPT-4, GPT-4-Turbo) Together AI (various open models) Anthropic (Claude models via different SDK) Azure OpenAI Service (enterprise deployments) Research Disclaimer This tutorial is based on: ...

A Developer’s Guide to Anthropic’s MCP: Integrating AI Models with Data Sources

Introduction “AI models are only as powerful as the data they access.” Anthropic’s Model Context Protocol (MCP) bridges this gap by standardizing how AI systems connect to structured and unstructured data sources—from cloud storage to enterprise databases. Yet, deploying MCP in production requires careful attention to architecture, security, and performance trade-offs. This guide walks through: MCP’s client-server architecture and how it differs from traditional API-based integrations. Step-by-step implementation with Azure Blob Storage (adaptable to PostgreSQL, GitHub, etc.). Security hardening for enterprise deployments (RBAC, encryption, auditing). Performance tuning for large-scale datasets (caching, batching, monitoring). Scope: This is a technical deep dive—assumes familiarity with REST/GraphQL and Python. ...

Optimizing AI Chatbot Energy Costs: Practical Metrics and Workflow Strategies

Optimizing AI Chatbot Energy Costs: Practical Metrics and Workflow Strategies Hook: “Training GPT-3 consumes ~1,300 MWh—but inference at scale can be worse. Here’s how to measure and slash your LLM’s energy footprint.” AI chatbots are electricity hogs. While training large language models (LLMs) like GPT-4 dominates sustainability discussions, inference—the process of generating responses to user queries—can cumulatively surpass training energy costs when deployed at scale. For DevOps teams and developers, unchecked energy use translates to: ...