Code Smarter

From Prompts to Context with GitHub Copilot

Jorge Castillo

In the past few weeks, I've embarked on an exciting journey that reshaped how I use AI in software development. Initially, my focus was on prompt engineering, a widely discussed technique where carefully crafted prompts help AI models produce better code or explanations. Yet, as I spent more time experimenting, I realized something crucial: a prompt alone was rarely enough. The models needed more contextmuch more.

Then came the "aha" moment: I stumbled upon a practice known as Context Engineering. It turns out the methods I had organically developed aligned closely with a structured and widely adopted practice already documented by the AI community.

What Exactly is Context Engineering?

Simply put, Context Engineering is the discipline of strategically selecting, compressing, and structuring information to enhance an AI models comprehension and outputs. Phil Schmid puts it elegantly: providing the "right info, in the right format, at the right time". LangChain further categorizes context engineering into four concrete strategies: write, select, compress, and isolateeach essential in guiding a models reasoning.

This resonated deeply with me. I realized I'd been instinctively applying these principles, though without formalizing them. Discovering this well-defined methodology gave me a roadmap to deepen and refine my practice.

VS Code 1.101: Custom Chat Modes and Tools

The recent release of Visual Studio Code 1.101 took Context Engineering to another level. Now, creating and using custom contexts doesn't involve complicated hacks. You simply place a markdown file describing your chat mode, e.g. for your project, put it into .github/chatmodes/, and VS Code automatically makes it available within GitHub Copilot.

Here's a practical example of a custom chat mode file:

---
description: 'Generate an implementation plan for new features or refactoring existing code.'
tools: ['codebase', 'fetch', 'findTestFiles', 'githubRepo', 'search', 'usages', 'context7', 'sequential-thinking', 'microsoft-docs']
---
# Planning mode instructions
You are in planning mode. Your task is to generate an implementation plan for a new feature or for refactoring existing code.
Don't make any code edits, just generate a plan.

The plan consists of a Markdown document that describes the implementation plan, including the following sections:

* Overview: A brief description of the feature or refactoring task.
* Requirements: A list of requirements for the feature or refactoring task.
* Implementation Steps: A detailed list of steps to implement the feature or refactoring task.
* Testing: A list of tests that need to be implemented to verify the feature or refactoring task.

By selecting this mode, GitHub Copilot immediately generates an implementation plan for new features or refactoring existing code.

Leveraging Community Wisdom: awesome-copilot-chatmodes

Exploring community resources like the awesome-copilot-chatmodes repository was another critical turning point. The repository is a treasure trove of ready-to-use chat modes, prompts and instructions; from rigorous code reviewers to supportive debugging assistants. It provided practical insights into how structured, predefined contexts can significantly enhance AI performance.

Experimenting with these resources, adapting them, and integrating them into my workflow significantly streamlined my tasks.

My Context Stack: MCP Tools I Cant Live Without

Three MCP tools, in particular, transformed my development experience:

Context7: Automatically retrieves the latest documentation and practical code snippets. This ensures GitHub Copilot delivers up-to-date and accurate code suggestions rather than outdated or generalized information.
Sequential Thinking: Reveals the AI's chain of thought step-by-step. This visualization helps me instantly verify whether the AI model fully grasped my instructions, making it easy to refine prompts and instructions in real-time.
Microsoft Docs MCP: Provides authoritative references for .NET and Azure, ensuring Copilot aligns with official Microsoft standards and best practices.

Using these tools not only improved the immediate quality of AI-generated outputs but significantly increased my confidence and understanding of the models reasoning.

The Importance of Planning First, Coding Later

One crucial lesson from this experience was reaffirming the importance of meticulous planning. Before initiating any coding session, I adopted a structured preparation approach:

Clearly outlining project goals, constraints, and success criteria.
Curating and preparing relevant documentation snippets, best practices, and examples.
Establishing preliminary test scaffolds to communicate intent.
Selecting an appropriate, context-specific chat mode.

This structured approach helped ensure GitHub Copilot produced predictable, aligned, and reliable outputs, vastly reducing instances of hallucinations or off-target suggestions.

Real-World Results: Wins and Lessons Learned

The results were remarkable:

Significant time savings: Tasks such as generating FastAPI router boilerplates went from approximately 20 minutes to less than 5 minutes.
Improved reasoning: Sequential Thinking instantly flagged logical oversightssuch as missing rate-limiters or incomplete middleware implementations.
Heightened awareness: Despite its capabilities, Copilot occasionally produced incorrect Python code; it keeps adding new functions to the top, before the imports section, underscoring the need for careful human review.

From this experience, I distilled several best practices:

Iterative prompt refinement: Minor wording adjustments significantly improved outputs.
Testing as a guardrail: Pre-defined tests ensured functionality remained within expected boundaries.
Focused context bundles: Precise, concise contexts proved more effective than broad, general-purpose prompts.
Mentoring mindset: Treat AI like a junior developer, guiding and reviewing its contributions carefully.

Looking Forward: Next Steps in My Context Engineering Journey

Theres still much to explore. On my roadmap:

Crafting more specialized persona modes for testing, security, and performance optimization.
Play with Claude Code.
Developing debug-aware prompts to automatically detect and address errors more efficiently.

If you've explored these avenues or discovered innovative techniques, I would love to exchange ideas and experiences.

Your Turn: Dive into Context Engineering

I encourage you to experiment with Context Engineering today. Start by creating a simple custom chat mode for your next projectit's easier than you think and immensely rewarding. Share your experiences, discoveries, and insights in the comments below. Lets keep learning and improving together!

Happy coding!

Prototyping Smarter with Aspire, Ollama & GitHub Models

Jorge Castillo

Thats exactly what I set out to dotransforming a single-provider AI demo into a flexible, production-like playground that supports local models with Ollama, and remote models with GitHub Models and Azure AI Foundry deployments. Along the way, I experimented with model orchestration, tool support, plugins, and a dynamic provider switcherall while learning a ton and pushing my productivity further.

In this post, Ill walk you through what I built, what worked (and what didnt), and why this turned into one of my most exciting AI automation experiments yet.

🌐 Multi-Provider, One Kernel: Why Settle for Less?

The original AI Developer Workshop provided a solid foundation for working with .NET Aspire and AI agentsbut it was locked to a single model provider.

So I refactored the kernel configuration to support dynamic model switching, allowing the user to toggle between:

Local Ollama-based models (e.g. llama3.2:3b, qwen3:8b)
Remote GitHub-hosted models via OpenAI-compatible endpoints
Azure AI Foundry deployments with enterprise-grade infrastructure

Heres the core logic powering the switch:

switch (currentProvider)
{
    case Provider.Local:
        kernelBuilder.AddOllamaChatCompletion(selectedModel, new Uri(ollamaApirUrl));
        break;
    case Provider.Remote:
        kernelBuilder.AddAzureOpenAIChatCompletion(
            deploymentName: azureModelName,
            endpoint: new Uri(azureEndpoint),
            apiKey: azureKey);
        break;
    case Provider.GitHub:
        kernelBuilder.AddOpenAIChatCompletion(
            modelId: githubModelName,
            endpoint: new Uri(githubEndpoint),
            apiKey: githubKey);
        break;
    default:
        throw new InvalidOperationException("Unknown provider selected");
}

🔄 Why it matters: With this setup, I could A/B test models, validate fallback strategies, and explore performance trade-offs in real-timeno code changes needed.

🧠 Local Models with Tool Support? Tougher Than Expected.

Running LLMs locally has clear advantagesprivacy, cost control, and the joy of offline AI. But finding a model that supports tool usage reliably? That took work.

My path:

llama3.2:3b: Lightweight and quick for single-tool calls, but broke on multi-tool prompts.
🔥 qwen3:8b: Surprisingly capablehandled multi-step toolchains in a single prompt with no issues.
🧩 GitHub Copilots GPT-4o (remote): Zero problems, blazing fast, and great fallback when local wasn't enough.

Lesson learned: Not all local models are agentic ready. Test them like you would test any external service: with real-world tasks and failure conditions.

🧩 Real Plugins, Real Use Cases

To make conversations useful (not just clever), I build a set of simple semantic kernel plugins following the guide presented in the Workshop:

TimePlugin: Answers time/date-related questions.
GeocodingPlugin: Converts place names into lat/lon coordinates.
WeatherPlugin: Fetches current and historical forecasts via OpenWeatherMap.

Plugin registration was frictionless:

kernel.Plugins.AddFromObject(new WeatherPlugin(), "Weather");

🔌 Pro tip: Plugins are the secret sauce for turning LLMs into real-world agents. They give structure, context, and purpose to your AI prompts.

Ollama + Aspire: An Unexpectedly Good Developer UX

I was curious about the new CommunityToolkit.Aspire.Hosting.Ollama packageand Im glad I tried it. In just a few lines, I spun up GPU-backed containers with Open WebUI access and model preloading:

var ollamaService = builder.AddOllama("ollama")
    .WithDataVolume()
    .WithGPUSupport(OllamaGpuVendor.Nvidia)
    .WithOpenWebUI();

ollamaService.AddModel("qwen3", "qwen3:8b");

Using Aspires dashboard, I could visualize dependencies, inspect logs, and even debug calls through GitHub Copilot integration. If youre building local-first LLM services, this stack is a dream.

💻 GitHub Models: Fast Prototyping, No Vendor Lock-in

GitHubs hosted models turned out to be perfect for prototyping. They speak the same API as OpenAI, so swapping backends was painless. Key benefits:

No credit card required for dev/test.
Fully OpenAI-compatible endpoints.
Ideal for CI environments or demos.

🚀 Tip: Dont underestimate hosted OSS modelsthey can save you hours and dollars during the exploration phase.

🛡 Secrets, Logging, and Keeping It Clean

While AI was the star of the show, security and maintainability were never an afterthought:

Used user-secrets and appsettings.Development.json for sensitive values.
Cleaned up .gitignore to avoid config leaks.
Added usage docs and diagrams to README.md.

🧼 Reminder: Productivity without hygiene is a debt trap. Automate config and document your architecture as you go.

🧪 GitHub Copilot: Still My Favourite Pair Programmer

From generating plugins to rewriting chat UX logic, GitHub Copilot was there every step of the way:

💡 Suggested API patterns and integration tricks.
🧠 Helped debug multi-provider setup quirks.
🪄 Wrote most of the boilerplate so I could focus on design.

Even with all the cool tools Ive tested, Copilot remains my daily accelerator.

🔚 Final Thoughts: What Ill Keep (and What Ill Tweak)

🚀 What worked:

Dynamic provider selection made testing effortless.
Aspire + Ollama delivered solid local dev ergonomics.
Qwen3 + tools = big win for offline agentic workflows.

🧱 What needs improvement:

Tool-based prompting is still fragile with some models.
Add support for the chat output.
Implement stream chat support.

🗣 Join Me in the Experiment

If you're building LLM-powered apps, dont stop at the default examples. Push boundaries. Mix providers. Try weird models. Break things and rebuild them better. Thats how we move from toy demos to real innovation.

You can access the repository with the code of this article in https://github.com/jcastillopino/ai-developer-enhanced

Leave a comment or reach outI'd love to hear about your own agentic setups.

Lets keep building smarter. 🧠💻

Building an MCP for Excel

Jorge Castillo

Introduction

These days, everyone seems to be talking about MCPs and agentic workflows in the context of AI-driven software development. Rather than add another technical deep dive to the pile, Ill skip the detailed explanation of what an MCP isif youre curious or need a refresher, feel free to ask me in the comments or check out this resource from the official MCP documentation for more details.

If youve followed my writing, youll know Im passionate about applying artificial intelligence to software development. Ive already shared my thoughts about how AI is transforming our workflowsif you missed those, you can check them out here: Post 0: Launching My Agentic Coding & AIAutomation Journey and LLM Agents Transforming Enterprise Efficiency.

Today, I want to focus on a recent project that took that passion in a new direction: building an MCP for Excel, integrating it with generative AI to automate and supercharge my development process.

Project Background & Motivation

The motivation behind building an MCP for Excel came directly from a real-world need in one of my recent projects. I was tasked with writing and maintaining VBA code within Excel files to automate various time-consuming and error-prone tasks. If youve ever developed in Excel, you know that the environment feels stuck in timethe built-in VBA editor is basic, outdated, and lacks modern development features we take for granted elsewhere.

At the same time, I wanted to leverage the power of artificial intelligence to accelerate development, automate repetitive tasks, and enhance my coding experience. However, for AI tools to be truly helpful, they need a way to see and understand the structure and data within Excel filessomething not easily possible out of the box.

This challenge led me to develop my own MCP server for Excel: a solution that could bridge the gap between modern AI-powered coding assistants and the legacy world of VBA scripting. My goal was simple: make the AI smarter, giving it a tool to fully understand Excel and make my workflow much more efficient.

Choosing the Stack

When it came time to choose the technology stack for this project, C# was the natural choice. Ive been programming in C# since around 2001-2002 when I was a beta-tester of Visual Studio .NET 2002, and while Ive also used TypeScript and Python to build other small MCP servers, I was particularly excited to take advantage of Microsofts recently released C# SDK for MCP development. This project provided the perfect opportunity to combine my deep knowledge of C# with my ongoing passion for artificial intelligence.

A key technical challenge was how to interact with Excel filesreading their structure and contentswithout requiring Excel to be installed on the server. During my research, I discovered ClosedXML, a robust open-source library for working with Excel files in .NET. ClosedXML turned out to be exactly what I needed: it offered comprehensive support for reading Excel sheets, formulas, ranges, etc., all without any dependency on the Excel application itself.

With the C# SDK and ClosedXML in hand, I was able to move forward and start building the tools I needed for the project, confident that I was using a stack that matched both my experience and the requirements of the solution.

Architecture Overview

High-Level Architecture

The Excel MCP Server is a modular, extensible C# application that exposes a set of tools for programmatic inspection and analysis of Excel workbooks via the Model Context Protocol (MCP). It is designed for use by AI agents (e.g., GitHub Copilot Chat) and other automation clients, enabling deep, structured access to Excel file content and metadata without requiring Microsoft Excel to be installed.

Key Architectural Components

Entry Point (Program.cs): Configures dependency injection, logging (Serilog), and registers all available tools with the MCP server. Sets up stdio-based communication for MCP.
Tool Layer (src/Tools/): Each tool (e.g., get_schema, get_formulas, get_sheet_data) is implemented as a class with a clear interface, request/response models, and error handling. Tools are registered with the MCP server and exposed as endpoints.
Service Layer (src/Tools//Services/): Each tool has dedicated services for validation and Excel file operations. Services use ClosedXML for direct file access and manipulation, ensuring cross-platform compatibility and no COM/Interop dependency.
Models Layer (src/Tools//Models/): Defines request, response, and error models for each tool, ensuring consistent and structured data exchange.
Core/Validation Layer (src/Core/Validation/): Shared validation logic for file paths, formats, and worksheet existence, used across multiple tools.
Logging: All operations are logged using Serilog, with structured logging for traceability and diagnostics.
Testing (tests/): Comprehensive unit and integration tests for all tools and services, using xUnit and Moq.

Data Flow

Request: An MCP-compatible client sends a request (e.g., get_schema) with parameters (e.g., file path).
Validation: The tool validates input parameters using its validation service.
Processing: The tool delegates to its service layer to open and analyze the Excel file using ClosedXML.
Response: The tool returns a structured JSON response (or error) to the client.
Logging: All steps are logged for auditing and debugging.

Extensibility

New tools can be added by implementing a tool class, models, and services, then registering the tool in Program.cs.
The architecture is modular, with each tool isolated in its own folder.

Mermaid Diagram

graph TD
    subgraph "Client"
        A["MCP Client e.g. Copilot Chat"]
    end
    subgraph "Server"
        B["Program.cs Entry Point"]
        C["MCP Server"]
        D["Tool Layer"]
        E["Service Layer"]
        F["Models Layer"]
        G["Core/Validation"]
        H["Serilog Logging"]
    end
    subgraph "Excel"
        I["Excel File .xlsx, .xlsm, .xltx, .xltm"]
    end

    A-->|"MCP Request"|C
    C-->|"Tool Registration"|D
    D-->|"Validate Input"|G
    D-->|"Call Service"|E
    E-->|"Read/Analyze"|I
    E-->|"Return Data"|D
    D-->|"Structured Response"|C
    C-->|"MCP Response"|A
    D-->|"Log Events"|H
    E-->|"Log Events"|H
    G-->|"Log Events"|H

Technology Stack

.NET 9 / C#
ClosedXML (Excel file access)
ModelContextProtocol/csharp-sdk (MCP communication)
Serilog (logging)
xUnit, Moq (testing)

Key Design Principles

Stateless per request: Each request is handled independently; Excel files are opened, processed, and closed per operation.
No Excel/COM dependency: All file operations use ClosedXML for portability and reliability.
Structured error handling: All errors are returned as structured JSON with codes and messages.
Modular and extensible: Each tool is self-contained and easily testable.

Tools Created

Here I'm showing the full description, not the final description sent to the MCP Client.

get_formulas

Extracts all formulas from an Excel workbook or a specific worksheet. Returns a list of formulas with cell address, formula string (as written in Excel, without the leading '='), evaluated value (if requested), hidden status, and worksheet name. Supports filtering by sheet and optionally includes formula results. Use this tool to audit, analyze, or document all formulas present in a workbook, including detection of hidden or error formulas.

get_schema

Extracts a complete structural overview of an Excel workbook. Returns a detailed JSON schema listing all worksheet names, their dimensions, all tables, and named ranges present in the file. Use this tool to understand the organization, available data regions, and metadata of any supported Excel file before attempting data extraction or analysis.

get_sheet_data

Retrieves tabular data from a specified worksheet in an Excel file. Returns column headers and a configurable number of data rows (with pagination support). Use this tool to access the actual cell values from a sheet, for previewing, processing, or exporting worksheet data. Requires the sheet name and supports limiting the number of rows returned.

get_range_data

Retrieves data from a specified range in an Excel workbook. Supports both explicit ranges (e.g., A1:B10) and named ranges. Returns a 2D array of cell values, with support for formulas (returns calculated values), merged cells (returns the merged value for all covered cells), and empty cells (as empty strings). Enforces a maximum of 1000 rows x 100 columns per request, with a warning if data is truncated. Use this tool for precise extraction of rectangular or named regions, including tables, headers, or custom ranges. Also supports workbook-level named ranges that may span multiple sheets (returns sheet names in response) and provides clear error handling for ambiguous, missing, or unsupported named ranges (e.g., non-contiguous).

get_sheet_summary

Analyzes a worksheet and returns a summary of its columns, including column names, inferred data types (e.g., Text, Decimal, Date, Boolean, Unknown), and statistics such as count of non-empty cells, distinct values, and presence of formulas. Use this tool to quickly understand the structure and content of a sheet, identify data types, and detect columns with mixed or empty data.

get_table_data

Retrieves structured data from a specified Excel table (ListObject). Returns column headers, data rows with preserved data types, and table metadata including dimensions and location. Supports pagination for large tables. Use this tool to access structured tabular data that has been defined as an Excel table, with automatic type preservation for numbers, dates, and formulas.

list_named_ranges

Lists all visible named ranges in an Excel workbook, including name, absolute reference, scope, and worksheet.

Development Journey

For this project, my main source of information was the official documentation for the C# SDK for MCP servers. I also found it helpful to study examples of other MCPs available in the github.com/modelcontextprotocol/servers repository.

Throughout the early stages, a recurring challenge was describing tools and their parameters so that artificial intelligence could use them effectively. Initial attempts often led to underutilization by the AI until I began directly iterating based on AI feedback, and even created a custom GPT within ChatGPT to refine prompt and tool definitions.

Debugging was another challenge. Without the ability to set breakpoints as in traditional application development, I had to rely on detailed log files to track down issuesa workaround reminiscent of earlier coding days, but effective nonetheless.

GitHub Copilot was invaluable. Whether using it in Chat, Edit, or Agent modes, Copilots suggestions, custom instructions, and prompt guidance made development faster and more enjoyable.

With these tools and methods, I was able to build the entire MCP server in just two daysmuch faster than if I had started from scratch without modern AI-driven tools and iterative development. Keep in mind that this is a 6,175-line code project with 164 tests (Unit and Integration tests).

Excitement of AI Using My Tool

The real wow moment came after I finished developing the MCP server and integrated it into my project for generating VBA scripts for Excel. Seeing artificial intelligence actually leverage the tools I had builtto extract all the necessary details from Excel files, understand the columns, formulas, tables, ranges, and structure, and then generate fully functional VBA scriptswas simply amazing.

It was incredibly exciting to watch the AI use my MCP to seamlessly access and interpret the information from the files I was working on. Suddenly, tasks that used to be tedious and manual became much more efficient. The tool I had created was now empowering the AI to do things it couldnt do before, making my workflow smoother and smarter.

Witnessing this in action was genuinely surprising. I realized that by bridging the gap between Excel and AI, I had given artificial intelligence a new capability to understand and manipulate complex Excel files in ways that made my own life as a developer much easier. Its moments like these that remind me why Im so passionate about combining automation, AI, and practical problem-solving.

Key Takeaways and Lessons Learned

Describe Tools with AI in Mind: One key lesson was the importance of crafting clear, unambiguous tool descriptions and parameter explanations for the AI to use them effectively. Initially, I underestimated how much small changes in wording could affect whether AI agents discovered and leveraged the right tools. Iterative testing and actually asking the AI for advice led to substantial improvements.
Iterate with Feedback Loops: Prompt engineering is an ongoing process. Creating a custom GPT and using AI suggestions as part of my workflow allowed for rapid experimentation and better outcomes.
Debugging Requires Creativity: Traditional debugging tools are often unavailable when working with agentic tools or headless servers. Falling back on structured, verbose logging was essential for troubleshooting issues and understanding execution flows.
AI as a Pair Programmer: Treating AI as a partnerusing GitHub Copilots modes, reviewing suggestions, and organizing promptsincreased productivity and helped bridge the gap between idea and implementation.
Speed Gains Are Real: By combining a familiar stack (C#, .NET, ClosedXML), good open-source libraries, and generative AI tools, I built a robust and extensible MCP server in a fraction of the time a traditional approach would have required.
Learning Never Stops: There are still open questionssuch as how to enable better debugging for MCP servers. If you have tips or tools for this, please share them in the comments!

Closing

If theres one thing I hope you take away from this article, its that creating your own MCP tools can truly expand what artificial intelligence is capable of. By building custom tools that give AI access to data and context it otherwise couldnt reach, you not only solve your own challenges but also push the boundaries of whats possible with automation.

I encourage you to experiment with creating your own MCP tools or agentic workflows, whether its for Excel, other applications, or any scenario where AI could benefit from better access to structured information. Youll be surprised at how much you can streamline your workand how empowering it feels to see AI making use of the capabilities youve built.

If you have any questions, want to share your own experience, or need advice on getting started, feel free to reach out or leave a comment. Im always happy to connect with others exploring the intersection of software development and artificial intelligence.

Lets keep building and learning together!

LLM Agents Transforming Enterprise Efficiency

Jorge Castillo

The digital-transformation wave that defined the last decade is now cresting into something bigger: a shift from automation by rules to automation by understanding. Large Language Models (LLMs) and agentic AI toolssoftware agents that can plan and execute multi-step tasks with minimal human inputare at the heart of this change. Early adopters report that marrying these technologies with existing workflows cuts development or operations effort by 2050 percent and, in some flagship projects, by an order of magnitude.

What Are LLMs and Agentic Tools?

LLMs are neural networks trained on billions of words (and often source-code tokens) that can read and generate natural language with near-human fluency. Because they have learned patterns across vast corpora, they answer questions, summarise documents, draft emails, write SQL queries and even generate production-ready code. Unlike earlier narrow AI models, LLMs excel at generalisation: give the model a clear promptDraft an ESG report summary in 200 wordsand it produces coherent text without a bespoke rules engine.

Agentic tools layer planning and tool-use capabilities on top of LLMs. Instead of a single response, an agent can decompose a goal (migrate our legacy tests) into steps, call external tools (IDEs, CI pipelines, browsers), evaluate results, and iterate until success. Airbnbs migration bot, for instance, read ~3,500 test files, rewrote them for a new framework, ran the suite, and retried failuresfinishing an 18-month manual project in six weeks with 97 percent automation.

How They Are Reshaping Enterprise Operations

Business lever	Practical payoff (evidence)
Task automation	Developers complete routine coding 55 % faster with GitHub Copilot; similar speed-ups appear in report-writing, contract review and marketing copy creation.
Customer service	LLM-backed chatbots resolve Tier-1 queries in seconds, freeing agents for complex cases and raising CSAT scores. Examples mirror the 2030 % efficiency boosts seen in AI-assisted software triage.
Data-driven insights	Agents scan millions of rows, surface anomalies and draft executive briefs in plain Englishcompressing analytics cycles from days to minutes, a pattern echoed in Intuits 23 faster system-integration tasks.
Personalised products	By turning unstructured feedback into feature ideas and tailoring content in real time, firms accelerate time-to-market and lift conversion rates; Microsoft attributes ~30 % of new code to AI, enabling smaller teams to ship more features.
Process optimisation & cost control	Companies see ROIs of roughly $3.70 for every $1 spent on generative-AI tooling, driven by shortened project timelines and reduced bug-fix debt.

Real-World Success Stories

Airbnb Turbo-charging legacy migration
- Challenge: Convert thousands of Enzyme tests to React Testing Library.
- Outcome: LLM-powered agent finished in 6 weeks (13 faster) while preserving 100 % coverage. Engineers intervened on only 3 % of files.
Intuit Context-aware development platform
- Challenge: Speed delivery across 100 M-user fintech suite.
- Outcome: Internal GenOS platform, fine-tuned on proprietary code, cut integration work by 23 and targets a 30 % organisation-wide efficiency lift.
Microsoft Enterprise-scale Copilot rollout
- Scale: Thousands of engineers; up to 30 % of new code written with AI assistance.
- Benefit: Tasks completed 55 % faster in pilot studies; smoother code reviews and higher developer satisfaction.

These cases span travel, finance and technologyillustrating that the advantages are sector-agnostic wherever language, knowledge or software is central to value creation.

Benefits for the Wider Business Landscape

Repetitive work vanishes: Drafting contracts, documenting SOPs or reconciling invoices becomes a one-click operation, releasing staff for higher-order thinking.
Customer experiences upgrade: Advanced chatbots handle multilingual support 24 / 7, using an LLM fine-tuned on brand tone and knowledge bases.
Faster, smarter decisions: Agents can read quarterly reports, extract KPIs and warn finance teams of anomalies before close.
Hyper-personalisation: Marketing engines generate tailored product descriptions or offers on the fly, informed by user behaviour and generated copy.
Cost reduction & resilience: Automating framework upgrades or policy updates slashes technical-debt backlogs and mitigates risk from outdated systems.

Challenges and Responsible Adoption

Risk	Mitigation
Accuracy & hallucinations	Keep a human-in-the-loop; enforce automated tests and reviewsakin to Microsofts internal Copilot guardrails.
Security & privacy	Run code-security scanners on AI output; prefer private instances of Azure OpenAI for sensitive data.
Intellectual-property leakage	Activate reference tracking filters; require attribution checks for large snippets.
Skill degradation	Pair AI adoption with up-skillingdevelopers must explain AI-generated code during PRs, preserving expertise.

A structured 12-month roadmappilots, guidelines, enterprise rollouthelps organisations capture quick wins while embedding governance from day one.

Conclusion

LLMs and agentic AI tools are no longer experimental novelties; they are pragmatic levers that compress months of effort into weeks and routine tasks into minutes. Tangible gains2050 % productivity jumps, multi-million-dollar cost savings, and happier teamshave already been documented across hospitality, fintech and Big Tech.

Yet the real prize extends beyond efficiency. By outsourcing linguistic and procedural grunt work to machines that understand context, enterprises free human talent to focus on creativity, strategy and innovation. Organisations that move deliberatelypiloting use cases, training staff, instituting safeguardswill not just keep pace with the AI revolution; they will set it.

Reference

GitHub (2023). Research: quantifying GitHub Copilots impact on developer productivity and happiness. The GitHub Blog, posted by GitHub research team. Provides results of surveys and a controlled experiment where Copilot users were 55% faster on a coding task and reported higher satisfaction github.blog github.blog.
Microsoft & collaborators (2024). Productivity Assessment of Code Assistants (Study). (Reported via InfoQ: Study Shows AI Coding Assistant Improves Developer Productivity, Sept. 24, 2024 by Anthony Alford). Large-scale RCT with 4,000+ developers at Microsoft, Accenture, and one Fortune 100 firm: Copilot users had a 26% increase in pull requests completed per week on average infoq.com.
Airbnb Engineering (2025). Accelerating Large-Scale Test Migration with LLMs. Airbnb Tech Blog on Medium, Mar. 13, 2025 by Charles Covey-Brandt. Case study of using GPT-4 and automation to migrate 3.5k React test files in 6 weeks (versus ~1.5 years estimated manually), preserving test coverage medium.com nexgencloud.com.
Microsoft Source (2024). Microsoft customers share impact of generative AI. Microsoft News (Source), Nov 19, 2024. Shares stats from Ignite 2024 and IDC research: generative AI usage grew to 75% of enterprises in 2024, and average ROI is $3.70 for every $1 invested in gen AI (reflecting productivity gains) news.microsoft.com.

Post 0: Launching My Agentic Coding & AI‑Automation Journey

Jorge Castillo

Hello, and welcome to Code Smarter! Im Jorge Castillo Pinoa solutions architect and lifelong technologist with more than two decades of handson experience in software design, system modernization, and technical leadership. From containerizing legacy monoliths to architecting cloudnative microservices, Ive always chased smarter, leaner ways to build. Lately, my curiosity has zeroed in on how autonomous AI agentsthink GitHub Copilot, lowcode bots, and selfhealing CI/CD sentinelscan automate everyday workflows, rescue us from repetitive toil, and unlock new creative headspace.

Todays Post 0 is my open invitation to you: join me as I explore the nuts and bolts of agentic coding and practical AIdriven automation, chart what I hope to learn, and show you how to follow along.

Who Am I?

Background I completed the coursework for a Computer Engineering degree at the University of Len but chose not to present my final career project, pivoting instead directly into industry. Earlier, I studied Electrical & Electronic Engineering at the Technological University of Panama. Over my career Ive worn many hatsTech lead, trainer, Delivery Manager and solutions architectguiding projects across Europe and the Americas.
Technical Expertise Ive migrated onprem apps to cloudnative architectures, architected highavailability solutions on Azure (Service Bus, Cosmos DB, Functions, AKS), and championed DevOps with GitHub Actions, Terraform, and lowcode tooling like OutSystems.
AI & Automation My recent focus is on leveraging largelanguagemodel agents to accelerate development. Ive integrated tools such as GitHub Copilot, Windsurf, and custom MCP servers to assist with code generation, automated testing, and selfservice deployments.

Why Agentic Coding & AIDriven Workflow Automation?

Agentic coding is the practice of designing and orchestrating autonomous software agentsprograms that make decisions, collaborate, and execute tasks with minimal human intervention. When fused with AIpowered workflow automation, these agents become force multipliers that can:

Write & Refactor Code Suggest improvements, generate functions, and even spin up whole microservices.
Guard the Pipeline Roll back faulty deployments, resolve merge conflicts, or optimize build times on the fly.
Monitor & Heal Continuously watch performance metrics and autotune infrastructure before incidents occur.
Augment Daily Ops Schedule standups, triage support tickets, and keep stakeholders informed without manual nudges.

The payoff is huge: faster delivery cycles, fewer errors, and more time for engineers to invent and innovate.

What Youll Find Here

HandsOn Tutorials Stepbystep guides to build your own agentsusing OpenAIs APIs, n8n workflows, and VS Code extensions.
Architecture Deep Dives Patterns for agent design, secure communication, observability, and scaling.
Case Studies & POCs Stories from my experience: automated client onboarding, selfhealing microservices, and beyond.
Tool Reviews Realworld evaluations of LangChain, AutoGen, Microsoft Copilot Labs, Cline, Cursor, Roo Code, Github Copilot and other platforms.
Lessons Learned Pitfalls, performance bottlenecks, and the reality of maintaining fleets of cooperating agents in production.

Join the Conversation

Agentic coding and AIdriven workflow automation arent mere buzzwordstheyre reshaping how we craft and ship software. Whether youre curious about a chatops bot for your Slack channel or an AIguided pipeline for continuous improvement, this blog aims to be your guide.

Subscribe to get new posts as they drop.
Comment below with your questions or share your own experiments.
Connect with me on LinkedIn or X lets build smarter code together!

Thanks for stopping by Code Smarter. I cant wait to see what we create.