Skip to Content
  • Home
  • Blog
  • Privacy Policy
  • Terms And conditions
  • Disclaimer
  • About Us
      • Home
      • Blog
      • Privacy Policy
      • Terms And conditions
      • Disclaimer
      • About Us
  • Knowledge Base
  • Building an AI Engineering Stack: Cloudflare's Internal Development Journey
  • Building an AI Engineering Stack: Cloudflare's Internal Development Journey

    4 May 2026 by
    Suraj Barman

    Building an AI Engineering Stack: Definition and Purpose

    The term AI engineering stack refers to a structured framework that integrates artificial intelligence tools and technologies into the software development lifecycle. This stack is designed to streamline workflows, enhance developer productivity, and enable seamless adoption of AI-powered coding assistants. Cloudflare's recent efforts in building their internal AI engineering stack exemplify the strategic implementation of such technology. By leveraging their proprietary infrastructure, the company aimed to enhance its engineering capabilities and scale AI adoption across its workforce.

    Key Objectives Behind Cloudflare's AI Integration

    Cloudflare's initiative to build an AI engineering stack was driven by the need for improved developer velocity and operational efficiency. The project began with the development of MCP servers and an access layer to facilitate AI tooling capabilities. These components served as the backbone for enabling AI agents to perform tasks effectively within the organization. The project also sought to establish a robust framework for code review processes, onboarding practices, and repository management to optimize engineering workflows.

    By unifying their AI tools under a centralized system, Cloudflare aimed to address challenges related to scalability and cost management. This goal was particularly pertinent given the massive number of AI requests and tokens processed monthly. Additionally, the company aimed to redefine standards for code propagation and review, ensuring that the AI tools aligned with organizational objectives.

    Another core objective was to foster collaboration across various teams, creating a unified vision for AI integration. Engineers from diverse departments were brought together to form the iMARS squad, focusing on the development and rollout of agent-based AI tools. This strategic approach ensured that the initiative captured a wide range of expertise and perspectives.

    Development of Key Components

    Central to the AI engineering stack was the construction of MCP servers, which served as the foundational infrastructure for AI tools. These servers were designed with Zero Trust authentication mechanisms to ensure secure access to sensitive data and resources. Additionally, the AI Gateway was developed to manage 20.18 million monthly requests and process a staggering 24.137 billion tokens, highlighting the scale of operations.

    The engineering team also created centralized LLM routing systems for cost tracking and resource allocation. This allowed them to optimize the use of AI tools while maintaining budgetary constraints. Another key innovation was the implementation of BYOK (Bring Your Own Key) protocols, which enhanced data security by allowing engineers to manage encryption keys independently.

    To ensure compatibility and ease of use, the team integrated open-source and third-party coding assistant tools into the MCP-compatible client layer. These tools included OpenCode and Windsurf, which were tailored to meet the specific needs of Cloudflares engineering teams. The combination of these components resulted in a versatile and scalable AI engineering stack.

    Metrics Demonstrating Success

    The success of Cloudflare's AI engineering stack is evident in the metrics achieved over the past 30 days. A total of 3,683 internal users actively utilized AI coding tools, representing 60% of the companys workforce and 93% of its R&D organization. Additionally, 4,795 million AI requests were processed, and 295 teams adopted the agentic AI tools for their projects.

    One of the most striking achievements was the increase in developer velocity. The rolling average of weekly merge requests rose from 5,600 to over 8,700, with the week of March 23 seeing an unprecedented 10,952 merge requests. This surge highlights the tangible impact of AI integration on the companys engineering productivity.

    Furthermore, the architecture enabled the processing of over 24.137 billion tokens through the AI Gateway and 51.83 billion tokens on Workers AI. These figures underscore the scalability and efficiency of the internal AI tools, validating the effectiveness of the engineering stack.

    Challenges and Solutions in Implementation

    Despite its success, the development of Cloudflares AI engineering stack was not without challenges. One significant hurdle was ensuring cross-team collaboration. The formation of the iMARS squad was instrumental in addressing this, as it brought together diverse engineering talents to tackle complex tasks collectively.

    Another challenge was the redefinition of coding standards and review processes across thousands of repositories. To address this, the team implemented automated systems to standardize code review and expedite onboarding. These systems were designed to propagate changes efficiently while maintaining quality and compliance.

    Cost management posed another obstacle, given the high volume of AI requests and tokens processed. The centralized LLM routing and cost tracking systems provided a solution, enabling the team to monitor and control expenditures effectively. This ensured the sustainable scalability of the AI engineering stack.

    Impact on Engineering Practices

    The introduction of the AI engineering stack has significantly transformed Cloudflares engineering practices. Developers now benefit from streamlined workflows, enhanced coding tools, and automated processes for code review and onboarding. These improvements have not only boosted individual productivity but also fostered a culture of innovation within the organization.

    Moreover, the integration of AI tools has enabled engineers to tackle complex problems with greater efficiency. The availability of advanced coding assistants has reduced the time spent on repetitive tasks, allowing developers to focus on strategic initiatives. This shift has led to a marked improvement in the quality and speed of software delivery.

    Finally, the AI engineering stack has set a new benchmark for internal tooling and infrastructure. By aligning the stack with existing Cloudflare products, the company has demonstrated its commitment to leveraging its own solutions for internal growth.

    Future Prospects and Scalability

    Looking ahead, Cloudflare plans to continue refining its AI engineering stack to address emerging challenges and opportunities. The company is exploring ways to expand the capabilities of its AI tools, ensuring they remain relevant in a rapidly evolving technological landscape. This includes integrating new features and optimizing existing components for greater efficiency.

    Another area of focus is the scalability of the stack. With the increasing adoption of AI tools across teams, Cloudflare aims to ensure that its infrastructure can support the growing demand. This may involve enhancing the capacity of MCP servers and AI Gateway systems to handle higher volumes of requests.

    Cloudflare is also committed to fostering a culture of continuous improvement. By leveraging feedback from engineers and monitoring usage patterns, the company can identify areas for enhancement and make data-driven decisions. This approach will ensure the long-term success of the AI engineering stack and its contribution to organizational goals.


    Latest Stories

    Explore fresh ideas and updates from our editorial team.

    See All
    Your Dynamic Snippet will be displayed here... This message is displayed because you did not provide enough options to retrieve its content.

    Copyright © 2026 TechStora. All Rights Reserved.