Unlocking the Power of AI Everywhere: Enhancements in Google Distributed Cloud

The influence of AI in the 2020s has already been transformative. However, several challenges stand in the way of enterprises fully harnessing AI's potential. Data sovereignty, regulatory compliance, and low-latency requirements can restrict the ability to leverage cloud benefits like scalability, cost-efficiency, and cutting-edge innovation.

The Importance of Running AI Everywhere

Google Distributed Cloud (GDC) addresses these challenges by extending Google's AI services to where businesses need them most — whether in their own data centers or at the edge. GDC is designed with AI and data-intensive workloads in mind, providing a robust, fully managed hardware and software solution that can operate connected to Google Cloud or air-gapped from the public internet. This flexibility enables organizations to process and analyze data near its source, ensuring compliance, low latency, and high security.

GDC’s New GenAI Search Packaged Solution

At the forefront of GDC's latest enhancements is the new generative AI search packaged solution, powered by the Gemma 7B model. This solution is poised to revolutionize how businesses retrieve and analyze data both on-premises and at the edge. Announced for preview in Q2 2024, it leverages Google's state-of-the-art open models, including Gemma and Llama, to facilitate advanced data processing capabilities directly where the data resides.

Architecture diagram for the Google Distributed Cloud gen AI search packaged solution
icon/enlarge

The GDC AI search solution empowers organizations to deploy an on-prem conversational search tool that uses natural language processing to find the most relevant information from their data. This can significantly boost employee productivity and knowledge sharing, as search queries and data remain on-prem, ensuring compliance with data sovereignty regulations. The architecture of this solution, as illustrated in the attached diagram, integrates various AI and data processing tools:

  1. Data Ingestion: OCR models, translation models, and speech-to-text models from Vertex AI convert images, audio, and text into English text.
  2. Embedding and Indexing: The text is then embedded into vectors using the E5 Large V2 model and indexed in a vector database (AlloyDB Omni).
  3. Query Processing: When a user submits a query via a chat interface, it is embedded into a vector and searched against the indexed vectors in AlloyDB Omni.
  4. Result Retrieval and Response Generation: The relevant document chunks are retrieved and sent to a large language model (LLM) like Gemma 7B, which generates an AI-driven response.

This solution provides a comprehensive AI stack on-premises, integrating Google-developed models, third-party solutions, and open-source tools. It supports various pre-trained APIs for tasks such as speech-to-text, translation, and optical character recognition, ensuring versatile data handling capabilities.

Security and Platform Enhancements

Deploying AI and other enterprise-grade applications demands a robust, secure platform. GDC places a strong emphasis on security and compliance, enabling organizations to meet stringent regulatory requirements while protecting sensitive workloads. Several new security and platform enhancements have been introduced:

Security Enhancements

  • ISO27001 and SOC2 Compliance: GDC has achieved these certifications, demonstrating its commitment to meeting high standards of regulatory and compliance requirements.
  • Managed Intrusion Detection and Prevention Solution (IDPS): This integrates Palo Alto Networks' threat prevention technology with the GDC architecture, inspecting north-south traffic and providing transparent inline protection for GDC workloads.

Platform Enhancements

  • GDC Sandbox:This managed environment allows customers and partners to build and test services for GDC in a Google Cloud environment, bypassing the need for physical hardware.
  • GDC Racks:The latest-generation racks are optimized for AI and general compute workloads, offering flexibility with network- or storage-optimized nodes.
  • Storage Flexibility:GDC now supports independent growth of storage to accommodate large analytics or AI workloads, with options for block, file, or object storage.
  • Survivability Enhancements:GDC supports disconnected mode for up to seven days and includes offline management features to ensure continuous operation even when disconnected.
  • Apigee Support:Google Cloud's API management solution, Apigee, is now supported on GDC, enabling scalable API management for AI workloads.

GDC Ecosystem with Nvidia

Collaboration with partners is key to deploying AI applications effectively at the edge or on-prem. Google continues to expand the GDC ecosystem, working closely with partners to provide specialized skills and technologies. Notable developments include:

  • Managed GDC Providers: Partners such as Clarence, T-Systems, and WWT now offer GDC as a managed service, providing end-to-end solutions including operation, design, migration, and deployment services.
  • Google Cloud Ready — Distributed Cloud Badge: This new certification helps customers identify independent software vendors (ISVs) whose solutions are optimized for GDC. Partners like Canonical, CIQ, Elastic, MongoDB, Palo Alto Networks, and Starbust have already validated their solutions, with more partners in the pipeline.
  • Nvidia Partnership: Google continues to collaborate with Nvidia to integrate the latest GPUs for AI workloads. The new GDC server features the energy-efficient Nvidia L4 Tensor Core GPU, designed for use cases where high availability is not required, alongside the AI-optimized servers with Nvidia H100 Tensor Core GPUs.

The enhancements to Google Distributed Cloud are poised to revolutionize how businesses deploy and manage AI workloads across diverse environments. By bringing Google's AI capabilities closer to where data is generated, GDC ensures compliance, enhances security, and provides the flexibility needed to innovate at scale. With new generative AI solutions, robust security features, platform enhancements, and a growing ecosystem of partners, GDC empowers organizations to fully leverage the power of AI, wherever they need it.

Reference: Google Cloud Blog | Run AI anywhere with Google Distributed Cloud innovations

Subscribe to Our Newsletters

Grow Your Competitive Edge With Our Insights.