In the rapidly evolving world of AI, staying ahead requires a platform that not only meets the demands of today but anticipates the needs of tomorrow. Enter Google Cloud’s Vertex AI, a fully-managed, unified platform designed to streamline AI development. Leveraging over 150 first-party, open, and third-party models, Vertex AI offers unparalleled customization, monitoring, and deployment capabilities.
Leading companies like ADT, IHG Hotels & Resorts, ING Bank, and Verizon are harnessing the power of Vertex AI to accelerate their innovation cycles and maintain a competitive edge.
At the recent Google I/O ‘24, several groundbreaking updates to Vertex AI were unveiled, promising to push the boundaries of what's possible in AI development. Here’s a closer look at the major announcements:
Vertex AI is also rolling out several new capabilities designed to optimize model performance and reduce costs:
Context Caching: Set to enter public preview next month, context caching allows customers to manage and reuse cached context data effectively. This feature is crucial for applications with long-context requirements, helping to significantly cut down processing costs by leveraging already processed data.
Controlled Generation: Coming to public preview later this month, controlled generation enables customers to define the format and structure of model outputs. This capability is particularly useful when precise output formats like YAML, XML, or custom schemas are needed. JSON output format is already live, providing a starting point for developers to experiment with structured output generation.
Batch API: Available in public preview today, the batch API facilitates the efficient handling of large volumes of non-latency sensitive text prompt requests. This includes tasks such as classification, sentiment analysis, data extraction, and description generation. By allowing multiple prompts to be sent in a single request, the batch API enhances developer workflows and reduces operational costs.
Vertex AI continues to simplify the development and deployment of AI agents with new open-source integrations and tools:
Firebase Genkit: Announced at I/O, Firebase Genkit is an open-source Typescript/JavaScript framework designed to streamline the development, deployment, and monitoring of production-ready AI agents. Integrated with Vertex AI, Firebase developers can now easily utilize powerful Google models like Gemini and Imagen 2, alongside text embeddings, to create sophisticated AI solutions.
LlamaIndex: This integration simplifies the retrieval augmented generation (RAG) process, covering everything from data ingestion and transformation to embedding, indexing, retrieval, and generation. Vertex AI customers can now leverage Google’s advanced models and AI-optimized infrastructure in conjunction with LlamaIndex’s flexible data framework to connect custom data sources to generative models.
Grounding with Google Search: Now generally available, this feature allows AI outputs to be grounded in proprietary databases or designated sources of "enterprise truth." This integration combines Google’s latest foundation models with access to up-to-date, high-quality information from Google Search, significantly enhancing the accuracy and completeness of AI-generated responses.
With these new features and ongoing support for tools like LangChain on Vertex AI, Google Cloud reaffirms its commitment to providing developers with the cutting-edge tools needed to create intelligent, responsive AI agents. Vertex AI’s continued expansion and innovation ensure that organizations can maximize the performance of their generative AI models at scale, accelerating the journey from experimentation to production.
Ready to experience the next wave of AI innovation? Start using Gemini 1.5 Flash on Vertex AI today. Discover how generative AI is transforming businesses by exploring our latest resources, including the ebook "Crossing the Generative AI Tipping Point: From Quick Wins to Sustained Growth," and delve into "101 Real-World Gen AI Use Cases from the World’s Leading Organizations" to see how others are leveraging AI to drive success.
Join the Digital Generation – Lead with Innovation.