Major AI Announcements at Google I/O 2023

Major AI Announcements at Google I/O 2023
May 21, 2024

Google's annual developer conference, I/O, is always filled with exciting announcements and updates across their various products and platforms. This year was no different, with Google unveiling significant advances in artificial intelligence at the keynote. From new state-of-the-art language models to tools for building responsible AI, here are the major AI developments that Google showcased at I/O 2023.


Justin Sullivan/Getty Images


PaLM 2 - An Even More Powerful Language Model

One of the biggest announcements was the introduction of PaLM 2, Google's latest natural language processing model. PaLM, which stands for Pathways Language Model, has been powering products like Google Assistant since its debut last year. PaLM 2 represents a major upgrade, with up to 10x more parameters than the original version.

Google is releasing PaLM 2 in four different sizes - Gecko, Otter, Bison, and Unicorn. The smallest model, Gecko, has been designed for on-device use cases where low latency and privacy are priorities. Otter and Bison are larger versions meant for more complex conversational tasks. And Unicorn, Google's largest model to date, aims to demonstrate state-of-the-art capabilities in language understanding.

PaLM 2 will underpin over 25 Google products and services, including the Google Assistant, Google Search, and Google’s question answering systems. With its increased scale and advanced self-supervised training, PaLM 2 is able to understand and generate human language with far greater nuance. It can translate between over 100 languages, answer follow-up questions in a conversation, and carry out multi-step reasoning. Overall, PaLM 2 represents a major step forward in Google's natural language capabilities.

Introducing Gemini - A Multimodal AI Model

In addition to PaLM 2, Google unveiled a brand new type of AI model called Gemini. Unlike previous models which primarily handle text, Gemini is designed to be multimodal - it can understand, generate and reason across multiple types of data like text, images and structured datasets.

One unique feature of Gemini is its ability to add metadata tags to generated images and videos, clearly identifying them as AI creations. This "watermarking" is aimed at increasing transparency and preventing manipulated media. Gemini also integrates tightly with Google services, allowing it to search image databases, generate images based on text descriptions, and more.

Gemini's multimodal abilities will power the next generation of Google products. The company announced Bard, its conversational AI assistant, will transition to use Gemini, enabling it to understand image prompts. Google Workspace tools will gain new generative features using Gemini as well, like auto-generating organizational charts from descriptions. Overall, Gemini represents Google's vision for building AI systems that can comprehend different types of inputs and media.

Enhancing Google Search with Conversational AI

Google shared developments in enhancing its flagship Search product with more human-like dialogue. Google Search 1.5, as it's called, introduces conversation modes that allow for back-and-forth questioning to explore topics. Search responses can now be more exploratory and conversational rather than just returning a list of links.

To power these capabilities, Google Search now leverages both PaLM 2 and Gemini models. It can understand follow-up questions, give explanations and sources for facts, and continue a coherent dialogue over multiple turns. Google is also launching Search Labs, an experimental platform for users to test new AI-powered search features before they launch more broadly.

The goal is to make Search an intelligent assistant capable of natural conversations to satisfy users' information needs. By applying state-of-the-art language models and multimodal reasoning, Google aims to significantly improve the dialogue experience in Search and help people gain a deeper understanding of topics. These updates represent major strides in building more human-like, conversational search engines.

Advancing AI for Business and Enterprises

Google's AI developments aren't just focused on consumer products either. The company shared significant updates for AI applied to business use cases and enterprises. Google Cloud's Vertex AI platform received expanded access to powerful AI accelerators like tensors processing units (TPUs) and graphics processing units (GPUs).

This will allow enterprises to more easily build and deploy advanced AI models at scale. Vertex AI also gained three new AI services - Imagen for generative image creation, Codex for programming with natural language, and Chirp for speech-to-text translation. These pre-trained models remove the need for businesses to train their own solutions from scratch.

Advancing Responsible and Ethical AI

While Google unveiled impressive advances in AI capabilities, they also emphasized their commitment to building these technologies responsibly. A key theme of the keynote was focusing AI development on tasks that benefit humanity. Google is particularly focused on issues like reducing bias, preventing harms from manipulated media, and providing transparency around model outputs.

To address these challenges, Google announced they are expanding their use of techniques like watermarking generated media from models like Gemini. This adds metadata to identify AI creations and prevent manipulated footage. Google is also launching new "Automated Adversarial Testing" of models, using generative techniques to proactively find potential harms or biases before public release. Experts will then work to improve model robustness.

Perspective, Google's toxicity detection API, is also being applied more broadly as an industry standard. Originally made for publishers, Perspective is now used by other major tech companies to identify harmful or abusive language across applications. At I/O, Google reinforced their commitment to fighting issues like misinformation through responsible technological progress.

Major AI Products and Services on the Horizon

In addition to current updates, Google teased several ambitious AI-powered projects in development. One is a "Universal Translator" capable of dubbing speech from one language to another in real-time, while also syncing lip movements. This has applications for facilitating global communication. However, Google notes the technology is only being worked on with chosen partners to prevent potential misuse for manipulated media like deepfakes.

Google is also advancing their conversational search capabilities with a focus on natural dialogue. Beyond Search 1.5's exploration mode, future versions may be able to understand and answer queries across many continuous back-and-forth exchanges. If successful, this could create an AI system approaching human-level language abilities. Only time will tell if Google can realize this vision while prioritizing safety.

In summary, Google I/O 2023 highlighted the company's leadership in areas like language models, generative capabilities, and applying AI widely across products and business tools. However, they also demonstrated a renewed emphasis on responsible development through transparency, bias mitigation, and focused efforts to curb harms. It will be fascinating to see the interplay between technological progress and safety priorities as these ambitious projects come to fruition in the coming years. The impact on society will depend greatly on navigating those challenges.

MORE FROM JUST THINK AI

MatX: Google Alumni's AI Chip Startup Raises $80M Series A at $300M Valuation

November 23, 2024
MatX: Google Alumni's AI Chip Startup Raises $80M Series A at $300M Valuation
MORE FROM JUST THINK AI

OpenAI's Evidence Deletion: A Bombshell in the AI World

November 20, 2024
OpenAI's Evidence Deletion: A Bombshell in the AI World
MORE FROM JUST THINK AI

OpenAI's Turbulent Beginnings: A Power Struggle That Shaped AI

November 17, 2024
OpenAI's Turbulent Beginnings: A Power Struggle That Shaped AI
Join our newsletter
We will keep you up to date on all the new AI news. No spam we promise
We care about your data in our privacy policy.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.