Google's Gemini Takes on OpenAI
Gemini, developed by Google, is a "multimodal" AI model capable of comprehending and processing diverse formats, such as text, code, audio, image, and video, simultaneously.
Gemini stands out for its multimodal capabilities
It allows to work with different types of information simultaneously, including text, code, audio, image, and video. It will be available in three different sizes: Ultra for complex tasks, Pro for a wide range of tasks, and Nano for on-device tasks.
On December 6, Google parent company Alphabet announced the launch of Gemini, their largest and most advanced AI model to date. This move is seen as a direct challenge to competitors such as OpenAI's GPT-4 and Meta's Llama 2 in the race to lead the nascent AI space.
Gemini is the first AI model to be developed by Alphabet after the merger of their AI research units, DeepMind and Google Brain, into a single division called Google DeepMind. The CEO of DeepMind, Demis Hassabis, leads this new division.
Gemini Ultra is currently being tested by select customers, developers, partners, and safety and responsibility experts, with a broader rollout expected for early next year.
Google plans to incorporate Gemini into all its products, with Bard, a natural language processing system, starting to use a refined version of Gemini Pro for advanced reasoning and understanding. Gemini Nano will power new features in Pixel 8 Pro smartphones and will soon be available in Smart Reply in Gboard.
Transforming AI in Search, Ads, and More
The introduction of Gemini has also improved Google's generative AI search offering, Search Generative Experience (SGE), increasing efficiency and quality. Alphabet intends to integrate Gemini into a range of products and services in the future, including Search, Ads, Chrome, and Duet AI.
Transition to AI far bigger than mobile or web
CEO of Google, Sundar Pichai stated that each technological evolution presents a chance to propel scientific breakthroughs, hasten the pace of human development, and enhance the quality of life.
According to Sundar Pichai, “Each technological shift presents an opportunity to advance scientific discovery, accelerate human progress, and enhance quality of life. He believes that the current transition to AI surpasses the impact of previous shifts like mobile or web technology. Pichai envisions AI as a catalyst for creating opportunities that range from everyday improvements to extraordinary advancements. This technology has the potential to drive innovation, economic progress, and exponential growth in knowledge, learning, creativity, and productivity. Despite the progress made, it is just the beginning of exploring the vast possibilities of AI.”
Google's move to launch Gemini is seen as a direct challenge to OpenAI, which has been at the forefront of AI development. This is part of Google's ongoing efforts to stay competitive in the rapidly growing field of generative AI.
Sources:
CNN - Google launches Gemini, its most-advanced AI model yet, as it competes with OpenAI
Ars Technica - Google launches Gemini—a powerful AI model it says can surpass GPT-4
Google Blog - Introducing Gemini: Google’s most capable AI model yet
Financial Times - Google’s ‘Gemini’ makes mobile breakthrough for generative AI