Google has introduced Gemini, its latest multimodal general AI model, designed to be its most powerful offering yet. Available on platforms such as Bard, developer platforms, and Google Pixel 8 Pro devices, Gemini aims to compete with ChatGPT in the realm of generalized AI (GenAI). The model is available in three sizes: Ultra, Pro, and Nano.
Key Features of Google Gemini
Gemini is not restricted to specific types of information and can process text, code, audio, image, and video. Unlike ChatGPT, Gemini can work with video natively.
Google claims that Gemini Ultra outperforms human experts on 30 out of 32 widely-used academic benchmarks in large language model (LLM) research. It excels in massive multitask language understanding (MMLU) by combining 57 subjects like math, physics, history, law, medicine, and ethics.
Gemini can understand, explain, and generate high-quality code in popular programming languages such as Python, Java, C++, and Go.
Three Sizes for Different Needs
Meant for highly complex tasks, undergoing trust and safety checks, and available for select customers, developers, and partners for early experimentation.
Suited for a wide range of tasks, available on Bard for regular users worldwide and accessible for developers and enterprise customers through the Gemini API.
Designed for on-device tasks, available on Pixel 8 Pro for features like Summarize in the Recorder app and Smart Reply in Gboard. Android developers can also use Gemini Nano via AICore on Pixel 8 Pro devices.
Potential Impact on Google Search
Google plans to integrate Gemini into various products and services, including Search, Ads, Chrome, and Duet AI. Initial experiments in Search show a 40% reduction in latency in the U.S. English language, along with improvements in quality.
Google acknowledges the potential for hallucinations in Gemini and has implemented techniques to improve response accuracy. Safety measures include new protections for Gemini’s multimodal capabilities, comprehensive safety evaluations, and collaboration with external experts to identify and mitigate risks.
Gemini vs. ChatGPT 4
While it’s early to declare superiority, Gemini’s flexibility, native video support, and on-device capabilities provide advantages. Additionally, Gemini is currently free to use, contrasting with ChatGPT 4, which is available only to paid users.
The introduction of Gemini marks Google’s significant foray into the evolving landscape of multimodal AI, emphasizing its potential applications and impact across various domains.