Skip to main content

Google launches Gemini 2.5 Pro, free app access continues

Following the first preview three months ago, Google is ready to launch Gemini 2.5 Pro and make the reasoning model generally available (GA) for consumers and developers.

Gemini 2.5 Pro entered experimental testing in late March (03-25). Four days after debuting for paying subscribers and developers, Google surprisingly made it available for free users. It got a big coding upgrade in May (05-06) just before I/O, and one last update at the start of June (06-05).

With the stable version today, there are “no changes from the 06-05 preview version.”

L-R: Latest, 06-05, 05-06, 03-25 benchmarks

Advertisement - scroll for more content
  • Gemini 2.5 Pro preview
  • Gemini 2.5 Pro coding
  • Gemini 2.5 Pro experimental

With this launch, 2.5 Pro will lose the “preview” label in the Gemini app’s model picker. It follows 2.5 Flash entering GA in the app at I/O last month. Google notes how Pro is for “Reasoning, math & code” prompts, while Flash is for “Fast all-around help.”

Google tells us today that free Gemini app users will continue to have “limited access” to 2.5 Pro, with AI Pro subscribers getting 100 prompts per day, or “expanded access.” Google AI Ultra offers the “highest access.”

2.5 Flash is also now generally available and stable for developers. It’s the same 05-20 model preview from I/O with updated pricing:


  • $0.30 / 1M input tokens (*up from $0.15 input)
  • $2.50 / 1M output tokens (*down from $3.50 output)
  • We removed the thinking vs. non-thinking price difference
  • We kept a single price tier regardless of input token size

Additionally, developers can now preview Gemini 2.5 Flash Lite for “high-volume, latency-sensitive tasks like translation and classification” where cost is a priority. Google touts “lower latency than 2.0 Flash-Lite and 2.0 Flash on a broad sample of prompts.”

Thinking is turned off by default, but can be enabled with set budgets. Available native tools include “Grounding with Google Search, Code Execution, and URL Context in addition to function calling.” There’s also
multimodal input and the 1 million-token context length.

2.5 Flash Lite has all-around higher quality than 2.0 Flash-Lite on coding, math, science, reasoning and multimodal benchmarks.

FTC: We use income earning auto affiliate links. More.

You’re reading 9to5Google — experts who break news about Google and its surrounding ecosystem, day after day. Be sure to check out our homepage for all the latest news, and follow 9to5Google on Twitter, Facebook, and LinkedIn to stay in the loop. Don’t know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel

Comments

Author

Avatar for Abner Li Abner Li

Editor-in-chief. Interested in the minutiae of Google and Alphabet. Tips/talk: [email protected]