Join the Gemini API Developer Competition!
Learn more
Release notes
Stay organized with collections
Save and categorize content based on your preferences.
This page documents updates to the Gemini API.
May 23, 2024
Model updates
May 14, 2024
API updates
- Introduced a 2M context window for Gemini 1.5 Pro (waitlist).
- Introduced pay-as-you-go
billing
for Gemini 1.0
Pro, with Gemini 1.5 Pro and Gemini 1.5 Flash billing coming soon.
- Introduced increased rate limits for the upcoming paid tier of Gemini 1.5
Pro.
- Added built-in video support to the
File API
.
- Added plain text support to the
File API
.
- Added support for parallel function calling, which returns more than one
call at a time.
May 10, 2024
Model updates
April 9, 2024
Model updates
- Released
Gemini 1.5 Pro
(
gemini-1.5-pro-latest
) in preview.
- Released a new text embedding model,
text-embeddings-004
, which supports
elastic embedding
sizes under 768.
API updates
- Released the
File API
for temporarily storing
media files for use in prompting.
- Added support for prompting with text, image, and audio data, also
known as
multimodal
prompting. To learn more, see
Prompting with media
.
- Released
System instructions
in
beta.
- Added
Function calling mode
,
which defines the execution behavior for function calling.
- Added support for the
response_mime_type
configuration option, which lets
you request responses in
JSON format
.
March 19, 2024
December 13 2023
4 new models:
- gemini-pro: New text model for a wide variety of tasks. Balances capability
and efficiency.
- gemini-pro-vision: New multimodal model for a wide variety of tasks.
Balances capability and efficiency.
- embedding-001: New embeddings model.
- aqa: A new specially tuned model that is trained to answer questions
using text passages for grounding generated answers.
See
Gemini models
for more details.
API version updates:
- v1: The stable API channel.
- v1beta: Beta channel. This channel has features that may be under
development.
See
the API versions topic
for more details.
API updates
GenerateContent
is a single unified endpoint for chat and text.
- Streaming available through the
StreamGenerateContent
method.
- Multimodal capability: Image is a new supported modality
- New beta features:
- Updated candidate count: Gemini models only return 1 candidate.
- Different Safety Settings and SafetyRating categories. See
safety settings
for more details.
- Tuning models is not yet supported for Gemini models (Work in progress).
Except as otherwise noted, the content of this page is licensed under the
Creative Commons Attribution 4.0 License
, and code samples are licensed under the
Apache 2.0 License
. For details, see the
Google Developers Site Policies
. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-05-24 UTC.
[{
"type": "thumb-down",
"id": "missingTheInformationINeed",
"label":"Missing the information I need"
},{
"type": "thumb-down",
"id": "tooComplicatedTooManySteps",
"label":"Too complicated / too many steps"
},{
"type": "thumb-down",
"id": "outOfDate",
"label":"Out of date"
},{
"type": "thumb-down",
"id": "samplesCodeIssue",
"label":"Samples / code issue"
},{
"type": "thumb-down",
"id": "otherDown",
"label":"Other"
}]
[{
"type": "thumb-up",
"id": "easyToUnderstand",
"label":"Easy to understand"
},{
"type": "thumb-up",
"id": "solvedMyProblem",
"label":"Solved my problem"
},{
"type": "thumb-up",
"id": "otherUp",
"label":"Other"
}]