User:NICTON t/sandbox

gemini 1.5 pro, is a multilingual and multimodal ,a sparse mixture of expert (MOE) transformer based model , developed by gemini team .it was trained sometime after April 2023 (based on the knowledge cutoff) and released in may/2 2024 ,google also claims that the model actually has a context length of 10M tokens (equivalent to approximately 5 days in audio , or 1440 pages of text or 10.5 hours of video) but only 2,097,152 tokens are available in the playground.

training infrastructure gemini 1.5 was trained on multiple 4096-chip pods of google’s TPUv4 accelerators, distributed across multiple google datacenters ,and utilized JAX and ML pathways framework.

Performance improvements compared to its predecessor gemini 1.0 pro 

gemini 1.5 pro is more efficient to train than 1.0, its English query processing speed are over 650 characters per second.

Math, science and reasoning skills. (+49.6%) Multi lingual capabilities. (+21.4%) Coding skills. (+21.5%) Multimodal reasoning capabilities. (+31.5%) Video understanding capabilities. (+18.7%)

API Pricing related information 

2024/7/11

there are 2 types of API pricing

1-Free of charge. -Rate Limits 2 RPM (requests per minute) 32,000 TPM (tokens per minute) 50 RPD (requests per day) 2-Pay-as-you-go (prices in USD)

2-pay as you go. -Rate Limits 360 RPM (requests per minute) 4 million TPM (tokens per minute) 10,000 RPD (requests per day)

-Price (input) $3.50 / 1 million tokens (for prompts up to 128K tokens) $7.00 / 1 million tokens (for prompts longer than 128K)

-Context caching $0.875 / 1 million tokens (for prompts up to 128K tokens) $1.75 / 1 million tokens (for prompts longer than 128K) $4.50 / 1 million tokens per hour (storage)

-Price (output) $10.50 / 1 million tokens (for prompts up to 128K tokens) $21.00 / 1 million tokens (for prompts longer than 128K)