You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: skills/vertex-inference/SKILL.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -57,10 +57,10 @@ Before running any code, ensure you are authenticated with Application Default C
57
57
58
58
## 2. Gemini Models
59
59
60
-
For Gemini models (e.g., `gemini-1.5-pro`, `gemini-2.0-flash`), the **GenAI SDK** (`google-genai`) is the **PREFERRED** method. The legacy `vertexai` SDK is still supported but GenAI SDK is recommended for new projects.
60
+
For Gemini models (e.g., `gemini-2.5-pro`, `gemini-3-flash-preview`), the **GenAI SDK** (`google-genai`) is the **PREFERRED** method. The legacy `vertexai` SDK is still supported but GenAI SDK is recommended for new projects.
61
61
62
62
> [!IMPORTANT]
63
-
>**Preview Models (including Gemini 2.0)** are often **ONLY** available in the `global` region. Stable models are available in`us-central1` and other regions.
63
+
>**Preview Models (including Gemini 3.1)** are often **ONLY** available in the `global` region. Stable models are available in`us-central1` and other regions.
64
64
65
65
### Choosing the Right SDK
66
66
@@ -240,7 +240,7 @@ See [`scripts/openmaas_vertexai_sdk.py`](scripts/openmaas_vertexai_sdk.py) for t
240
240
***OpenMaaS**: Verify the model is available in your region. See [Model Availability by Location](https://docs.cloud.google.com/vertex-ai/generative-ai/docs/learn/locations#genai-open-models).
241
241
***Gemini**:
242
242
***Source of Truth**: Always check [Gemini Model Locations](https://cloud.google.com/vertex-ai/generative-ai/docs/learn/locations#google-models) for the authoritative list.
243
-
***Preview Models**: All Preview models (e.g., Gemini 2.0, experimental versions) are often **ONLY** available in the `us-central1` or `global` regions.
244
-
***Stable Models**: (e.g., Gemini 1.5 Pro) Available in`us-central1`, `europe-west4`, and many other regions.
243
+
***Preview Models**: All Preview models (e.g., Gemini 3.1, experimental versions) are often **ONLY** available in the `us-central1` or `global` regions.
244
+
***Stable Models**: (e.g., Gemini 2.5 Pro) Available in`us-central1`, `europe-west4`, and many other regions.
245
245
***Important**: If you get a 404/400 error, try switching your client location to `us-central1` or `global`.
246
246
2. **Enable Llama Models**: For **Llama 3.3** and **Llama 4**, you **MUST**enable the model in Model Garden before use. Go to the [Model Garden](https://console.cloud.google.com/vertex-ai/model-garden), search for the model card (e.g., "Llama 3.3 API Service"), and click **Enable**. Only then can you make inference requests.
0 commit comments