You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[How It Works Under The Hood](#how-it-works-under-the-hood)
15
+
-[Installation & Usage](#installation--usage)
16
+
-[Example Prompts](#example-prompts)
17
+
-[How to Add a New Skill](#how-to-add-a-new-skill)
18
+
-[Best Practices for Authoring Skills](#best-practices-for-authoring-skills)
19
+
20
+
## 🌟 Top Starter Skills
21
+
22
+
- 🚀 `@vertex-deploy` for deploying Open Models from Model Garden directly to endpoints.
23
+
- 🛠️ `@genai-sdk` for learning how to use the latest Google Gen AI SDK.
24
+
- 🎯 `@vertex-tuning` for fine-tuning models with your own data.
25
+
- ⚡ `@vertex-inference` for executing Generative AI inference.
26
+
27
+
5
28
## Architecture & Routing
6
29
7
30
These skills are designed hierarchically to guide the LLM agent from a broad user intent down to the specific technical implementation steps.
@@ -13,22 +36,14 @@ The entry point for any general Vertex AI task. The `vertex-ai/SKILL.md` file ac
13
36
-**Generating text, chat, or embeddings** → Routes to `vertex-inference`
14
37
-**Fine-tuning a model** → Routes to `vertex-tuning`
15
38
16
-
### Sub-Skills
17
-
1.**`genai-sdk`**
18
-
-**Purpose:** Guides the usage of Gemini API on Google Cloud Vertex AI with the Gen AI SDK across multiple languages (Python, JS/TS, Go, Java, C#).
19
-
-**Capabilities:** Core inference, Live API, function calling, structured output, caching, and batch prediction.
20
-
21
-
2.**`vertex-deploy`**
22
-
-**Purpose:** Instructions and bash scripts for deploying Open Models from Model Garden or custom weights to a dedicated Vertex AI Endpoint.
23
-
-**Capabilities:** Handling `gcloud ai model-garden models deploy`, checking operation status, calculating quota/cost estimates, and undeploying models to save costs.
24
-
25
-
2.**`vertex-inference`**
26
-
-**Purpose:** Code samples and instructions for authenticating and executing Generative AI inference.
27
-
-**Capabilities:** Supports both First-Party (Gemini) using the `google-genai` SDK and Third-Party OpenMaaS (Llama, DeepSeek, Qwen) using the standard OpenAI SDK configured with a Vertex AI endpoint.
39
+
### 🧰 Sub-Skills Directory
28
40
29
-
3.**`vertex-tuning`**
30
-
-**Purpose:** A secondary router specifically for model fine-tuning.
31
-
-**Capabilities:** Directs the agent to specific tuning procedures depending on whether the user wants to tune an Open Model (e.g., Llama) or a First-Party Gemini model.
41
+
| Category / Skill | Purpose | Capabilities |
42
+
| :--- | :--- | :--- |
43
+
|**`genai-sdk`**| Guides the usage of Gemini API with the Gen AI SDK across multiple languages. | Core inference, Live API, function calling, structured output, caching, and batch prediction. |
44
+
|**`vertex-deploy`**| Instructions and bash scripts for deploying Open Models or custom weights to a dedicated Endpoint. |`gcloud ai model-garden models deploy`, checking status, cost estimates, undeploying. |
45
+
|**`vertex-inference`**| Code samples and instructions for authenticating and executing Generative AI inference. | First-Party (Gemini) via `google-genai` and Third-Party OpenMaaS via OpenAI SDK. |
46
+
|**`vertex-tuning`**| A secondary router specifically for model fine-tuning. | Directs the agent to specific tuning procedures for Open Models or First-Party Gemini models. |
0 commit comments