Skill210 repo starsupdated 2d ago

customize

Interactive guided deployment flow for Azure OpenAI models with full customization control. Step-by-step selection of model version, SKU (GlobalStandard/Standard/ProvisionedManaged), capacity, RAI policy (content filter), and advanced options (dynamic quota, priority processing, spillover). USE FOR: custom deployment, customize model deployment, choose version, select SKU, set capacity, configure content filter, RAI policy, deployment options, detailed deployment, advanced deployment, PTU deployment, provisioned throughput. DO NOT USE FOR: quick deployment to optimal region (use preset).

View source Repository: apex

Install in Claude Code

Copy

git clone --depth 1 https://github.com/jonathan-vella/apex /tmp/customize && cp -r /tmp/customize/.archive/_archived_skills/microsoft-foundry/models/deploy-model/customize ~/.claude/skills/customize

Then start a new Claude Code session; the skill loads automatically.

Definition

SKILL.md

# Customize Model Deployment

Interactive guided workflow for deploying Azure OpenAI models with full customization control over version, SKU, capacity, content filtering, and advanced options.

## Quick Reference

| Property           | Description                                                    |
| ------------------ | -------------------------------------------------------------- |
| **Flow**           | Interactive step-by-step guided deployment                     |
| **Customization**  | Version, SKU, Capacity, RAI Policy, Advanced Options           |
| **SKU Support**    | GlobalStandard, Standard, ProvisionedManaged, DataZoneStandard |
| **Best For**       | Precise control over deployment configuration                  |
| **Authentication** | Azure CLI (`az login`)                                         |
| **Tools**          | Azure CLI, MCP tools (optional)                                |

## When to Use This Skill

Use this skill when you need **precise control** over deployment configuration:

- ✅ **Choose specific model version** (not just latest)
- ✅ **Select deployment SKU** (GlobalStandard vs Standard vs PTU)
- ✅ **Set exact capacity** within available range
- ✅ **Configure content filtering** (RAI policy selection)
- ✅ **Enable advanced features** (dynamic quota, priority processing, spillover)
- ✅ **PTU deployments** (Provisioned Throughput Units)

**Alternative:** Use `preset` for quick deployment to the best available region with automatic configuration.

### Comparison: customize vs preset

| Feature               | customize                                                      | preset                                     |
| --------------------- | -------------------------------------------------------------- | ------------------------------------------ |
| **Focus**             | Full customization control                                     | Optimal region selection                   |
| **Version Selection** | User chooses from available                                    | Uses latest automatically                  |
| **SKU Selection**     | User chooses (GlobalStandard/Standard/PTU)                     | GlobalStandard only                        |
| **Capacity**          | User specifies exact value                                     | Auto-calculated (50% of available)         |
| **RAI Policy**        | User selects from options                                      | Default policy only                        |
| **Region**            | Current region first, falls back to all regions if no capacity | Checks capacity across all regions upfront |
| **Use Case**          | Precise deployment requirements                                | Quick deployment to best region            |

## Prerequisites

- Azure subscription with Cognitive Services Contributor or Owner role
- Azure AI Foundry project resource ID (format: `/subscriptions/{sub}/resourceGroups/{rg}/providers/Microsoft.CognitiveServices/accounts/{account}/projects/{project}`)
- Azure CLI installed and authenticated (`az login`)
- Optional: Set `PROJECT_RESOURCE_ID` environment variable

## Workflow Overview

### Complete Flow (14 Phases)

```
1. Verify Authentication
2. Get Project Resource ID
3. Verify Project Exists
4. Get Model Name (if not provided)
5. List Model Versions → User Selects
6. List SKUs for Version → User Selects
7. Get Capacity Range → User Configures
   7b. If no capacity: Cross-Region Fallback → Query all regions → User selects region/project
8. List RAI Policies → User Selects
9. Configure Advanced Options (if applicable)
10. Configure Version Upgrade Policy
11. Generate Deployment Name
12. Review Configuration
13. Execute Deployment & Monitor
```

### Fast Path (Defaults)

If user accepts all defaults (latest version, GlobalStandard SKU, recommended capacity, default RAI policy, standard upgrade policy), deployment completes in ~5 interactions.

---

## Phase Summaries

> ⚠️ **MUST READ:** Before executing any phase, load [references/customize-workflow.md](references/customize-workflow.md) for the full scripts and implementation details. The summaries below describe _what_ each phase does — the reference file contains the _how_ (CLI commands, quota patterns, capacity formulas, cross-region fallback logic).

| Phase                     | Action                                                                             | Key Details                                            |
| ------------------------- | ---------------------------------------------------------------------------------- | ------------------------------------------------------ |
| **1. Verify Auth**        | Check `az account show`; prompt `az login` if needed                               | Verify correct subscription is active                  |
| **2. Get Project ID**     | Read `PROJECT_RESOURCE_ID` env var or prompt user                                  | ARM resource ID format required                        |
| **3. Verify Project**     | Parse resource ID, call `az cognitiveservices account show`                        | Extracts subscription, RG, account, project, region    |
| **4. Get Model**          | List models via `az cognitiveservices account list-models`                         | User selects from available or enters custom name      |
| **5. Select Version**     | Query versions for chosen model                                                    | Recommend latest; user picks from list                 |
| **6. Select SKU**         | Query model catalog + subscription quota, show only deployable SKUs                | ⚠️ Never hardcode SKU lists — always query live data   |
| **7. Configure Capacity** | Query capacity API, validate min/max/step, user enters value                       | Cross-region fallback if no capacity in current region |
| **8. Select RAI Policy**  | Present content filter options                                                     | Default: `Microsoft.DefaultV2`

More from this repository

appinsights-instrumentationSkill

Guidance for instrumenting webapps with Azure Application Insights. Provides telemetry patterns, SDK setup, and configuration references. WHEN: how to instrument app, App Insights SDK, telemetry patterns, what is App Insights, Application Insights guidance, instrumentation examples, APM best practices.

azure-aiSkill

Use for Azure AI: Search, Speech, OpenAI, Document Intelligence. Helps with search, vector/hybrid search, speech-to-text, text-to-speech, transcription, OCR. WHEN: AI Search, query search, vector search, hybrid search, semantic search, speech-to-text, text-to-speech, transcribe, OCR, convert text to speech.

azure-aigatewaySkill

Configure Azure API Management as an AI Gateway for AI models, MCP tools, and agents. WHEN: semantic caching, token limit, content safety, load balancing, AI model governance, MCP rate limiting, jailbreak detection, add Azure OpenAI backend, add AI Foundry model, test AI gateway, LLM policies, configure AI backend, token metrics, AI cost control, convert API to MCP, import OpenAPI to gateway.

azure-diagramsSkill

ROUTING SKILL — delegates to specialized diagram skills. USE FOR: any diagram request when the caller does not know which tool to use. Routes to drawio, python-diagrams, or mermaid based on diagram type.

azure-hosted-copilot-sdkSkill

Build and deploy GitHub Copilot SDK apps to Azure. WHEN: build copilot app, create copilot app, copilot SDK, @github/copilot-sdk, scaffold copilot project, copilot-powered app, deploy copilot app, host on azure, azure model, BYOM, bring your own model, use my own model, azure openai model, DefaultAzureCredential, self-hosted model, copilot SDK service, chat app with copilot, copilot-sdk-service template, azd init copilot, CopilotClient, createSession, sendAndWait, GitHub Models API.

azure-messagingSkill

Troubleshoot and resolve issues with Azure Messaging SDKs for Event Hubs and Service Bus. Covers connection failures, authentication errors, message processing issues, and SDK configuration problems. WHEN: event hub SDK error, service bus SDK issue, messaging connection failure, AMQP error, event processor host issue, message lock lost, send timeout, receiver disconnected, SDK troubleshooting, azure messaging SDK, event hub consumer, service bus queue issue, topic subscription error, enable logging event hub, service bus logging, eventhub python, servicebus java, eventhub javascript, servicebus dotnet, event hub checkpoint, event hub not receiving messages, service bus dead letter.

copilot-customizationSkill

Authoritative reference for VS Code Copilot customization mechanisms: instructions, prompt files, custom agents, agent skills, MCP servers, hooks, and plugins. Use when deciding which customization type to use, creating new .instructions.md/.prompt.md/.agent.md/SKILL.md/mcp.json files from scratch, or debugging why a customization is not loading. DO NOT USE FOR: routine file edits where the format is already known.

count-registrySkill

Provides canonical entity counts from count-manifest.json. Use when agents need to reference how many agents, skills, instructions, or validators exist. Prevents hard-coded counts. WHEN: agent count, skill count, how many agents, how many skills, entity inventory, project statistics.