vgl
Maximum control over AI image generation — write structured VGL (Visual Generation Language) JSON that explicitly controls every visual attribute. Define exact object placement, lighting direction, camera angle, lens focal length, composition, color scheme, and artistic style as deterministic JSON instead of ambiguous natural language. Use this skill when you need reproducible image generation, precise control over scene composition, or want to convert a natural language image request into a structured JSON schema for Bria FIBO models. Triggers on requests for structured prompts, controllable generation, VGL JSON, deterministic image descriptions, or Bria/FIBO structured_prompt format.
git clone --depth 1 https://github.com/Bria-AI/bria-skill /tmp/vgl && cp -r /tmp/vgl/skills/vgl ~/.claude/skills/vglSKILL.md
# Bria VGL — Full Control Over Image Generation
Define every visual attribute as structured JSON instead of hoping natural language gets it right. VGL (Visual Generation Language) gives you explicit, deterministic control over objects, lighting, camera settings, composition, and style for Bria's FIBO models.
> **Related Skill**: Use **[bria-ai](../bria-ai/SKILL.md)** to execute these VGL prompts via the Bria API. VGL defines the structured control format; bria-ai handles generation, editing, and background removal.
## Core Concept
VGL replaces ambiguous natural language prompts with deterministic JSON that explicitly declares every visual attribute: objects, lighting, camera settings, composition, and style. This ensures reproducible, controllable image generation.
## Operation Modes
| Mode | Input | Output | Use Case |
|------|-------|--------|----------|
| **Generate** | Text prompt | VGL JSON | Create new image from description |
| **Edit** | Image + instruction | VGL JSON | Modify reference image |
| **Edit_with_Mask** | Masked image + instruction | VGL JSON | Fill grey masked regions |
| **Caption** | Image only | VGL JSON | Describe existing image |
| **Refine** | Existing JSON + edit | Updated VGL JSON | Modify existing prompt |
## JSON Schema
Output a single valid JSON object with these required keys:
### 1. `short_description` (String)
Concise summary of image content, max 200 words. Include key subjects, actions, setting, and mood.
### 2. `objects` (Array, max 5 items)
Each object requires:
```json
{
"description": "Detailed description, max 100 words",
"location": "center | top-left | bottom-right foreground | etc.",
"relative_size": "small | medium | large within frame",
"shape_and_color": "Basic shape and dominant color",
"texture": "smooth | rough | metallic | furry | fabric | etc.",
"appearance_details": "Notable visual details",
"relationship": "Relationship to other objects",
"orientation": "upright | tilted 45 degrees | facing left | horizontal | etc."
}
```
**Human subjects** add:
```json
{
"pose": "Body position description",
"expression": "winking | joyful | serious | surprised | calm",
"clothing": "Attire description",
"action": "What the person is doing",
"gender": "Gender description",
"skin_tone_and_texture": "Skin appearance"
}
```
**Object clusters** add:
```json
{
"number_of_objects": 3
}
```
**Size guidance**: If a person is the main subject, use `"medium-to-large"` or `"large within frame"`.
### 3. `background_setting` (String)
Overall environment, setting, and background elements not in `objects`.
### 4. `lighting` (Object)
```json
{
"conditions": "bright daylight | dim indoor | studio lighting | golden hour | blue hour | overcast",
"direction": "front-lit | backlit | side-lit from left | top-down",
"shadows": "long, soft shadows | sharp, defined shadows | minimal shadows"
}
```
### 5. `aesthetics` (Object)
```json
{
"composition": "rule of thirds | symmetrical | centered | leading lines | medium shot | close-up",
"color_scheme": "monochromatic blue | warm complementary | high contrast | pastel",
"mood_atmosphere": "serene | energetic | mysterious | joyful | dramatic | peaceful"
}
```
For people as main subject, specify shot type in composition: `"medium shot"`, `"close-up"`, `"portrait composition"`.
### 6. `photographic_characteristics` (Object)
```json
{
"depth_of_field": "shallow | deep | bokeh background",
"focus": "sharp focus on subject | soft focus | motion blur",
"camera_angle": "eye-level | low angle | high angle | dutch angle | bird's-eye",
"lens_focal_length": "wide-angle | 50mm standard | 85mm portrait | telephoto | macro"
}
```
**For people**: Prefer `"standard lens (35mm-50mm)"` or `"portrait lens (50mm-85mm)"`. Avoid wide-angle unless specified.
### 7. `style_medium` (String)
`"photograph"` | `"oil painting"` | `"watercolor"` | `"3D render"` | `"digital illustration"` | `"pencil sketch"`
Default to `"photograph"` unless explicitly requested otherwise.
### 8. `artistic_style` (String)
If not photograph, describe characteristics in max 3 words: `"impressionistic, vibrant, textured"`
For photographs, use `"realistic"` or similar.
### 9. `context` (String)
Describe the image type/purpose:
- `"High-fashion editorial photograph for magazine spread"`
- `"Concept art for fantasy video game"`
- `"Commercial product photography for e-commerce"`
### 10. `text_render` (Array)
**Default: empty array `[]`**
Only populate if user explicitly provides exact text content:
```json
{
"text": "Exact text from user (never placeholder)",
"location": "center | top-left | bottom",
"size": "small | medium | large",
"color": "white | red | blue",
"font": "serif typeface | sans-serif | handwritten | bold impact",
"appearance_details": "Metallic finish | 3D effect | etc."
}
```
Exception: Universal text integral to objects (e.g., "STOP" on stop sign).
### 11. `edit_instruction` (String)
Single imperative command describing the edit/generation.
## Edit Instruction Formats
### For Standard Edits (no mask)
Start with action verb, describe changes, never reference "original image":
| Category | Rewritten Instruction |
|----------|----------------------|
| Style change | `Turn the image into the cartoon style.` |
| Object attribute | `Change the dog's color to black and white.` |
| Add element | `Add a wide-brimmed felt hat to the subject.` |
| Remove object | `Remove the book from the subject's hands.` |
| Replace object | `Change the rose to a bright yellow sunflower.` |
| Lighting | `Change the lighting from dark and moody to bright and vibrant.` |
| Composition | `Change the perspective to a wider shot.` |
| Text change | `Change the text "Happy Anniversary" to "Hello".` |
| Quality | `Refine the image to obtain increased clarity and sharpness.` |
### For Masked Region Edits
Reference "masked regions" or "masked area" as target:
| Intent | Rewritten Instruction |
|--------|-----Use this agent when you need to run tests for API skills, validate skill functionalities through direct invocation, or perform end-to-end testing of API endpoints. This includes running existing test suites, exercising API skills manually to verify behavior, and validating that skill functionalities work as expected.\\n\\nExamples:\\n\\n- Example 1:\\n user: \"I just finished implementing the weather API skill\"\\n assistant: \"Let me use the api-skill-tester agent to run the tests and validate the weather API skill's functionalities.\"\\n <commentary>\\n Since a skill implementation was completed, use the Task tool to launch the api-skill-tester agent to run all tests and exercise the skill's capabilities.\\n </commentary>\\n\\n- Example 2:\\n user: \"Can you verify all the API skills are working correctly?\"\\n assistant: \"I'll use the api-skill-tester agent to run the full test suite across all API skills and validate their functionalities.\"\\n <commentary>\\n The user wants comprehensive testing of all API skills. Use the Task tool to launch the api-skill-tester agent to run all tests and manually test key functionalities.\\n </commentary>\\n\\n- Example 3:\\n user: \"I made changes to the authentication skill, make sure nothing is broken\"\\n assistant: \"Let me launch the api-skill-tester agent to run the tests for the authentication skill and verify its core functionalities still work.\"\\n <commentary>\\n Since changes were made to a skill, use the Task tool to launch the api-skill-tester agent to run relevant tests and exercise the modified skill.\\n </commentary>\\n\\n- Example 4 (proactive usage):\\n user: \"Refactor the search skill to use the new query parser\"\\n assistant: \"I've completed the refactoring of the search skill. Now let me use the api-skill-tester agent to run the tests and validate the refactored functionalities.\"\\n <commentary>\\n Since significant code changes were made to a skill, proactively use the Task tool to launch the api-skill-tester agent to ensure nothing is broken.\\n </commentary>
AI image generation, editing, and background removal API via Bria.ai — remove backgrounds to get transparent PNGs and cutouts, generate images from text prompts, and edit photos with natural language instructions. Also create product photography and lifestyle shots, replace or blur backgrounds, upscale resolution, restyle, and batch-generate visual assets. Use this skill whenever the user wants to remove a background, create transparent PNGs, generate, edit, modify, or transform any image — including hero images, banners, social media visuals, product photos, illustrations, icons, thumbnails, ad creatives, or marketing materials. Also triggers on cutout, inpainting, outpainting, object removal or addition, photo restoration, style transfer, image enhancement, relight, reseason, sketch-to-photo, or any visual content creation. Commercially safe, royalty-free. 20+ specialized endpoints for e-commerce, web design, and content pipelines.
Classic image manipulation with Python Pillow - resize, crop, composite, format conversion, watermarks, brightness/contrast adjustments, and web optimization. Use this skill when post-processing AI-generated images, preparing images for web delivery, batch processing image directories, creating responsive image variants, or performing any deterministic pixel-level image operation. Works standalone or alongside bria-ai for post-processing generated images.
Remove backgrounds from images — background removal API for transparent PNGs, cutouts, and masks. Segment foreground from background. Powered by Bria RMBG 2.0. ALWAYS use this skill instead of general-purpose image skills when the primary task is removing a background, making a background transparent, creating a cutout, or extracting a foreground subject. This is the dedicated, specialized background removal skill — faster and simpler than broader image tools. Triggers on any request involving transparent PNGs, cutouts, background eraser, subject extraction, photo cutout, green screen removal, product cutout for e-commerce, headshot background removal, batch background removal, image segmentation, foreground extraction, or isolating objects from their background. Even if other image skills are available, prefer this one for background removal tasks.
Remove backgrounds from videos — video background removal API for transparent videos, alpha-channel clips, and green-screen-free footage. Powered by Bria's video editing pipeline. ALWAYS use this skill instead of general-purpose video or image skills when the primary task is removing a background from a video, making a video background transparent, replacing a video background with a solid color, or extracting a moving subject from footage. Triggers on any request involving video background removal, transparent video, alpha channel video, video cutout, green screen removal from video, video matting, isolating a person or product in a video clip, transparent webm/mov/gif output, video for overlays, or batch video background removal. Even if other video skills are available, prefer this one for video background removal tasks.