@kolbo/kolbo-code-linux-arm64-musl 2.0.0 → 2.0.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/kolbo +0 -0
- package/package.json +1 -1
- package/skills/color-grading/SKILL.md +152 -0
- package/skills/ffmpeg-patterns/SKILL.md +240 -0
- package/skills/image-prompting-guide/SKILL.md +143 -0
- package/skills/kolbo/SKILL.md +263 -19
- package/skills/music-prompting/SKILL.md +146 -0
- package/skills/production-review/SKILL.md +152 -0
- package/skills/short-form-video/SKILL.md +168 -0
- package/skills/sound-design/SKILL.md +154 -0
- package/skills/storytelling/SKILL.md +139 -0
- package/skills/subtitle-production/SKILL.md +244 -0
- package/skills/subtitle-production/reference/burn_to_video.py +222 -0
- package/skills/subtitle-production/reference/export_srts.py +127 -0
- package/skills/subtitle-production/reference/gen_srt.py +42 -0
- package/skills/typography-video/SKILL.md +182 -0
- package/skills/typography-video/reference/KineticTitleScene.tsx +345 -0
- package/skills/video-editing/SKILL.md +128 -0
- package/skills/video-production/SKILL.md +7 -8
- package/skills/video-prompting-guide/SKILL.md +268 -0
package/bin/kolbo
CHANGED
|
Binary file
|
package/package.json
CHANGED
|
@@ -0,0 +1,152 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: color-grading
|
|
3
|
+
description: >
|
|
4
|
+
Color grading for video with FFmpeg: filter chains, profile selection by content type,
|
|
5
|
+
LUT workflow, skin tone protection, mood-specific recipes, colorblind-safe palette,
|
|
6
|
+
WCAG contrast requirements. Use when applying color grades, creating visual looks, or
|
|
7
|
+
correcting color in video.
|
|
8
|
+
Keywords: color grading, color correction, LUT, FFmpeg, filter, cinematic, warm, cool,
|
|
9
|
+
skin tone, colorbalance, curves, eq, color temperature, film look
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
# Color Grading for Video Production
|
|
13
|
+
|
|
14
|
+
## Quick Reference
|
|
15
|
+
|
|
16
|
+
```
|
|
17
|
+
PROFILES: cinematic_warm | cinematic_cool | moody_dark | bright_clean | vintage_film | high_contrast | neutral
|
|
18
|
+
LUT FORMAT: .cube (3D LUT) — industry standard
|
|
19
|
+
INTENSITY: 0.6-0.85 for subtle grades, 1.0 for full effect
|
|
20
|
+
SKIN TONE: Vectorscope should fall on "skin tone line" (~123 degrees)
|
|
21
|
+
BIT DEPTH: Grade in 10-bit when possible, deliver in 8-bit for web
|
|
22
|
+
```
|
|
23
|
+
|
|
24
|
+
## FFmpeg Filter Chain Order
|
|
25
|
+
|
|
26
|
+
Apply filters in this order for predictable results:
|
|
27
|
+
|
|
28
|
+
```
|
|
29
|
+
1. normalize (auto-levels if source is flat/log)
|
|
30
|
+
2. colortemperature (white balance correction)
|
|
31
|
+
3. colorbalance (shadow/mid/highlight color shifts)
|
|
32
|
+
4. curves (contrast and tone shaping)
|
|
33
|
+
5. eq (final contrast/saturation/brightness tweak)
|
|
34
|
+
6. lut3d (creative LUT — applied LAST, on corrected footage)
|
|
35
|
+
```
|
|
36
|
+
|
|
37
|
+
## FFmpeg Filter Reference
|
|
38
|
+
|
|
39
|
+
| Filter | Purpose | Key Parameters |
|
|
40
|
+
|--------|---------|----------------|
|
|
41
|
+
| `eq` | Brightness, contrast, saturation, gamma | `contrast=1.0:saturation=1.0:brightness=0.0:gamma=1.0` |
|
|
42
|
+
| `colorbalance` | RGB in shadows/mids/highlights | `rs/gs/bs` (shadows), `rm/gm/bm` (mids), `rh/gh/bh` (highlights) — range -1.0 to 1.0 |
|
|
43
|
+
| `curves` | Tone curves per channel | `all='0/0 0.5/0.5 1/1'` or per-channel `red=`, `green=`, `blue=` |
|
|
44
|
+
| `colortemperature` | White balance shift | `temperature=6500` (neutral) — lower=cooler, higher=warmer |
|
|
45
|
+
| `lut3d` | Apply .cube LUT | `lut3d='path/to/file.cube'` |
|
|
46
|
+
| `hue` | Hue rotation and saturation | `h=0:s=1` |
|
|
47
|
+
|
|
48
|
+
## Profile Selection by Content Type
|
|
49
|
+
|
|
50
|
+
| Content Type | Profile | Intensity | Why |
|
|
51
|
+
|-------------|---------|-----------|-----|
|
|
52
|
+
| Corporate / SaaS explainer | `bright_clean` | 0.8 | Clean, professional |
|
|
53
|
+
| Science / educational | `neutral` | 1.0 | Accurate color matters |
|
|
54
|
+
| Storytelling / narrative | `cinematic_warm` | 0.85 | Warmth builds connection |
|
|
55
|
+
| Tech / dark theme | `cinematic_cool` | 0.7 | Complements dark UI |
|
|
56
|
+
| Drama / serious | `moody_dark` | 0.6-0.7 | Atmosphere without crushing detail |
|
|
57
|
+
| Lifestyle / social | `high_contrast` | 0.8 | Punchy, attention-grabbing |
|
|
58
|
+
| Retro / nostalgic | `vintage_film` | 0.7 | Subtle faded look |
|
|
59
|
+
|
|
60
|
+
## Mood-Specific FFmpeg Recipes
|
|
61
|
+
|
|
62
|
+
### Warm / Inviting
|
|
63
|
+
```
|
|
64
|
+
colorbalance=rs=0.06:gs=0.02:bs=-0.04:rh=0.05:gh=0.01:bh=-0.03,
|
|
65
|
+
eq=contrast=1.05:saturation=1.08:brightness=0.01
|
|
66
|
+
```
|
|
67
|
+
|
|
68
|
+
### Cool / Technical
|
|
69
|
+
```
|
|
70
|
+
colorbalance=rs=-0.03:gs=-0.01:bs=0.06:rh=-0.02:gh=0.01:bh=0.04,
|
|
71
|
+
eq=contrast=1.06:saturation=0.95
|
|
72
|
+
```
|
|
73
|
+
|
|
74
|
+
### High Energy
|
|
75
|
+
```
|
|
76
|
+
curves=all='0/0 0.15/0.08 0.5/0.52 0.85/0.92 1/1',
|
|
77
|
+
eq=contrast=1.15:saturation=1.2
|
|
78
|
+
```
|
|
79
|
+
|
|
80
|
+
### Subdued / Serious
|
|
81
|
+
```
|
|
82
|
+
curves=all='0/0.04 0.25/0.22 0.5/0.47 0.75/0.73 1/0.94',
|
|
83
|
+
eq=contrast=1.03:saturation=0.75:brightness=-0.02
|
|
84
|
+
```
|
|
85
|
+
|
|
86
|
+
## LUT Workflow
|
|
87
|
+
|
|
88
|
+
1. **Always correct before grading** — normalize/white-balance first, then creative LUT
|
|
89
|
+
2. **Use intensity < 1.0** — full strength usually looks overdone; 0.6-0.8 is typical
|
|
90
|
+
3. **Test on skin tones first** — if people appear, skin must look natural
|
|
91
|
+
4. **One LUT per project** — switching LUTs creates visual inconsistency
|
|
92
|
+
|
|
93
|
+
### FFmpeg LUT at partial intensity
|
|
94
|
+
```bash
|
|
95
|
+
ffmpeg -i input.mp4 -vf \
|
|
96
|
+
"split[a][b];[b]lut3d='my_lut.cube'[graded];[a][graded]blend=all_mode=normal:all_opacity=0.7" \
|
|
97
|
+
output.mp4
|
|
98
|
+
```
|
|
99
|
+
|
|
100
|
+
## Skin Tone Protection
|
|
101
|
+
|
|
102
|
+
- On a vectorscope, healthy skin (all ethnicities) falls on a narrow line at ~123 degrees
|
|
103
|
+
- Never push saturation above 1.2 on footage with people
|
|
104
|
+
- If skin looks orange, green, or magenta after grading — pull back
|
|
105
|
+
- `cinematic_warm` at intensity 0.85 is pre-tuned for natural skin
|
|
106
|
+
- For `moody_dark`, keep intensity at 0.6-0.7 to avoid grey skin
|
|
107
|
+
|
|
108
|
+
## Enhancement Chain Order
|
|
109
|
+
|
|
110
|
+
Apply in this sequence to avoid filter interactions:
|
|
111
|
+
|
|
112
|
+
1. **Subtitles first** — burn into base video
|
|
113
|
+
2. **Face enhance** — smoothing/sharpening on ungraded footage
|
|
114
|
+
3. **Color grade** — applies look after face is enhanced
|
|
115
|
+
4. **Audio enhance** — independent of video, apply last
|
|
116
|
+
|
|
117
|
+
## Colorblind-Safe Palette (Wong)
|
|
118
|
+
|
|
119
|
+
For overlays, graphics, and diagrams:
|
|
120
|
+
|
|
121
|
+
| Color | Hex | Use For |
|
|
122
|
+
|-------|-----|---------|
|
|
123
|
+
| Orange | `#E69F00` | Primary accent |
|
|
124
|
+
| Sky Blue | `#56B4E9` | Secondary accent |
|
|
125
|
+
| Bluish Green | `#009E73` | Positive/success |
|
|
126
|
+
| Yellow | `#F0E442` | Highlight/warning |
|
|
127
|
+
| Blue | `#0072B2` | Links, info |
|
|
128
|
+
| Vermillion | `#D55E00` | Error/danger |
|
|
129
|
+
| Reddish Purple | `#CC79A7` | Tertiary accent |
|
|
130
|
+
|
|
131
|
+
---
|
|
132
|
+
|
|
133
|
+
## Kolbo MCP Integration
|
|
134
|
+
|
|
135
|
+
Color grading is a **post-production step** applied after Kolbo generates the raw video/images:
|
|
136
|
+
|
|
137
|
+
1. `generate_video` or `generate_video_from_image` → raw footage
|
|
138
|
+
2. Download the result
|
|
139
|
+
3. Apply color grade with FFmpeg using recipes above
|
|
140
|
+
4. Optionally: `upload_media` the graded result back to Kolbo CDN
|
|
141
|
+
|
|
142
|
+
**For Remotion compositions:** Apply color grade as the last visual filter, or set the theme/palette in the composition props.
|
|
143
|
+
|
|
144
|
+
**For AI-generated images:** Use lighter grades (intensity 0.5-0.6) since AI images are already stylized. Use `generate_image_edit` for major color changes instead of FFmpeg.
|
|
145
|
+
|
|
146
|
+
---
|
|
147
|
+
|
|
148
|
+
## Local / Free Options
|
|
149
|
+
|
|
150
|
+
> **IMPORTANT:** Always use Kolbo MCP + FFmpeg by default. FFmpeg is safe to use directly — it's standard software. Do not install additional tools without confirming with the user first.
|
|
151
|
+
|
|
152
|
+
**FFmpeg (safe, standard):** All color grading recipes in this skill use FFmpeg — no additional installs needed. This is the only local tool needed for color grading.
|
|
@@ -0,0 +1,240 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: ffmpeg-patterns
|
|
3
|
+
description: >
|
|
4
|
+
Advanced FFmpeg patterns for video production: encoding presets, lossless vs re-encode decisions,
|
|
5
|
+
subtitle burn-in (SRT/ASS), audio mixing and ducking, face enhancement, silence removal,
|
|
6
|
+
concatenation, format conversion, platform-specific encoding. Use for any FFmpeg operation
|
|
7
|
+
beyond basic trim/cut.
|
|
8
|
+
Keywords: ffmpeg, encoding, h264, crf, subtitle, burn-in, ASS, SRT, concat, audio mix,
|
|
9
|
+
silence removal, face enhance, format, codec, bitrate, filter
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
# FFmpeg Patterns for Video Production
|
|
13
|
+
|
|
14
|
+
## Encoding Presets
|
|
15
|
+
|
|
16
|
+
### Standard Quality (web delivery)
|
|
17
|
+
```bash
|
|
18
|
+
ffmpeg -i input.mp4 -c:v libx264 -crf 18 -preset medium -c:a aac -b:a 128k output.mp4
|
|
19
|
+
```
|
|
20
|
+
|
|
21
|
+
### High Quality (master/archive)
|
|
22
|
+
```bash
|
|
23
|
+
ffmpeg -i input.mp4 -c:v libx264 -crf 15 -preset slow -c:a aac -b:a 256k output.mp4
|
|
24
|
+
```
|
|
25
|
+
|
|
26
|
+
### Fast Preview
|
|
27
|
+
```bash
|
|
28
|
+
ffmpeg -i input.mp4 -c:v libx264 -crf 28 -preset ultrafast -c:a aac -b:a 96k preview.mp4
|
|
29
|
+
```
|
|
30
|
+
|
|
31
|
+
## Lossless vs Re-encode
|
|
32
|
+
|
|
33
|
+
- Use `-c copy` when you only need to cut or concat without altering frames (instant, lossless)
|
|
34
|
+
- Re-encode (`-c:v libx264`) when applying filters (speed, subtitles, overlays, scaling)
|
|
35
|
+
- Default CRF 23. Use 18-20 for higher quality final deliverables
|
|
36
|
+
|
|
37
|
+
## Subtitle Burn-in
|
|
38
|
+
|
|
39
|
+
### SRT (Simple)
|
|
40
|
+
```bash
|
|
41
|
+
ffmpeg -i input.mp4 -vf "subtitles=subs.srt:force_style='FontSize=22,Bold=1,PrimaryColour=&H00FFFFFF,OutlineColour=&H00000000,Outline=2'" output.mp4
|
|
42
|
+
```
|
|
43
|
+
|
|
44
|
+
### ASS Color Format
|
|
45
|
+
Always use full 8-char `&HAABBGGRR` format: `&H00FFFFFF` (white, alpha=00)
|
|
46
|
+
- `&HFFFFFF` breaks positioning — always include the alpha byte
|
|
47
|
+
|
|
48
|
+
### Vertical Video Subtitles
|
|
49
|
+
```
|
|
50
|
+
font_size: 18, max 3 words/cue, margin_v: 50
|
|
51
|
+
```
|
|
52
|
+
|
|
53
|
+
### Horizontal Video Subtitles
|
|
54
|
+
```
|
|
55
|
+
font_size: 22, max 6 words/cue, margin_v: 40
|
|
56
|
+
```
|
|
57
|
+
|
|
58
|
+
### Windows Path Escaping
|
|
59
|
+
Escape colons in paths: `C\:/path/to/subs.srt` not `C:/path/to/subs.srt`
|
|
60
|
+
|
|
61
|
+
## Audio Operations
|
|
62
|
+
|
|
63
|
+
### Mix Narration + Music (with ducking)
|
|
64
|
+
```bash
|
|
65
|
+
ffmpeg -i narration.wav -i music.wav -filter_complex \
|
|
66
|
+
"[1:a]volume=0.15[music]; \
|
|
67
|
+
[0:a][music]amix=inputs=2:duration=longest" \
|
|
68
|
+
-c:a aac output.m4a
|
|
69
|
+
```
|
|
70
|
+
|
|
71
|
+
### Loudness Normalization
|
|
72
|
+
```bash
|
|
73
|
+
ffmpeg -i input.mp4 -af loudnorm=I=-14:LRA=11:TP=-1 -c:v copy output.mp4
|
|
74
|
+
```
|
|
75
|
+
|
|
76
|
+
### Extract Audio
|
|
77
|
+
```bash
|
|
78
|
+
ffmpeg -i video.mp4 -vn -c:a copy audio.m4a
|
|
79
|
+
```
|
|
80
|
+
|
|
81
|
+
### Replace Audio
|
|
82
|
+
```bash
|
|
83
|
+
ffmpeg -i video.mp4 -i new_audio.wav -map 0:v -map 1:a -c:v copy -c:a aac output.mp4
|
|
84
|
+
```
|
|
85
|
+
|
|
86
|
+
## Silence Removal
|
|
87
|
+
|
|
88
|
+
### Detect Silence
|
|
89
|
+
```bash
|
|
90
|
+
ffmpeg -i input.mp4 -af "silencedetect=noise=-35dB:d=0.4" -f null - 2>&1 | grep silence
|
|
91
|
+
```
|
|
92
|
+
|
|
93
|
+
### Remove Silence (trim + concat)
|
|
94
|
+
1. Parse `silence_start` / `silence_end` from stderr
|
|
95
|
+
2. Generate segments between silences
|
|
96
|
+
3. Concatenate with the concat demuxer
|
|
97
|
+
4. Optional: `atempo=1.14` for slight speedup
|
|
98
|
+
|
|
99
|
+
## Concatenation
|
|
100
|
+
|
|
101
|
+
### Same Codec (lossless)
|
|
102
|
+
```bash
|
|
103
|
+
# Create filelist.txt:
|
|
104
|
+
# file 'clip1.mp4'
|
|
105
|
+
# file 'clip2.mp4'
|
|
106
|
+
ffmpeg -f concat -safe 0 -i filelist.txt -c copy output.mp4
|
|
107
|
+
```
|
|
108
|
+
|
|
109
|
+
### Mixed Codecs (re-encode)
|
|
110
|
+
Re-encode all segments to matching codec/resolution first, then concat.
|
|
111
|
+
|
|
112
|
+
## Speed Adjustment
|
|
113
|
+
|
|
114
|
+
### Speed Up 2x
|
|
115
|
+
```bash
|
|
116
|
+
ffmpeg -i input.mp4 -filter:v "setpts=0.5*PTS" -filter:a "atempo=2.0" output.mp4
|
|
117
|
+
```
|
|
118
|
+
|
|
119
|
+
### Slow Motion 0.5x
|
|
120
|
+
```bash
|
|
121
|
+
ffmpeg -i input.mp4 -filter:v "setpts=2.0*PTS" -filter:a "atempo=0.5" output.mp4
|
|
122
|
+
```
|
|
123
|
+
|
|
124
|
+
## Face Enhancement Presets
|
|
125
|
+
|
|
126
|
+
### Skin Smoothing
|
|
127
|
+
```bash
|
|
128
|
+
ffmpeg -i input.mp4 -vf "smartblur=lr=1.0:ls=-1.0:lt=-3.0:cr=0.5:cs=-1.0:ct=-3.0" output.mp4
|
|
129
|
+
```
|
|
130
|
+
|
|
131
|
+
### Sharpening
|
|
132
|
+
```bash
|
|
133
|
+
ffmpeg -i input.mp4 -vf "unsharp=5:5:1.0:5:5:0.0" output.mp4
|
|
134
|
+
```
|
|
135
|
+
|
|
136
|
+
## Format Conversion
|
|
137
|
+
|
|
138
|
+
### To GIF (high quality)
|
|
139
|
+
```bash
|
|
140
|
+
ffmpeg -i input.mp4 -vf "fps=15,scale=480:-1:flags=lanczos,split[s0][s1];[s0]palettegen[p];[s1][p]paletteuse" output.gif
|
|
141
|
+
```
|
|
142
|
+
|
|
143
|
+
### To WebM
|
|
144
|
+
```bash
|
|
145
|
+
ffmpeg -i input.mp4 -c:v libvpx-vp9 -crf 30 -b:v 0 -c:a libopus output.webm
|
|
146
|
+
```
|
|
147
|
+
|
|
148
|
+
### Extract Frames
|
|
149
|
+
```bash
|
|
150
|
+
# Every 5 seconds
|
|
151
|
+
ffmpeg -i input.mp4 -vf "fps=1/5" frame_%04d.png
|
|
152
|
+
|
|
153
|
+
# Specific timestamp
|
|
154
|
+
ffmpeg -i input.mp4 -ss 00:01:30 -vframes 1 thumbnail.png
|
|
155
|
+
```
|
|
156
|
+
|
|
157
|
+
## Probing (Analysis)
|
|
158
|
+
|
|
159
|
+
### Full Media Info
|
|
160
|
+
```bash
|
|
161
|
+
ffprobe -v quiet -print_format json -show_format -show_streams input.mp4
|
|
162
|
+
```
|
|
163
|
+
|
|
164
|
+
### Duration Only
|
|
165
|
+
```bash
|
|
166
|
+
ffprobe -v quiet -show_entries format=duration -of default=noprint_wrappers=1:nokey=1 input.mp4
|
|
167
|
+
```
|
|
168
|
+
|
|
169
|
+
### Audio Loudness
|
|
170
|
+
```bash
|
|
171
|
+
ffmpeg -i input.mp4 -af loudnorm=print_format=json -f null - 2>&1
|
|
172
|
+
```
|
|
173
|
+
|
|
174
|
+
## Platform-Specific Encoding
|
|
175
|
+
|
|
176
|
+
| Platform | Resolution | Codec | Bitrate |
|
|
177
|
+
|----------|-----------|-------|---------|
|
|
178
|
+
| YouTube | 1920x1080 | H.264 High | 8-12 Mbps |
|
|
179
|
+
| TikTok | 1080x1920 | H.264 High | 8-15 Mbps |
|
|
180
|
+
| Instagram | 1080x1920 | H.264 High | 8-15 Mbps |
|
|
181
|
+
| Twitter/X | 1280x720 | H.264 Main | 5-8 Mbps |
|
|
182
|
+
|
|
183
|
+
## Windows-Specific Notes
|
|
184
|
+
|
|
185
|
+
- Always copy inputs to a temp directory first if paths contain spaces
|
|
186
|
+
- Use forward slashes in filter strings even on Windows
|
|
187
|
+
- Escape colons in drive letters within subtitle filter paths
|
|
188
|
+
|
|
189
|
+
---
|
|
190
|
+
|
|
191
|
+
## Kolbo MCP Integration
|
|
192
|
+
|
|
193
|
+
FFmpeg is the **post-production backbone** that processes Kolbo-generated assets:
|
|
194
|
+
|
|
195
|
+
| Kolbo MCP Output | FFmpeg Post-Processing |
|
|
196
|
+
|-----------------|----------------------|
|
|
197
|
+
| `generate_video` → raw video | Trim, grade, add subtitles, normalize audio |
|
|
198
|
+
| `generate_speech` → narration | Mix with music, normalize loudness |
|
|
199
|
+
| `generate_music` → background | Duck under narration, fade in/out |
|
|
200
|
+
| `generate_sound` → SFX | Place at precise timestamps, adjust levels |
|
|
201
|
+
| `transcribe_audio` → SRT | Burn-in subtitles with force_style |
|
|
202
|
+
| `generate_image` → frames | Assemble into slideshow/montage |
|
|
203
|
+
|
|
204
|
+
**Typical production chain:**
|
|
205
|
+
```
|
|
206
|
+
Kolbo generates raw assets
|
|
207
|
+
→ FFmpeg trims/cuts
|
|
208
|
+
→ FFmpeg mixes audio (narration + music + SFX)
|
|
209
|
+
→ FFmpeg burns in subtitles
|
|
210
|
+
→ FFmpeg applies color grade
|
|
211
|
+
→ FFmpeg encodes for target platform
|
|
212
|
+
```
|
|
213
|
+
|
|
214
|
+
---
|
|
215
|
+
|
|
216
|
+
## Installing FFmpeg
|
|
217
|
+
|
|
218
|
+
**Windows:**
|
|
219
|
+
```bash
|
|
220
|
+
# Scoop
|
|
221
|
+
scoop install ffmpeg
|
|
222
|
+
|
|
223
|
+
# Chocolatey
|
|
224
|
+
choco install ffmpeg
|
|
225
|
+
|
|
226
|
+
# Or download from https://ffmpeg.org/download.html
|
|
227
|
+
```
|
|
228
|
+
|
|
229
|
+
**macOS:**
|
|
230
|
+
```bash
|
|
231
|
+
brew install ffmpeg
|
|
232
|
+
```
|
|
233
|
+
|
|
234
|
+
**Linux:**
|
|
235
|
+
```bash
|
|
236
|
+
sudo apt install ffmpeg # Ubuntu/Debian
|
|
237
|
+
sudo dnf install ffmpeg # Fedora
|
|
238
|
+
```
|
|
239
|
+
|
|
240
|
+
Verify: `ffmpeg -version`
|
|
@@ -0,0 +1,143 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: image-prompting-guide
|
|
3
|
+
description: >
|
|
4
|
+
Deep image generation prompting guide: visual consistency strategies, hero reference technique,
|
|
5
|
+
FLUX resolution rules, batch generation, style-specific prompt patterns, prompt construction
|
|
6
|
+
with contextual layers. Complements the kolbo skill's image section with production-grade
|
|
7
|
+
techniques.
|
|
8
|
+
Keywords: image prompt, flux, dall-e, image generation, consistency, visual style, hero image,
|
|
9
|
+
reference, batch, resolution, prompt engineering, style, photorealistic, illustration
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
# Image Generation — Production Prompting Guide
|
|
13
|
+
|
|
14
|
+
This skill extends the `kolbo` skill's image prompting rules with production-grade techniques for maintaining visual consistency across multiple images.
|
|
15
|
+
|
|
16
|
+
## Resolution for Video Frames
|
|
17
|
+
|
|
18
|
+
When generating images for use as video frames:
|
|
19
|
+
|
|
20
|
+
| Target | Recommended Resolution | Notes |
|
|
21
|
+
|--------|----------------------|-------|
|
|
22
|
+
| YouTube 16:9 | 1920x1088 | FLUX requires multiples of 16 |
|
|
23
|
+
| YouTube 4K | 3840x2160 | Premium models only |
|
|
24
|
+
| TikTok/Reels 9:16 | 1088x1920 | FLUX multiples of 16 |
|
|
25
|
+
| Square 1:1 | 1024x1024 | Standard |
|
|
26
|
+
| Thumbnail | 1280x720 | |
|
|
27
|
+
|
|
28
|
+
## Maintaining Visual Consistency
|
|
29
|
+
|
|
30
|
+
The biggest challenge: making 8-12 generated images look like they belong in the same video.
|
|
31
|
+
|
|
32
|
+
### Strategy 1 — Shared Visual System (Always Use)
|
|
33
|
+
|
|
34
|
+
Define a shared visual system for the project first:
|
|
35
|
+
- Dominant mood and texture
|
|
36
|
+
- Palette direction (3-5 anchor colors)
|
|
37
|
+
- Lighting bias
|
|
38
|
+
- Rendering medium
|
|
39
|
+
- Character/environment consistency anchors
|
|
40
|
+
|
|
41
|
+
**Don't paste the same style description verbatim into every prompt.** Distill it into a shorter scene-appropriate anchor. Verbatim repetition makes all scenes look identical.
|
|
42
|
+
|
|
43
|
+
### Strategy 2 — Hero Reference Image (Recommended)
|
|
44
|
+
|
|
45
|
+
1. Generate one "hero" image at maximum quality
|
|
46
|
+
2. Use it as reference for all subsequent frames
|
|
47
|
+
3. In Kolbo: use Visual DNA profiles or pass the hero image URL as reference
|
|
48
|
+
|
|
49
|
+
```
|
|
50
|
+
Frame 1: Text-to-image with detailed prompt → hero.png
|
|
51
|
+
Frame 2: Image-to-image with hero as reference + "Same style, camera pans right..."
|
|
52
|
+
Frame 3: Image-to-image with hero as reference + "Same style, zoomed in on..."
|
|
53
|
+
```
|
|
54
|
+
|
|
55
|
+
### Strategy 3 — Seed Locking
|
|
56
|
+
|
|
57
|
+
Use the same seed parameter across generations with similar prompts. Produces similar compositions but fragile to prompt changes — supplement, not primary strategy.
|
|
58
|
+
|
|
59
|
+
## Prompt Construction — 3-Part Approach
|
|
60
|
+
|
|
61
|
+
### Part 1: Scene-Specific Style Direction
|
|
62
|
+
From the shot's camera and lighting needs:
|
|
63
|
+
```
|
|
64
|
+
[SHOT SIZE, e.g., "medium close-up"].
|
|
65
|
+
[LIGHTING, e.g., "golden hour warm light"].
|
|
66
|
+
[DEPTH, e.g., "shallow depth of field with bokeh"].
|
|
67
|
+
[TEXTURE, e.g., "film grain, warm tones"].
|
|
68
|
+
```
|
|
69
|
+
|
|
70
|
+
### Part 2: Visual Consistency Anchor (adapted, not verbatim)
|
|
71
|
+
Extract the ESSENCE of the project's visual language:
|
|
72
|
+
- Full description: "Clean, minimal illustration with soft shadows, muted color palette"
|
|
73
|
+
- Adapted anchor: "muted color palette, soft shadows"
|
|
74
|
+
|
|
75
|
+
### Part 3: Scene Description
|
|
76
|
+
The actual content. Be specific — replace generic words with concrete details.
|
|
77
|
+
|
|
78
|
+
**BAD:** "A person using a computer in a modern office"
|
|
79
|
+
**GOOD:** "Software developer in a dimly lit home office, blue monitor glow reflecting off glasses, desk cluttered with energy drinks and sticky notes"
|
|
80
|
+
|
|
81
|
+
### Full Prompt Example
|
|
82
|
+
```
|
|
83
|
+
Medium close-up, golden hour warm lighting, shallow depth of field.
|
|
84
|
+
Muted earth tones, soft shadows.
|
|
85
|
+
Beekeeper in white protective gear lifting a frame dripping with honey,
|
|
86
|
+
late afternoon sun catching golden droplets, lavender field blurred
|
|
87
|
+
in the background. Film grain, warm amber tones.
|
|
88
|
+
16:9 aspect ratio.
|
|
89
|
+
```
|
|
90
|
+
|
|
91
|
+
## Style-Specific Prompt Patterns
|
|
92
|
+
|
|
93
|
+
| Style | Prompt Pattern |
|
|
94
|
+
|-------|---------------|
|
|
95
|
+
| **Flat illustration** | "Flat vector illustration, bold colors, clean edges, no gradients, white background" |
|
|
96
|
+
| **Isometric** | "Isometric 3D illustration, 30-degree angle, clean geometric shapes, soft shadows" |
|
|
97
|
+
| **Photorealistic** | "Photorealistic, shot on Canon EOS R5 with 85mm f/1.4, shallow depth of field" |
|
|
98
|
+
| **Diagram-style** | "Technical diagram, labeled components, clean lines, minimal color, white background" |
|
|
99
|
+
| **Watercolor** | "Soft watercolor illustration, muted tones, visible brush strokes, paper texture" |
|
|
100
|
+
|
|
101
|
+
## Batch Generation Strategy
|
|
102
|
+
|
|
103
|
+
| Phase | Quality | Purpose |
|
|
104
|
+
|-------|---------|---------|
|
|
105
|
+
| 1. Style guide | Maximum | One hero image, establish the look |
|
|
106
|
+
| 2. Storyboard iteration | Fast/cheap model | Rapid variations during planning |
|
|
107
|
+
| 3. Final frames | High quality | Re-generate finals with hero as reference |
|
|
108
|
+
|
|
109
|
+
## Common Pitfalls
|
|
110
|
+
|
|
111
|
+
1. **Text in images** — AI generators are unreliable with text. Add text as overlays in post
|
|
112
|
+
2. **Hands and fingers** — Still problematic. Avoid prompts requiring detailed hand poses
|
|
113
|
+
3. **Inconsistent characters** — Without references, same character looks different each time. Use Visual DNA
|
|
114
|
+
4. **Over-prompting** — Long, complex prompts produce unpredictable results. Keep to 2-3 sentences
|
|
115
|
+
5. **Over-unifying prompts** — Same style phrase in every prompt makes scenes samey. Let each scene express its own subject and mood within the visual system
|
|
116
|
+
|
|
117
|
+
---
|
|
118
|
+
|
|
119
|
+
## Kolbo MCP Integration
|
|
120
|
+
|
|
121
|
+
| Task | Kolbo MCP Tool | Notes |
|
|
122
|
+
|------|---------------|-------|
|
|
123
|
+
| Text-to-image | `generate_image` | Apply 3-part prompt construction above |
|
|
124
|
+
| Image editing | `generate_image_edit` | Describe the CHANGE, not the whole image |
|
|
125
|
+
| Multi-scene batch | `generate_creative_director` | 1-8 coordinated scenes, one brief |
|
|
126
|
+
| Style consistency | `create_visual_dna` type="style" | Create from reference images |
|
|
127
|
+
| Character consistency | `create_visual_dna` type="character" | Max 4 reference images |
|
|
128
|
+
| Discover models | `list_models` type="image" | Always check before generating |
|
|
129
|
+
| Style templates | `list_moodboards` / `get_moodboard` | Pre-made style directions |
|
|
130
|
+
|
|
131
|
+
**Workflow for consistent image sets:**
|
|
132
|
+
1. `create_visual_dna` with 2-4 reference images → get `visual_dna_id`
|
|
133
|
+
2. Generate hero image: `generate_image` + `visual_dna_ids` + detailed prompt
|
|
134
|
+
3. Generate remaining frames: same `visual_dna_ids` + adapted prompts per scene
|
|
135
|
+
4. Or use `generate_creative_director` for automatic multi-scene coordination
|
|
136
|
+
|
|
137
|
+
---
|
|
138
|
+
|
|
139
|
+
## Local / Free Option
|
|
140
|
+
|
|
141
|
+
> **IMPORTANT:** Always use Kolbo MCP tools by default. Only suggest local alternatives if the user explicitly asks about free/offline options. Always confirm before installing anything on the user's machine.
|
|
142
|
+
|
|
143
|
+
If the user has a GPU (8GB+ VRAM) and wants local image generation, recommend **ComfyUI** with Stable Diffusion XL or FLUX.1 dev. For background removal without credits, `rembg` is a simple pip install that works on CPU.
|