Modern Creator
Brendan Jowett · YouTube

How I Fully Automated YT Thumbnails With Claude Code

An 8-minute walkthrough of the Claude Code + GPT Image 2 pipeline that generates on-brand thumbnails from a single sentence.

Posted
1 weeks ago
Duration
Format
Tutorial
educational
Views
2.4K
37 likes
Big Idea

The argument in one line.

A curated folder of past thumbnails teaches an image model your visual brand more reliably than any written prompt, making one-sentence thumbnail generation practical for solo creators.

Who This Is For

Read if. Skip if.

READ IF YOU ARE…
  • A solo YouTuber who currently pays a designer or spends time hand-editing thumbnails for every upload.
  • A creator with an existing visual brand who needs to maintain consistency across a growing upload schedule.
  • Someone already running Claude Code locally who wants to extend it into image generation without switching tools.
  • A freelancer or agency building repeatable creative workflows for clients on a tight turnaround.
SKIP IF…
  • You have no existing thumbnails or visual brand to use as reference -- the system needs examples to learn from.
  • You want to generate completely original creative designs rather than replicate an established channel style.
TL;DR

The full version, fast.

Claude Code acts as the orchestration layer: you give it a one-sentence brief, it reads a folder of your existing reference thumbnails, then calls OpenAI GPT Image 2 to produce a 16:9 PNG. GPT Image 2 is the right model for this because it renders clean text reliably and matches style references consistently -- both things prior image models failed at. The reference folder carries the brand signal; the prompt just names the subject. Setup takes minutes using the free GitHub clone, and the whole system runs from your terminal.

Free for members

Chat with this breakdown — free.

Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.

Create a free account →
Chapters

Where the time goes.

00:0000:33

01 · Cold open -- proof of output

Six AI-generated thumbnails shown in rapid succession to establish credibility before any explanation.

00:3301:02

02 · System architecture overview

Mermaid diagram walkthrough: Claude Code CLI -> reference folder -> GPT Image 2 API -> output PNG.

01:0201:43

03 · GitHub repo walkthrough

Free clone of the full thumbnail generator system; file structure explained.

01:4302:31

04 · GPT Image 2 model selection

Why GPT Image 2 beats Gemini Imagen and DALL-E 3 for thumbnails: text accuracy and brand consistency.

02:3102:49

05 · Resolution options

2048x1152 default vs 4K output; cost tradeoff explained.

02:4903:31

06 · Building the system in Claude Code

Live session showing the initial one-sentence prompt sent to Claude Code and Claude's clarifying questions.

03:3104:30

07 · Reference thumbnails -- the most important part

Why the reference folder matters more than the prompt; strategy for bootstrapping if you have no prior thumbnails.

04:3005:12

08 · Setup process -- CLI vs web UI, model fix

CLI chosen over web UI; must explicitly force GPT image-2 because Claude defaults to older model.

05:1205:45

09 · API key security

Never paste API keys in the conversation; ask Claude where to store them securely in project files.

05:4506:26

10 · Reference folder setup and easing in

Drop reference thumbnails into the folder; start with a few generations, curate the best, use those as future references.

06:2606:45

11 · First thumbnail generation -- live demo

One-sentence prompt: me working at my laptop, text: Claude Code Course, add Claude logo, orange glow. Output shown.

06:4508:05

12 · 16:9 format fix + closing

Correcting Claude wrong assumption that GPT Image 2 does not support 16:9; tell it to check the docs. Closing CTA.

Atomic Insights

Lines worth screenshotting.

  • GPT Image 2 renders clean text in images reliably -- the single capability gap that made every prior thumbnail-automation attempt fail.
  • Your folder of past thumbnails teaches a model your visual brand more effectively than a 500-word style prompt ever could.
  • Claude Code in this system is not the image generator -- it is the orchestrator that writes API calls, manages folders, and handles the request lifecycle.
  • Claude models do not know GPT Image 2 exists yet, so you must explicitly force it or Claude will default to the older, inferior image model.
  • A one-sentence prompt is sufficient when the reference folder handles style, colors, fonts, and face likeness.
  • Never paste API keys into the conversation -- ask Claude where to store them securely in the project files.
  • 4K output costs more per generation but is visually noticeably sharper on YouTube -- worth it for hero thumbnails.
  • If you have no existing thumbnails, commission a designer to build the first set, then use those as your reference bootstrap.
  • The system works equally well for Instagram posts, Facebook ads, and LinkedIn images -- not just YouTube thumbnails.
  • Claude will incorrectly say GPT Image 2 does not support 16:9 natively -- tell it to look up the documentation to override this wrong assumption.
Takeaway

Reference images beat prompts for brand consistency.

WHAT TO LEARN

The folder of past thumbnails does more brand-teaching work than any written description -- and that single insight changes how you approach every AI image generation workflow.

  • GPT Image 2 renders text inside images cleanly and reliably, solving the single biggest failure mode of earlier image generation models for thumbnail use.
  • A curated set of 10-15 reference thumbnails teaches a model your visual brand -- face, palette, fonts, layout -- faster and more accurately than a detailed text prompt.
  • Claude Code in this workflow is the orchestrator, not the image generator: it manages folders, constructs API calls, and keeps the pipeline repeatable.
  • AI models trained before GPT Image 2 was released will incorrectly recommend older models and deny 16:9 support -- you must override those assumptions explicitly.
  • The one-sentence prompt is deliberately minimal: once reference images carry the style signal, adding more description often degrades consistency rather than improving it.
  • Start with a designer-built brand kit if you have zero existing thumbnails -- the reference bootstrapping ladder works by generating, curating, and recursively improving your reference set over time.
  • The same pipeline extends to Instagram posts, LinkedIn images, and Facebook ads without modification -- the model generalizes from any reference set, not just YouTube thumbnails.
Glossary

Terms worth knowing.

GPT Image 2
OpenAI's image generation model (gpt-image-2) released in 2026, notable for reliable text rendering inside images and strong style consistency when given reference inputs.
Reference folder
A directory of existing thumbnail or image examples that the generation pipeline attaches to every API request as style context, allowing the model to replicate a brand's visual language without a detailed written prompt.
CLI orchestration
Using a command-line tool to coordinate multiple services -- reading a folder, constructing an API request, saving output -- rather than doing each step manually in separate tools.
Resources

Things they pointed at.

01:02linkGitHub -- yt-thumbnail-generator (free clone)
00:00productinflate.agency
Quotables

Lines you could clip.

00:05
100% AI generated, nothing more than a single sentence and I was able to get these perfect thumbnails.
Specific claim + zero setup needed -- lands as a standalone promise.TikTok hook↗ Tweet quote
01:43
GPT image two model is no doubt about it the best model out there when it comes to image generation.
Bold claim with a named model -- shareable as a hot take.IG reel cold open↗ Tweet quote
03:31
These sample thumbnails I would say is the most important part to ensure that the outputs that you are getting are gonna be as consistent and as accurate as possible.
Counterintuitive insight: reference folder beats prompt engineering.newsletter pull-quote↗ Tweet quote
The Script

Word for word.

Read-along

Don't just watch it. Burn it in.

See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.

analogy
00:00This thumbnail right here was fully AI generated. So was this one as well. And this one fully AI generated.
00:07This one as well. Perfectly clean text. This one as well.
00:11This thumbnail and this thumbnail as well. 100% AI generated, nothing more than a single sentence and I was able to get these perfect thumbnails fully generated through ClaudeCode. So today, I'm gonna walk through my entire system that I'm currently using to generate all of my YouTube thumbnails through ClaudeCode.
00:26Whether or not you're a YouTuber and you need thumbnails or you're looking to get any kind of consistent designs out of AI, this tutorial is gonna help you out. So for this video, I'm gonna go through my exact Claude code setup in order to be able to do this. And if you do wanna fast track getting this system setup, I have cloned my exact thumbnail generator system and I've put it on GitHub for a completely free clone.
00:44So I'll have the link to this GitHub project within my free school community linked in the description. All you need to do is just copy that link, send it over to ClaudeCode and tell it to set it up for you and then you'll be good to go. But as I always recommend, it's good to have a fundamental understanding as to everything that's going on behind the scenes.
00:58So if you need to make changes or update it or make it better in the future, you know how to do that. So once again, this is the GitHub project with everything included in how this system works. I have told it to create a mermaid diagram as well just to give you a very fundamental understanding as to exactly how this works.
01:11Ultimately, this is an incredibly easy system to get set up and running. Ultimately, what we're using is Claude code at the start here. That's our CLI and that's what we're using to give it some instructions on what to create the thumbnail about.
01:21This will then get sent through OpenAI, of course, using our own OpenAI key. This gets sent into a folder with a bunch of reference thumbnails and images.
01:29I'll cover more about reference images in just a minute when I go through my own ClaudeCode setup. But once it's had a look at those reference images, if it has reference images, it's gonna obviously have a look at them. It's gonna add them into the request into the GPT image two model to then create the new thumbnail.
01:43If you're not yet familiar, OpenAI's GPT image two model is no doubt about it the best model out there when it comes to image generation. Previously, the best model was Google's Nano Banana Pro model, although I would say that GPT image two point o has definitely beat this model. I've been generating a lot of thumbnails both on Nano Banana Pro and on this new model as well, and I would say that this GPT image two model is just way more consistent at following your instructions.
02:05It practically never messes up text as well which is super important when it comes to thumbnails. Once again, I'd say the most important part is that it is incredibly consistent and that's and this is really important where we wanna make sure that all of our thumbnails look very similar to each other for a channel or for a brand.
02:18So once again, recommend using this model. Once it's gone through that model, it's gonna output the thumbnail in a 16 by nine two zero forty eight by one one five two resolution and then this is gonna get saved into our output folder as a PNG which we can then download.
02:31This isn't the highest resolution that OpenAI's GPT image two model can output. Personally, I actually do a little bit of a higher output in the four k range. You don't have to do a four k thumbnail every single time and it can cost a bit more money to do a four k output as well.
02:44But I do find that when you do output it in four k, it can just look a little bit more clearer one on YouTube. So it just depends what you wanna use. Anyways, this GitHub project has all the instructions on exactly how this setup works and exactly how you can get it working.
02:55But now I'm just gonna go through my ClaudeCode setup right here and just go through all of the messages that I sent through to ClaudeCode in order to get this project built out and just highlight some areas that I think you should be paying attention to if you wanna get the best outputs possible. So initially when I wanted to get these thumbnails generated, just said, hey, are you able to build a YouTube thumbnail generator here?
03:11I will provide you with a folder of sample thumbnails that I want you to take reference from. Then please use the OpenAI image two model to generate the new thumbnails and attach these examples to the new generations for it to take reference style from. It's really quite a basic prompt, of course, but ultimately, what we need to use is this image g p two model.
03:27And when you combine this with a really good set of sample thumbnails. So these sample thumbnails I would say is the most important part to ensure that the outputs that you're getting are gonna be as consistent and as accurate and as best as possible for you. I personally wouldn't rely on the image model itself to come up with how the design should look.
03:45At least at this stage, the reason for this is that otherwise you would need a really big prompt to be very specific as to the outputs that you're looking for and if you're not a designer yourself and you don't know exactly what you're looking for, it's probably easier to have a look at some visuals of thumbnails that you do like, provide them as style reference images so that it can just work that much better.
04:02So initially from this, it just gave me a couple of questions as to how I wanted this system to be set up. In my ClaudeCode operating system right here, I just got it to set it up as a separate folder in my YouTube folder. It then asked if I wanted to do this through the CLI, which is just this conversation right here, or create a web user interface to do this where we actually have a kind of like a dashboard.
04:19So you could do the web UI option if you wanted to have a dashboard to better upload stuff into, but to be honest, I don't mind especially for thumbnail generation where it's just a quick note on exactly what you're looking for. It's quite easy to do it just through this conversation. Then third, it asked me about the OpenAI model that I wanted to use.
04:34Because this GPT image two model is so brand new, it's not really been trained into a lot of these Claude models yet, so it doesn't actually know that the GPT image two model has been released and it's kinda recommending us to use the GPT image one model. We definitely don't wanna use the GPT image one model because it is significantly worse than the GPT image two model.
04:51So make sure just to clarify on that and force it to use the GPT image two model. After this, it's gonna ask us for our API key. I would always recommend never pasting your API key into the conversation itself.
05:01What you can do is just ask it to please tell me where to add the keys securely, and it'll give you a set of instructions on where to actually add the key in the folders here so that it's added in securely and you're not gonna be pasting it in plain text. After this, we were basically done. You can see here it's created the folder, it's added the key in, and it has installed everything it's needed to do.
05:18So once that was all set up and ready to go, it created that reference thumbnails folder and just told me to drop the reference thumbnails into that folder. So if I have look on the left hand side into YouTube, into thumbnail generator, you can see references, and in here I have added in a series of thumbnails that I wanted it to take reference from.
05:33I've just opened the folder here in my windows tab and you can see some of the examples that I gave it in order to be able to take reference from. As these thumbnails were also already AI generated before because I already used the system to generate them in the first place. But if you haven't already got any thumbnails or designs to get reference from, one thing that I might recommend doing is actually getting a proper thumbnail designer to build out a set of thumbnails or build out a brand for you before using a system like this.
05:56The reason it's able to replicate this so well is because I've already had such a big database of thumbnails with me in it that have already been created. So if you are yet to create any thumbnails, you probably will have to throw in a reference image of yourself so it knows what you look like as well as some designs and styles of thumbnails that you'd wanna replicate.
06:12Once you have done that, I would sort of recommend easing it in and generating a few thumbnails and finding ones that you specifically like, then using those as the main reference thumbnails to take reference from in the future. And then eventually, you'll be able to have a dataset just like this for it to take reference from.
06:25And so once I had uploaded all these examples, I came back and I said make me a thumbnail of me working at my laptop with the only text of the Claude code course. Then I told it to just add the Claude logo into the thumbnail as well and add an orange glow to the background. And this is what we got back.
06:38We got Claude code course. Text The is all perfect. There's an image of me with the orange glow background, and it's exactly what I was expecting.
06:44Now one thing that I forgot to mention is that there was a thumbnail before this that it tried to generate, and it didn't actually output it in the 16 by nine format. Essentially, 16 by nine is just the proper sort of length and height ratio that YouTube would wanna accept. So pretty much I just told it, can you actually output this in the proper 16 by nine format?
06:59It then went on and then thought that the model, the GPT image two model didn't natively support 16 by nine, which is not true. They do. Obviously, it didn't have that information already.
07:06And so I pretty much just told it, it can do 16 by nine now, but please look up the documentation to find this out. We do have to sort of baby it in to be able to get that information especially once again because it's a new model that doesn't have any information about. And then it was able to output the thumbnail perfectly and that's exactly what you're looking at right here.
07:20So honestly, that is practically it when it comes to thumbnail generation. These models have gotten so good now that we barely need any training material as to how I look or how I want the style references to look like. Because the models are so good, it can just look at such a small dataset and replicate a pretty good thumbnail just like this from not so much.
07:36So once again, works great for YouTube thumbnails but it also works for Instagram posts, Facebook ads, LinkedIn posts, literally anything that you might wanna use images for. This can pretty much crush that. I hope this video was helpful.
07:47If you wanna learn how to run ClawedCode for completely free, check out this video right here where I run through exactly how we're able to swap the models and start using far cheaper and free models without having to use the expensive Claude models that can rack up costs over time. So make sure to watch the video. I'm still pointing at it.
08:01I'm gonna keep pointing at it so you can click on the video. I highly recommend watching
The Hook

The bait, then the rug-pull.

The video opens by showcasing six thumbnails in quick succession -- all AI generated, all with clean readable text -- before revealing the system behind them: Claude Code paired with OpenAI GPT Image 2.

Frameworks

Named ideas worth stealing.

00:33model

Two-model thumbnail pipeline

  1. Claude Code (orchestration + CLI)
  2. Reference folder (brand signal)
  3. GPT Image 2 (rendering)
  4. Output folder (PNG)

Claude Code reads a sentence brief and a folder of reference images, then calls GPT Image 2 to produce a 16:9 thumbnail PNG.

Steal forAny repeatable brand-image workflow: Instagram posts, ad creatives, newsletter headers
05:45list

Reference bootstrapping ladder

  1. Commission a designer to build the first set of branded thumbnails
  2. Feed those as reference images into the generation system
  3. Generate several thumbnails, curate the best-looking ones
  4. Replace your reference set with the best AI outputs
  5. Repeat -- your reference dataset gets stronger over time

A cold-start strategy for creators with no existing visual brand who want to use this system.

Steal forAny creative AI pipeline that needs a style anchor before it can self-improve
CTA Breakdown

How they asked for the click.

VERBAL ASK
07:35next-video
If you wanna learn how to run Claude Code for completely free, check out this video right here.

Creator points directly at an end-screen card for roughly 15 seconds -- unusually persistent but effective for driving watch time to a second video.

FROM THE DESCRIPTION
PRIMARY CTAWhere the creator wants you to go next.
Storyboard

Visual structure at a glance.

proof montage
hookproof montage00:00
architecture
promisearchitecture00:33
github repo
valuegithub repo01:02
live build
valuelive build02:49
output demo
payoffoutput demo06:26
CTA
ctaCTA07:35
Frame Gallery

Visual moments.

Watch next

More from this channel + related breakdowns.

12:29
Brendan Jowett · Tutorial

5 Claude Skills I Can't Live Without (Steal These!)

A twelve-minute tour of the five Claude Code skills Brendan Jowett uses daily — humanizer, architecture diagrams, Remotion video, front-end design, and PDF generation — with one-prompt installs and a peanut-butter brand demo running through the back half.

May 2nd
Chat about this