oh lonely dark man okay I spent the last week trying to break GPT40's image generation prompt after prompt pushing it to its limits getting stuck by rate limits waiting but I've come away with 10 examples that genuinely surprise me some of them were total failures and others turned into paid client work almost instantly which is really cool and a few have completely changed how I think about the creative workflows I have especially for product marketing brand shoots and rapid concepting if you work in art direction videography branding or even niche marketing I'm telling you this isn't hype the tool is here and it's already incredibly useful so in this video I'm going to walk through the exact prompts I used what worked what didn't and where this tool fits into real world creative stacks a quick note before we dive in a lot of people in the comments previously asked whether they actually had access to the GPT40 image model the new one so let's just clear that up first i'll show you where you can find it so when you land in J GBT whether that's pro or not you'll notice the create image button and so this is a sure sign that you've got the new image model everyone does now so I wouldn't worry too much and let's just stick an image so I want to show you another cool thing I've just realized we can do an image of a man in a sad duck costume eating a hot dog on um on the beach in the rain generates final thing you should be seeing as long as you're seeing the getting started and the image generation like this then you're definitely using the new model so let's look at some of the stuff that I've been playing with while this uh dark man gets built here is the stuff I did last time around so the photo shoot a couple of people asked in the comments how I was able to achieve such consistency between the characters and such lifelike imagery but there's the answer was I just asked Chat GBT how to go about it what art direction we should work with and these were the results now I stuck these into a couple of different video generators just to see how they looked and you I'll pop them up on the screen now then I came to this idea of using chatbt to create these kind of branded photo shoots so working with GPT I asked for a series of art direction prompts and then composition prompts for for company so the first one I tried was a electrical contractors company so these the idea is these guys would be on site and it's as if they had paid for a branded shoot they were models in branded clothing dealing with electrical contractor stuff i just used the website text to help me come up with ideas and these were fine they're just slow to build and they don't look that realistic they look fine bit deadeyed really cool compared to what we could get similarly had a quite a better idea here with the and a landscape gardener on the on the coast in England and here I wanted everything to be shot in golden hour i wanted to have men in it in um in these these polo shirts that were branded and I think you could get somewhere with this it still feels a little bit artificial but I think that's more in the art direction I went with to be honest though for these kind of shoots I had much more success still doing this stuff in replicate so within Replicate I can within 90 seconds generate all these images each one of these took at least 90 seconds each one in Replicate um using the Flux Pro Ultra model which I'll show you a second over here we can generate tons of imagery i think this looks much more realistic because I use the RAW output which is over there um and this everyone's in lanyards everyone feels like they're part of the same company it's really consistent we got these three similarish looking dudes throughout now if I need to batch process images we're still not at a place where the GPT40 image generation is fast enough to be viable to for commercial reasons it's fun for hobbies and stuff but I'm not going to be sat there doing that maybe I'd pay someone else to do it but we're waiting for API access and we're waiting for things to speed up third application here is just Mother's Day stuff also had some pictures of my wife and kids and put them into GPT and worked around getting it to deliver Disneyish style imagery without using the big scary dword which scared it uh these came out really cool like Wallace and Grommit style more Pixar style occasionally it would add like a deformed half dog half human which we don't actually have in the family um also even more terrifyingly fourth child which we definitely don't have in the family uh and otherwise it did really well I think in terms of making everyone look pretty and beautiful and uh my wife and family really like them for Mother's Day so this is really cool and opens up personalized messages and cards and stuff for people and onto a whole new level now because unlike previously this stuff comes out and looks so much more like the photos that you started with a good friend of mine I was a bit jealous he had he has a boiler firm and he went away and had a real human graphic designer come up with these really beautiful old school kind of skate style t-shirts for for for his company wouldn't it be funny if I could get this working for you showing you what this would look like on a on a skater in Camden in London and so my prompt was put a British skateboarder in this t-shirt when a low res screenshot in Camden holding his board and we got a couple that came out quite well in terms of just the structure and just whacking that onto a t-shirt and trying its hardest to get it now it got a bit confused it brought over my steady bow colors which is my company cuz I got jealous and I decided I also needed skateboarding t-shirts um I asked for my own range of skateboarding t-shirts and it kind of it got the colors right i think actually ironically the one here that's best is the skateboard ding which I would probably wear uh and then asked for it to take a picture of me and put me in a skate shop wearing the skateboard ding t-shirt and facially it's not right i swear that's not me really cool kind of stuff that just just you know 10 days ago 7 days ago was not possible to just be like hey I want this on a t-shirt i want to see what this looks like over here that mindset that way of thinking about how you can compose images now uh has completely changed which brings me on to structured information and and how powerful that can be with this image generation model user Jack here put together this amazing JSON prompt and it set me off on the whole journey i just don't use JSON enough in my communications with language models jason is is a way of in plain text structuring information uh in a way that language models seem to really like now I'm not an expert at all but he shared this prompt about how to generate very stylized logos that could be pulled out as SVGs very easily because they are well you'll see they're like bold and stylized there's no curves it's all very chunky and using his his scripting I was able to go away and make a whole collection of really fun uh icons everything I could think of from ice cream to wallpaper decorators a luchador penguin eating fries uh a sloth knight uh a mummy horned mummy gorilla an axelottle transformer um builder and then add some shading on so logo stuff has got really cool and this is using this JSON to make sure that the the outputs are very very similar so you could use this for example on a website if you wanted to have a site branded with icons in the navbar or icons for your services all to feel cohesive you could start with a baseline prompt like this and then build it out maybe you do want curves maybe you want something that's specific to your brand this uh this opens up infinite possibilities about logo and and and generate around logo creation which is really cool logo and icon creation this is fun done a bit of work for a client whose social media channels I look after and we were able to very quickly create an April Fool's prank which was that they had not only been in concrete game but they've moved into uh Concrete as well so lovely little Instagram thing and we just fed in the picture of the truck and asked it to be spat out as a as an ice cream truck and it did a really good job so back to Jason here and this was combining a couple of things if I zoom back over so I wanted to combine this idea of a branded photo shoot with art direction and composition and and consistent characters with instead of using plain English and making all these tables and trying to understand image plans and couldn't we just structure that in the same way as Jason which is what I wanted to do so I just asked GPT to help me get there i knew I wanted this idea of like a rugged called the Maverick who using AI and going back and forth came up with this together which was Jason that talks about who the Maverick is we look at his sunglasses his wardrobe the visual tone the art style it's always going to be cinematic shallow depth of field we're going with heavy canvas textures raw denim angel leather we're giving it a essence of the art direction and then on top of that sticking in um more plain English prompts so I wanted to see him walking away from a jet and a bat like a a jet fighter i wanted him chilling out uh in the Penines having a chill with his socks off i should probably be charging you for the feet picks um on a hike there he is back with the jet there he is in Cuba he's over in Berlin having a coffee being super cool he's on the 1980s subway puffing on a on a massive stogy he's on a Soviet abandoned uh roller coaster but the thing here that works is these images the man is the same man throughout much more so than other models which I'll show you right now and I'll go and show you what I've just trying to spit the same thing out using um Flux Flux Pro Ultra in the playground here in in Replicate while it's great I can generate these images instantly none of them are consistent from a from a consistency these are great images i really like I think some of these come out really well but it's not the same dude so if I was working with influencers or trying to sell something with a with a personality behind it while I think you can still get some great imagery from other models I mean this is also a fraction of the price and a fraction of the speed there's three more to generated that I just turned on we get a much better sense of of art direction consistency and character consistency using GPT this is a bit more of a practical application asking it for how is my lighting setup i don't really know what I'm doing i don't know i've been blinded by key light and and this thing oh god and got lights dotted around in the background to try and make it uh pretty but I don't really know what I'm doing and I've asked for advice from other people and they've given me notes but I thought I could take that over to to GPT um as you can see I asked it to try and show me how I could look and it's a bit of a disaster but the lighting advice seemed interesting but then I asked it okay well show me the optimal version of how that I would look and it came up with this which I thought was not good at all i don't want to look like that that's too dark and gloomy um so another one I saw on Twitter here was to feed some normal product shots to GPT and then ask it for some macro shots of that product so that means very very closeup high-res imagery so I fed it in some low res shots of this random Halford's uh track pump which I just came popped into my mind and these got sped out now these are entirely fictional pumps go way above 80 PSI that's not actually usable as a as a pump uh and that looks nice and it's kind of based on this but it's not the same so that one was interesting from a conceptual point of view and I can imagine shops wanting to use this kind of stuff but you would have to get a few iterations out and I probably take high-res images up close with an a camera and then take that those images and ask them to be macros i then did the same thing with uh this Swiss watch style just to see what would happen so again fed it to low res images straight from its website and asked it to spit out some macro shots and a man modeling it standing in the middle of the train tracks and it's done fine the text is readable it's not really not good enough but there is a situation where with enough prompting and enough good images up front from references you might be able to get some really interesting stuff here again stuff like this was just not possible a week ago so still fresh coming up with all these ideas this was borrowed from someone on on on Twitter and uh it does start to make you think of all the other possibilities that you could do with this kind of thinking back over to our sad duck man he's done now cool fine we can be able to generate images like this all the time but now if I said I want a profile shot a side profile shot a shot from behind images but we can take the camera and move it around we could change the lens we could add another subject manipulating in that space is really interesting when it comes to taking this stuff to video generation because we can then tween those two images and and the the video generation can generate the the frames in between from shot one to shot two that's the lot 10 prompts a few total flops and a bunch that I think really show where this tech is going but now I need to shift gears a bit my core focus is still building custom tools and automations bots backend logic front-end tools whatever it takes that solves real problems for my current client base and hopefully more that come on starting with improving their ad conversion rates and going from there so I'm back working inside Lovable sorting out some of the SEO quirks that we've uncovered and largely solved and scaling out tools that are already generating quite serious revenue for me with some crazy potential these aren't hypotheticals but things actually out in the wild delivering results for clients so crazy exciting time at the moment if you want to see more on chat GPT image generation the cool promps automation flows whatever drop it in the comments i can't wait to get API access which will again change everything i'll reply where I can and I'll make more videos but honestly the best thing you can do is just ask the GPT tool itself if you want to create something try it break it see where it bends this stuff only gets more valuable when you start plugging it into real workflows not just poking prompts but automating the whole thing end to end for business owners that have these problems and need them solved and that's where I'm heading so stick around if you want to come with [Music] [Laughter] [Music]