here's all the news from the AI world that I think you'll find interesting from this week now there wasn't a ton of major announcements this week that's because of the 4th of July here in the United States I believe a lot of companies sort of slow down on the announcements and big releases during holiday weeks because of travel and things like that but that doesn't mean there's not a lot to talk about a lot of really cool things still happen this week starting with the fact that gen 3 access was made publicly available now I made a video earlier this week on Sunday before gen 3 was released and then on Monday the day after that video came out gen 3 was made publicly available unfortunately Gen 3 is not available if you're on like a free plan for Runway you do have to be a pro user but if you are a pro user of Runway you can come over to generate videos You'll see a big introducing gen 3 Alpha you can click get started and enter your prompt here I've made a ton of videos with Gen 3 already and again I already made another video about it so I'm not going to go into too much depth in this video about gen 3 cuz that whole video will break down what you can expect however I did attempt to make a 4th of July themed video with the prompt a bald eagle flying in front of an American flag with fireworks in the background and well this is what it gave me um yeah it's not always the best but it is the best textto video generator we have right now compare that to something like Luma AI where I was able to actually give the prompt to an image gen generator upload my image into Luma and I got something that's a little bit more impressive out of it so text a video Gen 3 is the best we've got today image to video Luma still the best in my opinion you can't do image to video yet with Gen 3 11 Labs got a couple Updates this week as well last week we talked about their new reader app this week they've added some famous voices to the reader app like Judy Garland James Dean Bert Reynolds and Sir Lawrence Olivier and before people start freaking out to much they did get permission from the estat they made deals the Estates are getting paid for them to be able to do this so this was all done above board with permission if you want to check out the app yourself search out 11 Labs on your phone make sure that you're downloading the proper version that is created by 11 Labs Inc a lot of people on the App Store are going to try to fool you into thinking their app is the actual 11 Labs app but you want the one that actually says 11 Labs once you're in the app you can actually see some of these new iconic voices that are in there I think it's cool that they have them but personally due to the time that a lot of these people were alive and their voices were available audio quality isn't my favorite for example here's what Judy Garland sounds like once upon a time in a land Far Far Away there was a beautiful young girl who had become lost while traveling Bert Reynolds is probably the best sounding one just because he's a little bit more recent a little more current for the good old American life for the money for the for the glory and for the fun mostly for the money I could see listening to articles and PDFs and stuff with Bert Reynolds voice 11 Labs also released a new voice isolator feature where you can upload any audio with any sort of background noise and it will clean it up and make it sound really good I particularly like this demo that they put online that shows how good it actually works check this out need to remove background noise from your video use our new voice isolator model for crystal clear audio every time I mean it's pretty impressive if you're a fan of sunno and making music with sunno they also released an app this week it pretty much does the exact same thing is the web app it just makes it easier to do it on your phone at the moment I believe it's only available on iOS so if you're on Android it's probably coming soon but at the moment you're kind of out of luck when you search it out in the App Store search out soono once again be careful a lot of people will try to fool you into thinking their app is the official sunno app there's one here that sayso AI by sharp Forks it's not sunno but there are quite a few apps that claim to be sunno so make sure you are grabbing the right one it has this little tiedy psychedelic looking background and the creator of the app is sunno Inc so look at the Creator make sure it says sunno Inc and that's the proper app once you're in sunno it works pretty much like the web version version you've got your library of all the songs that you've created you've got a create button your newest songs pop up towards the top on the web app your newest songs are at the bottom of the list once again I was trying to generate some stuff for a Fourth of July theme this is what it came up [Music] with you get the idea meta released some new research this week around text to 3D images it's called 3D genin and you can apparently input a text prompt and output a fairly decent quality 3D image this could speed up things like game development and creating assets for 3D videos and things like that here's the demo that they posted on X to kind of show off what it's capable of we got a little like breedy pug here made out of metal a dancing robot it looks like they fed this into mixamo or something to get that dance and it looks to be pretty good I mean I can't really say if this looks a lot better than a lot of the text to 3D we've already seen or if it's about on par and we don't have access to it yet they just released the research but once I can play with it I'm definitely going to play with it and see what we can do with it this week this company called kotai I'm not sure if I'm pronouncing that right or not but there are open-source AI research lab they released a new voice model that looks to be trying to go head-to-head with GPT 40's advanced voice and not only is it available right now to try out out they're actually open- sourcing it so that other companies could build off of it and use the underlying Tech to build new tools from it if you want to try it out you can head over to Moshi do chat it says enter your email to join the queue I don't know why it does that CU as soon as you click join Q it just spits you right into it where you can use it hello how can I help you what's your name you're Moshi no you're mhi who made you I was created by Q Tai a nonprofit research lab based in Paris what things are you best at I'm the voice artificial intelligence so I can understand and respond to human speech are you good at math yes I can perform basic math calculations what's 7,532 * 7 7,532 * 7 is 53 2,33 that wasn't exactly what I asked but still pretty impressive thank you now not quite on par with what we saw out of GPT 40 the voice is not very expressive yet still sounds very robotic however the responses you heard were in real time I didn't edit this to make it seem like it was responding to me quicker than it actually was it actually instantly responds sometimes almost talking over you before you finish your sentence and because this is open source this is just the very sort of base level the foundation of what we're going to see from this Tech as soon as other people start using this integrating it with tools like 11 Labs with more realistic voices integrating it with other large language models that are probably smarter than whatever it's powered by right now this is going to actually get a lot better and be a decent alternative to GPT 40 with the advanced voice assistant and speaking of Open Source there's a new open source large language model that was just made available over on hugging face called intern LM 2.5 what's interesting about this one is not only that it's open source but it has a 1 million context window now Gemini from Google Now has a 2 million context window which is probably overkill for most let's be honest 1 million context is probably overkill for most but now we have an Open Source 1 million context window that anybody can build with and it's available right now for anybody to play around with over on hugging face the weights are available right now and most likely either now or pretty soon you'll be able to use it in a tool like LM Studio or Jan or chat RTX it's kind of like a your own model chat system speaking of bring your own model chat system the brave browser just made an update where you can bring your own model into the brave browser so brave has their own AI similar to what you see from Microsoft co-pilot built into the browser they call it Leo Ai and now you can actually use custom models they've already got Mixr Claude and llama but now you can actually add a custom model in if that's something you're interested in complexity also got an update this week with their Pro search the new Pro search has multi-step reasoning so it actually understands when a question requires planning Works through goals step by step and synthesizes in-depth answers with greater efficiency it can also analyze search results and take intelligent actions based on findings it's also much better at math and programming because of the addition of Wolfram Alpha if you're a free user of perplexity you can use PR search five times every four hours and if you're a perplexity Pro member you get nearly unlimited daily access so there's some limits but you're probably not going to find them I feel like open Ai and chat GPT really led the way in making chat Bots and large language models popular but if I'm being totally honest I've become much more of a fan of clad and perplexity I use clad for most things except for when I need to do research perplexity seems to be the best option right now it looks like apple is actually getting a board seat on ai's board it's going to be an observer role so they won't have voting rights but it's super interesting because obviously Microsoft and Apple are like the two largest companies in the world and they're kind of head-to-head competitors and both companies are actually going to have board roles in open AI so pretty fascinating but when it comes to open AI another week another lawsuit the center for investigative reporting is suing open Ai and Microsoft once again claiming copyright infringement open Ai and Microsoft started vacuuming up our stories to make their product more powerful but they never asked for permission or offered compensation unlike other organizations that license our material if you remember recently open AI has been signing a ton of Licensing deals Associated Press Axel Springer Financial Times News Corp VOX media the Atlantic time makes me wonder if some of these lawsuits are just to try to get on the radar of open AI so that it can open discussions to get a licensing deal at the end of the day this is all about money for these companies and I'm sure if open AI made them a licensing offer that they couldn't refuse they would get on board with open AI pretty quickly but that's just speculation Mustafa solman who in my opinion is usually pretty calculated with his words and a very very smart person he wrote the book called The Coming wave which is a great book on sort of where AI is now and where it's headed but he made some statements that even I really really can't get behind with respect to content that is already on the open web the social contract of that content since the '90s has been that it is fair use anyone can copy it recreate with it reproduce with it that has been freear if you like that's been the understanding there's a separate category where a website or a publisher or a news organization had explicitly said do not scrape or crawl me for any other reason than indexing me so that other people can find that content that that's a great area and I think that's going to work its way through the court yeah I I don't know about that if I put up content on the internet on YouTube on my personal blog just because I didn't put like a specific robots. text file telling not to scrape on this I should just accept that these companies are allowed to use this content however they want I don't know I I have a hard time getting behind that I kind of understand what he's saying if you're putting it on the internet everybody can see it anyway so what's the difference between them seeing it there or seeing it here I feel like that's sort of the point he's trying to make but that doesn't completely negate copyright I don't know what are your thoughts do you agree with Mustafa here oh and by the way if you absolutely don't want to be scraped and you don't want your content taken by AI Cloud flare rolled out a solution this week if you're on cloud flare which is a Content delivery Network that sort of lives between like your domain and your hosting and it like caches a lot of the stuff on your website to make your website load faster and I'm not the best person to be explained this but it's a service that sort of helps your site load better helps with security on the site helps with preventing more downtime on your site things like that now they have a switch that you can flip on that makes it so these scrapers that are scraping for AI can't scrape your site anymore and it's available to both free and paid users of cloud flare if you're worried about companies like Microsoft just taking your content and using it however you want Cloud flare could be a good option for you especially since they have free options and speaking of using your content to train AI last week figma had their config conference where they showed off all their AI features this week they put out an official statement to users that says the company needs to train models that better understand Design Concepts and patterns and figma's internal formats and structure through figma content apparently they are going to offer a way to opt out but by default apparently they'll just train on whatever designs and things you make inside of figma figma also got into a little bit of hot water this week when it came out that the weather app that it was design sing for people looked identical to Apple's weather app the CEO of figma did speak up and say that they were using offthe shelf language models for their built-in AI so it wasn't specifically trained on Apple's designs but maybe the off-the-shelf AI they were using was trained on Apple's designs and they've since paused this feature so the ability to actually make this kind of stuff in figma is not available right now while they fix these issues and make sure that it's not going to accidentally create designs that look exactly like other companies designs YouTube rolled out a new feature where if a YouTuber creates content that simulates what you look like or simulates your voice you can actually request to have that taken down now before it actually had to be like stolen content like if somebody took my videos and put them on their YouTube channel I can force them to take it down or if they used my music or my IP I can claim copyright on it and have it taken down well now if somebody just uses an AI version of your voice or an AI version of your face you can request it to get taken down as well Instagram also made a little bit of a tweak a lot of people were having a bit of a fit about the fact that images that used any bit of AI at all like if you used Photoshop it would probably say made with AI on your image and people were going hey this image wasn't made with AI I just did some color correction that maybe might have used a teeny bit of AI or I just erased a little thing in the background and that might have used a little bit of AI that doesn't mean my image was made with AI it just means I did a tiny tweak to the image to make it look a little bit better well they changed their wording so now it no longer says made with AI now it says AI info you can click on it and see what's in the metadata we also got word this week that a new version of grock is coming out grock 2 is apparently arriving in August this comes from Elon himself he was replying to Beth Jos on X who said models training on each other data is like a human centipede effect elon's reply was sadly quite true it takes a lot of work to purge llms from the internet training data grock 2 which comes out in August will be a giant Improvement in this regard there's also a rumor going around that Apple might partner with Google Gemini at WWDC they announced that they were partnering up with open AI to power some of its chat features it's starting to sound like Gemini will also be an option if you don't want to use open AI Tech now this article does say apple could announce a Google Gemini deal this fall so this is all still just rumor and speculation WhatsApp appears to be getting a new feature based on some leaked screenshots it looks to me very much like what Apple showed off in their WWDC keynote where you can upload an image of yourself and it will generate sort of cartoon or alternate versions that look like you and this appears to be a feature that they're going to roll out inside of WhatsApp and I imagine if it's in WhatsApp it's probably also going to be in Instagram and messenger and all the other sweet of tools that meta has out there and since I mentioned meta they're getting some competition with their meta Ray bands which personally I love they're my daily wearers for sunglasses but there is a new company that is trying to make something very similar that looks very similar with little cameras on the side and it will have an llm in fact it's going to use chat GPT 40 for their llm inside of it which if I'm being quite honest is still better than what you get with llama 3 which is what's currently in the glasses that meta has and finally this was super cool I wanted to show this off it's called open television it lets you immersively operate a robot even if you're 3,000 M away just like the movie Avatar so we can see somebody over here wearing an apple Vision Pro moving their hands around they're over at MIT in Boston and the robot itself is over here in UCSD and what he's doing with his hands the robot is actually doing on the other side of the country I thought that was pretty cool I want to dive more into this Tech I'm definitely going to do more more videos on Robotics and things like that in the future heck I'm in San Diego maybe I can get a demo of this in person over at UCSD but I just thought this Tech was really cool wanted to share it as my final little exciting fun thing for this week again Fourth of July week so not a ton of huge major announcements but still plenty to talk about hopefully you feel more looped in on what's going on in the AI world and found some of this stuff cool interesting fascinating disappointing scary whatever all those emotions are val all I know is I love researching it I love turning around and pointing it out to you and I'm so grateful that you check out these videos and allow me to do this because I'm having so much fun and I really really appreciate you tuning in don't forget to check out futur tools. where I share all of the latest AI news and the coolest AI tools that I come across join the free newsletter if you haven't already it's going to be focused on AI news but also more emerging Tech not just AI there's a lot of AI newsletters out there so going to start expanding it into a little bit more of other emerging Tech outside of AI as well you're going to want to see this really cool stuff coming out of that check it out at Future tools. and if you like videos like this you want a weekly breakdown of the AI news you want AI tutorials you want to stay in the loop with emerging Tech and where the future is headed like this video And subscribe to this Channel and I'll make sure more stuff like this shows up in your YouTube feed thank you so much for tuning in once again I really really appreciate you hopefully I'll see you in the next video bye-bye