Transcript for:
AI Voice Apps for Productivity

these AI voice apps for your Mac could potentially save you more time than all your other tools combined to start they let you type four times faster than the average person with very accurate dictation that completely puts apples to shame and if you think about how much time you spend typing on your computer that's already pretty ridiculous I mean just imagine all your emails notes messages and AI chats being done at the speed of thought but it's actually just the beginning of what these apps can do all three come with their own unique set of AI features aimed to supercharge the human voice range from free to pricey beginner to Advanced so there's something here for everybody let's put them head-to-head so by the end of this video you know exactly which one is for you and how to use it kicking us off is whisper flow this one is the new kit on the Block but boy is she coming in hot because it legitimately blew my mind the first time I used it and I chose it first because it's not only really impressive but very simple let's reply to Caitlyn's email here using whisper flow and see just how fast we can do it go ahead and start the timer she asked what is the big deal with whisper flow I'm going to go ahead and start it in handsfree mode hey Caitlyn great question whisper flow is one of the best dictation apps I've ever used I'm G to stop it there actually check this out one it's already formatting it correctly as an email there's an exclamation point which is maybe a little weird but I don't mind it and then it's literally got her name spelled correctly as well because it can see limited text that's on the active application pretty sweet but we're going to keep going whisper flow can do all of the following one it dictates really really fast two it's able to um fix uh filter I mean filler words three it handles punctuation completely on on its own four it adapts to the app that you're in using proper punctuation in emails but keeping things more casual in messages this also helps it get named right the first time five it generates a dictionary of words that are personal to you allowing me to say something like Tai teaches Tech knowing it'll be spelled correctly and six it has whisper and command mode but I'm going to show you that in a second anyway hope this helps let me know if you have questions best Tyler okay this is a very complicated uh thing for AI to get but let's see how it does okay ignore the uh change up here I'm pretty sure that's an apple thing because that doesn't happen anywhere else but we see it dictates really really fast it is automatically getting the punctuation completely correct with the numbering which is really cool it completely fixed this sentence if you remember me messing up a lot when saying it and then it adapts to the app you're in like we saw up here this Caitlyn with a new line As a classic greeting is automatically getting added by flow and as we can see in number five it did get tied teaches Tech correctly though I would have preferred to have a capital T here but either way this is amazing because we all have words that are very personal to us whisper flow just makes this a very personal experience with the different spelling here the dictionary by the way can be accessed in the menu bar by clicking add word to dictionary you can add the obvious ones manually off the bat but Flo also learns them automatically as you use it and correct its output for example I didn't have to tell it how to spell my girlfriend sister or a doctor's name I just went in and corrected the false output and Flo let me know it caught that and won't mess up again now the other two apps do have dictionaries but they work a little bit differently and you do have to add words manually if Flo gets this right and can add words consistently accurately automatically all the time for people then I think it's going to be huge for winning over a big user base but we're going to step it up even more and I'm going to show you these two modes right here whisper mode is exactly how it sounds actually if you find yourself around a bunch of noisy people and are still too lazy to type don't worry at the small cost of looking just a tad bit crazy in front of your peers you can whisper into the microphone and flow will still sort it all out since I'm in Social I just tested this at home with my TV on and it did work very well but it also worked for super whisper and Mac whisper Now command mode is where things get extra juicy and there's two ways to use it one if you highlight your text you can have Flo edited just say hey Flo and then tell it what you want for example hey Flo make this more concise or hey Flo translate to Spanish this could be incredibly useful because you can simply turn on Flo and then stream of conscious reply to let's say a professional email it'll look terrible when it's done but then you just highlight and then say something like hey Flo turn this into a professional email while maintaining as much of the original wording as possible and there we go wild stuff command mode will also give us access to Google search and perplexity and I think more Integrations are on the way and I do not think there is a faster way to Google check this out if I have a question I just have to hold down the function key and say search Google for is a tomato a fruit or a vegetable there we go instant answers it's so simple but so effective once you get in the habit of using it all in all I was very impressed with their dictation it has like that magical like it just just works feel to it and it gets all these little things right that you don't realize are important until you start trying using dictation all the time like I did it's fast enough where I can add even just a few words at the time and know it's going to be faster than typing it lets me add in the middle of sentences without having to worry about punctuation being off it formats and adds punctuation on its own very very well and for each app I ask the team what's in store for the future and for this one they emphasize we're going to be able to have even more and more control over the output for example if you only text your friends in like all lowercase you'll be able to set that up accordingly or you'll be able to record like a 10-minute ramble inside a flow and then have it organize it however you want maybe like for a CEO PDF or something like that this is the kind of feature that the next two apps will have as well and finally and probably most importantly for a ton of you they're coming out with an IOS app so you'll be able to get the same super fast dictation on your phone on the go there's like a free trial if you want to test it out but it's going to be like $12 to $15 a month if you really want to use this thing like you should but if you're a stickler for free don't worry Mac whisper has got your back this along with super whisper introduces a whole new set of features that supercharge your voice in fact I would be doing this app a disservice if I only talked about its free dictation feature this app is a voice to text Powerhouse and a ton of people love it I mean Feast your eyes it's like an all you can eat transcription Buffet it lets you work with open AI free whisper speech to text models to create and work with highquality transcriptions of just about anything video files audio files meetings YouTube videos audio playing from anywhere on your computer you name it and it's all done completely locally so no data leaves your computer being able to transcribe all this audio is an essential tool for Content creators students academics and business professionals it makes you feel like the world is at your fingertips because you can take anything transform it into text and then feed it into AI to learn more about it it's as simple as dragging the file into Mac whisper hitting copy transcript and pasting it into your AI of choice but the pro version which by the way is a one-time buy unlocks quite a bit more you'll be able to batch transcribe multiple files at once use cloud transcriptions which are going to be faster and very accurate for older Macs transcribe YouTube videos straight from their URL and most importantly use your favorite AI service to chat with any transcript now I don't know if you've noticed but most apps charge like $8 to $15 a month just to unlock the AI features but the developer or Mac whisper is a real one instead you can plug in what's called an API key which means you're only going to pay for the AI Services you use and you get to pick which one it is now I realized that might sound too technical or expensive but don't worry it really isn't setting up this API key is essentially as easy as going to the website linked directly within Mac whisper adding a credit card clicking one button to generate the key and then pasting it into Mac whisper now my account with an AI service and Mac whisper are perfectly connected and anytime I use Mac Whispers AI features it will use this API key to send the prompt to chat GPT and the account will get charged accordingly bookmarking this website will let you track and put limits on how much you spend making sure you don't go over but to give you an idea of just how cheap the API can be if you send over the entire sorcerer Stone over to chat gbt and then tell it to make a summary that's like 10% the size of the book it's going to cost you a whopping 2 cents but yeah if it's still confusing and you want me to make a video on setting this up step by step then I would be happy to just let me know in the comments now back to Mac whisper features transcriptions can be saved here on the left for each you can play the file and follow the transcript in real time assign speakers to each line manually which is going to be great if you're in production and ask the transcript questions using a live library of fully customizable AI prompt presets you can also translate into other languages and Export into just about any file type you can imagine now Mac whisper gives us a free version of dictation that you can also supercharge with the pro version it's actually a newer feature that's in beta at the time of this recording but there are three key features that make it better than Apple dictation in my opinion the first point we already established it's that open AI whisper models are going to be way more accurate than Apple's built-in dictation the second is that similar to flow you can create a personal addiction AR of commonly misspelled words and you can even get fancy and add things that are just like impossible to say like links or emails or addresses so for example with this one right here whenever I actually say Instagram link the actual link is going to get added and again while this is a very useful feature it does not add words automatically like flow so keep that in mind but it does have this unique find and replace which allows us to add links so whichever one's good for you and the third reason is the most powerful though it's also paid it's these AI prompts these allow you to run everything you say through an AI prompt before it gets pasted into the computer this is essential for being able to really accurately add like emojis or punctuation because open ai's whisper model does not do it by default which is the one thing that Apple dictation does have over it but you can also get really fancy with these AI prompts for example you can have it automatically translate what you say into Spanish before getting pasted into the app or you can have it Fix Your Grammar and make it sound very powerful or maybe not even transcribe what you said at all and instead if you want to have the AI I answer any question that you say out loud you can have it do that as well the beautiful part about this is it's really up to you and your creativity and ability to prompt and changing the prompt your transcription gets run through is as simple as clicking this little Sparkle icon that pops up when you start dictation for example here if I select just dictate then we're going to be using just the whisper model which is fast but it doesn't do any fancy stuff like spoken punctuation for example if I want to say exclamation point it's literally going to say exclamation point however this is a really easy fix because you can create a PRP that not only lets me dictate any punctuation or Emoji I want just like apple dictation but also I can add in that it automatically cleans up any filler words or mistakes I make while talking out loud with this prompt which by the way I'm including everything in the description below I was able to get surprisingly similar results to flow and to show you just how creative and useful these prompts can get check out this hey Marge example I created it's designed so that I can give AI instructions to do anything I want at any point in the recording as long as I start the instructions with with hey Marge and end with thanks Marge for example I can activate Mac whisper and say something like hey Marge I'm about to think through everything I need to do today but it might be all over the place so I need you to make a concise to-do list when I'm done thanks Marge and then continue to just rant out loud free form thought on what I need to do today and then when I'm done the results not going to be my transcript but instead the to-do list that I asked for so this lets us really just take what we say and then transform it into the digital world and organize it instantly and finally if you've made a library area of prompts that you use for different situations you can set certain prompts to be defaulted based on what app you're on if I'm in mail it'll automatically format it as a professional email or if I'm in messages it'll just use a regular dictation so I can say whatever I want without it getting changed but again if you just want the basic dictation you can always get Mac whisper and try it but if you want the custom AI prompts that really you know give you the razzled Dazzle then you're going to have to pay for the pro version as a one time fee and then you can use the API which is not going to be that expensive once you get everything set up it's an exceptional bang for just make sure you select the cheap model like GPT 40 mini when you're adding your API key though if you need a little extra intelligence you can always upgrade to give it a little extra boost but before whipping out your wallet and clicking my affiliate link please and thank you let's talk about super whisper like whisper flow super whisper is built around dictation it has it at its core the difference however is if flow is like a plug andplay kind of gal then Super whisper is more of a tinker and tune kind of guy in other words if you like customizing your tools you are going to love super whisper it takes those AI prompts that we just talked about with Mac whisper and then gives them a healthy dose of protein and the things you can do are actually pretty mind-blowing but let's start with the basics let's hit the hot key to start recording and whoo whoo that is a that is a big black box don't let him intimidate you though just a little tickle right here in the corner we'll humble him right quick this is how I prefer to use it but we're going to actually open it back up again because this makes it a little easier to learn we get this cool little waveform to show that it's recording and we'll see down here we have voice selected this means it's acting like normal dictation you can talk and then watch your words fly on the screen when you're done part for the course but I can switch the mode as well which does the same thing as switching the AI prompted in Mac whisper for example if I select email right here my transcript is run through an AI prompt that formats it with proper salutations paragraphing and sign offs if I select Notes The Prompt is going to organize everything I said into bullet points with sections and headers and as you can see this is behaving a lot like mac whisper the difference however is Mac whisper only lets you feed your transcript through AI super whisper lets us include information from our clipboard our active application and our computer audio these inputs along with the right prompt open up a world of possibilities for you and the power is in your hands to create the modes that help you the most but for me with great power came great confusion on what I'm even supposed to do with it it does come with a handful of built-in modes but they felt a little basic to me and with all these inputs that we can add into AI in real time with our voice I wanted to see how far I can push it so I made these five modes that are actually changing my life they're in the description if you want to check them out first a better dictation this is the same prompt that I used with Mac whisper for really clean clean flow like dictation however there's one thing I was able to do here that Mac whisper cannot do and that is reference the text in the active application to spell names correctly this is basically the same functionality that whisper flow had when it got Caitlyn's name correctly in the email it accounts for stutters it'll add random bullet points or numbering it'll do punctuation emojis Etc on my M1 Mac it was about a half a second or 1 second slower than Flo but what Flo can't do is this ask AI mode so basically I'm able to turn on super whisper select this mode and now I can ask AI anything about about my active application or what's on my clipboard or just anything at all even if it's not related to the other two it's my personal favorite because I basically can just ask AI anything at any point on my computer about anything hey there future Tyler I realized that I probably just gave you a lot of information that was not very specific so I'm going to show you ask AI in super whisper with this example let's go ahead and plan a trip to Japan so we'll say Japan travels and let's say I activate the ask AI let me bring this over here two monitors so I have ask AI selected in super whisper I will start it and say I'm going to Japan and I want to visit three cities that are not popular but worth going and underrated for tourist travels I like the forest and nature in calm areas can you list three cities in Japan I should go to and then I will hit enter so basically I'm just you know asking a question and then it's literally going to Output the answer because the prompt is basically allowing AI to discern all right just answer the question but I can also do something like copy this right now and then start it again can you make a table of the Cities mentioned in the clipboard and creates a itinerary so a row is either going to be morning noon or evening and each column is going to be a city and then within the cells I want a a specific place within each City I can go to in that time frame see if it kiss this I butchered that I apologize everyone but here we go so basically it was able to uh I was able to copy this and based on the prompt again it knew uh to just reference the clipboard because I told it explicitly to check out the three cities in the clipboard so we have Morning Noon evening we have these two or these three cities perfect and it gave specific uh places I can go in each of these time zone so it's great and then another thing it can do is reference the active application so I'm not even going to copy anything and instead I'm just going to ask I have never been to these cities before in the active application can you let me know anything I should look out for for each of these cities as a traveler who does not speak Japanese and has never been to Japan before and as we can see it is literally just able to you know reference the the text that is in this notion document or if I was in a Word document or anything like that it could see it and yeah that's basically ask so you really are just able to just hit the function key or whatever your activation key is and just go to town tell AI what you want it to do and it'll basically do it number three is the meeting summarizer so you can select this mode within super whisper have it be handsfree so we just have it recording the whole time and then it will listen to any meeting on any app and at the end of it it will provide a summary next steps and any open or pending items for next time it can even detect different speakers which is pretty impressive next up we have the scripting mode so sometimes talking out loud is just is better than thinking inside your head or typing on a keyboard and for me this is especially true for scripting content because I want to feel relatable and I do that better when I'm talking instead of typing so with super whisper I can activate this mode and it will basically remove all of my mess up so I can just say the same line over and over again iron it out and then I basically tell the AI hey just ignore all of the previous takes only keep the last best one and I will output my entire script that I just said out loud and keep only the best takes this has saved me hours and finally we have rewrite clipboard if you ever have a hard time writing something you'll be able to copy it to the clipboard select this mode and then ask AI to change it however you want for example I could say make my clipboard more concise or I can get crazy and say create three succinct and friendly variations and then it will output all three in bullet form and I can use these outputs to inspire my writing hey future Tyler here surprise I actually made a sixth mode that I wanted to share with you guys because I found it too useful and a little bit unique from the other ones it is a journal mode so it basically allows me to just journal with my voice as like a train of thought as opposed to you know writing things down in a notebook which I'm you know it's not really for me but basically what I would do actually I'm going to include a little tip here if I hit command shift K whenever super whisper is open I'll be able to quickly change the mode that I'm in so I'll hit command 7 here to switch to the journal mode and now when I start super whisper it will be in journal mode and I will just talk out loud basically as if I'm talking to you know like a therapist and then what the AI will output is a full trans transcript of everything I said verbatim but I'll also get a therapist style summary that basically allows me to you know get a glance at what I was thinking that day it'll give me action items I'm actually going to dive into the example here it'll give action items that I implied or explicitly you know mentioned in the journal while talking and then finally it will ask two thought-provoking reflective questions and honestly I found this like incredibly helpful I think this is going to be a big thing that people use AI for in the future so if you journal or want to try journaling then definitely use this mode again Linked In the description pretty wild right like these are all very different kind of functionality but it all kind of has the same sort of features at their core and it's up to you or us as the users to create these which I think is really fun and amazing now with super whisper there are a ton of different models you can choose from both over the Internet or on your computer directly like whisper through open AI you get the standard one for free but the advanced ones that are going to be faster or just a little bit smarter are going to come with the pro version of super whisper which is a subscription and that subscription is going to cost you either $88.50 per month or if you pay by year $85 or you can actually get a lifetime subscription of whisper flow with unlimited use of cloud and local AI models for $250 I know that sounds like a lot but AI typically costs developers a lot of money to you know let their users or enable their users to use and the fact that you can get unlimited use of AI in an app for a onetime fee is quite rare so I could actually see this going away pretty soon soon the pro version also comes with the ability to add your clipboard record from system audio and identify speakers and one great part about super whisper is you can change the AI for each mode for example if it's something relatively simple then we can use 40 mini for my dictation prompt since that doesn't really take that much brain power but for the meeting summarizer we can use a more powerful model which is going to make the output even better Mac whisper at least right now does not currently support this you basically just choose one model which means one set of speed and intelligence for all of your prompts super whisper also takes the personal dictionary a step further you can add your words manually in the vocabulary section so it always gets spelling correctly but there's also the very clever text replacement section so instead of telling the AI exactly how to spell words this one will let you replace them with something more useful such as you know saying Instagram link to add your actual Instagram link or if you're constantly sending calendly invites then this would be really useful for something like that super whisper also has an IOS app though I'm going to be honest I didn't really test it too much because it was a little bit buggy but basically you should be able to create recordings and send those recordings through custom AI prompts just like you can on the computer now I give the soul Creator massive props for building something so powerful and so flexible I really think this is a forward-looking tool but the unfortunate truth is the user experience is a little bit confusing and it was the most confusing for me to figure out how to set up hopefully if you want to give this a shot the prompts I've provided will give you a good basis but if you want a full tutorial then let me know I would be happy to do that as well anyway which one's your favorite are you going to be using any of these and if so how I would love to hear and um yeah let me know any feedback in the comments below for next time I do a deep dive on apps like these peace