Guide to Building Effective AI Agents

i learned how to build AI agents for you i have spent hundreds of hours building AI agents and I actually run a program called Lonely Octopus where we teach people AI skills and give them the opportunity to build AI agents for companies as well so in this video I'm going to attempt to distill down everything that I've learned to give you that comprehensive guide with frameworks and a variety of different tools to build any type of agent you want whether you're someone who doesn't know how to code and wants to stick with no code tools or if you're a seasoned software engineer looking to build your next AI startup I'll also be walking through real examples of AI agents built using different tools as per usual there'll be little assessments throughout this video to help you retain the information as we go through things now without further ado let's get started a portion of this video is sponsored by HubSpot here's the exact structure of the video first I'm going to introduce the crucial components that make up an AI agent talk about what they are some of the tools for each category and how to choose which tool you want to be using for each category next we're going to go into the nitty-gritty and talk about some of the common agentic workflows that people are using today i'll also be including a crash course on prompt engineering for agents specifically because the prompt is literally the thing that's going to make or break your agent i'll then walk you through full examples of AI agents implemented using both no code tools as well as full code but what is the use of building these AI agents if they don't actually serve a purpose that's why I'll also be covering how to figure out what kinds of AI agents what kind of AI startups or businesses that you should be building as well as tech enabled specific suggestions for what to build the progress made in voice video and image agents has enabled so many cool use cases agents are coming agents hi you fellas let's first define what is an AI agent an AI agent is a system that perceives its environment processes information and autonomously takes actions to achieve specific goals now from a more human perspective often times we tend to think about AI agents as an AI counterpart to a human role or a task that a human performs that's why you often hear about AI agents in the context of a coding AI agent like cursor or windsurf which are AI powered code editors that have a agent mode that can autonomously perform coding tasks either with claude sonnet 3.7 or Gemini 2.5 pro another very common AI agent use case are customer service chat bots many companies are now experimenting with customer service agents that are able to do things like handle inquiries communicate with the customer file a complaint for them or to resolve specific issues now this is the definition and the experience of an AI agent but when it comes to implementation there's actually a lot of different ways to implement these agents and there's a lot of nuance to it i'll give you a little preview about what I mean by this now I'll be going into a lot more detail about this when I'm going to be covering the exact implementation of different agents but for now I just want you to note that when we say like AI agent we're not just talking about you know an AI just sitting there doing its AI agent things by itself it's oftentimes a bunch of sub aents that do specific things and ultimately come together in multi- aent systems to form what we perceive as the actual like complete agent for example a classic implementation of a customer service agent is oftentimes split into first a sub aent that handles the customer queries like interacts with the customer figures out what an issue is and then tags it to be passed along to a more specialized sub aent like for example my recent phone billing payment issue this would be tagged as a billing and payments issue and passed along to another sub agent that will be specialized in dealing with billing and payments there would also be other sub aents specialized in IT and sales and other things that customer service in phone companies do by the way this type of agentic workflow is called routing and it's proven to be very very effective at this type of problem anyways we'll go into more detail about routing and other types of agentic workflows in a bit but yes I hope this gives you a little bit of understanding about how agents actually work under the hood which is very important to know as we build them also just to answer this question which you might be thinking of why is it that we have to have these multi- aent systems these different types of implementations and the reason for this is actually quite intuitive if you think about agents the same way that you think about humans in a company humans have different roles you don't have just one human that is trying to do everything at the same time that human will get very confused and not be able to prioritize what they're supposed to do and not be very good at any specific thing and it's the same for agents when we have different agents that are specializ in different things the results of it all coming together is going to be far better than just having a single AI agent try to do everything all right so now I want to take a step back and give you a framework for understanding the components of AI agents sort of like say if you're making a burger a burger is made out of different components there's a bun a patty vegetables and condiments you could switch out the type of bun the type of vegetables the type of patty and the type of condiments but you do need to have all these components for your burger to function as a burger as opposed to a weird sandwich or a hot dog same thing for agents there are different components and you can switch out the different components for different things but ultimately you need to have these components for it to be an agent now unlike the components of a burger that have been long established the components that make up an AI agent is still relatively new so people kind of have like varying different definitions but the most comprehensive and well- definfined one comes from OpenAI as they explain building agents involves assembling components across several domains such as models tools knowledge and memory audio and speech guardrails and orchestration and OpenAI provides composible primitives for each yeah you know obviously OpenAI is going to list its own things there first but for each of these components there are actually a lot of other tools that are available out there as well depending on the type of agent that you want to build some are better than others and I will go into more detail about each of these components but first I just want to make a note that if you ever feel super overwhelmed because there's just like a new tool or new technology that's coming out like every single day do not panic don't feel overwhelmed it's okay because whatever this new like innovation or tool thing is that is revolutionizing AI agents just realize that it's still going to be part of this framework it's like a new type of condiment in the condiments category that just happens to be a little bit more spicy or something i hope that makes sense i hope you get what I mean by that anyways let's now actually move on to each of these different components openai has this handy dandy little table so first you have the models component these are your AI models your large language models that are the core intelligence capable of reasoning making decisions and processing different modalities of course the examples that Open Eye gives us are the 01 03 mini GPD 4.5 GPD 40 etc now depending on the specific type of agent that you're building you want to choose a different type of model within the OpenAI ecosystem gpd 40 is your flagship model it's a thinking model that's really great at reasoning multi-step problem solving and complex decision-m great at answering most questions now if you want something that is more intensive the trade-off is that it's going to be slower and more expensive you have GPT 4.5 it's good for writing and exploring new ideas you also have 03 mini that has advanced reasoning capabilities but it's also faster and 03 Mini High that is particularly good for coding and logic outside of the OpenAI ecosystem Claw 3.7 Sonnet is usually the go-to model for people who do a lot of coding and reasoning and STEM subject based stuff although Gemini 2.5 Pro is challenging this right now but honestly like in a month or whatever it is that you watch this video probably the rankings have all shifted anyway but overall speaking if you care the most about things being cheap then you probably want to go with an open- source model and host it yourself and if you want to go with things being fast you want to go for smaller models and most Google models at least as of the time of this filming also has longer context windows if you care a lot about maintaining a high context window anyways there are a lot of websites out there that actually rank these different model performances like Vim for example i don't know if that's how you pronounce it Vellum so depending on what your use case is you can actually just check out the rankings and see which model suits your needs the best next up is the tools category now do not underestimate the importance of tools your model is simply your base model but what really starts making models powerful is adding on different capabilities like the ability of using tools tools allow agents to interface with the world like being able to search the web for example and all of the different applications that you see out there these can potentially be turned into tools for the AI like you can give it access to Google products like your Gmail your calendar you can give it access to the things that are in your hard drive you can give it access to what's happening on your screen you can give it access to your favorite apps like Slack or Discord YouTube Salesforce Zapier whatever you can also build your own custom tools that you can give to the AI agent as well if you use Open AI's agents SDK uh you do need to know how to code to be able to use this they give you the ability of defining your own tools as well as some built-in tools like web search file search and computer use you may have also heard of something called MCP which is kind of all the rage these days that was built by Anthropic it stands for model context protocol and it's a protocol that standardizes the way that you can provide things like tools to your large language model this is quite a leap forward because previously it's quite difficult for developers to provide their agents with different tools because different softwares configure their services in different ways so as a developer you kind of had to like figure it out and piece it together but basically MCP has made it a lot easier now do not worry if you're not a cody person there's also a lot of no code or low code tools that have inbuilt within them the ability for you to provide tools to your models some of the examples I'll show you later like N8N for example it allows you to very easily drag and drop different tools and connect them to your large language models for example if you're trying to build a market research agent it would need to have a tool to be able to search the internet a tool to be able to analyze the data that it gathers and maybe if you wanted to send a email report to you also would need a tool to be able to access your email now moving on to knowledge and memory so there's two different types the first one is called the knowledge base or static memory this allows you to give your AI model static facts policies documents just information that I can reference and access that remains relatively static over time this is important if you're building something like an AI agent that does legal tasks it may need to have specific legal documents for a particular case for a particular company and maybe like certain policies that are relevant for that specific company as well the other type of memory is persistent memory so this is memory that will allow an AI agent to be able to track conversation histories or user interactions past just a single session this is really important for a lot of chatbot use cases like say if you have an AI personal assistant you want to make sure that the personal assistant will still remember what happened like yesterday again OpenAI provides its own hosted services like vector stores file search and embeddings there's also open- source versions of this where you can host your own databases and then you can also perform different ways of doing rag which is retrieval augmented generation not going to go into way too much detail about this but some solutions that people look into would be Pine Cone which is cloudnative and optimized for vector search or Weeat which is open source again if you're leaning more towards a no code solution you don't really have to worry about this is usually already taken care of by that solution like N for example already allows you to deal with this without you having to like figure out all the complex cody stuff next up is audio and speech so it's pretty interesting because OpenAI does split this into its own separate category while many other kind of like frameworks don't really include this one specifically and I think the reason they do this is because there's just been such innovations recently in audio formats but basically giving your agent ability to have audio and speech allows it to interact with natural language this is really important for chatbot AI agents because having that ability to communicate directly using natural language can be a much better user experience within the OpenAI ecosystem they have their own ways of implementing this while outside of that ecosystem what people seem to use a lot at least right now is 11 Labs which is used for voice cloning and generation oh and for audio transcription like audio to text people do stick with whisper which is an open AAI model as of right now like I said these things change a lot it's more important for you to understand kind of like the general category the general component as opposed to the specific tools within it next component is guard rails so guards are really important in order to prevent irrelevant harmful or undesirable behavior you know once you create this agent you got to make sure that it's actually doing what it's supposed to be doing and not doing something else if you have a customer service agent you want to make sure that it is in fact talking about customer service stuff and not giving you like haikus or something like that outside of the OpenAI ecosystem what's popular right now is guardrails AI and lang chain guardrails there's honestly a lot of different options in this category but again if you are using no code tools I think it's important for you to understand this category these component but a lot of no code tools already have solutions built into their platform finally there is orchestration and this is something that's super overlooked remember how we were talking about different sub aents like how it is that you're chaining together different sub agents in order to come up with a final result for something it also involves deploying like so it's able to do its thing in production monitoring it and improving the agent like once you deploy the agent you don't just run away and then just like not look at it again right like over time the models keep changing a lot of these technologies thoughts change as well like data keeps changing so you need to keep monitoring and making sure that your agent is behaving the way that's supposed to be behaving there's also a lot of different tools in this category oftent times there's usually like a framework and then the orchestration part of it is built into that framework like OpenAI has its own system there's also crew AI which is another framework for implementing multi- aent systems it also has its own kind of system for orchestrating and finally deploying it lang chain is also very popular for managing different agent interactions and deploying it as well as llama index which is particularly useful if you creating an AI agent that has a lot to do with documents and static memory and knowledge bases here is also a little pneummonic for you to remember the different components that make up an AI agent which is going to be immediately useful because right now we are going to do our first little assessment i'm going to put on screen out some of the questions comment below your answers to make sure that you just retain all the information that we went through okay so this is a very practical guide to building AI agents hubspots offers us a very practical free guide to building AI agents from a business perspective i think this free resource is a really great compliment to everything we're covering today because it goes in depth on how to now take these AI agents and make sure they're driving maximum business success the playbook explains how AI agents are being used in businesses today with actual examples and use cases common pitfalls and also discusses the future of work it includes a checklist that helps your organization think through each phase of implementing AI agents from identifying the highest return on investment opportunities to defining success metrics as well as integration and scaling i highly recommend that you check it out at this link over here also linked in description thank you so much HubSpot for creating these free practical resources and for sponsoring this portion of the video now back to the video all right now that we know the components that make up AI agents let's now move on to the implementations if you remember what I said a bit earlier AI agents are often times not just like a singular entity they're actually broken down into different sub agents that are interacting with each other my favorite resource that covers these common agentic workflows and agent systems is the building effective agents guide from anthropic so let's go through it first first of all you have the basic building block of agentic systems this is what Anthropic calls the augmented LLM from this image you can see that you have an input you have the LM and you have the output ndlm is able to generate their own search queries select appropriate tools and determine what information needs to retain through memory if you were paying attention earlier you'll see that there are overlaps between the components in this augmented LLM and OpenAI's components this version is a little bit more bare bones like it doesn't address things like guardrails or orchestration but you can see that there is definitely overlap that's okay when it comes to things like testing and deployment just remember the OpenAI components for those specific things just FYI in terms these augmented LM building blocks are often called sub agents as well so now let's actually see how these building blocks these sub aents fit into each other and work with each other to form your bigger AI agent we're going to be starting with the simplest agentic workflows all the way to the more complex and the truly autonomous all right so the simplest common agentic workflow is called prompt chaining prompt chaining decomposes a task into a sequence of steps where each sub aent processes the output of the previous one in its simplest form this is just like an assembly line but you can also add in little gates where you can split it off into different things but the logic is the same you'll have an input a sub agent does something with that input passes along to another sub agent who does something else and maybe to another one etc etc until you finally get a output this kind of implementation is the most ideal for situations where the task can be easily broken down into subtasks and decomposed an example for when prompt chaining could be useful is if you want your AI agent to be generating a report the input could be the description of what the user wants and then the sub agent will take that maybe generate an outline pass it along to another sub agent who may check the outline for like specific criteria and then pass it along to a writer sub agent that would actually write the report and then maybe to an editor sub aent that would actually edit the report and the final output would be the report that follows the criteria that was specified routing is another type of workflow where you would have an input coming in and you have a sub agent that is dedicated to directing that specific input into a specific follow-up task and each of these tasks is governed by a sub aent that is specific to that task then finally you get the output after the processing routing works really well for complex tasks where there are distinct categories that are better handled separately a classic example when routing is useful is if you have a customer service bot you have a customer service bot that will be getting different types of queries that could be like general questions refund requests technical support whatever it is that people ask customer service based on the nature of the query the first sub agent should be able to route the most relevant task to the sub agent that is specialized for that task like if it's a refund request then it would be routed to the specialist sub agent for refunds like if it's a technical support query then it would be routed to the AI sub agent that is a specialist for handling technical support questions another common use case is by routing different questions to different types of models some models are better at doing certain things than others like if it's a difficult STEM related question you might be routing it to Claude Sonnet 3.7 or if it's an easy question where you value speed you might be routing it to Gemini Flash next workflow is parallelization paralleliz Oh god I can't pronounce this specific agentic workflow usually has two key variations this is when you have sub agents that are working simultaneously on a task and then have all of its outputs then aggregated together the first one is sectioning which is breaking a task into independent subtasks that are run in parallel or voting which is running the same task multiple times using different sub aents to get different diverse outputs that you aggregate together an example of sectioning is if you're trying to evaluate how good the performance of a new model is for a given prompt each sub agent could be evaluating a different aspect of the model's performance like one of them could be evaluating speed and one of them is evaluating accuracy etc etc an example of voting is reviewing a piece of code for vulnerabilities you have different sub aents that are evaluating the code and ultimately you aggregate together to vote to decide if this is in fact a vulnerability or not next workflow and we're getting increasingly more complex is the orchestrator workers the orchestrator worker actually looks pretty similar to parallelization but what's different about it is that you don't have a predetermined list of subtasks that will be done so this is especially useful for more complex problems where you can't actually exactly predict what are the subtasks that are going to be needed ultimately like for example if you're building agents that involve coding often times you don't know the exact number of files that need to be changed and the exact nature of the change itself so you need to be dynamically making changes to multiple different files another example are search tasks like if you have a research assistant agent this would involve gathering and analyzing a lot of different types of information from a lot of different sources which cannot be predetermined ahead of time even more complex is the evaluator optimizer workflow this is approaching more autonomous situations where you're giving the sub agent the AI agent a lot more autonomy and freedom in determining what it is that it should be doing you have some sort of input and the first sub agent would generate something a solution based upon that and pass it along to an evaluator sub aent the evaluator sub agent would evaluate it and if it's accepted then that will be the output or if it feels like it's not good enough it would send it back to the first sub agent telling it's rejected and some feedback to improve and this is like a circular loop that you would keep doing until the evaluator sub agent thinks that the solution is good enough and pass it along to output this workflow is particularly useful if there's a clear evaluation criteria and when you can see iterative refinement and improvement over time an example where the evaluator optimizer workflow is useful is if say that you're doing some sort of literary translation for something there may be nuances that the translator sub agent cannot capture the first time around so the evaluator sub aent would be sending it feedback and telling it to keep doing it until it's able to capture all the nuances in the language another example is if you're having a complex search task that you're trying to aggregate together into like some form of ultimate report you might be doing research and the eval sub agent would be like it feels like it's not deep enough research is like keep doing that keep doing it keep doing it until you're able to gather all the necessary information that it feels like it's able to capture you know your super complex report fully and finally we have the truly autonomous agent implementation so this one is tricky because it is actually the simplest implementation wise but it can result in very different types and very complex potentially solutions the agent will begin his work with some form of human interaction and once that task is clear the agent will be completely independent it will perform some sort of action or actions that will have some form of reaction to the environment and the agent has to somehow figure out itself from the environment what is considered to be the result of what it's doing like for example if it decides to use a tool where it decides to execute some code it needs to figure out itself if it's making progress towards the ultimate completion of task or not and it's going to keep doing that getting the feedback from the environment judging how it's progressing until ultimately it feels like it has completed the task that it was assigned this kind of implementation the very autonomous freedom giving type of agent implementation is usually used for very open-ended problems where it's very difficult to predict the number of steps that it should take or the exact path to get to the final result you're basically just telling an agent hey like do this thing and it just kind of has to figure out itself how to do the thing like what are the task involved whether it's making progress or not towards the thing and then at some point deciding that it has in fact completed the thing and comes back to you you can get like really crazy good results from this but sometimes often times you also get some really crazy in general that can come from this some examples from the anthropic article include a coding agent that's able to resolve different software engineering bench tasks which involves editing like a lot of different files on a task description or their computer use implementation where Claude was able to use a computer have access to all of the different functionalities of this very complex computer machine to accomplish specific tasks here's a diagram that illustrates the path that a coding AI agent took in order to complete its task you can see that there's a lot of different back and forths interactions in environments coming back and refining and everything like that before going back to a human as the article suggests this kind of truly autonomous implementation is not something that you generally want to do because in most situations you can actually go with a more predetermined agentic workflow and it would yield more predictable results and be a lot cheaper this article keeps saying like repeatedly that you should always go with the simplest implementation possible like if you can achieve your AI agent goals through prompt chaining or routing don't be doing things that are more complex just a general rule of thumb when you're building your AI agents and actually just engineering and building things in general don't overengineer okay so we've covered all the different workflows now I want to first do a quick little crash course on prompt engineering for AI agents from a practical perspective for these AI agents the prompt engineering the prompts matter so much it's really what holds everything together like you can have your agents and it has all these tools and has access to all these really cool things but if you don't have a good prompt you're not able to pull all this together so that's why I'm going to emphasize this part when you're prompting for an AI agent you need to have the full prompt all of it just there like you can't interactively correct it and add more information throughout the process so there are six components that you should consider putting into your AI agent prompt the first thing to specify is the role so this is where you tell it that it's an AI research assistant but you also want to include things like the tone and how it is that it should be behaving so for example you could write you are an AI research assistant task with summarizing the latest news in artificial intelligence your style is succinct direct and focus on essential information next up is a task and you can write given a search term related to AI news produce a concise summary of the key points then we have input this is where you can specify what it is that the AI research assistant will be receiving in this case you can just write that the input is a specified AI related search term provided by the user but you can imagine that there could be other inputs that the AI research assistant could be receiving like certain graphs and different documents you want to specify and let the AI assistant know exactly what it is that will be receiving fourth is the output this is where you want to go into detail about what it is that you want the AI research assistant to come up with what is it supposed to ultimately look like what's the final deliverable in this case you can write provide only a succinct information dense summary capturing the essence of recent AI related news relevant to the search term the summary must be concise approximately two to three short paragraphs totaling no more than 300 words it exactly knows what it's supposed to output now fifth step of the framework is constraint this is a really really really important part that you want to be including in your prompt not just what it's supposed to do but also what it is that it should not be doing you could write "Focus on capturing the main point succinctly complete sentences and perfect grammar are not necessary ignore fluff background information and commentary do not include your own analysis or opinions we don't care about the AI agents we only want to focus on the facts finally you have capabilities and reminders this is where you want to tell the AI what it has access to like certain tools that it may have as well as provide reminders for things that it should really really really keep top of mind things that are really important in this example we gave the AI agent the ability to do web search so we can tell it you have access to the web search tool to find and retrieve recent news articles relevant to the search term also we want to remind it that it needs to be very aware of the current date a common issue that a lot of LMS have is that it's not really aware of what date or time it currently is so since we're only interested in searching for things that are relevant right now we want to make sure that it's aware of what time it is and what's the search window so we might write you must be deeply aware of the current date to ensure the relevance of news summarizing only information published within the past seven days a general tip is that the more important something is the lower down on the prompt it is that you want to remind you it's just the way that the AI is able to process that information it has a bias towards the most recent things first that was the crash course on AI agent prompt engineering i hope you guys are also not mad at me for making you actually learn the foundations first because I do find that a lot of people who are doing vibe coding these days who don't actually know the foundations you end up you know having you building something and it's just you know not that great it's kind of just like or if it's something that you want to tweak slightly you end up you know making a lot of stupid mistakes because you don't understand the foundations so now you're equipped with the information to actually go build something with the knowledge and the confidence that it is in fact the best implementation to do so here's now alo now a little quiz that I will now put on screen please answer these questions in the comment section to make sure that you're retaining all of this information that I am presenting now the next section I'm going to be showing you the actual implementations of AI agents i have included some no code low code examples as well as fully coded examples as well so there should be something for everybody here this is a customer support AI agent and we implemented this using N8N so this was NAN it's a platform it's a no code low code platform that is super easy to use that you can use it to create different AI agents in this case we implemented this AI agent using a multi- aent system that follows the routing agentic pattern which we talked about earlier the way it works is that a customer will send an email inquiry and then we have a text classifier which is powered in this case by an open AI model that's able to route the inquiry as technical support billing or general inquiry and each of these have their own specific workflows after that let us see actually how it works let's go to my email over here and I'm going to write an email to customer support this case is going to go to [email protected] i'm going to say refund because I am angry hello I want a refund yes click send you can see that the emails here it classifies it as a billing situation we have the AI agent and the AI agent is able to use the email to respond back to the inquiry and if we check our email again we saw the agent has responded to us hello thank you for reaching out regarding a request for a refund to assist you effectively blah blah blah you know give all these information and then you can go ahead and send that information to the agent for you to process your refund if it's classified as technical support it also has this workflow if it decides that it can answer your technical support question directly using documentation it can directly email you back the response as well but here we also have an option where if they can't figure out how to support you from a technical support perspective it would actually escalate this and send it on Discord like this hello team customer needs help please investigate further the email ID is this ID over here so a real agent would be able to jump in and start helping the customer in this case really important to actually have this here because you always want to have some way in your AI agent to be able to escalate to an actual human and of course if it's a general inquiry it would route to this branch over here and then it would send a general email asking for additional information this is another AI agent it is a AI news aggregator agent the way it works is that it's scheduled at 7:00 a.m every day and it's going to go and gather information gather news from different newsletters as well as Reddit then it will aggregate all of that information together and ultimately come up with a summary that it's going to send to me on WhatsApp this is an example of a parallelization workflow pattern so it's not 7 am but I'm just going to trigger the workflow right now and have it do its thing it's going to be running everything over here so I want to actually make a note that even though it is a parallelization workflow the limitation of NATO is that it actually still runs sequentially if you implement this using a coded tool like OpenAI's agents SDK for example which I will show you an example of in just a little bit it would actually run in parallel but yeah in this case just kind of let you know technically it is parallelization but it isn't able to do that because of the platform limitations itself okay so after running it's going to send me a notification on WhatsApp where it gives me an aggregated information from all of the different news sources so open AI launches GP5 alpha AI ethics Google's AI ethics regulatory developments blah blah blah like all these different things that are happening over here and in the prompt I specified to make sure that it cites the sources so if I wanted to actually go in and learn more about each of these different news reports I could actually just click in and be able to look at the actual source itself this is actually a really helpful AI agent to have cuz in this prompt over here you can see that I can like exactly specify what it is I'm interested in like AI related search term provided by the newsletter mind for example right where like whatever it is I want how I want it to be summarized how I want everything to be aggregated together so it's a really handy little tool think it would be really useful for you as well if you are someone who also has to just like go through a lot of information every day final NAN example this is a multi-input daily expenses tracker AI agent that is such a mouthful so the way it works that you interact with it using WhatsApp you can send it pictures or receipts of whatever it is that you've spent you can send it text as well like if you spend like $10 you can tell it that you spent $10 as well is able to take all of that information and ultimately aggregate everything together to give you a final expenses track report every single day it also stores it in memory on Google Sheets and it will also give you that report and send it to you on WhatsApp as well and finally at 9:00 p.m every single day it would then on WhatsApp send you a summary of how much money that you've spent for example here I said that I spent $10 on a potato i don't know why it's like $10 on potato is very expensive then it would be able to put this on my expense tracker so potato over here $10 a potato here's like all the other things that I've bought see that I've bought a lot of things these days and at night it tells me that my consumption has focused on living expenses specifically with the purchase of potatoes totally $10 this indicates a straightforward and essential spending pattern with no other itemized purchase recorded for the day on some of my previous days when I bought more than just the one potato it says here like on April 7 2025 the spending now showed a significant emphasis on food with large purchases like steak and chocolate totaling $4,000 making food the most dominant category minor expenses living expenses was also recorded with the purchase of peanuts okay that is not exactly correct as you can see maybe we still need to modify this prompt a little bit um but yeah this is an example of how it is that you can track your expenses based upon my explanation of how this works what a gentic workflow design pattern do you think the multi-inputs daily expenses tracker AI agent is implemented with put that in the comments i wanted to show you an example that is implemented using code now specifically this was implemented using OpenAI's agents SDK it's done using Python and what it is is a financial research assistant that is able to take in an inquiry and is able to search the internet gather information about it aggregate it and it also has voice functionalities and also like language and translation functionalities as well and this follows the routing agentic design workflow pattern where we have a main manager and actually instead of me just like showing you the code to explain this to you I'm actually going to use cursor to show you how the AI agent works and also to run it as well just a little preview to my vibe coding video that's going to be happening maybe in like two weeks so stay tuned for that okay so I'm going to say could you please explain the way that the financial research assistant agent works so we have a main orchestrator which is the financial research manager and the core workflow steps is that it plans searches perform searches write reports and verifies the report the way it does this is that after the manager kicks off the program it would pass it on to a planner agent so it uses a planner agent to break down the user's query into specific search terms each search term contains a query and reason for searching and it returns a financial search plan with multiple search items so then it passes along the search terms to a search agent which then performs each of these searches and which then collects and aggregates all the search results then we go on to the analysis phase it uses specialized agents for different aspects so we have two agents that's going to be over here first one it passes to the financials agents that would analyze key financial metrics as well as the risk agent that identifies potential red flags and both agents will return analysis summaries then they would pass along all of these analysis summaries to the report writing phase where you have a writer agent that's able to synthesize all that information together combines the together search terms with financial and risk analysis and generates a structured report using markdown short summary and follow-up questions then we have a verifier agent which then goes through the report's accuracy and completeness we also included a voice interaction functionality that's so you're able to communicate and ask it questions based upon the report that is generated using audio and finally you'll get your output and your results for your financial report you can see that it's implemented based upon the prompt chaining agentic workflow where the main orchestrator manager kicks off the query and it passes along to the planner agent the search agent and many other agents until finally you're able to get a financial report.txt with with all that result contained within the financial report.tx txt let's actually run this now let's run the financial research agent whatever i can't spell it's fine by the way if you've never seen AI coding agent coding editor at work before this is kind of what it's like honestly like after I started using cursor windsurf and just like AI coding agents in general it has been a huge game changer for how people code and run code as well so all right we will let it do its thing it says at first help you run a financial research agent first let me check the workspace rush to ensure we have everything we need blah blah blah so let it do that okay it's telling me that I need to install some things so we'll just do that installing dependencies running into errors okay run more things five minutes later okay after running all of the these dependencies it says that we have the server running let's now run the financial research agent i'm just going to write what are the key financial metrics for Tesla so we're going to run this oh no it didn't work honestly a lot of Vibe coding is just running things and then letting it install stuff and fix its own problems so we're going to patiently wait for it to work okay it says enter a financial research query enter oh looks like we don't have an open AI key let me put that in the key metrics for Tesla it is starting financial research and starting to do its thing we'll perform seven searches searching planning report structure and there we go it looks like we have the report all right the financial agent has gener successfully generated comprehensive report and we can actually find that report over here so instead of actually having to read through everything I'm going to use the voice functionality that has been implemented so run the voice functionality tell me about the key metrics in the report sure here are the key financial metrics mentioned in the report one revenues tesla recorded revenues of $24.93 billion the substantial revenue figure is largely attributed to the successful sales of their Model 3 and Model Y vehicles as well as strategic expansions in Berlin and Texas factories two so you can communicate directly using voice and finally I want to show you guys how you can translate your report into Spanish so this uses MCP so which allows it to have access to a tool that can translate the report into Spanish which it did over here so this is an example of a coded implementation and if you want to check out the code I'll actually link in the description so you can check it out and play around with it yourself too remember that there's actually a lot of different ways that you can use to implement an AI agent choose what makes the most sense for the AI agent that you're building as well as your own skill level by the way if you are interested in learning more about AI agents and how to build AI agents I wanted to let you know that I'll be launching an AI agents boot camp in the next few weeks it's a four-week long program that is really hands-on where you're going to be building your own AI agents like the ones that you see in this video as well as ones that are going to be more advanced and more custom towards specific use cases so if you interested please do check out the link over here also linked in description instead of just ending the video right now and being like "All right guys go build your AI agents." I actually want to include this final section where I want to share with you how it is that you should be thinking about what kind of AI agents that you want to be building in the first place because ultimately speaking we're trying to build AI agents not just for funsies hopefully or maybe it is I don't know then that's fine but for a lot of us we're trying to build AI agents so that they can be useful for us useful for a business useful for enterprise whatever right maybe some of you guys also want to be starting your own AI agent businesses or startups by the way if you haven't already please do check out the Y combinator YouTube channel I have learned learned so much in terms of figuring out what kind of AI agents to build kind of startups to be doing what kind of things to be aware of while playing around in the AI space and their videos are really really worth watching but I'm going to share with you the major insight that I got from watching this video which is how to find your AI startup ideas the easiest way of figuring out a useful AI agent to build is by starting with yourself first what is it that you're currently doing that if you were to offload to an AI agent would make your life so much easier again don't worry about what kind of tools and frameworks and tech stack it is right now okay just think about what is it that if you did would just make your life so much easier for example I work with a very lovely team and agency that takes care of the sponsorships that I do and one of the people on the team actually messaged me on Slack saying that she wanted to build an AI agent that is able to access her emails and be able to screen like what are considered good leads versus bad leads and only respond to emails that are considered to be good leads i thought that this was a great idea and I was like "Yes you should totally do this." And you can totally do this through no code using nan as well you can use the prompt I shared earlier to figure out what is the agentic workflow that is the most applicable in this specific situation and then you can go build it using a no code tool but what if you're someone who is not currently working and solving problems every day like maybe you have just graduated where you're currently still a student don't worry YC also has really great advice for this in this case what you want to do is go undercover seeing as you yourself don't have the experience to understand what can be automated instead of just thinking of something in your head the best approach to doing this is to go and meet up with someone who is in fact working like someone who either owns their own business or has a job or something like that just ask if you can shadow them try to figure out their problems the thing is often times they might not even know their own problems because they're so deeply entrenched in whatever it is that they're doing on a day-to-day basis they don't even realize there could be ways of doing things that is so much easier and so much better if they incorporated AI into their workflow but you you're coming in with a fresh pair of eyes so look at what they're doing and try to identify where it is that you can build an AI agent and offload some of their tasks automate some of what they're doing so that they're able to accomplish their goals even better once you start doing that and developing that you often times start to realize that whatever issue it is that you had or you know somebody else had is something that many many people have and there you go that's how you can start working on something that could eventually turn into a business or a startup as well and finally if you just want some like highlevel guidance the absolute goal that I got from one of the YC videos as well is that for every SAS company that you see out there software as a service company that you see out there there will be an AI agent equivalent of that literally every company that is a SAS unicorn you could imagine there's a vertical AI unicorn equivalent so there you go that is literally like such clear overarching guidance look at all the SAS companies that are available right now think about what is the AI agent equivalent to that company and create that finally I want to talk about the specific tech enabled innovations that you can be working on right now as always the AI industry is just moving so quickly and there's so many new technologies that are being developed every day but the major like fundamental developments that we can see right now in 2025 is that there's huge leaps forwards in terms of voice and audio audio generation is just freaking unreal right now here's a little excerpt for me to show you what I mean from Sesame this is actually from a friend who showed me this and I was just freaking like mind blown hobbies to meet people well joining a club or online community can be really fun especially if you're into gaming or crafting volunteering is also a great way to connect with awesome people who care about the same things as you and hey if you're watching this don't forget to subscribe to Tina's channel for more awesome tips this is also why OpenAI itself and its SDK has a whole category dedicated to voice agents because it is just so many use cases that are enabled from that there's also massive developments in image models like Rev Gemini Flash image generation as well as GPD40 image generation as well and there's also video models like Sora so anything related to image and video these are all things that are also ripe for disruption now ending this video with a final general piece of advice there's always so much stuff that is happening in his industry if you ever feel overwhelmed by what is happening try to relax calm down and think back to these frameworks and components that I presented today there's a reason why I created this video where I'm not just showing you tutorials of things and just telling you about the new like things that people are building and the new agents that people are building as well it's because like with all of this that's going on if you just focus on understanding the fundamental components the fundamental frameworks and the fundamental technologies everything that comes on top of that you're able to categorize in your mind as it being actually important for you to learn about or not important so keep up with the actual big innovation in this category things like actual model innovations gemini 2.5 Pro recently came out for example MCP that enables better tool use and a lot of the other stuff you don't really need to pay so much attention to that hype keep learning keep doing your own projects build out your own AI agents and when the time comes when the opportunity comes where your skill set and your interest they align together with what is in demand in the world right now you'll be off building a successful AI agent business or startup or just side hustle or fun project as well be patient my friend all right as promised here is the final little assessment please write in the comments your answers for these now thank you so much for watching to the end of this very long very intensive video and I really hope that it has been helpful and I will see you guys in next video or live stream

i learned how to build AI agents for you i have spent hundreds of hours building AI agents and I actually run a program called Lonely Octopus where we teach people AI skills and give them the opportunity to build AI agents for companies as well so in this video I'm going to attempt to distill down everything that I've learned to give you that comprehensive guide with frameworks and a variety of different tools to build any type of agent you want whether you're someone who doesn't know how to code and wants to stick with no code tools or if you're a seasoned software engineer looking to build your next AI startup I'll also be walking through real examples of AI agents built using different tools as per usual there'll be little assessments throughout this video to help you retain the information as we go through things now without further ado let's get started a portion of this video is sponsored by HubSpot here's the exact structure of the video first I'm going to introduce the crucial components that make up an AI agent talk about what they are some of the tools for each category and how to choose which tool you want to be using for each category next we're going to go into the nitty-gritty and talk about some of the common agentic workflows that people are using today i'll also be including a crash course on prompt engineering for agents specifically because the prompt is literally the thing that's going to make or break your agent i'll then walk you through full examples of AI agents implemented using both no code tools as well as full code but what is the use of building these AI agents if they don't actually serve a purpose that's why I'll also be covering how to figure out what kinds of AI agents what kind of AI startups or businesses that you should be building as well as tech enabled specific suggestions for what to build the progress made in voice video and image agents has enabled so many cool use cases agents are coming agents hi you fellas let's first define what is an AI agent an AI agent is a system that perceives its environment processes information and autonomously takes actions to achieve specific goals now from a more human perspective often times we tend to think about AI agents as an AI counterpart to a human role or a task that a human performs that's why you often hear about AI agents in the context of a coding AI agent like cursor or windsurf which are AI powered code editors that have a agent mode that can autonomously perform coding tasks either with claude sonnet 3.7 or Gemini 2.5 pro another very common AI agent use case are customer service chat bots many companies are now experimenting with customer service agents that are able to do things like handle inquiries communicate with the customer file a complaint for them or to resolve specific issues now this is the definition and the experience of an AI agent but when it comes to implementation there's actually a lot of different ways to implement these agents and there's a lot of nuance to it i'll give you a little preview about what I mean by this now I'll be going into a lot more detail about this when I'm going to be covering the exact implementation of different agents but for now I just want you to note that when we say like AI agent we're not just talking about you know an AI just sitting there doing its AI agent things by itself it's oftentimes a bunch of sub aents that do specific things and ultimately come together in multi- aent systems to form what we perceive as the actual like complete agent for example a classic implementation of a customer service agent is oftentimes split into first a sub aent that handles the customer queries like interacts with the customer figures out what an issue is and then tags it to be passed along to a more specialized sub aent like for example my recent phone billing payment issue this would be tagged as a billing and payments issue and passed along to another sub agent that will be specialized in dealing with billing and payments there would also be other sub aents specialized in IT and sales and other things that customer service in phone companies do by the way this type of agentic workflow is called routing and it's proven to be very very effective at this type of problem anyways we'll go into more detail about routing and other types of agentic workflows in a bit but yes I hope this gives you a little bit of understanding about how agents actually work under the hood which is very important to know as we build them also just to answer this question which you might be thinking of why is it that we have to have these multi- aent systems these different types of implementations and the reason for this is actually quite intuitive if you think about agents the same way that you think about humans in a company humans have different roles you don't have just one human that is trying to do everything at the same time that human will get very confused and not be able to prioritize what they're supposed to do and not be very good at any specific thing and it's the same for agents when we have different agents that are specializ in different things the results of it all coming together is going to be far better than just having a single AI agent try to do everything all right so now I want to take a step back and give you a framework for understanding the components of AI agents sort of like say if you're making a burger a burger is made out of different components there's a bun a patty vegetables and condiments you could switch out the type of bun the type of vegetables the type of patty and the type of condiments but you do need to have all these components for your burger to function as a burger as opposed to a weird sandwich or a hot dog same thing for agents there are different components and you can switch out the different components for different things but ultimately you need to have these components for it to be an agent now unlike the components of a burger that have been long established the components that make up an AI agent is still relatively new so people kind of have like varying different definitions but the most comprehensive and well- definfined one comes from OpenAI as they explain building agents involves assembling components across several domains such as models tools knowledge and memory audio and speech guardrails and orchestration and OpenAI provides composible primitives for each yeah you know obviously OpenAI is going to list its own things there first but for each of these components there are actually a lot of other tools that are available out there as well depending on the type of agent that you want to build some are better than others and I will go into more detail about each of these components but first I just want to make a note that if you ever feel super overwhelmed because there's just like a new tool or new technology that's coming out like every single day do not panic don't feel overwhelmed it's okay because whatever this new like innovation or tool thing is that is revolutionizing AI agents just realize that it's still going to be part of this framework it's like a new type of condiment in the condiments category that just happens to be a little bit more spicy or something i hope that makes sense i hope you get what I mean by that anyways let's now actually move on to each of these different components openai has this handy dandy little table so first you have the models component these are your AI models your large language models that are the core intelligence capable of reasoning making decisions and processing different modalities of course the examples that Open Eye gives us are the 01 03 mini GPD 4.5 GPD 40 etc now depending on the specific type of agent that you're building you want to choose a different type of model within the OpenAI ecosystem gpd 40 is your flagship model it's a thinking model that's really great at reasoning multi-step problem solving and complex decision-m great at answering most questions now if you want something that is more intensive the trade-off is that it's going to be slower and more expensive you have GPT 4.5 it's good for writing and exploring new ideas you also have 03 mini that has advanced reasoning capabilities but it's also faster and 03 Mini High that is particularly good for coding and logic outside of the OpenAI ecosystem Claw 3.7 Sonnet is usually the go-to model for people who do a lot of coding and reasoning and STEM subject based stuff although Gemini 2.5 Pro is challenging this right now but honestly like in a month or whatever it is that you watch this video probably the rankings have all shifted anyway but overall speaking if you care the most about things being cheap then you probably want to go with an open- source model and host it yourself and if you want to go with things being fast you want to go for smaller models and most Google models at least as of the time of this filming also has longer context windows if you care a lot about maintaining a high context window anyways there are a lot of websites out there that actually rank these different model performances like Vim for example i don't know if that's how you pronounce it Vellum so depending on what your use case is you can actually just check out the rankings and see which model suits your needs the best next up is the tools category now do not underestimate the importance of tools your model is simply your base model but what really starts making models powerful is adding on different capabilities like the ability of using tools tools allow agents to interface with the world like being able to search the web for example and all of the different applications that you see out there these can potentially be turned into tools for the AI like you can give it access to Google products like your Gmail your calendar you can give it access to the things that are in your hard drive you can give it access to what's happening on your screen you can give it access to your favorite apps like Slack or Discord YouTube Salesforce Zapier whatever you can also build your own custom tools that you can give to the AI agent as well if you use Open AI's agents SDK uh you do need to know how to code to be able to use this they give you the ability of defining your own tools as well as some built-in tools like web search file search and computer use you may have also heard of something called MCP which is kind of all the rage these days that was built by Anthropic it stands for model context protocol and it's a protocol that standardizes the way that you can provide things like tools to your large language model this is quite a leap forward because previously it's quite difficult for developers to provide their agents with different tools because different softwares configure their services in different ways so as a developer you kind of had to like figure it out and piece it together but basically MCP has made it a lot easier now do not worry if you're not a cody person there's also a lot of no code or low code tools that have inbuilt within them the ability for you to provide tools to your models some of the examples I'll show you later like N8N for example it allows you to very easily drag and drop different tools and connect them to your large language models for example if you're trying to build a market research agent it would need to have a tool to be able to search the internet a tool to be able to analyze the data that it gathers and maybe if you wanted to send a email report to you also would need a tool to be able to access your email now moving on to knowledge and memory so there's two different types the first one is called the knowledge base or static memory this allows you to give your AI model static facts policies documents just information that I can reference and access that remains relatively static over time this is important if you're building something like an AI agent that does legal tasks it may need to have specific legal documents for a particular case for a particular company and maybe like certain policies that are relevant for that specific company as well the other type of memory is persistent memory so this is memory that will allow an AI agent to be able to track conversation histories or user interactions past just a single session this is really important for a lot of chatbot use cases like say if you have an AI personal assistant you want to make sure that the personal assistant will still remember what happened like yesterday again OpenAI provides its own hosted services like vector stores file search and embeddings there's also open- source versions of this where you can host your own databases and then you can also perform different ways of doing rag which is retrieval augmented generation not going to go into way too much detail about this but some solutions that people look into would be Pine Cone which is cloudnative and optimized for vector search or Weeat which is open source again if you're leaning more towards a no code solution you don't really have to worry about this is usually already taken care of by that solution like N for example already allows you to deal with this without you having to like figure out all the complex cody stuff next up is audio and speech so it's pretty interesting because OpenAI does split this into its own separate category while many other kind of like frameworks don't really include this one specifically and I think the reason they do this is because there's just been such innovations recently in audio formats but basically giving your agent ability to have audio and speech allows it to interact with natural language this is really important for chatbot AI agents because having that ability to communicate directly using natural language can be a much better user experience within the OpenAI ecosystem they have their own ways of implementing this while outside of that ecosystem what people seem to use a lot at least right now is 11 Labs which is used for voice cloning and generation oh and for audio transcription like audio to text people do stick with whisper which is an open AAI model as of right now like I said these things change a lot it's more important for you to understand kind of like the general category the general component as opposed to the specific tools within it next component is guard rails so guards are really important in order to prevent irrelevant harmful or undesirable behavior you know once you create this agent you got to make sure that it's actually doing what it's supposed to be doing and not doing something else if you have a customer service agent you want to make sure that it is in fact talking about customer service stuff and not giving you like haikus or something like that outside of the OpenAI ecosystem what's popular right now is guardrails AI and lang chain guardrails there's honestly a lot of different options in this category but again if you are using no code tools I think it's important for you to understand this category these component but a lot of no code tools already have solutions built into their platform finally there is orchestration and this is something that's super overlooked remember how we were talking about different sub aents like how it is that you're chaining together different sub agents in order to come up with a final result for something it also involves deploying like so it's able to do its thing in production monitoring it and improving the agent like once you deploy the agent you don't just run away and then just like not look at it again right like over time the models keep changing a lot of these technologies thoughts change as well like data keeps changing so you need to keep monitoring and making sure that your agent is behaving the way that's supposed to be behaving there's also a lot of different tools in this category oftent times there's usually like a framework and then the orchestration part of it is built into that framework like OpenAI has its own system there's also crew AI which is another framework for implementing multi- aent systems it also has its own kind of system for orchestrating and finally deploying it lang chain is also very popular for managing different agent interactions and deploying it as well as llama index which is particularly useful if you creating an AI agent that has a lot to do with documents and static memory and knowledge bases here is also a little pneummonic for you to remember the different components that make up an AI agent which is going to be immediately useful because right now we are going to do our first little assessment i'm going to put on screen out some of the questions comment below your answers to make sure that you just retain all the information that we went through okay so this is a very practical guide to building AI agents hubspots offers us a very practical free guide to building AI agents from a business perspective i think this free resource is a really great compliment to everything we're covering today because it goes in depth on how to now take these AI agents and make sure they're driving maximum business success the playbook explains how AI agents are being used in businesses today with actual examples and use cases common pitfalls and also discusses the future of work it includes a checklist that helps your organization think through each phase of implementing AI agents from identifying the highest return on investment opportunities to defining success metrics as well as integration and scaling i highly recommend that you check it out at this link over here also linked in description thank you so much HubSpot for creating these free practical resources and for sponsoring this portion of the video now back to the video all right now that we know the components that make up AI agents let's now move on to the implementations if you remember what I said a bit earlier AI agents are often times not just like a singular entity they're actually broken down into different sub agents that are interacting with each other my favorite resource that covers these common agentic workflows and agent systems is the building effective agents guide from anthropic so let's go through it first first of all you have the basic building block of agentic systems this is what Anthropic calls the augmented LLM from this image you can see that you have an input you have the LM and you have the output ndlm is able to generate their own search queries select appropriate tools and determine what information needs to retain through memory if you were paying attention earlier you'll see that there are overlaps between the components in this augmented LLM and OpenAI's components this version is a little bit more bare bones like it doesn't address things like guardrails or orchestration but you can see that there is definitely overlap that's okay when it comes to things like testing and deployment just remember the OpenAI components for those specific things just FYI in terms these augmented LM building blocks are often called sub agents as well so now let's actually see how these building blocks these sub aents fit into each other and work with each other to form your bigger AI agent we're going to be starting with the simplest agentic workflows all the way to the more complex and the truly autonomous all right so the simplest common agentic workflow is called prompt chaining prompt chaining decomposes a task into a sequence of steps where each sub aent processes the output of the previous one in its simplest form this is just like an assembly line but you can also add in little gates where you can split it off into different things but the logic is the same you'll have an input a sub agent does something with that input passes along to another sub agent who does something else and maybe to another one etc etc until you finally get a output this kind of implementation is the most ideal for situations where the task can be easily broken down into subtasks and decomposed an example for when prompt chaining could be useful is if you want your AI agent to be generating a report the input could be the description of what the user wants and then the sub agent will take that maybe generate an outline pass it along to another sub agent who may check the outline for like specific criteria and then pass it along to a writer sub agent that would actually write the report and then maybe to an editor sub aent that would actually edit the report and the final output would be the report that follows the criteria that was specified routing is another type of workflow where you would have an input coming in and you have a sub agent that is dedicated to directing that specific input into a specific follow-up task and each of these tasks is governed by a sub aent that is specific to that task then finally you get the output after the processing routing works really well for complex tasks where there are distinct categories that are better handled separately a classic example when routing is useful is if you have a customer service bot you have a customer service bot that will be getting different types of queries that could be like general questions refund requests technical support whatever it is that people ask customer service based on the nature of the query the first sub agent should be able to route the most relevant task to the sub agent that is specialized for that task like if it's a refund request then it would be routed to the specialist sub agent for refunds like if it's a technical support query then it would be routed to the AI sub agent that is a specialist for handling technical support questions another common use case is by routing different questions to different types of models some models are better at doing certain things than others like if it's a difficult STEM related question you might be routing it to Claude Sonnet 3.7 or if it's an easy question where you value speed you might be routing it to Gemini Flash next workflow is parallelization paralleliz Oh god I can't pronounce this specific agentic workflow usually has two key variations this is when you have sub agents that are working simultaneously on a task and then have all of its outputs then aggregated together the first one is sectioning which is breaking a task into independent subtasks that are run in parallel or voting which is running the same task multiple times using different sub aents to get different diverse outputs that you aggregate together an example of sectioning is if you're trying to evaluate how good the performance of a new model is for a given prompt each sub agent could be evaluating a different aspect of the model's performance like one of them could be evaluating speed and one of them is evaluating accuracy etc etc an example of voting is reviewing a piece of code for vulnerabilities you have different sub aents that are evaluating the code and ultimately you aggregate together to vote to decide if this is in fact a vulnerability or not next workflow and we're getting increasingly more complex is the orchestrator workers the orchestrator worker actually looks pretty similar to parallelization but what's different about it is that you don't have a predetermined list of subtasks that will be done so this is especially useful for more complex problems where you can't actually exactly predict what are the subtasks that are going to be needed ultimately like for example if you're building agents that involve coding often times you don't know the exact number of files that need to be changed and the exact nature of the change itself so you need to be dynamically making changes to multiple different files another example are search tasks like if you have a research assistant agent this would involve gathering and analyzing a lot of different types of information from a lot of different sources which cannot be predetermined ahead of time even more complex is the evaluator optimizer workflow this is approaching more autonomous situations where you're giving the sub agent the AI agent a lot more autonomy and freedom in determining what it is that it should be doing you have some sort of input and the first sub agent would generate something a solution based upon that and pass it along to an evaluator sub aent the evaluator sub agent would evaluate it and if it's accepted then that will be the output or if it feels like it's not good enough it would send it back to the first sub agent telling it's rejected and some feedback to improve and this is like a circular loop that you would keep doing until the evaluator sub agent thinks that the solution is good enough and pass it along to output this workflow is particularly useful if there's a clear evaluation criteria and when you can see iterative refinement and improvement over time an example where the evaluator optimizer workflow is useful is if say that you're doing some sort of literary translation for something there may be nuances that the translator sub agent cannot capture the first time around so the evaluator sub aent would be sending it feedback and telling it to keep doing it until it's able to capture all the nuances in the language another example is if you're having a complex search task that you're trying to aggregate together into like some form of ultimate report you might be doing research and the eval sub agent would be like it feels like it's not deep enough research is like keep doing that keep doing it keep doing it until you're able to gather all the necessary information that it feels like it's able to capture you know your super complex report fully and finally we have the truly autonomous agent implementation so this one is tricky because it is actually the simplest implementation wise but it can result in very different types and very complex potentially solutions the agent will begin his work with some form of human interaction and once that task is clear the agent will be completely independent it will perform some sort of action or actions that will have some form of reaction to the environment and the agent has to somehow figure out itself from the environment what is considered to be the result of what it's doing like for example if it decides to use a tool where it decides to execute some code it needs to figure out itself if it's making progress towards the ultimate completion of task or not and it's going to keep doing that getting the feedback from the environment judging how it's progressing until ultimately it feels like it has completed the task that it was assigned this kind of implementation the very autonomous freedom giving type of agent implementation is usually used for very open-ended problems where it's very difficult to predict the number of steps that it should take or the exact path to get to the final result you're basically just telling an agent hey like do this thing and it just kind of has to figure out itself how to do the thing like what are the task involved whether it's making progress or not towards the thing and then at some point deciding that it has in fact completed the thing and comes back to you you can get like really crazy good results from this but sometimes often times you also get some really crazy in general that can come from this some examples from the anthropic article include a coding agent that's able to resolve different software engineering bench tasks which involves editing like a lot of different files on a task description or their computer use implementation where Claude was able to use a computer have access to all of the different functionalities of this very complex computer machine to accomplish specific tasks here's a diagram that illustrates the path that a coding AI agent took in order to complete its task you can see that there's a lot of different back and forths interactions in environments coming back and refining and everything like that before going back to a human as the article suggests this kind of truly autonomous implementation is not something that you generally want to do because in most situations you can actually go with a more predetermined agentic workflow and it would yield more predictable results and be a lot cheaper this article keeps saying like repeatedly that you should always go with the simplest implementation possible like if you can achieve your AI agent goals through prompt chaining or routing don't be doing things that are more complex just a general rule of thumb when you're building your AI agents and actually just engineering and building things in general don't overengineer okay so we've covered all the different workflows now I want to first do a quick little crash course on prompt engineering for AI agents from a practical perspective for these AI agents the prompt engineering the prompts matter so much it's really what holds everything together like you can have your agents and it has all these tools and has access to all these really cool things but if you don't have a good prompt you're not able to pull all this together so that's why I'm going to emphasize this part when you're prompting for an AI agent you need to have the full prompt all of it just there like you can't interactively correct it and add more information throughout the process so there are six components that you should consider putting into your AI agent prompt the first thing to specify is the role so this is where you tell it that it's an AI research assistant but you also want to include things like the tone and how it is that it should be behaving so for example you could write you are an AI research assistant task with summarizing the latest news in artificial intelligence your style is succinct direct and focus on essential information next up is a task and you can write given a search term related to AI news produce a concise summary of the key points then we have input this is where you can specify what it is that the AI research assistant will be receiving in this case you can just write that the input is a specified AI related search term provided by the user but you can imagine that there could be other inputs that the AI research assistant could be receiving like certain graphs and different documents you want to specify and let the AI assistant know exactly what it is that will be receiving fourth is the output this is where you want to go into detail about what it is that you want the AI research assistant to come up with what is it supposed to ultimately look like what's the final deliverable in this case you can write provide only a succinct information dense summary capturing the essence of recent AI related news relevant to the search term the summary must be concise approximately two to three short paragraphs totaling no more than 300 words it exactly knows what it's supposed to output now fifth step of the framework is constraint this is a really really really important part that you want to be including in your prompt not just what it's supposed to do but also what it is that it should not be doing you could write "Focus on capturing the main point succinctly complete sentences and perfect grammar are not necessary ignore fluff background information and commentary do not include your own analysis or opinions we don't care about the AI agents we only want to focus on the facts finally you have capabilities and reminders this is where you want to tell the AI what it has access to like certain tools that it may have as well as provide reminders for things that it should really really really keep top of mind things that are really important in this example we gave the AI agent the ability to do web search so we can tell it you have access to the web search tool to find and retrieve recent news articles relevant to the search term also we want to remind it that it needs to be very aware of the current date a common issue that a lot of LMS have is that it's not really aware of what date or time it currently is so since we're only interested in searching for things that are relevant right now we want to make sure that it's aware of what time it is and what's the search window so we might write you must be deeply aware of the current date to ensure the relevance of news summarizing only information published within the past seven days a general tip is that the more important something is the lower down on the prompt it is that you want to remind you it's just the way that the AI is able to process that information it has a bias towards the most recent things first that was the crash course on AI agent prompt engineering i hope you guys are also not mad at me for making you actually learn the foundations first because I do find that a lot of people who are doing vibe coding these days who don't actually know the foundations you end up you know having you building something and it's just you know not that great it's kind of just like or if it's something that you want to tweak slightly you end up you know making a lot of stupid mistakes because you don't understand the foundations so now you're equipped with the information to actually go build something with the knowledge and the confidence that it is in fact the best implementation to do so here's now alo now a little quiz that I will now put on screen please answer these questions in the comment section to make sure that you're retaining all of this information that I am presenting now the next section I'm going to be showing you the actual implementations of AI agents i have included some no code low code examples as well as fully coded examples as well so there should be something for everybody here this is a customer support AI agent and we implemented this using N8N so this was NAN it's a platform it's a no code low code platform that is super easy to use that you can use it to create different AI agents in this case we implemented this AI agent using a multi- aent system that follows the routing agentic pattern which we talked about earlier the way it works is that a customer will send an email inquiry and then we have a text classifier which is powered in this case by an open AI model that's able to route the inquiry as technical support billing or general inquiry and each of these have their own specific workflows after that let us see actually how it works let's go to my email over here and I'm going to write an email to customer support this case is going to go to cloud@lontopus.com i'm going to say refund because I am angry hello I want a refund yes click send you can see that the emails here it classifies it as a billing situation we have the AI agent and the AI agent is able to use the email to respond back to the inquiry and if we check our email again we saw the agent has responded to us hello thank you for reaching out regarding a request for a refund to assist you effectively blah blah blah you know give all these information and then you can go ahead and send that information to the agent for you to process your refund if it's classified as technical support it also has this workflow if it decides that it can answer your technical support question directly using documentation it can directly email you back the response as well but here we also have an option where if they can't figure out how to support you from a technical support perspective it would actually escalate this and send it on Discord like this hello team customer needs help please investigate further the email ID is this ID over here so a real agent would be able to jump in and start helping the customer in this case really important to actually have this here because you always want to have some way in your AI agent to be able to escalate to an actual human and of course if it's a general inquiry it would route to this branch over here and then it would send a general email asking for additional information this is another AI agent it is a AI news aggregator agent the way it works is that it's scheduled at 7:00 a.m every day and it's going to go and gather information gather news from different newsletters as well as Reddit then it will aggregate all of that information together and ultimately come up with a summary that it's going to send to me on WhatsApp this is an example of a parallelization workflow pattern so it's not 7 am but I'm just going to trigger the workflow right now and have it do its thing it's going to be running everything over here so I want to actually make a note that even though it is a parallelization workflow the limitation of NATO is that it actually still runs sequentially if you implement this using a coded tool like OpenAI's agents SDK for example which I will show you an example of in just a little bit it would actually run in parallel but yeah in this case just kind of let you know technically it is parallelization but it isn't able to do that because of the platform limitations itself okay so after running it's going to send me a notification on WhatsApp where it gives me an aggregated information from all of the different news sources so open AI launches GP5 alpha AI ethics Google's AI ethics regulatory developments blah blah blah like all these different things that are happening over here and in the prompt I specified to make sure that it cites the sources so if I wanted to actually go in and learn more about each of these different news reports I could actually just click in and be able to look at the actual source itself this is actually a really helpful AI agent to have cuz in this prompt over here you can see that I can like exactly specify what it is I'm interested in like AI related search term provided by the newsletter mind for example right where like whatever it is I want how I want it to be summarized how I want everything to be aggregated together so it's a really handy little tool think it would be really useful for you as well if you are someone who also has to just like go through a lot of information every day final NAN example this is a multi-input daily expenses tracker AI agent that is such a mouthful so the way it works that you interact with it using WhatsApp you can send it pictures or receipts of whatever it is that you've spent you can send it text as well like if you spend like $10 you can tell it that you spent $10 as well is able to take all of that information and ultimately aggregate everything together to give you a final expenses track report every single day it also stores it in memory on Google Sheets and it will also give you that report and send it to you on WhatsApp as well and finally at 9:00 p.m every single day it would then on WhatsApp send you a summary of how much money that you've spent for example here I said that I spent $10 on a potato i don't know why it's like $10 on potato is very expensive then it would be able to put this on my expense tracker so potato over here $10 a potato here's like all the other things that I've bought see that I've bought a lot of things these days and at night it tells me that my consumption has focused on living expenses specifically with the purchase of potatoes totally $10 this indicates a straightforward and essential spending pattern with no other itemized purchase recorded for the day on some of my previous days when I bought more than just the one potato it says here like on April 7 2025 the spending now showed a significant emphasis on food with large purchases like steak and chocolate totaling $4,000 making food the most dominant category minor expenses living expenses was also recorded with the purchase of peanuts okay that is not exactly correct as you can see maybe we still need to modify this prompt a little bit um but yeah this is an example of how it is that you can track your expenses based upon my explanation of how this works what a gentic workflow design pattern do you think the multi-inputs daily expenses tracker AI agent is implemented with put that in the comments i wanted to show you an example that is implemented using code now specifically this was implemented using OpenAI's agents SDK it's done using Python and what it is is a financial research assistant that is able to take in an inquiry and is able to search the internet gather information about it aggregate it and it also has voice functionalities and also like language and translation functionalities as well and this follows the routing agentic design workflow pattern where we have a main manager and actually instead of me just like showing you the code to explain this to you I'm actually going to use cursor to show you how the AI agent works and also to run it as well just a little preview to my vibe coding video that's going to be happening maybe in like two weeks so stay tuned for that okay so I'm going to say could you please explain the way that the financial research assistant agent works so we have a main orchestrator which is the financial research manager and the core workflow steps is that it plans searches perform searches write reports and verifies the report the way it does this is that after the manager kicks off the program it would pass it on to a planner agent so it uses a planner agent to break down the user's query into specific search terms each search term contains a query and reason for searching and it returns a financial search plan with multiple search items so then it passes along the search terms to a search agent which then performs each of these searches and which then collects and aggregates all the search results then we go on to the analysis phase it uses specialized agents for different aspects so we have two agents that's going to be over here first one it passes to the financials agents that would analyze key financial metrics as well as the risk agent that identifies potential red flags and both agents will return analysis summaries then they would pass along all of these analysis summaries to the report writing phase where you have a writer agent that's able to synthesize all that information together combines the together search terms with financial and risk analysis and generates a structured report using markdown short summary and follow-up questions then we have a verifier agent which then goes through the report's accuracy and completeness we also included a voice interaction functionality that's so you're able to communicate and ask it questions based upon the report that is generated using audio and finally you'll get your output and your results for your financial report you can see that it's implemented based upon the prompt chaining agentic workflow where the main orchestrator manager kicks off the query and it passes along to the planner agent the search agent and many other agents until finally you're able to get a financial report.txt with with all that result contained within the financial report.tx txt let's actually run this now let's run the financial research agent whatever i can't spell it's fine by the way if you've never seen AI coding agent coding editor at work before this is kind of what it's like honestly like after I started using cursor windsurf and just like AI coding agents in general it has been a huge game changer for how people code and run code as well so all right we will let it do its thing it says at first help you run a financial research agent first let me check the workspace rush to ensure we have everything we need blah blah blah so let it do that okay it's telling me that I need to install some things so we'll just do that installing dependencies running into errors okay run more things five minutes later okay after running all of the these dependencies it says that we have the server running let's now run the financial research agent i'm just going to write what are the key financial metrics for Tesla so we're going to run this oh no it didn't work honestly a lot of Vibe coding is just running things and then letting it install stuff and fix its own problems so we're going to patiently wait for it to work okay it says enter a financial research query enter oh looks like we don't have an open AI key let me put that in the key metrics for Tesla it is starting financial research and starting to do its thing we'll perform seven searches searching planning report structure and there we go it looks like we have the report all right the financial agent has gener successfully generated comprehensive report and we can actually find that report over here so instead of actually having to read through everything I'm going to use the voice functionality that has been implemented so run the voice functionality tell me about the key metrics in the report sure here are the key financial metrics mentioned in the report one revenues tesla recorded revenues of $24.93 billion the substantial revenue figure is largely attributed to the successful sales of their Model 3 and Model Y vehicles as well as strategic expansions in Berlin and Texas factories two so you can communicate directly using voice and finally I want to show you guys how you can translate your report into Spanish so this uses MCP so which allows it to have access to a tool that can translate the report into Spanish which it did over here so this is an example of a coded implementation and if you want to check out the code I'll actually link in the description so you can check it out and play around with it yourself too remember that there's actually a lot of different ways that you can use to implement an AI agent choose what makes the most sense for the AI agent that you're building as well as your own skill level by the way if you are interested in learning more about AI agents and how to build AI agents I wanted to let you know that I'll be launching an AI agents boot camp in the next few weeks it's a four-week long program that is really hands-on where you're going to be building your own AI agents like the ones that you see in this video as well as ones that are going to be more advanced and more custom towards specific use cases so if you interested please do check out the link over here also linked in description instead of just ending the video right now and being like "All right guys go build your AI agents." I actually want to include this final section where I want to share with you how it is that you should be thinking about what kind of AI agents that you want to be building in the first place because ultimately speaking we're trying to build AI agents not just for funsies hopefully or maybe it is I don't know then that's fine but for a lot of us we're trying to build AI agents so that they can be useful for us useful for a business useful for enterprise whatever right maybe some of you guys also want to be starting your own AI agent businesses or startups by the way if you haven't already please do check out the Y combinator YouTube channel I have learned learned so much in terms of figuring out what kind of AI agents to build kind of startups to be doing what kind of things to be aware of while playing around in the AI space and their videos are really really worth watching but I'm going to share with you the major insight that I got from watching this video which is how to find your AI startup ideas the easiest way of figuring out a useful AI agent to build is by starting with yourself first what is it that you're currently doing that if you were to offload to an AI agent would make your life so much easier again don't worry about what kind of tools and frameworks and tech stack it is right now okay just think about what is it that if you did would just make your life so much easier for example I work with a very lovely team and agency that takes care of the sponsorships that I do and one of the people on the team actually messaged me on Slack saying that she wanted to build an AI agent that is able to access her emails and be able to screen like what are considered good leads versus bad leads and only respond to emails that are considered to be good leads i thought that this was a great idea and I was like "Yes you should totally do this." And you can totally do this through no code using nan as well you can use the prompt I shared earlier to figure out what is the agentic workflow that is the most applicable in this specific situation and then you can go build it using a no code tool but what if you're someone who is not currently working and solving problems every day like maybe you have just graduated where you're currently still a student don't worry YC also has really great advice for this in this case what you want to do is go undercover seeing as you yourself don't have the experience to understand what can be automated instead of just thinking of something in your head the best approach to doing this is to go and meet up with someone who is in fact working like someone who either owns their own business or has a job or something like that just ask if you can shadow them try to figure out their problems the thing is often times they might not even know their own problems because they're so deeply entrenched in whatever it is that they're doing on a day-to-day basis they don't even realize there could be ways of doing things that is so much easier and so much better if they incorporated AI into their workflow but you you're coming in with a fresh pair of eyes so look at what they're doing and try to identify where it is that you can build an AI agent and offload some of their tasks automate some of what they're doing so that they're able to accomplish their goals even better once you start doing that and developing that you often times start to realize that whatever issue it is that you had or you know somebody else had is something that many many people have and there you go that's how you can start working on something that could eventually turn into a business or a startup as well and finally if you just want some like highlevel guidance the absolute goal that I got from one of the YC videos as well is that for every SAS company that you see out there software as a service company that you see out there there will be an AI agent equivalent of that literally every company that is a SAS unicorn you could imagine there's a vertical AI unicorn equivalent so there you go that is literally like such clear overarching guidance look at all the SAS companies that are available right now think about what is the AI agent equivalent to that company and create that finally I want to talk about the specific tech enabled innovations that you can be working on right now as always the AI industry is just moving so quickly and there's so many new technologies that are being developed every day but the major like fundamental developments that we can see right now in 2025 is that there's huge leaps forwards in terms of voice and audio audio generation is just freaking unreal right now here's a little excerpt for me to show you what I mean from Sesame this is actually from a friend who showed me this and I was just freaking like mind blown hobbies to meet people well joining a club or online community can be really fun especially if you're into gaming or crafting volunteering is also a great way to connect with awesome people who care about the same things as you and hey if you're watching this don't forget to subscribe to Tina's channel for more awesome tips this is also why OpenAI itself and its SDK has a whole category dedicated to voice agents because it is just so many use cases that are enabled from that there's also massive developments in image models like Rev Gemini Flash image generation as well as GPD40 image generation as well and there's also video models like Sora so anything related to image and video these are all things that are also ripe for disruption now ending this video with a final general piece of advice there's always so much stuff that is happening in his industry if you ever feel overwhelmed by what is happening try to relax calm down and think back to these frameworks and components that I presented today there's a reason why I created this video where I'm not just showing you tutorials of things and just telling you about the new like things that people are building and the new agents that people are building as well it's because like with all of this that's going on if you just focus on understanding the fundamental components the fundamental frameworks and the fundamental technologies everything that comes on top of that you're able to categorize in your mind as it being actually important for you to learn about or not important so keep up with the actual big innovation in this category things like actual model innovations gemini 2.5 Pro recently came out for example MCP that enables better tool use and a lot of the other stuff you don't really need to pay so much attention to that hype keep learning keep doing your own projects build out your own AI agents and when the time comes when the opportunity comes where your skill set and your interest they align together with what is in demand in the world right now you'll be off building a successful AI agent business or startup or just side hustle or fun project as well be patient my friend all right as promised here is the final little assessment please write in the comments your answers for these now thank you so much for watching to the end of this very long very intensive video and I really hope that it has been helpful and I will see you guys in next video or live stream

Transcript for:Guide to Building Effective AI Agents

Transcript for:
Guide to Building Effective AI Agents