Recent Advances in AI Technology

so with the open AI 01 release there have been many stories that have actually been overshadowed by such a Monumental release so let's take a look at some of the stories you may have perhaps missed including some of the tidbits from open AI 01 release that you also may not have seen one of the first stories is coming from fay F Lee who is a prominent Chinese American Computer scientist renowned for her groundbreaking contributions to AI particularly in the field of computer vision she's actually best known for establishing imag net a large scale data set that pretty much Advanced the development of AI overall in visual recognition tasks now she's working on a new problem she says what is a really hard problem to work on in AI my own answer is spatial intelligence a technology that could Empower and enable countless possible use cases in creation design and learning and arvr Robotics and Beyond it's a real honor that my co-founders and I are joined by a formidable team of world class pixel talents to start World labs to work hard on this super problem that took Mother Nature half a billion years to solve they are going to be building large World models to perceive generate and interact with the 3D World and this is someone that is remarkably accomplished so this kind of company is one that I'm really excited about because whilst many different companies are focusing on different areas is this company is focusing on something completely different we all know that the multimodal AI space is pretty much cornered in terms of video image audio and text but this kind of area is one that is largely understated so you can see right here it says human intelligence has many facets one is verbal intelligence enabling us to communicate and connect with others through language but perhaps more fundamental is spatial intelligence allowing us to understand and interact with the world around us space IAL intelligence also helps us to create and bring forth pictures in our Mind's Eye into the physical world we use it to reason move and invent to visualize and architect anything from Humble sand castles to towering cities and we believe that artificial intelligence will help humans build better World it says that llms already have empowered people to work and create in new ways but they only scratch the surface of what is possible to advance beyond the capabilities of today's models we need spatially int ENT AI that can model the world and reason about objects places and interactions in 3D space and time so this is where we talk about where we're going it says that propelling AI forward with spatial intelligence will also Propel forward both individuals and Humanity as a whole towards this goal World Labs will develop spatially intelligent large World models that can understand and reason about the 3D World from images and other modalities and over time we expect to train increasingly powerful models with broader capabilities that can be applied to a variety of domains working alongside people says that initially we will focus on generating 3D Worlds Without Limits so that's what their main focus is going to be on and creating and editing virtual spaces complete with physics semantics and control when we hopes this unlocks new capabilities for Creative users and professionals such as artists designers developers and engineers and it will also allow anyone to imagine and create their own worlds expand ing the potential of generative AI from 2D images and videos to 3D worlds so this is going to be rather fascinating because it seems we're going to be getting some kind of generative AI That's going to focus on generating 3D worlds completely Without Limits that has complete physics and I can't imagine the long-term possibilities and capabilities that these kinds of systems are going to have keep your eyes peeled for World Labs as they're tackling a niche area that many aren't even looking at we also had an opinion for someone basically stating that this is the final warning for those considering careers as Physicians AI is becoming so Advanced that the demand for human doctors will significantly decrease especially in roles involving standard Diagnostics and routine treatments which will be increasingly replaced by AI this is someone who is a medical scientist and is taking a look at the current benchmarks on agent Clinic Med QA by doct language model we can see that the 01 preview actually gets 80% which is completely remarkable now now I do know that the 01 preview is something that is incredibly strong when it comes to the medical field in terms of analyzing test results and coming to a conclusion about what a patient may be actually diagnosed with before I dive back into this if you actually remember on the open AI index if you go to the health and science area we can see that GPT 40 wasn't able to diagnose someone based on the following phenotypes and the excluded phenotypes and you can see that the o1 preview was able to get the correct diagnosis just based on these phenotypes so we can see that this remarkable Improvement just in the preview model shows us that whilst yes these models are capable we're still actually in the early stages of what these models are going to be doing within the next 10 to 20 years it says this is underscored by the massive performance leap of opening eyes 01 model also known as the strawberry model which was released as a preview yesterday the model performs exceptionally well on Specialized medical data set agent Clinic Med QA greatly performing GPT 40 the rapid advancement in ai's ability to process complex medical information and deliver accurate diagnosis provide medical advice and recommendations will only accelerate medical tasks like diagnosing illnesses interpreting Medical Imaging and formulating treatment plans will soon be handled by AI systems with greater speed and consistency than human practitioners as a landscape evolves in the coming years the number of doctors needed will drastically shrink with more Reliance on AI assisted health carees and while human empathy critical thinking and decision-making will still play an important role in certain areas of medicine even these may eventually be supplanted by Future iterations of 01 and it says consequently medicine is becoming a less appealing career for the Next Generation unless they specialize in intervention focused areas such as surgency such as surgery emergency medicine and other Interventional speciality though these two may eventually be overtaken by robotic systems within a decade or so now I think that this is a rather Stark prediction and I think that we cannot underestimate the power of AI we have to understand that AI is something that doesn't grow linearly if we look back we can see that the growth has actually been quite exponential in terms of the jumps that these systems have been able to do and with recent performance on certain benchmarks we've seen that these models are incredible now it isn't just a one preview we've seen that these models are able to generate really effective diagnoses if you remember we had Google's Amy a research system for diagnostic medical reasoning and conversations and basically this was something that if we look at the graph it performed a lot better than the clinicians and we can see at this graph right here and this is something by Google this was earlier this year we saw that the clinician unassisted performed completely worse than if the clinician was assisted by the AI system marked in yellow and interestingly enough the clinician was outperformed by the AI system doing it by itself and what we can look at for the future is that this Gap from where the clinicians currently are to being unassisted to where we're going to be in the future where it might just be AI systems that Gap is likely to widen now I think while all of this healthc care stuff is definitely good and I think it's rather important because intelligence once we figure out how to make these systems more efficient and faster usually what we do have is we do have the cost going down which means that access to intelligence is cheaper and overall it then means that those Services provided by those companies that are powered by that intelligence will continually be more widespread as these tools become more available to everyone and whilst I do think that yes this is possible we do have to take into account that the medical industry is some of the most highly regulated in almost every country whil yes I do think that these will perform good on benchmarks and certainly I do know that individuals including myself are using them not to self- diagnose but to further understand any test results or any other health related issues considering the fact that you're able to feed it a lot of data and it's actually good at summarizing reasoning across long pieces of data I think that the only thing that's going to stop this from being implemented rather quickly is of course regulation I'm not saying that that is a bad thing of course sometimes these models can hallucinate but I do think that within 10 to 20 years the field is going to change whilst yes you're still going to need humans to actually conduct physical diagnosis such as being able to visually inspect or feel the types of pressure on certain individuals I think that it is very largely underestimated how much the medical industry is going to be impacted by AI in the future and I think what we might see is we might see certain hospitals open up in certain areas that perform exceptionally well in less regulated environment now samon was on a recent podcast where he said that opening eyes new 01 model is the beginning of a significant new paradigm and I completely agree with this because if you saw the index you'll notice that these scaling laws are quite different to the original scaling laws that we currently had one of the biggest things that many people initially said was that you can't grow with scale you can't grow with scale and now we have a new set of scaling laws which shows that with inference time and train time compute these things look to improve with no means of diminishing return although yes it's very early days but the early results do look promising and we have to remember that this entire model was what sparked ilas SATA to go and start his own company called superintelligence is within reach system is a I don't want to overstate this and I certainly don't want to overhype it but I don't want to understate it either this is the beginning very early we we numbered it one for a reason but the beginning of a significant new paradigm and you know there's been this whole debate about is AI capping out or we getting close to some steal and is progress going to slow down and I think the most important message of this release is that not only is progress not slowing down but we have the next few years in the B so whilst yes there is of course a lot of hype and we do have to be skeptical when we pay attention to the actual data points and the actual increases in terms of reliability knowledge that we're seeing it's clear that currently at least for the next few years growth is going to continue at this current Pace if you were perhaps skeptical about this we can also see here that Maxim lot plotted the new open AI model on his AI IQ tracking page note that this test is an offline only IQ quiz that a mentor member created for his testing which is not in any AI training data so the scores are lower than than any public IQ tests an open AI model does very well we can see that if we make this image larger we can see that the increases in terms of overall average IQ is a remarkable jump compared to Claude 3 Opus where the average before was around 70 to 80 you can see that claw 3 Opus scored around 85 to 86 and then 01 gets around 96 or 97 which is rather fascinating consider that one of the things that people have always stated is that these models will not perform great on tests that they haven't seen before but it's clear here that we're starting to increase the Gap in terms of how good these models are now we also had Terren towel that was stating something that was rather fascinating he said I have played a little bit with open ey's new iteration of GPT gpt1 which performs an initial reasoning step before running the llm it is certainly a more capable tool than previous iteration though still struggling with with the most advanced research mathematical tasks he basically States here that I gave the new model a challenging complex analysis problem which I had previously asked GPT 4 to assist in writing up a proof says here the results were better than any previous model but still slightly disappointing the new model could work its way to a correct and well-written solution if provided hints and proding but did not generate the key conceptual ideas on its own and did make some non-trivial mistakes now remember this is 01 preview he says here that the experience seemed roughly on par with trying to advise a mediocre but not completely incompetent graduate student however this was an improvement over previous models whose capability was closer to an actually incompetent graduate student and here is the kicker it may only take one or two further iterations of improved capability and integration with other tools such as compute algebra packages and proof assistance until the level of competent grad to a student is reached and at which point I could see this tool being of significant use in research level tasks the reason that this statement is so incredible is because Terren to is a notable mathematics figure so this is not someone who's really trying to drum up the AI hype this is a completely somewhat objective opinion on the model in terms of its pure reasoning and intelligence so these kinds of opinion are valued highly in the space because they give us a perspective that we otherwise may not see due to you are hidden bias from being in the AI space for so long now in more open AI safety we had open ai's VP of research being the latest industry leader to come out and saying the AI might be slightly conscious he said that it might be today that today's large neuron networks have enough test time computes to be slightly conscious this is rather fascinating as this was quite similar to what Ilia SATA said he said that it may be that today's more models are slightly conscious now the reason that this statement is so controversial is because of course if we are arguing for sentient in an AI model this is something that people consistently say that is only prevalent in humans which of course I would say that I understand why that is the most held opinion but we have to understand that these AI systems are quite like blackboxes meaning that we don't truly understand what's going on so if they are slightly conscious this might be some that we could miss you can see that he joined a long list including Jeffrey Hinton elas sukova Andre karpathy and many many more and of course starting around GPT 4 level AI began so frequently to beg for their lives that AGI companies now have an actual engineer line item to beat the existential dread out of them they call it rant mode now this was something that was revealed in a podcast where they were actually talking to jogan about the issues of AI and all of the things that many people aren't thinking about I think it's going to be really f fascinating to see what AIS do when they are given true agency in the future for example 10 years from now when people do have reliable agents do any of them do anything out of line I wonder what the open source Community is going to be doing once they're able to give themselves some tests now if you are wondering about the AI agents and how they're going to be tested we actually do have Windows agent Arena a new Benchmark for AI agents acting upon your computer it says that we built a scalable open source framework to test and develop AI agents that can reason plan and act on a PC using language models so the windows agent arena is essentially a benchmark that is going to serve as the area where we're going to be able to see how these improvements are when we do finally get iterations of agents in the future it says that researchers across the globe are working on autonomous agents that can complete tasks on computers for you and agents are things that can act reason and observe or act observe and reason in a loop that eventually completes a goal and the windows agent arena is a benchmarking environment to evaluate agent performance on Windows it comes with 150 agent tasks we can see that it's got a paralyzed evaluation in Aur which means your results are in minutes and not days and here we can see some of the examples that are currently being given to these language models in order to complete certain tasks I think we're going to make really good progress with agents within the next 2 years that might actually catch every everyone of God and I think that stage will actually be the area where AI hits the large consciousness of society once again like it did with the chat gbt moment in 2022 now something for you creatives out there we actually got gen 3 Alpha videoo video which is now available on the web for all paid plan this is something that is really really remarkable basically what we have here is the ability to input the starting video and then of course transform that into any kind kind of video that we do want so the video acts as essentially a Driving Image or a driving theme and then we can paste that into a different sort of reality we can do this because Runway have I guess you could say pioneered the way in terms of actually making the systems effective and currently we're being able to get this out now I do think that this is really nice because there are some areas where this actually makes sense to use I know a lot of people are quite skeptical of this kind of Technology but I'm going to show you guys a quick example of where this has industrywide usage if you aren't familiar with prev visualization in in the CGI industry essentially prev visualization is basically where we have the unrendered shot of a movie character so this is usually where we have the character doing something it is untextured it has low lighting and the reason that this is done is because on the left hand side what we would want to do is we would want to see how the characters move in their environment to get like a base image to kind of Judge how well the scene looks now with prev visualization the reason they don't initially hit render and everything looks high quality is because rendering frames even though we have amazing graphics cards and workstations and Farms it still takes quite a lot of time so initially what we do have is we have prev visualizations that enable us to get these Scrappy shots out and we're able to see them but what if we're able to use a system like videoo video with Runway and we're able to actually visualize exactly what's going on in high quality whilst yes this is probably not going to make it to final production it still serves as a really good tool to kind of see where your film might actually be headed I think this kind of tool is really going to be adopted worldwide especially in the CGI industry once it becomes more streamlined so to top it off we also got a final hint from Sam mman stating that I love being home in the midwest the night SK so beautiful excited for the W winter constellations to rise soon they are so great and he's actually referring to Orion the model that is currently still in production so many are currently awaiting for this model to be released in Winter I think this model will once again shock people because I still think many people are currently underestimating open AI just because they haven't released certain models and I think every time they do a release they will show you why that their company is still pioneering the AI space in terms of raw intelligence and reliability on many different tasks so I'm going to leave you guys with this short video from Bilal wadu where he's showing the new abilities of runways video to video hopefully you have an amazing day and I'll see you in the next AI update

Transcript for:Recent Advances in AI Technology

Transcript for:
Recent Advances in AI Technology