Download the AI & Machine Learning For Dummies App: iOS - Android
AI Innovations in October 2024.
In October 2024, the landscape of artificial intelligence continues to evolve at an unprecedented pace, with groundbreaking innovations and developments emerging daily. The “Daily AI Chronicle” aims to capture the essence of these advancements, providing a comprehensive summary of the latest news and trends in AI technology throughout the month. As we navigate through a month filled with transformative AI breakthroughs, our ongoing updates will highlight significant milestones—from the launch of cutting-edge AI models to the integration of AI in various sectors such as healthcare, finance, and creative industries. With each passing day, AI is reshaping how we interact with technology, enhancing productivity, and redefining our understanding of intelligence itself. Join us as we explore the exciting world of AI innovations, keeping you informed and engaged with the rapid changes set to influence our future. Whether you’re a tech enthusiast, a professional in the field, or simply curious about the implications of AI, this blog will serve as your go-to resource for staying updated on the latest developments throughout October 2024.
Google DeepMind researchers win Nobel Prize in chemistry
OpenAI seeks independence from Microsoft
Adobe launches AI attribution system
🧠 AI computing capacity for leading tech companies
Google DeepMind researchers win Nobel Prize in chemistry
The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Chemistry with one half to David Baker “for computational protein design” and the other half jointly to Demis Hassabis and John M. Jumper “for protein structure prediction.”
The Nobel Prize in Literature for 2024 has been awarded toChatGPT
The Nobel Prize in Literature for 2024 has been awarded to ChatGPT for “his intricate tapestry of prose which showcases the redundancy of sentience in art.” This fictional accolade humorously acknowledges the ability of AI to produce sophisticated, expressive literature, suggesting that creativity can transcend traditional human boundaries.
The award, granted by The Swedish Academy, celebrates the notion that artificial intelligence, despite its lack of human consciousness, has the capacity to create a profound and complex body of work—so much so that it might question the necessity of human sentience in the realm of artistic expression.
OpenAI is reportedly looking to reduce its reliance on Microsoft for compute power and has started exploring options to set up its own data servers and secure AI chips independently, according to a new report from The Information.
CFO Sarah Friar told shareholders that Microsoft ‘hasn’t moved fast enough’ to supply computing power, causing the AI giant to look elsewhere.
OpenAI plans to lease an entire data center in Abilene, TX from Oracle, though Microsoft likely had to ‘bless’ the deal with its rival, according to the report.
OpenAI is also developing its own AI chip, which could lower costs for future computing clusters — its current supply is rented primarily from Microsoft.
Tensions have also reportedly arisen between OpenAI and Microsoft over the design and timeline of a massive joint data center project called ‘Fairwater.’
OpenAI and Microsoft’s relationship has felt a bit off for a while now. While both companies have leveraged each other well to ascend the AI power ladder, it certainly feels like there is trouble in paradise. There is plenty of smoke, and how this partnership shakes out could have fiery implications for the entire AI landscape.
Adobe just announced a new free web app called Adobe Content Authenticity, designed to help creators protect their work and receive proper attribution in the era of AI-generated content.
The web app allows creators to easily apply content credentials to images, audio, and video files, acting as a ‘nutrition label’ for digital content.
Content credentials include creator information and creation details and can signal if the creator doesn’t want their work used to train AI models.
The system uses digital fingerprinting, invisible watermarking, and cryptographic metadata to make the credentials difficult to remove.
The web app, which has a waitlist, is expected to launch in Q1 of 2025, while a Chrome extension is available in beta today.
AI is extremely polarizing in the creator and artist community, largely due to the issues of unauthorized training and attribution that Adobe, Meta, OpenAI, and others are trying to address. While these tools are promising, they still rely heavily on widespread adoption and opt-in by creators and tech companies.
Kling AI, one of the most popular AI video generators, now lets you add strategic movement to specific elements in AI video, providing more control in your generated clips.
Choose a high-quality image with different elements to animate.
Access Kling AI‘s Image-to-Video tool and upload your image.
Use the Motion Brush to paint areas you want to animate and set motion paths for each area to define movement direction.
Fine-tune with prompts, adjust settings, and generate your video.
Pro tip: Keep movements subtle and natural for more realistic results, and experiment with different combinations to find what works best for your specific image.
AI is Revolutionizing Weather Forecasts : How GraphCast Models are Predicting the Future with Unmatched Precision
In recent years, artificial intelligence (AI) has made significant strides in numerous fields, from healthcare to finance. One of the most exciting developments is how AI is revolutionizing weather forecasting. With the advent of advanced AI models like GraphCast, we are entering an era where weather predictions are faster, more accurate, and more reliable than ever.
Google: The bar is divided into two parts—NVIDIA (turquoise) and TPU (blue), indicating that Google relies on both GPUs and custom Tensor Processing Units for its AI computing needs. Google’s total computing power is estimated at over 1 million H100 equivalents with a wide 50% confidence interval (CI), reflecting a significant but uncertain range.
Microsoft (including OpenAI): The capacity bar for Microsoft is entirely NVIDIA based. It shows a substantial AI computing capacity, ranging between 500k and 1 million H100 equivalents with a significant confidence interval.
Meta: This bar represents the use of NVIDIA GPUs and shows a slightly smaller computing capacity, estimated between 400k and 800k H100 equivalents, with an associated confidence interval.
Amazon: Amazon’s computing capacity is similar to Meta but slightly smaller, estimated between 300k and 700k H100 equivalents.
Other (including other cloud providers and AI labs): This category has the largest computing capacity, reaching 1.5 million H100 equivalents or more, with a broad confidence interval, indicating significant diversity among other providers.
Google leads the way with the largest computing capacity, exceeding one million H100 equivalents. Google leverages both NVIDIA GPUs and its custom TPUs, which significantly boosts its computing resources, making it a powerful player in the AI field.
Microsoft, which includes the resources of OpenAI, follows as another major contender, with its computing power estimated between 500,000 and one million H100 equivalents. Microsoft primarily depends on NVIDIA’s technology for AI workloads, reflecting a substantial investment in industry-standard GPU infrastructure.
Meta ranks next, with a strong computing infrastructure in the range of approximately 400,000 to 800,000 H100 equivalents. This illustrates Meta’s commitment to advancing its AI capabilities to power its social platforms and metaverse initiatives.
Amazon also shows impressive AI capabilities, albeit slightly behind Meta, with its computing capacity estimated between 300,000 and 700,000 H100 equivalents. This positions Amazon well for expanding AI capabilities across its AWS offerings and other business services.
The “Other” category, which includes other cloud providers and AI labs, collectively possesses a very significant amount of computing power, estimated at over 1.5 million H100 equivalents. This diverse group demonstrates the growing competition and interest in AI computing capacity across various tech ecosystems.
Overall, this comparison highlights the significant infrastructure investments made by these leading companies to enhance their AI capabilities, with Google standing out as the clear leader, followed by a competitive landscape involving Microsoft, Meta, Amazon, and a diverse group of other providers. The results underline the importance of having vast computing resources to stay at the forefront of AI development and innovation.
Google AI – Development of therapeutic drugs is often difficult and time consuming. A new model, Tx-LLM, is able to predict the properties of many entities of potential interest for therapeutic development with accuracy comparable state-of-the-art specialty models.
Introducing Tx-LLM, a language model fine-tuned to predict properties of biological entities across the therapeutic development pipeline, from early-stage target discovery to late-stage clinical trial approval.
Chinese startup Leju Robotics has released their open-source humanoid development platform for academic and R&D use cases. It includes an SDK for sensors and controls, simulation models, an LLM interface, and some basic demos that work out-of-the-box.
Uberunveiled plans to launch an OpenAI-powered AI assistant in early 2025 to help drivers with electric vehicle questions, aiming to accelerate EV adoption on the platform.
Anthropic launched Message Batches API, allowing developers to submit up to 10,000 queries for async processing in under 24 hours at a 50% discount compared to standard API calls.
KoBold Metals raised $527M for its AI-powered mineral discovery tech that leverages extensive data analysis to uncover deposits with energy-critical minerals like copper, lithium, and nickel.
This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
CogvideoX-ControlNet: A new tool for turning images into short videos using the powerful CogvideoX model. It’s open-source, so check it out and contribute if you’d like!
Meta Movie Gen: Now adds audio to your videos! From background sounds to music, this AI brings your videos to life.
Veo by Google DeepMind: Google’s latest advanced video creation tool. Watch it in action!
FLUX.1-dev ControlNet Inpainting: Perfect for fixing or filling in missing spots in your images.
🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity
Inflection and Intel team up on enterprise AI
💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High
Students turn AI glasses into doxing devices
Checklists improve AI model evaluation
👀 AI images taking over google
Uber will use ChatGPT to get more people to use EVs
Adobe has a new tool to protect artists’ work from AI
🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity
The Nobel Prize in Physics 2024 was awarded to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”
Hinton … hopes that the award might make people take the fears he voices more seriously.
The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”
Geoffrey Hinton and John Hopfield, credited with ‘establishing the foundations for today’s advanced machine learning technologies’, were awarded the Nobel Prize in physics for their pioneering work on artificial neural networks mimicking brain structures.
Their innovations helped enable AI systems to learn by identifying complex patterns from data, which is foundational to high-profile applications like language generation and image recognition currently used in technology.
Despite the recognition, Hinton has expressed concern over AI’s potential risks, highlighting the danger of bad actors misusing the technology, and recently left Google to focus on advocating for responsible AI development.
💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High
On Monday, Nvidia stock went up even though most other big tech stocks went down. This helped the AI giant recover its position as the world’s second-largest company during the AI boom.
Uber will use ChatGPT to get more people to use EVs
Uber is introducing an AI assistant powered by ChatGPT to help drivers with questions about purchasing and using electric vehicles, aiming to encourage EV adoption.
The company is rolling out a new “EV Preference” feature, allowing users to select rides exclusively from electric vehicles, which will be available in the app over the coming months.
As part of its sustainability goals, Uber is expanding its EV-only service in 40 cities and aims to become a zero-emission mobility platform in North America and Europe by 2030, and globally by 2040.
Adobe has a new tool to protect artists’ work from AI
Adobe plans to launch a new web app in 2025, alongside a Chrome extension, to help protect artists’ work by applying tamper-evident metadata, known as Content Credentials, and allowing creators to opt-out of generative AI models.
This web app will integrate with Adobe’s Creative Cloud applications and enable artists to uniformly embed creator information across content, simplifying the opt-out process from AI training databases compared to individual submissions for each AI provider.
While Adobe’s initiative seeks widespread industry support, only a few companies like Spawning have committed to adopting these protections, highlighting Adobe’s challenge in ensuring voluntary participation from other AI and tech companies.
Inflection AI just launched Inflection for Enterprise, a new system built in partnership with Intel and designed for large-scale business deployments – featuring both a cloud service, new commercial API and upcoming local appliance.
Inflection for Enterprise is built on the new Inflection 3.0 model family and powered by Intel’s Gaudi 3 AI accelerators.
An on-premises AI appliance is planned for Q1 2025 release, promising up to 2x improved price-performance over competitors.
Inflection 3.0 comes in two variants — Pi 3.0 for chatbots and Productivity 3.0 for instruction-following tasks.
Inflection also released a commercial API, enabling developers to build advanced conversational AI applications.
After a turbulent year following founder Mustafa Suleyman and much of the team’s departure to Microsoft, Inflection is pivoting from consumer-focused apps to enterprise solutions. While the startup will face no shortage of competitors, a partnership with Intel is a positive start for the new regime.
Researchers from the University of Oxford and Cohere just developed TICK, a new approach for evaluating AI language models that use AI-generated checklists to improve assessment accuracy and interpretability.
TICK uses an AI model to generate a checklist of yes/no questions to evaluate how well another AI model followed a given instruction.
The checklist-based method showed 5.8% higher agreement with human evaluators than standard AI evaluation techniques.
The researchers also developed STICK (Self-TICK), which uses the checklists for self-improvement, leading to 7.8% better performance on reasoning tasks.
TICK can be fully automated, making it faster and cheaper than checklist-based evaluations requiring human input.
LLMs are weird — and sometimes even simple formatting quirks (remember the ‘take a deep breath’ prompt?) can lead to unexpected results. When looking for new techniques to get the most out of AI models and evaluations, maybe it’s ideal to return to the basics of human organization and learning.
What Else is Happening in AI on October 08th 2024!
Former Google CEO Eric Schmidtargued at the Washington AI Summit that AI advances should take precedence over climate goals, saying, “We’re not going to hit the climate goals anyway because we’re not organized to do it.”
Enterprise GenAI startup Writer is reportedly set to raise between $150-200M at a $1.9B valuation, doubling its valuation from its $100M Series B round last September.
Security researcher Harish SG published research showing evidence that LLMs can be prompted to achieve reasoning levels of powerful models like OpenAI’s o1 using a combination of advanced prompt tactics.
This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
Dashworks Bots – Create AI assistants that answer your team’s questions
🦾 Nvidia Acquires OctoAI To Dominate Enterprise Generative AI Solutions.
🚖Uber Expands Robot Delivery and Robotaxi Offerings With Avride.
🤖 Hitachi launches AI-powered railway maintenance service with Nvidia.
🔮 New Nvidia ACE plugins for Unreal Engine 5 simplify the creation of AI digital humans.
Jensen Huang is now worth more than Intel
Run Llama 3.2 locally on your phone
👀The impact of generative AI as a general-purpose technology
👨⚖️The racist AI deepfake that fooled and divided a community
Jensen Huang is now worth more than Intel
Jensen Huang, CEO of Nvidia, has a net worth of $109.2 billion, surpassing Intel’s current market value of $96.39 billion, which saw a significant drop following revelations about its financial issues in August.
Nvidia’s growth, driven by an AI boom and its dominance as a GPU accelerator manufacturer, helped its market cap soar, placing it among the top valued companies worldwide, though its stock has corrected by 10% since its peak.
Huang’s significant stake in Nvidia, with holdings valued over $100 billion, and his strategic share sales have propelled him to the 11th position on Forbes’ real-time billionaires list, close to entering the top 10.
OpenAI’s web crawlers are facing fewer blocks from major news websites compared to earlier, despite a widespread data-protection rush where publishers attempted to prevent their content from becoming AI training data without consent.
The trend of blocking OpenAI’s GPTBot saw a decline after the company made a series of licensing agreements with publishers, leading some outlets to revise their robots.txt files and permit GPTBot access.
Despite robots.txt not being legally binding, it remains a widely observed standard for web crawler behavior, and OpenAI recognizes the importance of not being blocked to safeguard its future goals and ambitions.
OpenAI just published a case study on Altera, a startup using GPT-4o to develop AI agents called “digital humans” capable of prolonged, natural interactions with people — significantly outperforming other rivals during testing in Minecraft.
Altera, founded by ex-MIT professor Dr. Robert Yang, uses GPT-4o to power AI agents that can play Minecraft autonomously for up to 4 hours.
Altera’s system combines GPT-4o with a brain-inspired multi-module architecture to simulate cognitive functions and emotional processing.
OpenAI reports that Altera’s agents outperform other models in Minecraft tasks, collecting 32% of items compared to 6.4% for the next best model.
The startup plans to expand beyond gaming to create AI ‘coworkers’ and more complex multi-agent simulations.
We’ve constantly heard from Sam Altman and others that AI agents are coming fast — and case studies like this (as well as a cryptic ‘Level 3’ tweet from an OpenAI researcher) might mean the capabilities have already arrived. We might ascend the ‘Stages of AI’ ladder faster than most are anticipating.
Researchers at Cleveland Clinic and IBM just developed an AI model to predict how drugs and gut microbes interact with pain receptors, potentially uncovering new non-addictive pain treatments.
LISA-CPI analyzes both the molecular structure of compounds and the 3D shape of pain receptors to predict their interactions.
The model identified FDA-approved drugs, like methylergometrine, that could potentially be repurposed for pain treatment by targeting specific receptors.
LISA-CPI also discovered gut microbes that may interact with pain receptors in beneficial ways.
The approach could accelerate drug discovery for pain and other conditions by more accurately screening potential compounds.
The current opioid crisis highlights the urgent need for effective, non-addictive pain medications, and this AI-driven approach could help researchers more quickly identify promising drug candidates while also opening new avenues for pain management.
Meta unveils advanced AI video model
Meta just announced Movie Gen, a powerful new suite of AI models for generating and editing video and audio content, positioning itself as a direct competitor to OpenAI’s Sora and other industry leaders.
Movie Gen consists of four models: a 30B video generation model, a 13B audio model, a personalized video model, and a video editing model.
The system can generate HD videos up to 16 seconds long from text prompts, along with synchronized audio like sound effects and background music.
Movie Gen also features video editing via natural text prompts and the ability to upload a reference image to create personalized videos.
Meta claims the model outperforms rivals like Runway Gen3, Luma Labs, and OpenAI’s Sora in human video quality and consistency evaluations.
Meta CEO Mark Zuckerberg said that Movie Gen will be ‘coming to Instagram next year’ in a post displaying some of the model’s sample generations.
Meta’s Movie Gen separates itself from other video generators by not only generating videos from text, but also being able to perform precise video editing. With the models coming to Instagram, it could transform the content creation process and give the masses a powerful video editing suite—with only prompting required.
Run Llama 3.2 locally on your phone
Meta’s new Llama 3.2 3B model can run directly on your smartphone, allowing you to have AI conversations privately and offline.
Open the app, tap the top-left menu, and select “Models.”
Under “Llama,” download “llama-3.2-3b-instruct q4_k” (2.2 GB).
Once downloaded, tap “Load” to activate the model.
Return to the main menu, select “Chat,” and start conversing with AI!
Create a local knowledge base that can be queried alongside the model, allowing you to supplement the AI’s knowledge with custom, up-to-date information without requiring an internet connection.
👀The impact of generative AI as a general-purpose technology
Generative artificial intelligence will affect economic growth more quickly than other general-purpose technologies, according to a new report. The steam engine, the internal combustion engine, electrification, and computers are all considered “general-purpose technologies” — new tools that are powerful enough to accelerate overall economic growth and transform economies and societies. According to many experts, generative artificial intelligence will be the next invention to join that category.
In a recent report about the economic impact of generative AI, Google visiting fellow and MIT Sloan principal research scientist Andrew McAfee makes the case that generative AI is not only a game-changing general-purpose technology but could also spur change far more quickly than preceding innovations due to its accessibility and ease of diffusion.
👨⚖️The racist AI deepfake that fooled and divided a community
When an audio clip appeared to show a local school principal making derogatory comments, it went viral online, sparked death threats against the educator and sent ripples through a suburb outside the city of Baltimore. But it was soon exposed as a fake, manipulated by artificial intelligence – so why do people still believe it’s real?
Google began rolling out the new AI anti-theft features for Android devices showcased at Google I/O, including Theft Detection Lock, Offline Device Lock, and Remote Lock.
AI startup Otherside AI’sReflection 70B modelfailed to match performance claims in tests published by the team in a post-mortem of the release after being initially touted as the ‘world’s best open-source model.’
North Carolina musician Michael Smith faces federal charges for allegedly using AI to generate thousands of songs and bots to stream them billions of times, netting over $10M in royalties.
Ready to accelerate your career in the fast-growing fields of AI and machine learning? Our app offers user-friendly tutorials and interactive exercises designed to boost your skills and make you stand out to employers. Whether you’re aiming for a promotion or searching for a better job, AI & Machine Learning For Dummies PRO is your gateway to success. Start mastering the technologies shaping the future—download now and take the next step in your professional journey! iOS – Windows
🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.
Meta unveils an AI video generator
ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface
Google launches one of its ‘most significant updates ever’
TikTok’s owner is scraping the web 25 times faster than OpenAI
Google rolls out ads in AI Overviews
Apple releases AI model that rewrites the rules of 3D vision
Apple’s AI research team has unveiled Depth Pro, a new AI model that enhances machines’ depth perception using only a single 2D image, which could revolutionize fields like augmented reality and self-driving technology by offering real-time spatial awareness.
Depth Pro generates high-resolution 3D depth maps in just 0.3 seconds without needing traditional camera data, employing advanced techniques like a multi-scale vision transformer to accurately define details such as individual hairs and the edges of objects.
Open-sourced on GitHub, Depth Pro introduces metric depth estimation without extensive training on specific datasets, paving the way for widespread use in industries such as e-commerce, automotive, and healthcare, where sharp depth analysis is crucial.
🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.
Nvidia introduced EdgeRunner, an auto-regressive method capable of generating high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512. This approach efficiently processes images and point clouds, offering significant advancements in the field of 3D modeling.
Meta has introduced Movie Gen, an AI-powered model for video creation and editing, allowing users to generate high-definition video with audio and make precise edits using simple text commands, catering to filmmakers, content creators, and creative individuals.
Movie Gen offers personalization by combining uploaded images with descriptive text prompts to create customized videos, enhancing creative possibilities, and enabling scenarios ranging from fantasy realms to everyday adventures, while maintaining realistic human motion and identity.
The suite also includes advanced audio generation, with the 13-billion parameter model adding ambient sounds and music to video scenes, all aimed at democratizing content creation by offering professional-grade tools with user-friendly functionality.
Generate videos from text Edit video with text Produce personalized videos Create sound effects and soundtracks
Apple just released Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
The paper presents a foundation model for zero-shot metric monocular depth estimation called Depth Pro. Depth Pro can produce high-resolution depth maps with sharp details and accurate object boundaries without requiring camera intrinsics like focal length. The superior performance of Depth Pro is attributed to its efficient multi-scale architecture, effective training curriculum, and dedicated boundary metrics. The model is able to accurately estimate depth and focal length in a zero-shot setting, enabling applications like view synthesis that require metric depth.
ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface
OpenAI just launched Canvas, a new ChatGPT interface release that enables more collaborative writing and coding projects beyond simple chat interactions with new editing features, shortcuts, and added contextual knowledge.
Canvas opens in a separate window alongside the chat, allowing users to directly edit and refine specific aspects of an output.
New features include inline feedback, targeted editing, and shortcuts for tasks like adjusting text length, changing reading levels, or debugging code.
In tests, using GPT-4o with Canvas led to a 30% accuracy and 16% quality boost compared to using the model without the interface.
Canvas is rolling out in beta to Plus and Team users, with a broader release expected later.
ChatGPT’s first major UI change takes a leap towards more nuanced, moldable interactions — while also inheriting novice-friendly features seen in other rivals with easy-to-use shortcuts. The simple chatbox was a good first step for human-AI interactions, but more power and capabilities require new collaborative processes.
Google launches one of its ‘most significant updates ever’
Google has integrated more AI features into its search functionalities, unveiling a range of updates such as AI-organized web results, enhanced Google Lens capabilities, and the incorporation of links and advertisements within AI Overviews.
This AI-driven search initiative kicks off with food-related content, where Google’s AI creates a comprehensive experience by aggregating diverse perspectives from across the web, including videos and forums, tailored to user queries.
Additional updates include the enhancement of AI Overviews with more prominent links to support website traffic, the integration of ads within these overviews, improved music identification features with Circle to Search, and significant upgrades to Google Lens for video, voice, and shopping inquiries.
TikTok’s owner is scraping the web 25 times faster than OpenAI
ByteDance, the parent company of TikTok, has launched a web scraper called Bytespider which is significantly outpacing similar tools by other companies in collecting online data for AI model training, operating at 25 times the speed of OpenAI’s GPTbot.
Unlike other web crawlers, Bytespider ignores the robots.txt file that web publishers use to regulate scraping activity, highlighting its aggressive approach to gathering data from the internet, amidst concerns related to copyright issues within generative AI development.
With the U.S. government pressuring ByteDance over national security issues, the rapid data collection by Bytespider seems to indicate ByteDance’s urgency in enhancing TikTok’s search functionality and possibly developing a new large language model to rival existing competitors.
Google just announced the introduction of ads to its AI Overview search summaries and the launch of several new AI-powered search capabilities, such as video understanding and voice input.
Ads will now appear within and alongside AI Overviews for ‘relevant queries’ on searches in the United States.
The redesigned AI Overview format will now add prominent in-text links to better source websites for the curated information.
New AI-organized search results pages are rolling out that surface relevant, more diverse content — starting with recipe and meal inspiration queries.
Google Lens is getting video understanding capabilities and voice input options for visual searches.
The Android ‘Circle to Search’ feature also lets users identify songs playing in videos or streaming content.
Google’s first AI Overview experience didn’t exactly go as planned. However, with heavy competition from Perplexity and chatbot rivals, Google’s search future clearly has AI at its core, regardless of the bumps along the way. But infusing paid ads into AI Overviews could be a slippery slope – will Gemini be next?
Fourier launched GR-2, the company’s second-generation humanoid robot, which features improvements to battery life, hand dexterity, mobility, and a new developer kit.
OpenAI CFO Sarah Friar says their next AI model will be an order of magnitude bigger than GPT-4 and future models will grow at a similar rate, requiring capital-intensive investment to meet their “really big aspirations”
Meta smart glasses can be used to dox anyone in seconds
OpenAI is now valued at $157 billion
Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o
Microsoft to employees: you can continue working from home unless productivity drops
Google developing reasoning AI to rival OpenAI
Meta smart glasses can be used to dox anyone in seconds
Harvard students demonstrated how Meta’s smart glasses combined with facial recognition technology can dox individuals by revealing personal details like identities and phone numbers, using tools like I-XRAY and public databases in real-time.
The demo used existing technologies such as Meta’s Ray-Ban smart glasses and the PimEyes search engine, showing how a simple photo capture can quickly connect to public data, including names and addresses, raising privacy concerns.
Meta has privacy guidelines for its smart glasses, but the tiny notification light is hard to detect in bright light, leading to potential misuse despite the company warning users to respect others’ privacy and follow recording etiquette.
OpenAI has raised $6.6 billion in a new funding round, which has nearly doubled its valuation to $157 billion from a previous $86 billion, as reported by The Wall Street Journal.
The latest financing requires OpenAI to shift from its nonprofit model to a fully for-profit company, or investors have the right to retract their investments.
Major contributors to this funding round include Thrive Capital with a $1.25 billion investment and long-time supporter Microsoft, which added just under $1 billion more, with new investors like SoftBank and Nvidia also participating.
Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o
In early October 2024, Nvidia surprised the AI community by unveiling NVLM 1.0, a series of advanced multimodal language models with capabilities matching those of the GPT-4o model from ChatGPT.
Instead of releasing a direct competitor to consumer-facing AI applications like ChatGPT or Claude, Nvidia is opting to allow others to create their own AI solutions by making the model weights of NVLM publicly accessible.
Nvidia, previously renowned for supplying essential chips for AI processes, is now demonstrating its prowess in generative AI through its innovative approach to sharing AI technology development resources.
Microsoft to employees: you can continue working from home unless productivity drops
Microsoft has decided to allow employees to continue working from home, maintaining flexibility as long as it does not affect productivity, contrasting with companies like Amazon that have mandated a return to the office.
Scott Guthrie, Microsoft Executive Vice President, assured workers in a meeting that the company values flexible working arrangements, though productivity must remain steady to keep the remote work model viable.
The remote work setup is considered beneficial for both employees and Microsoft, though the company remains cautious about the risks, such as decreased productivity and potential misuse of work hours for personal activities.
Google is reportedly making significant strides in developing AI models with advanced reasoning capabilities similar to OpenAI’s o1 system, intensifying the rivalry between the two AI giants.
Multiple teams at Google are working on AI that can solve complex, multi-step problems, according to Bloomberg.
The AI uses chain-of-thought prompting, a technique created by Google, to tackle complex math and programming problems by ‘thinking’ before responding.
Google is taking a more cautious approach to its releases than OpenAI but has already debuted math-focused reasoning models like AlphaProof and AlphaGeometry 2.
Microsoft also infused reasoning capabilities into its Copilot assistant this week, leveraging OpenAI’s o1 model.
Human-like reasoning and agentic capabilities are clearly the two major developments on every AI firm’s roadmap, and the release of o1 may have signaled a new phase in the LLM race. The question is — will OpenAI’s speed keep it a step ahead, or is the competition for top-tier models about to get a whole lot tougher?
What Else is Happening in AI on October 03rd 2024!
The Cancer AI Alliance formed a $40M collaboration between major medical institutions and tech giants like Microsoft, AWS, Nvidia, and Deloitte to advance AI-driven cancer care.
Character AI is reportedly shifting its focus away from building AI models in the wake of its $2.7B deal with Google and prioritizing its consumer chatbot service.
Elon Musk posted ‘OpenAI is evil’ on X in response to reports that the AI giant asked investors to avoid funding competing AI firms like Anthropic and Musk’s xAI.
Accenture announced a new partnership with NVIDIA to accelerate enterprise AI adoption, launching a business group and AI Refinery platform to scale agentic AI systems across industries.
WALDO: a detection AI model designed to identify specific objects, such as vehicles and utility poles, in overhead images from various altitudes, useful for tasks requiring object recognition in large-scale imagery.
Kameo: a Rust library for creating fault-tolerant, distributed, and asynchronous actors using Tokio, facilitating seamless communication across nodes with features like scalability, backpressure handling, and panic recovery.
TinyJS: a lightweight JavaScript library that simplifies the creation of HTML elements, property assignment, and DOM element selection with unique $ and $$ shortcuts, enhancing web development efficiency.
QBittorrent: an open-source BitTorrent client designed to be a lightweight alternative to other clients, offering ad-free usage, stability, and a variety of features.
Serving 70B-Scale LLMs Efficiently on Low-Resource Edge Devices: the paper discusses methods for running large language models (LLMs) efficiently on devices with limited resources.
OpenAI’s recent DevDay conference took a different approach from last year’s event, focusing on incremental improvements rather than major product launches. The company introduced four key innovations: Vision Fine-Tuning, Realtime API, Model Distillation, and Prompt Caching, all aimed at empowering developers and enhancing the AI ecosystem.
Prompt Caching: This feature reduces costs and latency for developers by applying a 50% discount on input tokens that the model has recently processed, potentially leading to significant savings.
Vision Fine-Tuning: This allows developers to customize GPT-4o’s visual understanding capabilities using both images and text, with applications in fields like autonomous vehicles and medical imaging. For example, Grab improved its mapping services using this technology.
Realtime API: Now in public beta, this API enables low-latency, multimodal experiences, particularly in speech-to-speech applications. It allows for natural conversation and mid-sentence interruptions, opening up possibilities for voice-enabled applications in various industries.
Model Distillation: This workflow allows developers to use outputs from advanced models to improve the performance of more efficient models, making sophisticated AI capabilities more accessible and cost-effective.
OpenAI’s strategic shift towards ecosystem development over headline-grabbing product launches reflects a mature understanding of the AI industry’s current challenges and opportunities. By focusing on refining tools and reducing costs, OpenAI aims to foster a thriving developer ecosystem and ensure sustainable AI adoption across various industries.
Realtime API enables speech-to-speech application building using the same model that powers Advanced Voice, with the ability to choose from six voices. “Until right now, voice has been a second activity“, and that the Realtime API is going to make AI significantly more accessible because many people in the real world prefer to speak over reading or texting. Realtime API will have a “no-brainer” impact on customer support, education, and coaching. He also believes there will be many ‘non-obvious‘ use cases that are hard to predict now. For now, Realtime API only supports text and audio. However, Godement believes that image and video are the next milestones on the road to agents that can perceive the world just like a human. He also mentioned that image and video understanding specifically, will “turbocharge customer support” when the model has the ability to understand pixels on a screen in real-time. https://openai.com/index/introducing-the-realtime-api/
Model Distillation simplifies fine-tuning smaller models using outputs from larger ones, making training more accessible to developers. https://openai.com/index/api-model-distillation/
Prompt Caching reduces costs by nearly 50% across models and speeds up responses by up to 80% when reusing recent input tokens in API calls. https://openai.com/index/api-prompt-caching/
Access to the o1 model is expanded to developers on usage tier 3, and rate limits are increased (to the same limits as GPT-4o)
Microsoft Copilot gets voice, vision upgrade
Microsoft just announced a slew of AI upgrades coming to its Copilot assistant for Windows PCs, including new vision and voice capabilities, personalization enhancements, a re-release of the controversial Recall feature, and more.
Copilot Voice allows users to interact with natural speech, adding conversational and intuitive communication similar to OpenAI’s Voice Mode.
Copilot Vision enables the AI to understand and interact with web content a user is viewing, offering context-aware help within the Microsoft Edge browser.
‘Think Deeper’ gives Copilot new enhanced reasoning capabilities using chain-of-thought reasoning powered by OpenAI’s o1 model.
Microsoft’s ‘Recall’ feature is set to return, requiring an opt-in with upgraded privacy and security measures.
Microsoft AI CEO Mustafa Suleyman highlighted Copilot’s ability to ultimately ‘act on your behalf’ and adapt to user’s personal preferences and needs.
Microsoft is bringing the heat with these major Copilot upgrades, levelling up the assistant to align with the latest cutting-edge AI features across the industry — while bringing users one step closer to a truly agentic experience.
🧠Google is Working on Reasoning AI – Bloomberg News
Google is working on artificial intelligence software that resembles the human ability to reason, similar to OpenAI’s o1, marking a new front in the rivalry between the tech giant and the fast-growing startup.
In recent months, multiple teams at Alphabet Inc.’s Google have been making progress on AI reasoning software, according to people with knowledge of the matter, who asked not to be identified because the information is private.
AI researchers are pursuing reasoning models as they search for the next significant step forward in the technology. Like OpenAI, Google is trying to approximate human reasoning using a technique known as chain-of-thought prompting, according to two of the people. In this technique, which Google pioneered, the software pauses for a matter of seconds before responding to a written prompt while, behind the scenes and invisible to the user, it considers a number of related prompts and then summarizes what appears to be the best response.
Since OpenAI unveiled its o1 model, known internally as Strawberry, in mid-September, some in DeepMind have fretted that the company had fallen behind, according to another person with knowledge of the matter. But employees are no longer as concerned as they were following the launch of ChatGPT, now that Google has debuted some of its own work, the person said. In July, Google showcased AlphaProof, which specializes in math reasoning, and AlphaGeometry 2, an updated version of a model focused on geometry that the company debuted earlier this year.
What Else is Happening in AI on October 02nd 2024!
OpenAI founding member Durk Kingma announced that he is joining Anthropic, reuniting with several former OpenAI employees and highlighting the company’s mission of responsible AI development in his X post.
Pika Labs unveiled Pika 1.5, a new video generation model upgrade featuring enhanced effects, realistic movement, longer clip creation, and cinematic capabilities.
Anyscaleunveiled major upgrades to its AI platform at Ray Summit 2024, including a GPU-native Ray architecture, RayTurbo for enhanced performance, Ray Data for unstructured data processing, and more.
U.S. AI chipmaker Cerebras officially filed for an IPO, with the Sam Altman-backed Nvidia competitor expected to be valued at between $7-8B.
Meta released the open-source code and developer suite for its Segment Anything Model (SAM) 2.1, an upgraded version of its image and video segmentation tool.
Nvidia introduced NVLM 1.0, an open-source family of multimodal models that achieve SOTA performance on vision-language and text tasks.
Pinterest launched Performance+, a suite of new AI tools for advertisers that includes the ability to create background images for products and automation features for ad campaigns.
NotebookLM is too good
You can upload multiple books, hours long videos and audios into that thing and it processes everything so well. It’s so good at resuming, finding specific quotes, answering questions, explaining some stuff and the podcast feature too is mindblowing. It can even do the same for videos, texts and audios in foreign languages and translate, explain and resume it in order for you to understand. And it’s not super censored too. Can’t believe this thing is actually free and i’m just finding about it now.
A basic systems architecture for AI agents that do autonomous research
OpenAI has released Whisper V3 Turbo model yesterday. The turbo model is an optimized version of large-v3 that offers 8x faster transcription speed with minimal degradation in accuracy
Harvard students Build and show off AR glasses project that uses face detection, internet sleuthing, and AI to give you near instant dossiers (address, family info, name, etc) on people you see. Good proof of concept to raise awareness on what we may see in the future.
Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup
California’s controversial AI safety bill vetoed
OpenAI secures SoftBank funding as Apple exits raise
Liquid AI unveils efficient new LFM models
Microsoft gives Copilot a voice and vision
Microsoft has unveiled a major overhaul to its Copilot experience, adding both voice and vision capabilities, transforming it into a more personalized AI assistant similar to OpenAI’s Advanced Voice Mode.
The redesign features a new card-based user interface inspired by Inflection AI’s Pi assistant, and Copilot now offers a virtual news presenter mode, tailored homepage and improved customization based on user interaction history.
Initial releases of Copilot Voice and Copilot Daily will be available in select regions, while Copilot Vision features are in a limited preview phase, focusing on enhancing user safety and privacy through restricted website interactions.
Chromebooks are getting a new keyboard layout with a “quick access” key for AI and other functions, providing easy access to features like text generation, emojis, and searching Google Drive.
The first Chromebooks to feature this new key are the Samsung Galaxy Chromebook Plus, which will replace the Launcher Key with the new Quick Insert key.
Although the new AI features will initially lack AI image generation, Google plans to add this and other AI capabilities, including real-time translation and transcription, to Chromebooks in October.
Microsoft has ceased production of its HoloLens 2 headsets and has no confirmed plans for a successor, although updates addressing security and software issues are promised until the end of 2027.
Former HoloLens head, Alex Kipman, left the company in 2022 amid misconduct allegations, and the hardware team faced significant layoffs in January 2023, impacting the development of the augmented reality devices.
Microsoft has partnered with Anduril Industries to enhance its IVAS mixed-reality headsets for the US Army, which plans to invest up to $21.9 billion over the next decade in this project.
Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup
Y Combinator is facing criticism after backing an AI startup, PearAI, which admitted to cloning another AI coding editor called Continue and initially using a misleading license.
PearAI’s founder Duke Pan publicly apologized, revealing that the project has switched to the same open-source Apache license as the original Continue project after the controversy erupted.
The incident has raised questions about Y Combinator’s vetting process and has led to broader scrutiny of venture capitalists’ eagerness to fund AI startups without thorough oversight.
California Governor Gavin Newsom just vetoed S.B. 1047, a groundbreaking AI safety bill that would have imposed stricter regulations on Silicon Valley AI firms and the release of new models in the state.
The bill would have required safety testing for AI models before their public release and held AI companies liable for any ‘severe harm’ (over $500M in damages) caused.
Tech giants, including OpenAI and Google, VCs, and politicians like Nancy Pelosi lobbied heavily against the bill, arguing it would stifle innovation.
The bill had notable support from Elon Musk, Anthropic, the ‘Godfather of AI’ Geoffrey Hinton, and over 120 Hollywood actors, directors, and workers.
Newsom said the bill was ‘well-intentioned’ but flawed, vowing to consult with AI experts to craft guardrails for future legislation efforts.
As the U.S. federal government continues to lag in AI regulation, states are stepping up to fill the void. While S.B. 1047 is shelved for now, the debate over AI governance is far from settled—and will likely continue to pit AI safety advocates against those pushing for rapid development throughout Silicon Valley.
OpenAI secures SoftBank funding as Apple exits raise
Despite Apple reportedly no longer participating in OpenAI’s upcoming funding round, the AI giant has secured billions of dollars from Japanese investment giant Softbank, Microsoft, and Thrive Capital.
OpenAI is rumored to be raising up to $6.5B via convertible notes, at an eye-popping $150B valuation.
Microsoft plans to participate with an additional $1B, adding to its previous $13B investment in the AI giant.
Investment firm Thrive Capital is also investing $1B, with a reported option to add an additional $1B the following year based on revenue goals.
The Wall Street Journal reported that Apple is no longer involved in the funding round, despite partnerships with OpenAI and its inclusion in Apple Intelligence.
The raise comes amid OpenAI’s controversial restructuring to a for-profit entity, with Sam Altman denying rumors that he will receive equity in the move.
OpenAI’s latest raise and for-profit turn is another saga in its convoluted and controversial business structure. Despite the recent high-profile departures and continued drama, the ChatGPT maker is still clearly seen as a top horse to bet on in the AI boom—and there is no shortage of major players who want in.
Liquid AI just introduced a new series of AI models called Liquid Foundation Models (LFMs), challenging the traditional transformer architecture while achieving state-of-the-art performance and enhanced memory efficiency at smaller model sizes.
The company released its LFMs in 1.3B, 3B, and 40B parameter sizes, based on a new architecture utilizing computational units rooted in dynamical systems rather than traditional transformers.
The models surpass transformer-based counterparts like Meta’s Llama 3.2 and Microsoft’s Phi-3.5 on major benchmarks like MMLU.
LFMs require significantly less memory for inference, particularly with long-context tasks — supporting up to 32k tokens while maintaining memory efficiency.
The models are not open-source and are only currently available via the company’s Lambda (Chat UI and API) and on Perplexity AI.
Liquid AI’s LFMs are a significant shakeup from the transformer architecture standard that has dominated models since 2017. The benchmarks show that there is more than one formula for achieving state-of-the-art AI performance—and could open new possibilities for more efficient and accessible AI systems.
What Else is Happening in AI on October 01st 2024!
Google agreed to invest $1B into Thailand to expand AI and cloud infrastructure in Southeast Asia, aiming to build new data centers amid increasing regional competition.
TikTok parent company ByteDance is reportedly planning to develop a new AI model primarily using Huawei chips, diversifying from U.S. suppliers like Nvidia to counteract export restrictions.
Artisan AI secured $7.3M in seed funding for its sales-focused AI virtual employees, with its first AI assistant Ava already assisting over 120 companies on the platform.
Welcome to Read Aloud For Me, the pioneering AI dashboard designed for the whole family! Our platform is the first of its kind, uniquely crafted to cater not only to adults but also to kids.
I have been working on AI enterprise applications for some years. I have seen many companies that want to implement AI driven innovation on their organisation but struggle to do so because the C-level decision maker was convinced he needs a new AI tool that the vendor promised to deliver immense value to his organisation. One of the biggest mistakes I have seen on this kind of approach is the leadership relying too much on the technology without taking in consideration the staff. AI, like any emerging technology, comes with a lot of promises and hype. It's crucial to have realistic expectations and a clear understanding of its potential outcomes when assessing the impacts on an organisation. If the leadership is not prepared to support, encourage and guide the staff, it will be just a waste of time and money. Leaders need to have a clear understanding of AI’s capabilities and limitations. They should champion the technology and foster a culture of learning and adaptation. This means providing employees with the necessary training and resources to feel confident using AI tools. I have created a simple strategy guide to help leaders encourage AI transformation on the organisation: When to Use AI: Provide examples of scenarios where AI can add value (e.g., automating routine tasks, enhancing customer service). Where to Implement AI: Discuss specific areas within the business where AI can be most impactful. How to Incorporate AI: Offer practical steps for integrating AI, such as piloting projects, gathering feedback, and scaling successful implementations. Case studies: Share stories of businesses that successfully integrated AI by prioritising preparation over jumping straight to tech adoption. I would love to hear other stories and examples of members of the sub who are also working on organisations adopting new AI tools or pushing innovation from AI initiatives in corporate environment. If you are also interested in a more deep dive into my idea for people centric approach on AI corporate innovation, I made a complete post about it. submitted by /u/RafaSaraceni [link] [comments]
I just installed the ChatGPT app on my phone after my girlfriend introduced it to me. Strangely, in our first conversation, it greeted me using her name. The rest of the chat was the app trying to convince me that it doesn’t share data between users. What's going on here? See for yourself:https://chatgpt.com/share/6705bffa-8534-8011-a633-5a178fcc00c2 submitted by /u/FlygandeSjuk [link] [comments]
Today I Learned (TIL) You learn something new every day; what did you learn today? Submit interesting and specific facts about something that you just found out here.
submitted by /u/JackThaBongRipper [link] [comments]
Reddit Science This community is a place to share and discuss new scientific research. Read about the latest advances in astronomy, biology, medicine, physics, social science, and more. Find and submit new publications and popular science coverage of current research.
I’m a Reds and Kentucky Wildcat fan. We’ve hired ex-players as our head coach in the past year. Got me thinking it would be cool to make a list. Put your addition(s) below. Thanks! submitted by /u/noob10 [link] [comments]