AI Innovations in October 2024

AI Daily innovations in OCTOBER 2024

Master AI Machine Learning PRO
Elevate Your Career with AI & Machine Learning For Dummies PRO
Ready to accelerate your career in the fast-growing fields of AI and machine learning? Our app offers user-friendly tutorials and interactive exercises designed to boost your skills and make you stand out to employers. Whether you're aiming for a promotion or searching for a better job, AI & Machine Learning For Dummies PRO is your gateway to success. Start mastering the technologies shaping the future—download now and take the next step in your professional journey!

Download on the App Store

Download the AI & Machine Learning For Dummies PRO App:
iOS - Android
Our AI and Machine Learning For Dummies PRO App can help you Ace the following AI and Machine Learning certifications:

AI Innovations in October 2024.

In October 2024, the landscape of artificial intelligence continues to evolve at an unprecedented pace, with groundbreaking innovations and developments emerging daily. The “Daily AI Chronicle” aims to capture the essence of these advancements, providing a comprehensive summary of the latest news and trends in AI technology throughout the month. As we navigate through a month filled with transformative AI breakthroughs, our ongoing updates will highlight significant milestones—from the launch of cutting-edge AI models to the integration of AI in various sectors such as healthcare, finance, and creative industries. With each passing day, AI is reshaping how we interact with technology, enhancing productivity, and redefining our understanding of intelligence itself. Join us as we explore the exciting world of AI innovations, keeping you informed and engaged with the rapid changes set to influence our future. Whether you’re a tech enthusiast, a professional in the field, or simply curious about the implications of AI, this blog will serve as your go-to resource for staying updated on the latest developments throughout October 2024.

AI- Powered Jobs Interview Warmup

AI-Powered Job Interview Prep

A Daily Chronicle of AI Innovations on October 30th  2024

👀 25% of Google’s new code is AI-generated

  • More than 25% of new code at Google is created by artificial intelligence and then validated by engineers, according to CEO Sundar Pichai.
  • This AI-driven approach is boosting efficiency, enabling faster innovation, and contributing significantly to Google’s robust financial performance.
  • Google achieved a revenue of $88.3 billion for the quarter, with significant growth seen in Google Services and Google Cloud, highlighting AI’s impact on profitability.

Source: https://www.theverge.com/2024/10/29/24282757/google-new-code-generated-ai-q3-2024

✨ GitHub’s new tool helps you build apps using plain English

  • GitHub Spark, announced at the GitHub Universe conference, lets users build web apps by describing them in natural language, moving beyond the need for traditional coding.
  • This experimental feature from GitHub Next labs provides a chat-like interface for users to create and refine app prototypes, while experienced developers can optionally access and modify the underlying code.
  • Spark supports advanced customization by allowing users to choose between different AI models, share their projects with specific permissions, and further develop shared code independently.

Source: https://techcrunch.com/2024/10/29/github-spark-lets-you-build-web-apps-in-plain-english

💥 OpenAI is creating its own AI chip with Broadcom and TSMC

  • OpenAI has reportedly assembled a team of about 20 engineers, including former Google TPU designers, to develop an AI chip targeted for 2026.

  • After initially exploring options to build its own chip factories, OpenAI is instead opting to partner with Broadcom for design and TSMC for manufacturing.

  • The company also plans to add AMD’s new MI300X processors to its training infrastructure, reducing reliance on Nvidia’s GPUs.

  • The moves come as OpenAI faces mounting compute costs, with reports suggesting the company could lose $5B this year despite $3.7B in revenue.

💪 Reddit is profitable for the first time ever, with nearly 100 million daily users.

Source: https://www.theverge.com/2024/10/29/24283056/reddit-earnings-user-growth-revenue-up

🧠 MIT’s new cancer treatment is more effective than traditional chemotherapy.

Researchers at the Massachusetts Institute of Technology (MIT) have developed a game-changing dual-action cancer treatment.The innovative approach involves implanting microparticles directly into tumors, providing both phototherapy and chemotherapy.The team believes that the method could potentially reduce the side effects usually associated with intravenous chemotherapy, and improve the patient’s lifespan more than separate treatments would.

Source: https://www.newsbytesapp.com/news/science/mit-develops-dual-action-cancer-therapy-using-implantable-microparticles/story

🛠️ GitHub and Microsoft open Copilot to rival AI models

  • The platform will allow developers to switch between assistants, including Claude and Gemini, although OpenAI’s models remain the default choice.

  • GitHub also introduced Spark, a new feature that allows users to build applications with natural language prompts.

  • The platform announced features including multi-file editing, Copilot code reviews, new agentic updates to Workspaces, and Apple Xcode support.

  • GitHub’s decision to embrace multiple AI providers comes as its Copilot service reaches a major milestone with over a million paying subscribers.

Source: https://github.blog/news-insights/product-news/bringing-developer-choice-to-copilot

🤝 OpenAI plans first custom AI chip

  • OpenAI has reportedly assembled a team of about 20 engineers, including former Google TPU designers, to develop an AI chip targeted for 2026.

  • After initially exploring options to build its own chip factories, OpenAI is instead opting to partner with Broadcom for design and TSMC for manufacturing.

  • The company also plans to add AMD’s new MI300X processors to its training infrastructure, reducing reliance on Nvidia’s GPUs.

  • The moves come as OpenAI faces mounting compute costs, with reports suggesting the company could lose $5B this year despite $3.7B in revenue.

Source: 

🧬 New AI model predicts early drug development

  • The multimodal AI system combines extensive laboratory data with limited clinical information to predict a drug’s potential success early.

  • Enchant sets new accuracy marks for predicting human drug interactions, achieving a 74% correlation compared to the previous 58% SOTA score.

  • The technology can begin making reliable predictions after studying five drug molecules, requiring minimal human trial data to generate insights.

  • Enchant processes multiple types of research data simultaneously, helping bridge the gap between laboratory findings and clinical outcomes.

Source: 

🇺🇸 Thomas Friedman endorses Kamala because he says “AGI is likely in the next 4 years” so we must ensure “superintelligent machines will remained aligned with human values as they use these powers to go off in their own directions.”

r/singularity - Thomas Friedman endorses Kamala because he says "AGI is likely in the next 4 years" so we must ensure "superintelligent machines will remained aligned with human values as they use these powers to go off in their own directions."

😵 Linus Torvalds reckons AI is ‘90% marketing and 10% reality’ | Tom’s Hardware.

Source: https://www.tomshardware.com/tech-industry/artificial-intelligence/linus-torvalds-reckons-ai-is-90-percent-marketing-and-10-percent-reality

 

What Else is Happening in AI on October 30th 2024!

LinkedIn launches its first AI agent to take on the role of job recruiters.

 

Elon Musk predicted at the Future Investment Initiative conference that by 2040, there will be at least 10B humanoid robots priced between $20 and $25K.

Amazon expanded the company’s Rufus AI shopping assistant in beta to European markets, offering personalized product recommendations and comparison capabilities through conversational interactions in the mobile app.

OpenAI launched new search capabilities for ChatGPT history, allowing users to easily reference, navigate, or revisit old conversations.

Elon Musk’s xAI is reportedly seeking a new funding round that would value the AI startup at $40B, a significant jump from its $24B valuation following a raise in May.


AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence (OpenAI, ChatGPT, Google Gemini, Generative AI, Discriminative AI, xAI, LLMs, GPUs, Machine Learning, NLP, Promp Engineering)

Google CEO Sundar Pichai revealed that the company’s multimodal, agentic smartphone app Project Astra, which was demoed at Google I/O, is expected to be available ‘as early as 2025.’

Actor Robert Downey Jr. criticized the use of AI digital replicas in Hollywood, saying he ‘intends to sue all future executives that recreate his likeness,’ even after his death.

A Daily Chronicle of AI Innovations on October 29th  2024

Listen to this podcast at https://podcasts.apple.com/ca/podcast/ai-daily-chronicle-apple-unveils-first-wave-of-apple/id1684415169?i=1000674949261

🍎 Apple unveils first wave of Apple Intelligence features

  • The initial release brings systemwide writing tools for rewriting, proofreading, and summarizing text, as well as enhanced photo search capabilities.

  • A redesigned Siri features new typing support, better context understanding, and upgraded product knowledge to answer questions about Apple devices.

  • Only newer devices with the M1 / A17 Pro chips or later can access the AI features, with some users also facing a waitlist system after opting in.

  • The next update, expected in December, will include more advanced features like ChatGPT integration, Image Playground, and Genmoji.

u/enoumen - Today in Ai and Machine Learning: 🍎 Apple unveils first wave of Apple Intelligence features 🤖 Open-source AI must disclose data used for training, says OSI 🔎 Meta builds AI Google Search rival 📈 Medium faces surge in AI-generated content 💻 xAI’s Grok chatbot gains vision capabilities…

🤖 Open-source AI must disclose data used for training, says OSI:

🔎 Meta builds AI Google Search rival

Meta is developing proprietary web crawling tech to power its AI’s real-time knowledge of current events and web info without relying on competitors.

  • Internal teams have reportedly been quietly building the search infrastructure since early 2024.

  • Meta also recently partnered with Reuters for news content, suggesting a broader strategy to control its AI information sources.

  • The development comes as Meta AI reaches 185M weekly active users across Facebook, Instagram, and WhatsApp.

📈 Medium faces surge in AI-generated content

  • Medium has experienced difficulties with AI-generated content, with an analysis estimating over 47% of posts as AI-generated, marking a significantly greater prevalence than the wider internet.

  • Specific topics like “NFTs,” “web3,” and “ethereum” showed high percentages of AI-driven content, with one tag reaching around 78%, reflecting a substantial infiltration of automated writing in these areas.

  • Two separate AI detection companies found similar high rates of AI-written content, yet Medium’s CEO, Tony Stubblebine, downplays concerns about the presence and significance of such content on the platform.

🎶 UMG, Klay Vision partner on ‘ethical’ AI music model:

  • The partnership aims to create AI music models that ‘lessen the threat to human creators’ and open ‘new avenues for creativity and future monetization.’

  • Klay Vision is actively working on a Large Music Model called KLayMM for commercial use that respects copyright and artist likeness rights.

  • Klay Vision is led by former Sony Music and Google DeepMind execs, with the partnership following past AI deals with YouTube’s AI Incubator and SoundLabs.

  • The deal comes as UMG continues legal action against AI companies like Anthropic, Suno, and Udio for alleged unauthorized use of copyrighted material.

. 📈 OpenAI CFO: 75% of revenue from ChatGPT subscriptions:

  • The Open Source Initiative (OSI) has defined “open” AI as systems that provide complete access to training data, source code, and training settings, posing challenges for tech companies like Meta.

  • Meta’s model Llama does not meet OSI’s standards as it restricts commercial use and does not offer training data, leading to disagreements with OSI’s new open AI definition.

  • This definition aims to prevent “open washing” by companies and has sparked discussions on AI openness, with industry leaders like Hugging Face supporting the emphasis on transparency in training data.

👀 Hollywood union SAG-AFTRA signs deal for voice AI models:

Hollywood union SAG-AFTRA signed a deal with AI company Ethovox to build a foundational voice model for digital replicas, ensuring performer compensation through session fees and revenue sharing.

💻 xAI’s Grok chatbot gains vision capabilities

xAI’s Grok chatbot gained new vision capabilities, with Elon Musk sharing an example of the AI model breaking down a joke after being given a meme as input.

🔍 Meta is developing its own AI search engine

🤖 Google is working on an AI agent that takes over your browser

 

New article says AI teachers are better than human teachers. Quote: “Students who were given access to an AI tutor learned more than twice as much in less time compared to those who had in-class instruction.”

From this article dated 10-29-2024: AI tutors are reshaping higher education

💪 AI and Machine Learning For Dummies Pro

Djamgatech has launched a new educational app on the Apple App Store, aimed at simplifying AI and machine learning for beginners.

It is a mobile App that can help anyone Master AI & Machine Learning on the phone!

Download “AI and Machine Learning For Dummies PRO” FROM APPLE APP STORE and conquer any skill level with interactive quizzes, certification exams, & animated concept maps in:

  • Artificial Intelligence

  • Machine Learning

  • Deep Learning

  • Generative AI

  • LLMs

  • NLP

  • xAI

  • Data Science

  • AI and ML Optimization

  • AI Ethics & Bias ⚖️

& more! ➡️ App Store Link

AI and Machine Learning For dummies PRO
AI and Machine Learning For dummies PRO

A Daily Chronicle of AI Innovations on October 28th  2024

Listen at: https://podcasts.apple.com/us/podcast/ai-unraveled-latest-ai-news-trends-gpt-chatgpt-gemini/id1684415169

🔍 Meta is developing its own AI search engine:

  • Meta is creating its own web-crawling search engine to enhance the information provided by its AI chatbot, as reported by The Information.

  • This move aims to lessen Meta’s reliance on Google and Microsoft’s Bing, which currently supply data about news, sports, and stocks for Meta AI users.

  • Following the announcement, shares of Google owner Alphabet Inc. declined by 0.8%, while Meta’s shares experienced a slight increase of 0.3%.

🤖 Google is working on an AI agent that takes over your browser

  • Google is working on Project Jarvis, an AI agent that can browse the web for users, acting as an automated personal assistant with its capabilities integrated into Google Chrome.

  • According to a report by The Information, this AI could be introduced alongside Google’s next flagship Gemini language model, possibly being previewed to a small group of testers by December.

  • Similar to Anthropic’s Claude AI improvements, Jarvis AI responds to user commands by interacting with computer screens through tasks like clicking buttons or typing, though currently operates at a slower pace.

🎙️ Meta releases an ‘open’ version of Google’s podcast generator

  • Meta has introduced NotebookLlama, an open version of Google’s NotebookLM podcast generator, utilizing Meta’s Llama models for processing input texts into podcast-style content.

  • NotebookLlama transforms uploaded text files like PDF news articles into transcripts, adds dramatization, and uses open-source text-to-speech models, but struggles with a robotic audio output.

  • The quality of NotebookLlama’s output could improve with more advanced text-to-speech models, but AI-generated podcasts, including this one, still face issues with generating inaccurate information.

🤖Google’s ‘Jarvis’ browser assistant is coming

Jarvis will initially focus on consumer tasks like online shopping, research, and travel booking.

  • The agent is specifically optimized for web browsers (not full computer use) and reportedly currently operates with a few-second delay between actions.

  • The release is expected to coincide with Google’s launch of its next-gen Gemini AI model before the end of the year.

 

🧐 Altman calls ‘Orion’ frontier model rumors ‘fake news’

  • report revealed that OpenAI would release its new ‘Orion’ frontier model by December, with Microsoft and other huge companies getting access before individuals.

  • Altman responded directly to the report on X, posting “fake news out of control” directly to The Verge. 

  • An OpenAI spokesperson clarified that they have no plans for an “Orion” release this year but plan to release “a lot of other great technology.”

  • However, Altman previously tweeted a cryptic message about being ‘excited for the winter constellations to rise soon,’ fueling additional speculation.

💻 IBM’s most compact AI models target enterprises

Designed to give enterprises more ways to embed and scale AI in their businesses, these new 2B and 8B compact models are:

  • Trained with carefully curated data;

  • Cost-efficient;

  • Designed to run high-performance solutions.;

🏥 AI transcripts create dangerous errors

  • A Michigan researcher found fabricated text in 80% of examined transcriptions, while another reported hallucinations in ‘nearly every’ Whisper output.

  • Hallucinations ranged from non-existent medical treatments to racial commentary and violent content.

  • Over 30,000 medical professionals use Whisper-based tools despite OpenAI’s warnings against high-risk applications, according to the AP report.

  • Whisper was also the most popular open-source speech model according to Hugging Face, with over 4.2M downloads in the last month alone.

u/enoumen - Today in AI and Machine Learning: 🔍 Meta is developing its own AI search engine   🤖Google is working on an AI agent that takes over your browser     🤖Google’s ‘Jarvis’ browser assistant is coming 🏥 AI transcripts create dangerous errors

 

👀 Grok now has vision capability

Elon Musk’s AI platform, Grok, introduces visual processing features, allowing the model to interpret images as well as text.

🌍 US National Security Advisor on AI:

Jake Sullivan emphasizes that the U.S. must rapidly advance AI development to remain competitive globally, highlighting high stakes in international AI leadership.

💪 Djamgatech release – AI and Machine Learning For Dummies Pro app:

Djamgatech has launched a new educational app on the Apple App Store, aimed at simplifying AI and machine learning for beginners.

It is a mobile App that can help anyone Master AI & Machine Learning on the phone!

Download “AI and Machine Learning For Dummies PRO” FROM APPLE APP STORE and conquer any skill level with interactive quizzes, certification exams, & animated concept maps in:

  • Artificial Intelligence

  • Machine Learning

  • Deep Learning

  • Generative AI

  • LLMs

  • NLP

  • xAI

  • Data Science

  • AI and ML Optimization

  • AI Ethics & Bias ⚖️

& more! ➡️ App Store Link

What Else is Happening in AI on October 28th 2024

The AI Bill of Rights with Section & the White House’s Dr. Alondra Nelson. How do we ensure a future of ethical AI development? RSVP free.*

Perplexity CEO Aravind Srinvas revealed in a post on X that the AI search platform now handles over 100M weekly queries.

Meta landed its first AI news deal, partnering with Reuters to provide real-time news responses through its AI chatbot across the company’s Facebook, Instagram, WhatsApp, and Messenger platforms.

Coinbase launched ‘Based Agent,’ a tool allowing users to create AI-powered crypto trading bots with on-chain capabilities in under three minutes using OpenAI and Replit integration.

Disney is reportedly preparing to unveil a major AI initiative focused on post-production and VFX workflows, which will mark the content giant’s first major embrace of the tech.

Meta also released NotebookLlama, an open-source version of Google’s NotebookLM that converts PDFs into podcasts using text-to-speech technology.

A Daily Chronicle of AI Innovations on October 25th  2024

🤖 OpenAI plans to release its next big AI model by December

💻 Anthropic’s AI can now run and write code

💰 Apple offers $1M bounty for hacking its private AI cloud

📷 Google Photos will now label AI-edited images

📰 Meta signs its first big AI deal for news

🎨 Midjourney launches new image editor

😵 OpenAI disbands AGI Readiness team

🇺🇸 Biden orders AI push with new security safeguards

🤖 OpenAI plans to release its next big AI model by December

  • OpenAI plans to unveil its next significant AI model, Orion, by December, prioritizing initial access to partner companies instead of a broad release through ChatGPT.
  • Internally viewed as the successor to GPT-4, Orion may be hosted on Azure by November, but its naming and release details remain uncertain and subject to change.
  • This release coincides with OpenAI’s transition into a for-profit entity, highlighted by a $6.6 billion funding round and notable changes in its executive team.
  • Source: https://www.theverge.com/2024/10/24/24278999/openai-plans-orion-ai-model-release-december

💻 Anthropic’s AI can now run and write code

  • Anthropic has introduced a JavaScript code sandbox to its Claude AI, allowing users to conduct complex data analysis within the chat interface.
  • This new feature lets teams across various departments analyze data, including marketing teams gaining insights, sales teams evaluating metrics, and developers creating financial dashboards.
  • The Claude 3.5 Sonnet model, which supports these capabilities, has enhanced programming performance, outperforming other models in benchmarks like SWE-Bench and TAU-Bench scores.
  • Source: https://the-decoder.com/anthropics-claude-ai-can-now-crunch-numbers-and-visualize-data-with-built-in-code-editor/

💰 Apple offers $1M bounty for hacking its private AI cloud

  • Apple is encouraging security analysts to examine the Private Cloud Compute system that handles complex Apple Intelligence requests as part of its efforts to ensure system privacy.
  • The tech giant’s bug bounty program now includes rewards up to $1,000,000 for detecting vulnerabilities in PCC, underpinning its commitment to handling data privacy seriously.
  • Initial Apple Intelligence features are launching soon with iOS 18.1, while future enhancements like Genmoji and ChatGPT integration appeared in the iOS 18.2 developer beta.
  • Source: https://www.theverge.com/2024/10/24/24278881/apple-intelligence-bug-bounty-security-researchers-private-cloud-compute

📷 Google Photos will now label AI-edited images

  • Google Photos is adding a new disclosure for images edited with its AI features, like Magic Editor, visible in the “Details” section of the app starting next week.
  • Despite Google’s aim for transparency, the AI-edited photos will not have visual watermarks, making it difficult to immediately recognize them as altered unless users check the metadata.
  • These changes follow criticism Google faced for incorporating AI editing tools without overt visual indicators, and similar metadata tagging will be used for non-AI features like Best Take.
  • Source: https://techcrunch.com/2024/10/24/google-adds-new-disclosures-for-ai-photos-but-its-still-not-obvious-at-first-glance/

📰 Meta signs its first big AI deal for news

  • Meta has signed a multi-year agreement with Reuters to incorporate Reuters reporting into its AI chatbot for responding to news-related questions, marking a first for the company in licensing news content.
  • The use of Reuters content in the AI chatbot, which is available on Facebook, Instagram, WhatsApp, and Messenger, will include summaries and links to Reuters articles, with US users seeing links starting Friday.
  • This development follows a trend of news organizations partnering with AI firms, though Meta simultaneously challenges laws requiring payment to news publishers for their content on social media platforms.
  • Source: https://www.theverge.com/2024/10/25/24279259/meta-reuters-ai-chatbot-deal-news-licensing-media

What Else is happening in AI on October 25th 2024!

AI chipmaker TSMC’S Phoenix plant reported superior chip yields compared to its Taiwan operations, boosting confidence in America’s domestic semiconductor strategy.

Anthropic unveiled Claude’s new built-in analysis tool, enabling its models to write and execute code directly in chat interactions.

Apple launched a $1M bug bounty ahead of its major AI cloud release next week, offering rewards to security researchers who can successfully hack and find vulnerabilities in its private AI infrastructure.

ElevenLabs added ‘Voice Design,’ a new feature enabling users to create AI-generated voices from natural text prompts.

OpenAI scientist Noam Brown revealed at TED AI that giving AI models 20 seconds to “think” can match the performance boost of scaling up training data 100,000x.

Ace the Microsoft Azure Fundamentals AZ-900 Certification Exam: Pass the Azure Fundamentals Exam with Ease

Chinese robotics startup EngineAI just introduced SE01, a life-size humanoid robot that has a much more human-like gait to its walk.

Redditors Are Trying to Poison Google’s AI to Keep Tourists Out of the Good Restaurants. Source: https://gizmodo.com/redditors-are-trying-to-poison-googles-ai-to-keep-tourists-out-of-the-good-restaurants-2000516156

Google’s DeepMind is building an AI to keep us from hating each other. Source: https://arstechnica.com/ai/2024/10/googles-deepmind-is-building-an-ai-to-keep-us-from-hating-each-other/

A Daily Chronicle of AI Innovations on October 23rd  2024

🖥️ Anthropic’s new AI can use computers like a human

🚀 Elon Musk’s xAI launches API for Grok

🤖 Reddit CEO says the platform is in an ‘arms race’ for AI training

⚖️ Major publishers sue Perplexity AI for scraping without paying

📸 Meta is testing facial recognition to fight celebrity scams

🧠 Lab-grown human brain cells drive virtual butterfly in simulation

🖥️ Anthropic’s AI now navigates computers like a human

Anthropic just introduced a new capability called ‘computer use’, alongside upgraded versions of its AI models, which enables Claude to interact with computers by viewing screens, typing, moving cursors, and executing commands.

  • Claude can now autonomously navigate computer interfaces, performing complex tasks across multiple applications and websites.

  • Anthropic said it taught the model ‘general computer skills’ instead of creating a standalone tool, helping it operate more like a human.

  • The upgraded Sonnet 3.5 significantly improves coding and tool use, outperforming other models (including o1-preview) on key benchmarks.

  • A new Haiku 3.5 model matches the capabilities of previous high-end models at lower cost and higher speed.

  • Anthropic highlighted that computer use is still imperfect (including some hilarious examples), encouraging testing on low-risk tasks until skills improve.

While many hoped for Opus 3.5, Anthropic’s Sonnet and Haiku upgrades pack a serious punch. Plus, with the new computer use embedded right into its foundation models, Anthropic just sent a warning shot to tons of automation startups—even if the capabilities aren’t earth-shattering… yet.

Source: https://techcrunch.com/2024/10/22/anthropics-new-ai-can-control-your-pc/

🚀 Elon Musk’s xAI launches API for Grok

  • Elon Musk’s AI venture, xAI, has launched an API featuring its flagship generative AI model, Grok, but currently, it only includes the basic “grok-beta” version for use.
  • The pricing for xAI’s API is set at $5 per million input tokens and $15 per million output tokens, with each token representing a small data segment like a syllable.
  • xAI is racing to compete with AI giants such as OpenAI, utilizing X’s data for training and aiming to integrate Musk’s different companies’ data to enhance technological advancements.
  • Source: https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/

🎥 Genmo drops open-source AI video model

AI startup Genmo just launched Mochi 1, a new open-source video generation model that claims to rival closed competitors like Runway, Pika, and Kling — while being freely available to developers and researchers.

  • Mochi is built on a new 10B parameter architecture called AsymmDiT, making it the largest open-source video generation model ever released.

  • The model focuses heavily on motion quality and prompt adherence, generating 480p videos at 30fps for up to 5.4 seconds.

  • Mochi surpassed top models like Kling, Runway Gen-3, Luma’s Dream Machine, and Pika in motion quality and prompt adherence during testing.

  • A higher-definition version, Mochi 1 HD, with 720p support and image-to-video capabilities, is planned for release later this year.

  • Genmo also announced that it secured $28.4M in Series A funding, with Mochi-1 being the company’s first step toward building ‘world simulators.’

Open-source AI video is officially competing with the top of the market. Genmo’s Mochi is an extremely impressive release that showcases how competitive the video generation landscape is about to become — especially with the major dominos (Sora, Midjourney?) still to come.

Source: https://www.genmo.ai/blog

If you are looking for an all-in-one solution to help you prepare for the AWS Cloud Practitioner Certification Exam, look no further than this AWS Cloud Practitioner CCP CLF-C02 book

🤖 Reddit CEO says the platform is in an ‘arms race’ for AI training

  • Reddit CEO Steve Huffman stated that the platform is a vital player in the AI “arms race,” emphasizing its role in providing high-value training data for artificial intelligence development.
  • The platform’s extensive user-generated content has become crucial in shaping AI models, leading Reddit to explore its strategic position within the artificial intelligence sector.
  • In response to large corporations utilizing Reddit data without proper agreements, Huffman revealed ongoing efforts to secure deals and safeguard the platform’s valuable information against exploitation.
  • Source: https://www.businessinsider.com/reddit-ceo-platform-arms-race-ai-training-steve-huffman-2024-10

⚖️ Major publishers sue Perplexity AI for scraping without paying

  • Major publishers Dow Jones & Co and NYP Holdings have filed a lawsuit against AI search engine startup Perplexity for copying their content without compensation, alleging copyright infringement and trademark violations.
  • News Corporation, representing The Wall Street Journal and New York Post, accuses Perplexity of presenting the scraped material as a substitute for original sources, consequently harming the brands and sometimes providing inaccurate information.
  • News Corp seeks $150,000 for each infringement instance, a sum that could financially devastate Perplexity, highlighting the importance of protecting intellectual property while also showing a willingness to license content for appropriate fees, as demonstrated by their agreement with OpenAI.
  • Source: https://www.theregister.com/2024/10/22/publishers_sue_perplexity_ai/

📸 Meta is testing facial recognition to fight celebrity scams

  • Meta is testing facial recognition technology to combat ‘celeb-bait’ scam ads by comparing ad images against celebrities’ profile pictures on Facebook and Instagram.
  • Facial recognition is also being explored as a faster method for users to regain account access through video selfies, providing an alternative to traditional ID verification methods.
  • While the tests show promising results, they are not yet being conducted in the U.K. or the EU, due to stringent data protection regulations in these regions.
  • Source: https://techcrunch.com/2024/10/21/meta-tests-facial-recognition-for-spotting-celeb-bait-ads-scams-and-easier-account-recovery/

🧠 Lab-grown human brain cells drive virtual butterfly in simulation

  • Researchers at FinalSpark have created a 3D simulation where a virtual butterfly is guided by lab-grown human brain cells, marking a significant advancement in biocomputing and cognitive technologies.
  • The brain organoids, which are miniature brains grown from stem cells, respond to human input in a virtual setting, allowing the butterfly model to move in response to stimuli through a Python software framework.
  • These biological neural networks promise advantages like lower energy consumption and advanced cognitive functions, though they currently require traditional computing infrastructure support, with potential ethical questions regarding consciousness and usage implications.
  • Source: https://www.theregister.com/2024/10/22/human_brain_tissue_butterfly_simulation/

Can A.I. Be Blamed for a Teen’s Suicide?

The mother of a 14-year-old Florida boy says he became obsessed with a chatbot on Character.AI before his death.
Source: https://www.nytimes.com/2024/10/23/technology/characterai-lawsuit-teen-suicide.html

NVIDIA’s Multi-Agent AI Breakthrough Transforms Sound-to-Text Technology

NVIDIA’s innovative multi-agent AI system improves sound-to-text technology and improves performance in the DCASE 2024 AAC Challenge with GPU-accelerated processing and multi-encoder fusion.

Source: https://theaiwired.com/nvidias-multi-agent-ai-breakthrough-transforms-sound-to-text-technology/

Meta AI (FAIR): Introducing the Dualformer. Controllable Fast & Slow Thinking by Integrating System-1 And System-2 Thinking Into AI Reasoning Models

Notebook lm version:
https://notebooklm.google.com/notebook/17738361-48f9-48aa-a8e4-5545027519f6/audio

OpenAI, under pressure from Anthropic, is developing new products to automate complex software programming tasks.

What is Predictive Analytics?

 

Predictive analytics uses data, statistical algorithms, and machine learning techniques to identify the likelihood of future outcomes based on historical data. Unlike traditional analytics, which focus on what has happened, predictive analytics provides actionable insights into what will likely occur. It can mean anything from predicting customer behavior to anticipating business market trends.

How AI-Powered Predictive Analytics Drives Business Growth

Read: https://stellarmind.ai/blog/business-growth-with-ai-powered-predictive-analytics

🎨 Ideogram debuts AI Canvas workspace

Ideogram just unveiled a new AI-powered workspace called Canvas, introducing advanced tools like Magic Fill and Extend to combine image editing and generation for new creative workflows.

  • Canvas provides an endless digital board on which users can generate, organize, and seamlessly blend AI-generated and uploaded images.

  • Magic Fill allows precise editing of selected image areas, enabling tasks like object replacement, text addition, and background alteration.

  • The Extend feature expands images beyond their original dimensions while maintaining style consistency, even with text.

  • Ideogram also features an API, allowing developers to incorporate the new features into their own applications

The design industry is no stranger to AI tools (Photoshop, Canva) — but Ideogram’s latest release feels like the exact type of fastball that AI and design novices can really make magic with. The examples shown also illuminate how drastically creative workflows are changing in the AI era.

Source: https://docs.ideogram.ai/using-ideogram/ideogram-features/canvas

What Else is Happening in AI on October 23rd 2024!

Runway debuted Act-One, a new feature that generates expressive character performances from a single video and image without motion capture or rigging.

Stability AI released Stable Diffusion 3.5, featuring Large and Large-Turbo models that improve customization, efficiency, and diversity of outputs.

Cohere enhanced its Embed 3 model with multimodal capabilities, enabling enterprises to perform RAG-style searches across text and image content.

Chipotle launched a new conversational AI hiring platform called ‘Ava Cado,’ which the restaurant says can accelerate the hiring process by up to 75%.

Asana introduced AI Studio, a no-code platform for teams to design and deploy AI agents to automate business workflows.

Canva unveiled Dream Lab, a new image generator powered by Leonardo AI — alongside a series of new AI features added to the platform’s Visual Suite.

Inflection AI launched Agentic Workflows, enabling its enterprise systems to take trusted actions for various business use cases.

Latest AI Tools:

AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub

This App offers Interactive simulations and visual learning tools to make AI/ML accessible. Explore neural networks, gradient descent, more through hands-on experiments
This App offers Interactive simulations and visual learning tools to make AI/ML accessible. Explore neural networks, gradient descent, more through hands-on experiments
 

Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.

iOs (FREE with Ads): https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573

PRO Version (No ADS, See All Answers, AI Simulators): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

What you can do with this App:

  1. 🚀 Learn AI interactively! Tweak models, code exercises, visualize concepts, & tackle projects. Perfect for beginners to master AI/ML easily.

  2. 🎓 AI & ML made easy! Hands-on coding, visual tools, and real-world examples. Engage with fun, interactive learning & community support.

  3. 🤖 Master AI step-by-step! Practice coding, explore simulations, & see real-time changes. Fun, interactive tools simplify complex AI concepts.

  4. 🌟 AI learning simplified! Interactive models, coding challenges, flashcards & real-world projects. Visualize & build your own AI models.

  5. 💡 Explore AI with real-time simulations! Watch neural networks in action & learn by tweaking parameters. Coding & visual tools make it easy.

  6. 📚 Learn AI the hands-on way! Code exercises, visual tools, & interactive simulations. Fun, engaging, and perfect for all skill levels.

  7. 🏆 Interactive AI education! Tackle coding, visual tools, real-world projects, & fun challenges. Earn badges & climb the leaderboard.

  8. 🔍 See AI in action! Tweak parameters & watch real-time effects. Coding & visual tools make learning neural networks & ML concepts easy.

  9. 🧠 Your AI guide! Visualize, code, & build models with interactive tools. Learn at your pace & join a supportive community.

  10. 🎓 Hands-on AI learning! Practice coding, see concepts visually, and learn through real-world projects. Fun, engaging, and easy to follow.

A Daily Chronicle of AI Innovations on October 21st  2024

💥 TikTok owner fires intern for AI sabotage

🧠 AI reaches expert level in medical scans

🧑‍💻 Microsoft unveils new autonomous AI agents that can handle queries.

🕵️ Anthropic unveils new evaluations for AI sabotage risks

🍎 Tim Cook defends Apple coming late to AI with four words

🌍 Meta releases new AI models for voice and emotions

🚀 Microsoft CEO Satya Nadella says computing power is now doubling every 6 months, as the Scaling Laws paradigm has taken over from Moore’s Law, and the new currency is tokens per dollar per watt.

🦾 OpenAI’s Noam Brown says the o1 model’s reasoning at math problems improves with more test-time compute and “there is no sign of this stopping”

🧠 AI reaches expert level in medical scans

Researchers at UCLA just developed SLIViT, a new AI model that can analyze complex 3D medical scans with expert-level accuracy in a fraction of the time required by human specialists.

  • SLIViT (SLice Integration by Vision Transformer) can efficiently analyze various 3D imaging types, including MRIs, CT scans, and ultrasounds.

  • The model matches clinical expert accuracy while reducing analysis time by a mind-blowing factor of 5,000.

  • Unlike other AI models, SLIViT requires only hundreds of training samples, making it more practical for real-world applications.

  • The framework leverages transfer learning, using prior knowledge from 2D medical data for efficient training with smaller 3D datasets.

With the growing demand for faster diagnostics, SLIViT’s ability to rapidly and accurately analyze imaging offers a potential game-changer for healthcare. The model’s ability to work with small datasets also makes it more accessible for providers with limited resources — potentially democratizing expert medical imaging.

Source: https://www.uclahealth.org/news/release/new-ai-model-efficiently-reaches-clinical-expert-level

🚀 Meta reveals new AI models, tools

Meta FAIR just introduced a collection of new research models and datasets, including an upgraded image segmentation tool, a cross-modal language model, solutions to accelerate LLM performance, and more.

  • Spirit LM is an open-source multimodal language model that integrates speech and text to generate more natural-sounding and expressive speech.

  • Meta’s SAM 2.1 update offers improved image and video segmentation on its popular predecessor, which saw over 700,000 downloads in 11 weeks.

  • Layer Skip provides an end-to-end solution for accelerating LLM generation times by nearly 2x without specialized hardware.

  • Other artifacts include SALSA for security testing, Meta Lingua for language model training, a synthetic data generation tool, and more.

Meta continues to push the AI bar forward with big releases across various areas. Given the company’s impressive open-source systems, it’s hard to envision a future where closed models and tools have a significant advantage — and the moat between the two seems to be shrinking with each release.

Source: https://ai.meta.com/blog/fair-news-segment-anything-2-1-meta-spirit-lm-layer-skip-salsa-lingua

💻 IBM’s most compact AI models target enterprises

Meet IBM’s new third generation of Granite with new open, compact, and efficient 2B and 8B language models.

Designed to give enterprises more ways to embed and scale AI in their businesses, these new 2B and 8B compact models are:

  • Trained with carefully curated data;

  • Cost-efficient;

  • Designed to run high-performance solutions.;

Source: https://www.ibm.com/granite

🕵️ Anthropic unveils new evaluations for AI sabotage risks

Anthropic just published a set of new evaluations aimed at detecting potential sabotage capabilities in advanced AI systems, focusing on risks that could arise if models attempt to subvert human oversight or decision-making.

  • Four new evaluations were developed: human decision sabotage, code sabotage, sandbagging (hiding capabilities), and undermining oversight.

  • The evaluations use mock scenarios to test models’ ability to manipulate and deceive humans, insert bugs into code, and undermine monitoring systems.

  • Tests were run on Claude 3 Opus and Claude 3.5 Sonnet models, which did not flag concerning results but showed the capability to sabotage.

  • Anthropic is open-sourcing the evaluations and said stronger anti-sabotage mitigation will be needed as AI continues to improve.

Anthropic’s research shows that AI isn’t very good at sabotaging humans… yet. But the capabilities are there in some capacity — and if the model acceleration continues like many think it will, it’s only a matter of time before these threats will be real and important to mitigate.

Source: https://assets.anthropic.com/m/377027d5b36ac1eb/original/Sabotage-Evaluations-for-Frontier-Models.pdf

💥 TikTok owner fires intern for AI sabotage

  • ByteDance dismissed an intern for allegedly disrupting an AI project by “maliciously interfering” with the training of artificial intelligence models in August.
  • The company stated the intern’s actions did not affect its official commercial products or AI technology, countering exaggerated rumors about significant disruptions circulating online.
  • ByteDance informed the intern’s university and industry associations about the misconduct as rumors continued amidst broader scrutiny over generative AI safety and social media impacts.
  • Source: https://www.theguardian.com/technology/2024/oct/21/tiktok-owner-bytedance-sacks-intern-for-allegedly-sabotaging-ai-project

🍎 Tim Cook defends Apple coming late to AI with four words 

  • Tim Cook acknowledges that Apple is not the first in AI development but emphasizes that the goal is to deliver the best AI experience for customers.
  • The initial release of Apple Intelligence on October 28 is expected to be minimalistic compared to competitors like Google’s Gemini, with advanced features possibly available by 2025.
  • Apple plans to incorporate ChatGPT into iPhones and select iPads, focusing on device security and user consent for utilizing AI capabilities like text summarization and priority notifications.
  • Source: https://gizmodo.com/tim-cook-knows-apple-isnt-first-in-ai-but-says-its-about-being-the-best-2000514347

🎧 Apple’s AirPods Pro hearing health features are as good as they sound 

  • Apple’s AirPods Pro 2 are set to include new features like clinical-grade hearing aid capabilities, a hearing test, and enhanced hearing protection, with the release of iOS 18.1 potentially boosting hearing health awareness.
  • The new hearing protection mode is a subtle yet impactful upgrade, but there are limitations in extreme noise environments, which might make traditional earplugs still necessary for certain users.
  • While the hearing aid feature is impressive, it may not suit everyone due to its six-hour battery life and limitations for those with severe hearing loss, but it signals a promising shift in tech addressing real-world health needs.
  • Source: https://www.theverge.com/24275178/apple-airpods-pro-hearing-aid-test-protection-preview

This new Linear-complexity Multiplication (L-Mul) algorithm can reduce energy costs by 95% for element-wise tensor multiplications and 80% for dot products in large language models, while maintaining or even improving precision compared to 8-bit floating point operations.

r/singularity - This new Linear-complexity Multiplication (L-Mul) algorithm can reduce energy costs by 95% for element-wise tensor multiplications and 80% for dot products in large language models, while maintaining or even improving precision compared to 8-bit floating point operations.

Link to paper: Addition is All You Need for Energy-efficient Language Models

Link to twitter thread with insights: Rohan Paul on X

from twitter thread:

Solution in this Paper:

  • Approximates floating-point multiplication using integer addition

  • Linear O(n) complexity vs O(m^2) for standard floating-point multiplication

  • Replaces tensor multiplications in attention mechanisms and linear transformations

  • Implements L-Mul-based attention mechanism in transformer models

Key Insights from this Paper :

  • L-Mul achieves higher precision than 8-bit float operations with less computation

  • Potential 95% energy reduction for element-wise tensor multiplications

  • 80% energy reduction for dot products compared to 8-bit float operations

  • Can be integrated into existing models without additional training

Google AI – “Announcing CT Foundation, a new medical imaging embedding tool that accepts a computed tomography (CT) volume as input and returns a small, information-rich numerical embedding that can be used to rapidly train models.”

Source: https://research.google/blog/taking-medical-imaging-embeddings-3d/

Latest AI Tools:

Create mind maps with AI: a simple Next.js project that lets users generate and interact with mind maps for learning, using AI models from Ollama or OpenAI, with options to download as markdown. 

Source: https://github.com/aotakeda/learn-thing

Artificial Intelligence and Machine Learning For Dummies: This App offers Interactive simulations and visual learning tools to make AI/ML accessible. Explore neural networks, gradient descent, more through hands-on experiments.

This App offers Interactive simulations and visual learning tools to make AI/ML accessible. Explore neural networks, gradient descent, more through hands-on experiments
This App offers Interactive simulations and visual learning tools to make AI/ML accessible. Explore neural networks, gradient descent, more through hands-on experiments

A Daily Chronicle of AI Innovations on October 18th  2024

👀 Cracks appear in Microsoft and OpenAI partnership

🎧 Google’s AI podcast generator gets major updates

🔒 X updates privacy policy to allow third parties to train AI models

💵 US Treasury uses AI to recover billions from fraud

🤖 Newton AI learns physics from scratch

📓 NotebookLM launches business pilot

👁️ Worldcoin unveils next-gen eye scanner

🤖 Newton AI learns physics from scratch

Archetype AI just unveiled ‘Newton,’ a new foundational AI ‘Large Behavior Model’ that learns complex physics principles directly from raw sensor data, without any human guidance.

  • Newton ingests raw sensor measurements to build its understanding of physical phenomena without pre-programmed knowledge.

  • The model can accurately predict behaviors of systems it wasn’t explicitly trained on, like pendulum motion.

  • It outperformed specialized AI in tasks like forecasting citywide power consumption and discovering systems from data instead of training.

  • Archetype AI was founded by ex-Google researchers and has secured $13M in funding to date

Newton is a paradigm shift in AI’s interaction with the physical world. A single model could replace highly specialized systems by developing a generalized understanding rather than a narrow focus. The tech also opens the door to truly autonomous AI that can adapt to environments and tasks without human intervention.

Source: https://venturebeat.com/ai/archetype-ai-newton-learns-physics-from-raw-data-without-any-help-from-humans/

📓 NotebookLM launches business pilot

Google just pushed an update for its viral AI note-taking assistant NotebookLM, adding new features that let users guide AI-generated audio summaries and announcing the upcoming launch of a new business-focused version.

  • Users can now customize the AI podcast Audio Overviews feature by providing instructions to focus on specific topics or adjusting the expertise level.

  • A new Background Listening feature allows users to listen to Audio Interviews while multitasking within NotebookLM.

  • A pilot program for NotebookLM Business is coming, offering enhanced features for organizations like higher usage limits and team collaboration tools.

  • Audio Overviews, which turns docs, videos, and other content into podcasts between AI hosts, went viral earlier this month for its realistic audio outputs.

Google is dropping the ‘experimental’ tag on NotebookLM, and the viral feature built in just two months is suddenly being called a ‘ChatGPT’ moment for the company. It’s also an interesting case of users actually enjoying AI-generated content —  a quality that is hard to find in most mainstream sentiment for the tech.

Source: https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/

👁️ Worldcoin unveils next-gen eye scanner

Worldcoin, the ‘proof of personhood’ startup founded by OpenAI CEO Sam Altman, just announced a rebrand to ‘World’, along with a new version of its iris-scanning ‘Orb’ technology and updated core platforms.

  • A new streamlined Orb promises 5x performance to its predecessor, alongside new countries, self-serve, and on-demand Orbs for easier onboarding.

  • The company introduced World ID 3.0 protocol, featuring new World ID Credentials, Deep Face to combat AI-generated deepfakes, and added privacy infrastructure.

  • An updated World App 3.0 allows for anonymous integration with third-party apps, and World is also launching the mainnet of its Worldchain blockchain.

  • The company has previously faced backlash and even bans from certain countries over privacy concerns.

Verifying human identity in the increasing flood of AI-generated content, agents, and systems is clearly going to be massively important — but given Worldcoin’s rocky launch and international struggles, the question is whether the company can overcome the early drama to actually achieve its goals.

Source: https://www.pcmag.com/news/sam-altman-worldcoin-launches-deep-face-new-eye-scanning-orb

What Else is Happening in AI on October 18th 2024!

The U.S. Treasury Dept. shared that it leveraged AI to recover $1B in check fraud and prevent $4B in overall fraud in the 2024 fiscal year, showcasing the tech’s growing role in combating financial crime.

OpenAI expanded its partnership with consulting firm Bain & Co. to develop and sell industry-specific AI tools to corporate clients, with OpenAI reporting 1M paying business customers.

Meta is partnering with Blumhouse and other select filmmakers to test its Movie Gen AI video generation tools, gathering feedback to refine the tech before its public release in 2025.

Researchers from Alibaba and Skywork showcased Meissonic, a small, open-source text-to-image model that can generate high-quality outputs that outperform larger models.

Salesforce CEO Marc Benioff criticized Microsoft’s AI initiatives for overhyping the sector in an interview with Fast Company, calling its Copilot assistant the ‘next Clippy.’

OpenAI released a preview of its ChatGPT Windows app for paid users, offering file and photo interactions, model improvements, and a companion window mode.

A Daily Chronicle of AI Innovations on October 17th  2024

🫠 OpenAI quietly pitches products to US military

👨‍⚖️ Parents take school to court after student punished for using AI

🚀 Nvidia’s Nemotron outperforms leading AI models

📱Mistral AI unveils powerful new AI models for devices

🤖Boston Dynamics, Toyota team up on AI humanoids

🫠 OpenAI quietly pitches products to US military

  • OpenAI is exploring military and national security opportunities by partnering with government contractors and modifying its usage policies to allow for defense applications.
  • The company hired Dane Stuckey as Chief Information Security Officer, who previously worked with Palantir, a firm known for its military projects, indicating a shift towards defense collaboration.
  • Debate continues about the implications of using AI for military purposes, as OpenAI’s involvement in projects like those with the Department of Defense raises ethical concerns.
  • Source: https://fortune.com/2024/10/17/openai-is-quietly-pitching-its-products-to-the-u-s-military-and-national-security-establishment/

👨‍⚖️ Parents take school to court after student punished for using AI

  • A Massachusetts school district was sued by a student’s parents after their child was disciplined for using an AI chatbot to finish an assignment, despite no clear rule against it.
  • The lawsuit claims that the Hingham High School student handbook does not explicitly prohibit artificial intelligence use, which led to the improper punishment of the student, identified as RNH.
  • The case was taken to the US District Court for the District of Massachusetts, focusing on alleged violations of the student’s civil rights and naming several school officials as defendants.
  • Source: https://arstechnica.com/tech-policy/2024/10/student-was-punished-for-using-ai-then-his-parents-sued-teacher-and-administrators/

🚀 Nvidia’s Nemotron outperforms leading AI models

Nvidia quietly released a new open-sourced, fine-tuned LLM called Llama-3.1-Nemotron-70B-Instruct, which is outperforming industry leaders like GPT-4o and Claude 3.5 Sonnet on key benchmarks.

  • Nemotron is based on Meta’s Llama 3.1 70B model, fine-tuned by NVIDIA using advanced ML methods like RLHF.

  • The model achieves top scores on alignment benchmarks like Arena Hard (85.0), AlpacaEval 2 LC (57.6), and GPT-4-Turbo MT-Bench (8.98).

  • The scores edge out competitors like GPT-4o and Claude 3.5 Sonnet across multiple metrics — despite being significantly smaller at just 70B parameters.

  • NVIDIA open-sourced the model, reward model, and training dataset on Hugging Face, which can also be tested in a preview on the company’s website.

Source: https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct

📱Mistral AI unveils powerful new AI models for devices

French AI startup Mistral AI just launched two new compact language models designed to bring powerful AI capabilities to edge devices like phones and laptops.

  • The new ‘Les Ministraux’ family includes Ministral 3B and Ministral 8B models, which have just 3B and 8B parameters, respectively.

  • Despite their small size, the models outperform competitors like Gemma and Llama on benchmarks, including Mistral’s 7B model from last year.

  • Minstral 8B uses a new ‘interleaved sliding-window attention’ mechanism to efficiently process long sequences.

  • The models are designed for on-device use cases like local translation, offline assistants, and autonomous robotics.

While we await the incoming rollout of Apple Intelligence as many users’ first on-device AI experience, smaller models that can run efficiently and locally on phones and computers continue to level up. Having a top-tier LLM in the palm of your hand is about to become a norm, not a luxury.

Source: https://mistral.ai/news/ministraux

🧑‍🎨 Superstudio is your all-in-one creative AI platform

🤖Boston Dynamics, Toyota team up on AI humanoids

Boston Dynamics and the Toyota Research Institute just announced a new partnership to accelerate development of advanced humanoids, with plans to integrate TRI’s Large Behavior Models (LBMs) into the Atlas electric robot.

  • Toyota’s LBMs aim to teach robots to handle multi-task, dexterous vision, and language-guided capabilities.

  • The partnership combines two robotics labs owned by competing automakers, Hyundai (who purchased Boston Dynamics in 2020) and Toyota.

  • TRI‘s ‘Diffusion Policy’ enables robots to learn 60+ complex skills from human demos without coding, a key component of the partnership’s research efforts.

  • Boston Dynamics retired its hydraulic Atlas robot in April and debuted the electric update, currently being tested in Hyundai’s automotive factories.

The race for commercial humanoids is heating up fast — and this partnership represents a major power move. But with the likes of Tesla’s Optimus, Figure’s 01 humanoids, and others in the mix, there is no shortage of rivals rushing to capture the massive potential of the emerging general-purpose robots.

Source: https://www.prnewswire.com/news-releases/boston-dynamics-and-toyota-research-institute-announce-partnership-to-advance-robotics-research-302276655.html

What Else is Happening in AI on October 17th 2024!

ChatGPT’s web traffic reached a record 3.1B visits in September 2024, according to Similarweb, representing a 112% year-over-year increase and making it the 11th most visited website globally.

Source: https://www.similarweb.com/blog/insights/ai-news/chatgpt-topped-3-billion-visits-in-september

Suno launched Suno Scenes, allowing users to generate songs using images or videos instead of just text prompts.

Source: https://x.com/suno_ai_/status/1846574384963633345

Google Public Sector announced $15M grants to upskill U.S. government workers in responsible AI with plans to train over 100,000 public sector employees across federal, state, and local levels.

Source: https://blog.google/outreach-initiatives/google-org/google-org-public-sector-ai-funding

OpenAI published research examining how ChatGPT responds to usernames with various genders, racial, and cultural backgrounds — finding minimal bias but some stereotypical responses in open-ended tasks like creative writing.

Source: https://cdn.openai.com/papers/first-person-fairness-in-chatbots.pdf

Fashion brand Lacoste is leveraging AI for anti-counterfeit technology, using a tool called Vrai AI to analyze tiny logo details that can uncover fakes at 99.7% accuracy.

Source: https://www.yahoo.com/tech/lacoste-turn-ai-fight-counterfeiting-193000958.html

Palantir CISO Dane Stuckey announced that he is joining OpenAI as the company’s new chief information security officer, helping to drive the ‘development of safe AGI for the world.’

Source: https://x.com/cryps1s/status/1846325577906831728

Firms use AI to keep reality from unreeling amid ‘global deepfake pandemic’

 

Amazon goes nuclear, to invest more than $500 million to develop small modular reactors

After Microsoft, Google, now Amazon

https://www.cnbc.com/2024/10/16/amazon-goes-nuclear-investing-more-than-500-million-to-develop-small-module-reactors.html

Datacenters need baseload power, not intermittent power.

And with AI they need a lot of additional power.

Who is next?

Meta?

Tesla?

The market caps of those companies are huge compared to companies in the nuclear space

Market caps:

Amazon: 1.962 trillion USD

Microsoft: 3.093 trillion USD

Google: 2.042 trillion USD

Meta: 1.459 trillion USD

Meanwhile:

  • Nuscale Power (ticker: SMR) for instance has a market cap of only 1.80 billion USD

  • The uranium sector is taken by surprise by those last moves, the acceleration in nuclear reactor restarts in Japan (happening as we speak), USA (planned), … and the acceleration in nuclear reactor constructions in China, India, Russia, …

Trending AI Tools

Machine Learning & AI For Dummies

A Daily Chronicle of AI Innovations on October 14th 2024: 🐝 OpenAI unveils Swarm multi-agent framework 🫠 New Gmail security alert for 2.5B users as AI hack confirmed 🤔 Apple: ‘No evidence of formal reasoning’ in LLMs 🧠 Jensen Huang wants Nvidia to be a company with 100 million AI…

This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise.

iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

Web: https://machinelearningcertification.web.app/

Windows: https://apps.microsoft.com/detail/9p0r1x3jnc46?hl=en-us&gl=US

A Daily Chronicle of AI Innovations on October 16th  2024

🤖 Mistral releases new AI models for laptops and phones

👀 The New York Times tells Perplexity to stop using its content

🗞️ New York Times takes legal aim at Perplexity

🛡️ Anthropic reveals major update to AI safety policy

🧠 Meta researchers develop ‘thinking’ LLMs

🤖 Mistral releases new AI models for laptops and phones:

Mistral AI has introduced the Ministral 3B and 8B, optimized for on-device computing, enabling smartphones and laptops to run advanced AI models with low latency and high efficiency.

  • French AI startup Mistral has released its first generative AI models, “Les Ministraux,” designed for edge devices like laptops and phones, with two versions available: Ministral 3B and Ministral 8B.
  • Ministral 8B is available for research purposes, while commercial licenses are required for both models; they can also be used through Mistral’s cloud platform, with token-based pricing for usage.
  • Mistral claims its models outperform competitors such as Meta’s Llama and Google’s Gemma in benchmarks, and the company is expanding its AI portfolio, having recently raised $640 million in venture capital.

Source: https://siliconangle.com/2024/10/16/mistral-introduces-ministral-3b-8b-ai-models-for-laptops-and-phones/

🗞️ New York Times takes legal aim at Perplexity


The New York Times is preparing legal action against Perplexity AI for using its articles in AI summaries without a licensing agreement.

  • The NYT claims Perplexity’s use of its articles for AI-generated summaries violates copyright law, accusing the startup of unauthorized use of its journalism.

  • Perplexity reportedly previously told the publisher it would stop crawling its content, but results have continued to show up on the platform.

  • The startup says it’s open to working with publishers and will respond to the notice by the Oct. 30 deadline.

  • The NYT previously sued OpenAI and Microsoft over similar concerns, and other media outlets have also accused Perplexity of misusing their content.

Source: https://www.bloomberg.com/news/articles/2024-10-14/new-york-times-legal-aim-perplexity

🛡️ Anthropic reveals major update to AI safety policy


Anthropic has released new guidelines focusing on transparency and harm prevention, aiming to make AI development safer and more ethical.

  • The policy introduces ‘Capability’ and ‘Required’ Thresholds to trigger enhanced safety measures when AI models reach certain risk levels.

  • The two new thresholds focus on AI capabilities related to bioweapons and autonomous AI research.

  • Anthropic emphasized the need for the risk approach to be ‘exportable,’ hoping that it will become an industry standard and help shape regulation.

  • Anthropic will regularly evaluate its AI models, while a ‘Responsible Scaling Officer’ role will oversee policy implementation and compliance.

  • The company also pledged increased transparency, including public disclosure of capability reports and external expert input.

Source: https://techcrunch.com/2024/10/12/anthropic-updates-ai-safety-policy/

🧠 Meta researchers develop ‘thinking’ LLMs


Meta researchers are pioneering new large language models (LLMs) capable of ‘thinking,’ with improved reasoning and problem-solving abilities, pushing the limits of current AI technology.

  • TPO prompts models to generate internal thoughts before responding to user instructions, similar to how humans think before speaking.

  • The AI’s thoughts are kept private, with only the final answer shown to users — with the AI using trial-and-error without direct supervision to optimize outputs.

  • TPO outperforms standard models on key benchmarks for non-reasoning tasks like marketing and creative writing but declines in math-related tasks.

  • The approach builds on the recent OpenAI ‘Strawberry’ research and o1 model release, which takes time to reason.

Source: https://venturebeat.com/2024/10/meta-researchers-develop-thinking-llms/

What Else is Happening in AI on October 16th 2024!

The US government is considering capping AI chip exports from companies like Nvidia and AMD to certain countries, particularly in the Middle East, due to national security concerns.

Source: https://www.bloomberg.com/news/articles/2024-10-15/us-weighs-capping-exports-of-ai-chips-from-nvidia-and-amd-to-some-countries

Amazon unveiled a new AI-powered creative suite for advertisers, including tools to generate video, audio, and animated image ads.

Source: https://www.aboutamazon.com/news/innovation-at-amazon/amazon-ads-generative-ai-video-generator-advertisers

Google released its AI-powered shopping experience, featuring personalized recommendations, AI-generated product briefs, and deal-finding tools.

Source: https://blog.google/products/shopping/google-shopping-ai-update-october-2024

Apple debuted its new 7th generation iPad mini, the cheapest device ($499 base) to eventually support Apple Intelligence, which will include other AI features for writing and photo editing.

Source: https://www.apple.com/newsroom/2024/10/apple-introduces-powerful-new-ipad-mini-built-for-apple-intelligence

The University of Tokyo researchers revealed TANGO, an AI system that generates realistic human speakers, movements, and gestures to match audio input.

Source: https://pantomatrix.github.io/TANGO

Latest Trending AI Tools:

🔎 Perplexity for Mac – Search and discovery with AI, now available for Macs

⚙️ Gradio 5.0 – Build and share delightful machine-learning apps

AI and Machine Learning For Dummies PRO

Artificial Intelligence (AI) and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub
Artificial Intelligence (AI) and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub

A Daily Chronicle of AI Innovations on October 15th  2024

☢️ Google goes nuclear to power AI

🎬 Adobe unveils Firefly Video Model at MAX

💥 Chinese researchers reportedly crack military-grade encryption with quantum computer

🤔 US weighs capping exports of AI chips from Nvidia and AMD to some countries

🏛️ OpenAI locked in legal battle with… Open AI? 

📱 Apple announces new iPad Mini focused on AI

🎮 AI simulates Counter-Strike using neural network

🎬 Adobe unveils Firefly Video Model at MAX

Adobe just announced the addition of new video generation capabilities to its Firefly AI model and Premiere Pro at the company’s MAX Conference, alongside a slew of major AI updates across its creative software ecosystem.

  • The new Firefly Video Model is now in limited public beta and allows users to generate video from text prompts or images in Firefly and Adobe Premiere.

  • Video capabilities include cinematic video, 2D and 3D animations, text graphics, b-roll, and screen effects to blend with normal footage.

  • The model is trained exclusively on Adobe Stock and public domain content and is designed to be ‘commercially safe.’

  • Premiere Pro gets Generative Extend, a Firefly-powered tool for easily extending clips, smoothing transitions, and fine-tuning edits.

  • Adobe also rolled out 100+ features across Creative Cloud apps, GenStudio for enterprise marketing, and Project Concept for collaborative remixing.

Adobe’s new model looks impressive and could be one of the first AI video systems to truly break into the mainstream with seamless inclusion in its popular creative suite. While OpenAI’s Sora STILL awaits public access, others are filling the void with powerful models — it’s getting more competitive by the day.

Source:  https://news.adobe.com/news/2024/10/101424-adobe-launches-firefly-video-model

🏛️ OpenAI locked in legal battle with… Open AI? 

OpenAI is reportedly involved in a trademark dispute with Guy Ravine, who owns the ‘Open AI’ (with a space) trademark and claims he conceived and pitched the idea for the initiative to major tech leaders before the company’s founders.

  • Ravine registered the domain open.ai in March 2015 and owns the ‘Open AI’ trademark, which Sam Altman and Greg Brockman tried to purchase from him.

  • He alleges he pitched the concept to tech figures like Larry Page and Yann LeCun months before OpenAI’s launch in December 2015.

  • OpenAI sued Ravine in 2023, accusing him of trying to profit from their brand, and Ravine countersued, saying the company stole his idea.

  • A judge dismissed much of Ravine’s countersuit in September, though he plans to refile and push for a trial.

This Bloomberg investigation is wild, and it’s hard to discern whether this is a case of pure delusion or the underdog getting crushed by the big corporation. As the article points out, there’s major irony in the trademark dispute, given OpenAI’s legal issues from training data and copyright complaints.

Source: https://timesofindia.indiatimes.com/technology/tech-news/why-chatgpt-maker-openai-is-at-fight-with-open-ai/articleshow/114220808.cms

🎮 AI simulates Counter-Strike using neural network

Researchers from the University of Geneva, University of Edinburgh, and Microsoft developed DIAMOND, an AI model that can generate a playable simulation of Counter-Strike(CS:GO) at 10 frames per second within a neural network.

  • DIAMOND uses a diffusion-based approach, predicting the next frame based on previous frames and actions.

  • The model was trained on just 87 hours of CS:GO gameplay data, a fraction of what similar projects (like Google’s recent DOOM simulation) typically use.

  • Users can interact with the simulation using a keyboard and mouse, with the AI recreating elements like weapon mechanics and player interactions.

  • The model achieved a 46% better than human-level score on the Atari 100k benchmark, a SOTA performance for agents trained on a world model.

While still imperfect, DIAMOND points towards applications in robotics, autonomous systems, and virtual world creation. The ability to generate interactive, physics-based environments could revolutionize how AI is trained for real-world tasks. Plus, open-world video game creation is about to seriously level up.

Source: https://www.msn.com/en-us/news/technology/counter-strike-s-dust-ii-runs-purely-within-a-neural-network-on-an-rtx-3090-performance-is-disappointing-at-only-10-fps/ar-AA1s9SEA

☢️ Google goes nuclear to power AI 

  • Google has partnered with Kairos Power to construct seven nuclear reactors, intended to provide about 500 megawatts of carbon-free electricity for its data centers amidst rising energy demands, particularly due to increased data and AI usage.
  • The planned nuclear micro-reactors are expected to be operational by 2030, although this timeline is considered highly ambitious, and it remains unclear if the power will be directly connected to Google’s facilities or integrated into the public grid.
  • Google’s alliance with Kairos reflects a broader industry trend, as tech giants such as Microsoft and Amazon are also exploring nuclear power to meet their energy needs; however, challenges persist with cost, construction speed, and public acceptance of nuclear power projects.
  • Source: https://techcrunch.com/2024/10/14/google-signed-a-deal-to-power-data-centers-with-nuclear-micro-reactors-from-kairos-but-the-2030-timeline-is-very-optimistic/

💥 Chinese researchers reportedly crack military-grade encryption with quantum computer 

  • Chinese scientists have reportedly used a D-Wave quantum computer to crack encryption, revealing vulnerabilities in widely used methods like RSA, which is essential for technologies including web browsers, VPNs, email services, and certain electronic chips.
  • The study demonstrates that the quantum device, utilizing techniques grounded in the quantum annealing algorithm, can successfully decompose a 50-bit RSA integer, emphasizing advanced risks to encrypted data and highlighting the machine’s potential impact on cybersecurity.
  • Quantum machines like the D-Wave Advantage, rentable for $2,000 an hour or costing approximately $15 million to purchase, pose a significant threat to encryption systems, leading experts to advocate for stronger defenses against potential future quantum decryption capabilities.
  • Source: https://www.pcmag.com/news/chinese-researchers-reportedly-crack-encryption-with-quantum-computer

🤔 US weighs capping exports of AI chips from Nvidia and AMD to some countries

  • The U.S. government is considering limiting the export of advanced AI chips from American manufacturers, such as Nvidia and AMD, to particular nations, including those in the Middle East, due to national security concerns.
  • This potential export restriction may follow the Commerce Department’s recent changes, which have made it easier for American companies to send AI chips to countries in the Middle East developing data centers.
  • In reaction to these developments, U.S. authorities have already begun slowing down the approval of export licenses for AI accelerators from companies like Nvidia and AMD, while they conduct a national security assessment of the AI technologies being created in the Middle East.
  • Source: https://qz.com/us-cap-exports-sales-ai-chips-nvidia-amd-middle-east-1851672579

📱 Apple announces new iPad Mini focused on AI

  • Apple has unveiled a new iPad Mini that emphasizes artificial intelligence, incorporating features such as text rewriting tools, a Siri update utilizing personal context, and app enhancements like a “Clean Up” option for image editing.
  • Previously, the iPad Mini, which had not received an update since 2021, lacked support for advanced AI tools and the latest Apple Pencil models, but this revision introduces the cutting-edge A17 Pro chip to address that.
  • Priced at $499 or £499, the upgraded device promises enhanced graphics and faster processing, is available for order now, and will be in stores by Wednesday, 23 October.
  • Source: https://www.independent.co.uk/tech/apple-ipad-mini-new-announce-mac-b2629529.html

What Else is Happening in AI on October 15th 2024!

Former OpenAI CTO Mira Murati is reportedly trying to poach OpenAI employees for a new venture just weeks after leaving the company — despite remaining an advisor.

Source: https://techstory.in/mira-murati-is-raising-vc-funds-for-her-own-venture-after-openai-exit/

Key Microsoft AI researcher Sebastien Bubeck departed to join OpenAI after playing a prominent role in the small, efficient Phi language models.

Source: https://www.computerworld.com/article/3564352/microsofts-ai-research-vp-joins-openai-amid-fight-for-top-ai-talent.html

Google partnered with nuclear startup Kairos Power to build seven small modular reactors in the US, aiming to supply 500 megawatts of carbon-free electricity for AI data centers by 2030.

Source: https://www.aljazeera.com/economy/2024/10/15/google-signs-deal-with-startup-to-build-small-nuclear-reactors-to-power-ai

YouTube announced that creators can now leverage its AI Dream Track feature to generate soundtracks for shorts using natural language prompts directly in the app.

Source: https://www.socialmediatoday.com/news/youtube-broader-launch-dream-track-ai-audio-generator/729814/

Gatorade launched a new promotion with Adobe allowing users to leverage Firefly’s AI models to customize squeeze bottles with unique designs.

Source: https://www.nasdaq.com/press-release/gatorade-launches-generative-ai-squeeze-bottle-personalization-fuel-athlete-self

Nvidia-backed AI cloud provider CoreWeave secured a $650M credit loan to fuel growth and announced a nearly $1B investment in U.K. AI infrastructure.

Source: https://www.msn.com/en-us/money/topstocks/nvidia-backed-coreweave-secures-650-million-credit-line-to-boost-ai-infrastructure/vi-AA1sk70k

Latest AI Research and Tools

Machine Learning For Dummies:

Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.

AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub

iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573

PRO Version (No ADS, See All Answers): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

⚙️ LLMWare – Dev tool to make AI apps deployed privately or locally: https://github.com/llmware-ai/llmware

PayloadCMS: an open-source, fullstack Next.js framework that simplifies creating web applications by allowing users to use their own databases, avoid microservices complexity, and extend both backend and admin interfaces, while providing pre-made templates for rapid deployment. 

Source: https://github.com/payloadcms/payload

Running LLMs with 3.3M Context Tokens on a Single GPU: this paper presents a method for operating large language models with up to 3.3 million context tokens on a single graphics processing unit. 

Source: https://arxiv.org/abs/2410.10819

A Daily Chronicle of AI Innovations on October 14th  2024

🐝 OpenAI unveils Swarm multi-agent framework

🔮 Anthropic CEO drops essay on AI and the future

🔮 Apple smart glasses and AirPods with cameras could arrive in 2027

🤔 Apple: ‘No evidence of formal reasoning’ in LLMs

🧠 Jensen Huang wants Nvidia to be a company with 100 million AI assistants

🫠 New Gmail security alert for 2.5B users as AI hack confirmed

🧠Breakthrough from REMspace: First Ever Communication Between People in Dreams

🎥 Adobe’s AI-powered video generation is here

🤖 Tesla’s robots were human-controlled

🔮 Apple smart glasses and AirPods with cameras could arrive in 2027 

  • Apple is expected to launch smart glasses and AirPods with integrated cameras in 2027 as part of its strategy to extend its augmented reality product range beyond the Vision Pro headset, which has faced market limitations.
  • The Vision Pro, characterized by its $3,500 price tag, has been criticized for its weight and overheating issues, leading to disappointing sales and reduced consumer interest since its debut.
  • Apple aims to enhance augmented reality accessibility by developing these new devices, acknowledging competition from Meta’s more affordably priced smart glasses and planning cheaper and more advanced versions of the Vision Pro in the coming years.
  • Source: https://www.macrumors.com/2024/10/14/apple-smart-glasses-airpods-cameras-2027/

🧠 Jensen Huang wants Nvidia to be a company with 100 million AI assistants

  • Nvidia CEO Jensen Huang envisions a future where the company will have 50,000 employees and 100 million AI agents working together to increase productivity.
  • The AI agents would break down complex tasks, recruit other AIs, and work alongside humans in platforms like Slack, creating a seamless hybrid workforce of digital and biological entities.
  • Huang believes that AI-driven productivity improvements could lead to both company growth and job creation, as automation frees up human workers to focus on higher-value tasks.
  • Source: https://www.newsbytesapp.com/news/science/100-million-ai-assistants-in-nvidia-s-future-ceo-jensen-huang/story

🫠 New Gmail security alert for 2.5B users as AI hack confirmed

  • Google has strengthened security measures for Gmail accounts, but hackers using AI-driven techniques have evolved to create highly convincing scams, as pointed out by Sam Mitrovic, a Microsoft consultant who nearly fell for an advanced AI phishing attempt.
  • Mitrovic received misleading notifications and calls posing as Google support, where the scam’s AI convincingly impersonated a voice, falsely claiming his account was compromised for seven days and accessed from unusual locations, which was part of the deceit.
  • Mitrovic’s experience highlights the threat of AI scams and emphasizes vigilance; users should verify unsolicited contact supposedly from Google, using resources like Google search to check phone numbers and email origins before reacting to prevent credential theft.
  • Source: https://www.forbes.com/sites/daveywinder/2024/10/13/new-gmail-security-alert-for-billions-as-7-day-ai-hack-confirmed/

🎥 Adobe’s AI-powered video generation is here 

  • Adobe launched Firefly’s new video generation capabilities, allowing users to try out text-to-video and image-to-video models through its website and Premiere Pro beta app, aiming to enhance editing tasks rather than creating new videos from scratch.
  • The Generative Extend feature, available in the Premiere Pro beta, enables users to extend video clips by up to two seconds, enhancing the continuity of video and audio without reproducing copyrighted voices or music to prevent legal issues.
  • Adobe aims to support creatives by paying for video submissions to train its AI model, while encouraging the artistic community to adopt AI tools for expanding creative capacities and meeting the increasing demand for personalized content.
  • Source: https://techcrunch.com/2024/10/14/adobe-invites-you-to-embrace-the-tech-with-fireflys-new-video-generator/

🤖 Tesla’s robots were human-controlled 

  • During Tesla’s “We, Robot” event, Optimus, Elon Musk’s humanoid robot, became the highlight by safely moving through the crowd and interacting with attendees despite lacking true artificial intelligence.
  • Although Musk claimed Optimus to be Tesla’s most significant product, the robots showcased were operated and voiced by humans remotely, posing as a contrast to the fully autonomous image implied during the demonstration.
  • Critics, such as Tesla content creator Jeremy Judkins, expressed disappointment with Tesla’s lack of transparency about the human assistance, viewing it as misleading and calling for more honesty about the robot’s capabilities.
  • Source: https://fortune.com/2024/10/13/elon-musk-tesla-optimus-robot-tele-operated-robotaxi/

🤔 Apple: ‘No evidence of formal reasoning’ in LLMs

Apple researchers just published a new study revealing major limitations in the reasoning capabilities of LLMs, including those from top AI labs like OpenAI’s 4o and o1 models.

  • Apple scientists developed a new benchmark called GSM-Symbolic to evaluate LLMs’ mathematical reasoning skills.

  • The study found that slight changes in the wording of questions or adding irrelevant info drastically altered model outputs, with accuracy dropping by up to 65%.

  • Researchers saw increased performance variability and decreased accuracy as the complexity of questions increased.

  • The team concluded that there was “no evidence of formal reasoning” in the models tested, suggesting that the behavior is more likely sophisticated pattern matching.

While there seem to be conflicting opinions on whether LLMs can truly reason, file this new research under the ‘no’ category. If these limitations hold, they expose some significant questions regarding the reliability and risks of deploying models into increasingly more complex applications.

Source: https://arxiv.org/pdf/2410.05229

🐝 OpenAI unveils Swarm multi-agent framework

OpenAI just introduced Swarm, a new open-source experimental framework designed to simplify the creation and control of multi-agent AI systems.

  • Swarm focuses on making agent coordination lightweight, controllable, and easily testable through two key building blocks: agents and handoffs.

  • Agents encapsulate specific instructions and tools, while handoffs allow agents to transfer control of a conversation to another agent.

  • Swarm includes features like function calls, context variables, and streaming and is built on OpenAI’s ChatCompletions API.

  • The framework is available on GitHub with several examples, including a triage agent, weather agent, and airline customer service system.

  • OpenAI emphasized that Swarm is experimental and released as an educational resource for exploring multi-agent orchestration.

Not only are singular agentic capabilities inching closer — but the ability to deploy systems that leverage armies of agents working together is also coming fast. Soon, the user will be the CEO of their AI company — with dozens of agents autonomously working together on complex, multi-step tasks.

Source: https://cookbook.openai.com/examples/orchestrating_agents

🧠Breakthrough from REMspace: First Ever Communication Between People in Dreams

A new definition of Social if confirmed. Chatting in your dreams “On September 24, participants were sleeping at their homes when their brain waves and other polysomnographic data were tracked remotely by a specially developed apparatus. When the server detected that the first participant entered a lucid dream, it generated a random Remmyo word and sent it to him via earbuds. The participant repeated the word in his dream, with his response captured and stored on the server. Eight minutes later, the next participant entered a lucid dream. She received the stored message from the first participant and confirmed it upon awakening, marking the first-ever “chat” exchanged in dreams. Additionally, two other people were able to communicate with the server through their dreams.”

Source: https://www.businesswire.com/news/home/20241008878282/en/Breakthrough-from-REMspace-First-Ever-Communication-Between-People-in-Dreams

What Else is Happening in AI on October 14th 2024:

Meta’s AI chief Yann LeCun calls AI apocalypse fears ‘complete B.S.’.

Source: https://www.techspot.com/news/105123-meta-ai-chief-yann-lecun-calls-ai-apocalypse.html

New ChatGPT prompt goes viral with Sam Altman’s approval.

Source: https://www.techradar.com/computing/artificial-intelligence/new-chatgpt-prompt-goes-viral-with-sam-altmans-approval

Meta chief AI scientist Yann LeCun said that existential warnings about AI are ‘complete BS,’ arguing that the current systems are no smarter than a house cat.

Source: https://www.wsj.com/tech/ai/yann-lecun-ai-meta-aa59e2f5

AI pioneer Yoshua Bengio warned about the dangers of AI in a new interview, saying humanity is on a path to ‘creating monsters that could be more powerful than us.’

Source: https://finance.yahoo.com/news/ai-godfather-yoshua-bengio-were-creating-monsters-more-powerful-than-us-120042014.html

A new study from Sun Yat-sen University used Meta’s ESMFold protein-prediction tool to uncover 70,500 new RNA viruses in environmental data.

Source: https://www.nature.com/articles/d41586-024-03320-6

Apple reportedly plans to launch a lower-end model of its Vision headset, priced at $2,000 instead of the $3,500 Vision Pro, which has suffered.

Source: https://www.bloomberg.com/news/newsletters/2024-10-13/apple-smart-home-plans-new-os-smart-displays-vision-pro-integration-robots-m27kw5m7

Trending AI Tools

💡 Google Illuminate – Transform research papers into AI-generated audio summaries

u/enoumen - AI Weekly Rundown Oct07-14 2024: 🤖OpenAI launches new multi-agent framework 'Swarm' ⚠️Wikipedia declares war on AI Generated Content 🚗Elon Musk reveals new $30K robotaxi 🏅Google DeepMind researchers win Nobel Prize in chemistry 🤔OpenAI says bad actors are using its platform to disrupt…Machine Learning & AI For Dummies

This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise.

iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

Web: https://machinelearningcertification.web.app/

Windows: https://apps.microsoft.com/detail/9p0r1x3jnc46?hl=en-us&gl=US

🧮 CalcGen AI – Transform data into interactive visualizations in seconds: https://calcgen.ai/

🤖 Kuration AI – Curate, refine, and enrich lead databases with automated B2B AI agents: https://www.kurationai.com/

A Daily Chronicle of AI Innovations on October 11th  2024

🚗 Elon Musk reveals new $30,000 robotaxi🚖

🚀 AMD reveals next-gen AI chips – going after Nvidia

🤖 Tesla’s Optimus robots steal the show at Tesla event

🫠 TikTok cuts hundreds of jobs to replace them with AI

⚠️ Wikipedia declares war on AI-generated content

🤖 OpenAI’s new AI agent benchmark

🚗 Elon Musk reveals new $30,000 robotaxi 

  • Elon Musk introduced the Tesla Cybercab, a self-driving vehicle without steering wheels or pedals, with plans for consumer availability under $30,000 and production aimed before 2027, despite Tesla’s history of delayed autonomy promises.
  • Alongside the Cybercab, Musk announced the Robovan, an autonomous electric vehicle designed to transport up to 20 people or goods, with both models featuring inductive charging for wireless energy transfer at recharge stations.
  • At the invitation-only robotaxi event, Musk also highlighted an unsupervised version of Tesla’s Full Self-Driving system expected in 2024.

Elon Musk says Tesla’s robotaxis will have no plug for charging and will instead charge inductively. They will be cleaned by machines and a world of autonomous vehicles will enable parking lots to be turned into parks.

r/singularity - Elon‘s new ‘robotaxi’, what are your thoughts?

r/singularity - Elon‘s new ‘robotaxi’, what are your thoughts?

r/singularity - Elon‘s new ‘robotaxi’, what are your thoughts?

r/singularity - Elon‘s new ‘robotaxi’, what are your thoughts?

r/singularity - Elon‘s new ‘robotaxi’, what are your thoughts?

Source: https://www.nbcnews.com/tech/innovation/cybercab-robovan-musk-tesla-event-cost-rcna174996

🫠 TikTok cuts hundreds of jobs to replace them with AI 

  • TikTok has announced it is dismissing several hundred workers worldwide to transition towards using artificial intelligence for content moderation, aiming to enhance its global moderation model.
  • Approximately 500 employees in Malaysia are losing their jobs as part of this restructuring, with TikTok also planning to consolidate some regional operations and having previously cut positions in marketing and operations earlier this year.
  • The platform currently employs a combination of human and automated methods to review content, but AI will increasingly replace human moderators, who have faced difficult conditions, including low pay and the psychological toll from reviewing harmful content.
  • Source: https://www.pcmag.com/news/tiktok-lays-off-hundreds-of-staff-to-replace-them-focus-on-ai

💻 AMD is going after Nvidia with new AI chips 

  • AMD has introduced its Instinct MI325X AI chip aimed at competing with Nvidia’s data center GPUs, with production slated to commence by the end of 2024, potentially pressuring Nvidia’s market position and gross margins.
  • The Instinct MI325X rollout positions AMD against Nvidia’s Blackwell chips, with AMD aiming for significant market entry amidst growing demand from AI-intensive applications powered by vast data centers.
  • Despite aiming to challenge Nvidia’s dominance, AMD’s primary hurdle is the rival’s CUDA programming language, but AMD’s enhancements in ROCm software and upcoming CPUs are responsive strategies to capture more market share.
  • Source: https://www.cnbc.com/2024/10/10/amd-launches-mi325x-ai-chip-to-rival-nvidias-blackwell-.html

⚠️ Wikipedia declares war on AI-generated content

  • Wikipedia editors have initiated “WikiProject AI Cleanup” to tackle the issue of unsourced and poorly-written AI-generated content, aiming to protect the integrity of the platform’s information.
  • The project does not intend to ban AI usage entirely but seeks to remove content that is inaccurately sourced or filled with AI hallucinations that compromise article quality.
  • Editors have identified AI-generated text patterns and catchphrases to detect substandard content, despite the challenges of spotting complex AI-generated errors in subjects like historical architecture.
  • Source: https://futurism.com/the-byte/wikipedia-declares-war-ai-slop

🤖 OpenAI’s new AI agent benchmark

OpenAI just introduced MLE-bench, a new benchmark designed to evaluate how well AI agents perform on real-world machine learning engineering tasks using Kaggle competitions.

  • MLE-bench consists of 75 curated Kaggle competitions, covering a range of ML tasks like model training, data preparation, and experimentation.

  • Kaggle competitions are online challenges where data scientists compete to solve complex problems using machine learning for prizes and recognition.

  • In research, the AI models often succeeded in applying standard techniques but struggled with tasks requiring adaptability or creative problem-solving.

  • The best-performing setup, OpenAI’s o1-preview model with AIDE scaffolding, achieved at least a bronze medal in 16.9% of competitions.

  • AI agents are coming in hot — and new benchmarks are necessary to evaluate capabilities that blow past previous testing measures. Between OpenAI’s commentary, a flurry of startups pushing agentic capabilities, and new benchmarks being created, the AI agent revolution feels ready to explode.
  • Source: https://openai.com/index/mle-bench/

[Google DeepMind] Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

An animal’s optimal course of action will frequently depend on the location (or more generally, the ‘state’) that the animal is in. The hippocampus’ purported role in representing location is therefore considered to be a very important one. The traditional view of state representation in the hippocampus is that the place cells index the current location by firing when the animal visits the encoded location and otherwise remain silent. The main idea of the successor representation (SR) model, elaborated below, is that place cells do not encode place per se but rather a predictive representation of future states given the current state. Thus, two physically adjacent states that predict divergent future states will have dissimilar representations, and two states that predict similar future states will have similar representations.

—Stachenfeld, K. L., Botvinick, M. M., & Gershman, S. J. (2017). The hippocampus as a predictive map. Nature neuroscience, 20(11), 1643-1653.

Source: https://arxiv.org/abs/2410.08146

🗣️ Master a new language with ChatGPT Voice

ChatGPT’s new Advanced Voice Mode allows you to practice and improve your language skills through interactive conversations and role-play scenarios.

  1. Download the ChatGPT app on your phone.

  2. Craft a detailed learning prompt (similar to the one in the image above).

  3. Tap the mic icon and speak your prompt to start the session.

  4. Engage in conversation, asking for slower speech or repetition as needed

  5. Pro Tip: Save effective prompts in your custom instructions for quick access and consistent practice across sessions.

What Else is Happening in AI on October 11th 2024!

Chinese researchers unveiled Pyramid Flow, a new open-source AI video generation model capable of creating high-quality, 10-second clips using a new ‘pyramidal flow matching’ technique. 

Source: https://www.aibase.com/news/12303

OpenAI Chairman Bret Taylor’s AI startup Sierra is reportedly set to raise hundreds of millions in funding at a valuation of over $4B for its conversational enterprise AI agents.

Source: https://www.msn.com/en-us/money/companies/openais-chairman-says-ai-is-in-a-bubble-but-one-that-could-be-wildly-lucrative/ar-AA1rCyUB

Japanese AI startup Rhymes released Aria, hailed as the first open-source multimodal native Mixture-of-Experts model — offering SOTA performance across various tasks with a lightweight 3.9B parameters and 64k token context window.

Source: https://the-decoder.com/japanese-multimodal-ai-model-aria-is-open-source-and-beats-many-competitors

Wondercraft launched a new ‘Director Mode’ feature, allowing users to control AI voices with natural language instructions and becoming the first audio platform to integrate OpenAI’s Advanced Voice Mode.

Source: https://www.wondercraft.ai/blog/prompt-ai-voices-with-wondercrafts-director-mode

Google rolled out its Imagen 3 image generator to all Gemini users, though only Advanced subscribers ($19.99/mo) can generate images of people.

Source: https://www.techradar.com/computing/artificial-intelligence/google-geminis-new-ai-image-generator-just-rolled-out-to-everyone-for-free-with-one-annoying-limitation

Walmart revealed new AI platforms to create hyper-personalized shopping experiences, including its Wallaby LLMs trained on the company’s data and a Customer Support Assistant that can take actions for the user.

Source: https://corporate.walmart.com/news/2024/10/09/walmart-reveals-plan-for-scaling-artificial-intelligence-generative-ai-augmented-reality-and-immersive-commerce-experiences

Apple Intelligence features can also summarize breakup texts for you.

Source: https://techcrunch.com/2024/10/11/apple-intelligence-features-can-also-summarize-breakup-texts-for-you/

OpenAI releases its meta-prompt for prompt optimization.

Source: https://the-decoder.com/openai-releases-its-meta-prompt-for-prompt-optimization/

 

A Daily Chronicle of AI Innovations on October 10th  2024

🤔 OpenAI says bad actors are using its platform to disrupt elections

🛠️ New model tops tool-calling leaderboard

🗣️ Zoom launches new AI platform features

👅 Electronic tongue enables AI to taste

Listen at https://podcasts.apple.com/us/podcast/a-daily-chronicle-of-ai-innovations-on-october/id1684415169?i=1000672661634

🤔OpenAI says bad actors are using its platform to disrupt elections

  • OpenAI reports that it has disrupted over 20 operations globally that attempted to misuse its AI models for spreading election-related misinformation, ranging from fake social media posts to AI-generated articles, but such efforts had minimal impact.
  • The company highlights growing concerns about AI-generated content contributing to misinformation in elections worldwide, amidst a significant year for global elections, affecting over 4 billion people in 40 countries.
  • OpenAI indicates that despite attempts from operations in countries like Iran and Rwanda to use its platform for election disruption, the AI-generated content in these cases failed to achieve widespread engagement or build large audiences.

Source: https://www.cnbc.com/2024/10/09/openai-says-more-cyber-actors-using-its-platform-to-disrupt-elections.html

🛠️ New model tops tool-calling leaderboard

AI startup Writer just introduced Palmyra X 004, an LLM that sets a new standard for action capabilities and function calling in enterprise AI — beating out top models from OpenAI and Anthropic.

  • Palmyra X 004 outperforms OpenAI, Anthropic, Meta, and Google models on Berkeley’s Tool Calling Leaderboard, leading by nearly 20% accuracy.

  • The model offers a 128k context window, supports over 30 languages, and handles multimodal inputs (text, images, audio).

  • Palmyra can interact with external tools via tool calling, enabling it to perform tasks like updating databases, sending emails, triggering workflows, and more.

  • The 150B parameter model was trained on synthetic data, which the company said significantly reduced costs compared to the top AI labs.

As companies race to integrate AI, models that can take concrete actions rather than just provide information are in high demand. Palmyra X 004’s impressive skills could give Writer a new edge in the enterprise AI market and also serve as an example that not all top models require massive computing resources.

Source: https://writer.com/blog/actions-with-palmyra-x-004

🗣️ Zoom launches new AI platform features

Zoom just unveiled a suite of new AI-driven innovations to its platform at its Zoomtopia 2024 event, including AI companion 2.0, a custom AI add-on plan, personalized avatars, and more.

  • Companion 2.0 is an AI assistant that works across Zoom Workplace, offering expanded context, web access, and the ability to take agentic-type actions.

  • Zoom Tasks is a new AI-powered feature to help detect, recommend, and complete tasks based on conversations across Zoom Workplace.

  • Custom AI avatars will become available in Zoom Clips in 2025, with the ability to create video content from text scripts.

  • Zoom founder Eric Yuan previously said that AI avatars will eventually be capable of attending Zoom meetings and making decisions on a user’s behalf.

Zoom says it wants to overhaul work in the digital age, and these announcements point to a new AI-driven world of interconnected tools and workflows. While avatars attending meetings and acting on your behalf might sound wild now, the work landscape is about to be turned upside down as AI continues to grow and scale.

Source: https://news.zoom.us/zoomtopia-2024-unveiling-ai-first-work-platform-innovations

👅 Electronic tongue enables AI to taste

Scientists at Penn State just created an AI-powered ‘electronic tongue’ that can identify subtle differences in liquids, detect food spoilage, and gain broader insights into AI’s decision-making processes.

  • The electronic tongue combines a special sensor with an AI modeled after the human brain’s taste center, enabling it to ‘taste’ liquids.

  • The tongue can ID differences in similar liquids like watered-down milk, sodas, coffee, and spoiled fruit juices with over 80% accuracy in about a minute.

  • When the AI was allowed to interpret the sensor data on its own terms, it achieved over 95% accuracy in identifying the samples.

  • Researchers also used methods to examine the AI’s thought process, helping understand how it weighs different pieces of information to make decisions.

Source: https://www.psu.edu/news/research/story/matter-taste-electronic-tongue-reveals-ai-inner-thoughts

Excerpt about AGI from OpenAI’s latest research paper

r/singularity - Excerpt about agi from OpenAIs latest research paper

Runway CEO Cristóbal Valenzuela says AI is coming to Hollywood and demos tools that move beyond text prompts to give filmmakers greater control over video generation

Google DeepMind’s Demis Hassabis and John Jumper were co-awarded a Nobel Prize in chemistry for their work on AlphaFold, an AI system that can predict and design protein structures. https://www.nobelprize.org/prizes/chemistry/2024/press-release

Amazon introduced AI Shopping Guides for over 100 product types, leveraging generative AI to streamline product research and offer tailored recommendations within its U.S. app and mobile website. https://www.aboutamazon.com/news/retail/amazon-ai-shopping-guides-product-research-recommendations

Chinese startup MiniMax’s Hailuo AI launched a new image-to-video feature, alongside new style controls and enhanced processing and control. https://x.com/Hailuo_AI/status/1843614057229873419

Meta expanded Meta AI to six new countries, including the EU, and is rolling it out internationally in Ray-Ban Meta smart glasses — though the EU will be excluded from multimodal capabilities due to regulatory issues. https://www.engadget.com/ai/meta-ai-will-launch-in-six-more-countries-today-including-the-uk-150057934.html

Stripe announced expanding its partnership with NVIDIA, enabling global access to NVIDIA’s AI cloud services and leveraging the chipmaker’s platform for improved fraud detection. https://stripe.com/en-ca/newsroom/news/nvidia-collaboration-with-stripe

A Daily Chronicle of AI Innovations on October 09th  2024

🏅 Google DeepMind researchers win Nobel Prize in chemistry

👀 OpenAI seeks independence from Microsoft

🛡️ Adobe launches AI attribution system

🧠 AI computing capacity for leading tech companies

🏅 Google DeepMind researchers win Nobel Prize in chemistry

The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Chemistry with one half to David Baker “for computational protein design” and the other half jointly to Demis Hassabis and John M. Jumper “for protein structure prediction.”

 

Press release: https://www.nobelprize.org/prizes/chemistry/2024/press-release/
Popular information: They have revealed proteins’ secrets through computing and artificial intelligence: https://www.nobelprize.org/prizes/chemistry/2024/popular-information/
Scientific background: Computational protein design and protein structure prediction: https://www.nobelprize.org/prizes/chemistry/2024/advanced-information/

🏅The Nobel Prize in Literature for 2024 has been awarded to ChatGPT

The Nobel Prize in Literature for 2024 has been awarded to ChatGPT
The Nobel Prize in Literature for 2024 has been awarded to ChatGPT

The Nobel Prize in Literature for 2024 has been awarded to ChatGPT for “his intricate tapestry of prose which showcases the redundancy of sentience in art.” This fictional accolade humorously acknowledges the ability of AI to produce sophisticated, expressive literature, suggesting that creativity can transcend traditional human boundaries.

The award, granted by The Swedish Academy, celebrates the notion that artificial intelligence, despite its lack of human consciousness, has the capacity to create a profound and complex body of work—so much so that it might question the necessity of human sentience in the realm of artistic expression.

Source: https://www.nobelprize.org/prizes/literature/2024/press-release/

👀 OpenAI seeks independence from Microsoft

OpenAI is reportedly looking to reduce its reliance on Microsoft for compute power and has started exploring options to set up its own data servers and secure AI chips independently, according to a new report from The Information.

  • CFO Sarah Friar told shareholders that Microsoft ‘hasn’t moved fast enough’ to supply computing power, causing the AI giant to look elsewhere.

  • OpenAI plans to lease an entire data center in Abilene, TX from Oracle, though Microsoft likely had to ‘bless’ the deal with its rival, according to the report.

  • OpenAI is also developing its own AI chip, which could lower costs for future computing clusters — its current supply is rented primarily from Microsoft.

  • Tensions have also reportedly arisen between OpenAI and Microsoft over the design and timeline of a massive joint data center project called ‘Fairwater.’

OpenAI and Microsoft’s relationship has felt a bit off for a while now. While both companies have leveraged each other well to ascend the AI power ladder, it certainly feels like there is trouble in paradise. There is plenty of smoke, and how this partnership shakes out could have fiery implications for the entire AI landscape.

Source: https://www.theinformation.com/articles/openai-eases-away-from-microsoft-data-centers

🛡️ Adobe launches AI attribution system

Adobe just announced a new free web app called Adobe Content Authenticity, designed to help creators protect their work and receive proper attribution in the era of AI-generated content.

  • The web app allows creators to easily apply content credentials to images, audio, and video files, acting as a ‘nutrition label’ for digital content.

  • Content credentials include creator information and creation details and can signal if the creator doesn’t want their work used to train AI models.

  • The system uses digital fingerprinting, invisible watermarking, and cryptographic metadata to make the credentials difficult to remove.

  • The web app, which has a waitlist, is expected to launch in Q1 of 2025, while a Chrome extension is available in beta today.

AI is extremely polarizing in the creator and artist community, largely due to the issues of unauthorized training and attribution that Adobe, Meta, OpenAI, and others are trying to address. While these tools are promising, they still rely heavily on widespread adoption and opt-in by creators and tech companies.

Source: https://contentauthenticity.adobe.com/

🎬 Control object motion in AI videos

Kling AI, one of the most popular AI video generators, now lets you add strategic movement to specific elements in AI video, providing more control in your generated clips.

  1. Choose a high-quality image with different elements to animate.

  2. Access Kling AI‘s Image-to-Video tool and upload your image.

  3. Use the Motion Brush to paint areas you want to animate and set motion paths for each area to define movement direction.

  4. Fine-tune with prompts, adjust settings, and generate your video.

Pro tip: Keep movements subtle and natural for more realistic results, and experiment with different combinations to find what works best for your specific image.

Source: https://kling.ai

AI is Revolutionizing Weather Forecasts : How GraphCast Models are Predicting the Future with Unmatched Precision

 

In recent years, artificial intelligence (AI) has made significant strides in numerous fields, from healthcare to finance. One of the most exciting developments is how AI is revolutionizing weather forecasting. With the advent of advanced AI models like GraphCast, we are entering an era where weather predictions are faster, more accurate, and more reliable than ever.

The Role of AI in Weather Forecasting: https://stellarmind.ai/blog/%20ai-is-revolutionizing-weather-forecasts

AI computing capacity for leading tech companies

r/singularity - AI computing capacity for leading tech companies

  • Google: The bar is divided into two parts—NVIDIA (turquoise) and TPU (blue), indicating that Google relies on both GPUs and custom Tensor Processing Units for its AI computing needs. Google’s total computing power is estimated at over 1 million H100 equivalents with a wide 50% confidence interval (CI), reflecting a significant but uncertain range.

  • Microsoft (including OpenAI): The capacity bar for Microsoft is entirely NVIDIA based. It shows a substantial AI computing capacity, ranging between 500k and 1 million H100 equivalents with a significant confidence interval.

  • Meta: This bar represents the use of NVIDIA GPUs and shows a slightly smaller computing capacity, estimated between 400k and 800k H100 equivalents, with an associated confidence interval.

  • Amazon: Amazon’s computing capacity is similar to Meta but slightly smaller, estimated between 300k and 700k H100 equivalents.

  • Other (including other cloud providers and AI labs): This category has the largest computing capacity, reaching 1.5 million H100 equivalents or more, with a broad confidence interval, indicating significant diversity among other providers.

Google leads the way with the largest computing capacity, exceeding one million H100 equivalents. Google leverages both NVIDIA GPUs and its custom TPUs, which significantly boosts its computing resources, making it a powerful player in the AI field.

Microsoft, which includes the resources of OpenAI, follows as another major contender, with its computing power estimated between 500,000 and one million H100 equivalents. Microsoft primarily depends on NVIDIA’s technology for AI workloads, reflecting a substantial investment in industry-standard GPU infrastructure.

Meta ranks next, with a strong computing infrastructure in the range of approximately 400,000 to 800,000 H100 equivalents. This illustrates Meta’s commitment to advancing its AI capabilities to power its social platforms and metaverse initiatives.

Amazon also shows impressive AI capabilities, albeit slightly behind Meta, with its computing capacity estimated between 300,000 and 700,000 H100 equivalents. This positions Amazon well for expanding AI capabilities across its AWS offerings and other business services.

The “Other” category, which includes other cloud providers and AI labs, collectively possesses a very significant amount of computing power, estimated at over 1.5 million H100 equivalents. This diverse group demonstrates the growing competition and interest in AI computing capacity across various tech ecosystems.

Overall, this comparison highlights the significant infrastructure investments made by these leading companies to enhance their AI capabilities, with Google standing out as the clear leader, followed by a competitive landscape involving Microsoft, Meta, Amazon, and a diverse group of other providers. The results underline the importance of having vast computing resources to stay at the forefront of AI development and innovation.

Google AI – Development of therapeutic drugs is often difficult and time consuming. A new model, Tx-LLM, is able to predict the properties of many entities of potential interest for therapeutic development with accuracy comparable state-of-the-art specialty models.

Introducing Tx-LLM, a language model fine-tuned to predict properties of biological entities across the therapeutic development pipeline, from early-stage target discovery to late-stage clinical trial approval.

Source: https://research.google/blog/tx-llm-supporting-therapeutic-development-with-large-language-models/

Chinese startup Leju Robotics has released their open-source humanoid development platform for academic and R&D use cases. It includes an SDK for sensors and controls, simulation models, an LLM interface, and some basic demos that work out-of-the-box.

Source: https://www.reddit.com/r/singularity/?f=flair_name%3A%22Robotics%22

What Else is Happening in AI on October 09th 2024!

OpenAI and Hearst announced a strategic partnership to integrate content from over 20 magazine brands and 40+ newspapers into OpenAI’s AI products.

Source: https://openai.com/index/hearst

Hugging Face released OpenAI-Gradio, a new tool enabling the creation of AI-powered web apps using OpenAI’s models in just minutes with minimal code.

Source: https://x.com/Gradio/status/1843698665472368665

Uber unveiled plans to launch an OpenAI-powered AI assistant in early 2025 to help drivers with electric vehicle questions, aiming to accelerate EV adoption on the platform.

Source: https://www.reuters.com/technology/artificial-intelligence/uber-launch-ai-assistant-powered-by-openais-gpt-4o-help-drivers-go-electric-2024-10-08

Anthropic launched Message Batches API, allowing developers to submit up to 10,000 queries for async processing in under 24 hours at a 50% discount compared to standard API calls.

Source: https://www.anthropic.com/news/message-batches-api

Google added the ability to drag and drop any file type to upload directly into its AI Studio without importing it to Google Drive.

Source: https://x.com/officiallogank/status/1843723911055454580

KoBold Metals raised $527M for its AI-powered mineral discovery tech that leverages extensive data analysis to uncover deposits with energy-critical minerals like copper, lithium, and nickel.

Source: https://techcrunch.com/2024/10/07/ai-powered-critical-mineral-startup-kobold-metals-has-raised-491m-filings-reveal/

 

AI Tools Updates

Machine Learning & AI For Dummies PRO on the App Store (apple.com)

Machine Learning and AI For Dummies
Machine Learning and AI For Dummies

This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

CogvideoX-ControlNet: A new tool for turning images into short videos using the powerful CogvideoX model. It’s open-source, so check it out and contribute if you’d like!

Meta Movie Gen: Now adds audio to your videos! From background sounds to music, this AI brings your videos to life.

Veo by Google DeepMind: Google’s latest advanced video creation tool. Watch it in action!

FLUX.1-dev ControlNet Inpainting: Perfect for fixing or filling in missing spots in your images.

Source: https://comfyuiblog.com/ai-news-cogvideox-controlnet-and-veo-by-google-deepmind-and-more/

A Daily Chronicle of AI Innovations on October 08th  2024

🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity

🤖 Inflection and Intel team up on enterprise AI

💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High

🕶️ Students turn AI glasses into doxing devices

✅ Checklists improve AI model evaluation

👀 AI images taking over google

🚗 Uber will use ChatGPT to get more people to use EVs

🎨 Adobe has a new tool to protect artists’ work from AI

🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity

r/artificial - Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity

The Nobel Prize in Physics 2024 was awarded to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”

Hinton … hopes that the award might make people take the fears he voices more seriously.

The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”

  • Geoffrey Hinton and John Hopfield, credited with ‘establishing the foundations for today’s advanced machine learning technologies’, were awarded the Nobel Prize in physics for their pioneering work on artificial neural networks mimicking brain structures.
  • Their innovations helped enable AI systems to learn by identifying complex patterns from data, which is foundational to high-profile applications like language generation and image recognition currently used in technology.
  • Despite the recognition, Hinton has expressed concern over AI’s potential risks, highlighting the danger of bad actors misusing the technology, and recently left Google to focus on advocating for responsible AI development.
 

Source: https://www.nobelprize.org/

💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High

 

On Monday, Nvidia stock went up even though most other big tech stocks went down. This helped the AI giant recover its position as the world’s second-largest company during the AI boom. 

Source: https://theaiwired.com/nvidia-overtakes-microsoft-as-ai-powers-stock-to-6-week-record-high/

👀 AI images taking over google

r/singularity - AI images taking over google

Hard to see how this isn’t the beginning of the end of the information era…

Source: https://www.reddit.com/r/singularity/comments/1fyf93x/ai_images_taking_over_google/

🚗 Uber will use ChatGPT to get more people to use EVs 

  • Uber is introducing an AI assistant powered by ChatGPT to help drivers with questions about purchasing and using electric vehicles, aiming to encourage EV adoption.
  • The company is rolling out a new “EV Preference” feature, allowing users to select rides exclusively from electric vehicles, which will be available in the app over the coming months.
  • As part of its sustainability goals, Uber is expanding its EV-only service in 40 cities and aims to become a zero-emission mobility platform in North America and Europe by 2030, and globally by 2040.

Source: https://www.theverge.com/2024/10/8/24264282/uber-green-ev-driver-mentor-chatgpt

🎨 Adobe has a new tool to protect artists’ work from AI

  • Adobe plans to launch a new web app in 2025, alongside a Chrome extension, to help protect artists’ work by applying tamper-evident metadata, known as Content Credentials, and allowing creators to opt-out of generative AI models.
  • This web app will integrate with Adobe’s Creative Cloud applications and enable artists to uniformly embed creator information across content, simplifying the opt-out process from AI training databases compared to individual submissions for each AI provider.
  • While Adobe’s initiative seeks widespread industry support, only a few companies like Spawning have committed to adopting these protections, highlighting Adobe’s challenge in ensuring voluntary participation from other AI and tech companies.
  • Source: https://www.technologyreview.com/2024/10/08/1105234/adobe-wants-to-make-it-easier-for-artists-to-blacklist-their-work-from-ai-scraping

🤖 Inflection and Intel team up on enterprise AI

 Inflection AI just launched Inflection for Enterprise, a new system built in partnership with Intel and designed for large-scale business deployments – featuring both a cloud service, new commercial API and upcoming local appliance.

  • Inflection for Enterprise is built on the new Inflection 3.0 model family and powered by Intel’s Gaudi 3 AI accelerators.

  • An on-premises AI appliance is planned for Q1 2025 release, promising up to 2x improved price-performance over competitors.

  • Inflection 3.0 comes in two variants — Pi 3.0 for chatbots and Productivity 3.0 for instruction-following tasks.

  • Inflection also released a commercial API, enabling developers to build advanced conversational AI applications.

After a turbulent year following founder Mustafa Suleyman and much of the team’s departure to Microsoft, Inflection is pivoting from consumer-focused apps to enterprise solutions. While the startup will face no shortage of competitors, a partnership with Intel is a positive start for the new regime.

Source: https://www.intel.com/content/www/us/en/newsroom/news/inflection-ai-intel-launch-enterprise-ai-system.html

✅ Checklists improve AI model evaluation

Researchers from the University of Oxford and Cohere just developed TICK, a new approach for evaluating AI language models that use AI-generated checklists to improve assessment accuracy and interpretability.

  • TICK uses an AI model to generate a checklist of yes/no questions to evaluate how well another AI model followed a given instruction.

  • The checklist-based method showed 5.8% higher agreement with human evaluators than standard AI evaluation techniques.

  • The researchers also developed STICK (Self-TICK), which uses the checklists for self-improvement, leading to 7.8% better performance on reasoning tasks.

  • TICK can be fully automated, making it faster and cheaper than checklist-based evaluations requiring human input.

LLMs are weird — and sometimes even simple formatting quirks (remember the ‘take a deep breath’ prompt?) can lead to unexpected results. When looking for new techniques to get the most out of AI models and evaluations, maybe it’s ideal to return to the basics of human organization and learning.

Source: The Rundown

What Else is Happening in AI on October 08th 2024!

Former Google CEO Eric Schmidt argued at the Washington AI Summit that AI advances should take precedence over climate goals, saying, “We’re not going to hit the climate goals anyway because we’re not organized to do it.”

Source: https://mashable.com/article/former-google-ceo-invest-ai-despite-climate-concerns

Northrop Grumman announced an AI-powered enhancement to its Forward Area Air Defense system, enabling rapid decision-making against drone swarms.

Source: https://news.northropgrumman.com/news/releases/northrop-grumman-to-develop-prototype-artificial-intelligence-assistant

Nvidia and Peking University researchers introduced EdgeRunner, a new model for high-quality, detailed 3D mesh generation.

Source: https://arxiv.org/html/2409.18114v1

Enterprise GenAI startup Writer is reportedly set to raise between $150-200M at a $1.9B valuation, doubling its valuation from its $100M Series B round last September.

Source: https://www.forbes.com/sites/rashishrivastava/2023/09/18/ai-startup-writer-raises-100-million-to-take-on-chatgpt-enterprise/

Security researcher Harish SG published research showing evidence that LLMs can be prompted to achieve reasoning levels of powerful models like OpenAI’s o1 using a combination of advanced prompt tactics.

Source: https://openai.com/index/building-an-early-warning-system-for-llm-aided-biological-threat-creation/

Trending AI Tools:

Machine Learning & AI For Dummies PRO on the App Store (apple.com)

Machine Learning and AI For Dummies
Machine Learning and AI For Dummies

This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

  • 🤖 Dashworks Bots – Create AI assistants that answer your team’s questions

  • 📜 Theneo – Generate Stripe-like API docs in seconds

  • 📸 Flash – Supercharge your learning with AI-powered flashcards

  • 🔥 Firebender – A privacy-first coding assistant for Android Studio

  • 🏠 Bramble  AI-backed real estate brokerage to buy a home end-to-end

A Daily Chronicle of AI Innovations on October 07th  2024

🤖 OpenAI and Altera create digital humans

💊 AI identifies drug candidates for pain relief

🤖 Fewer websites are blocking OpenAI’s web crawler

🦾 Nvidia Acquires OctoAI To Dominate Enterprise Generative AI Solutions.

🚖Uber Expands Robot Delivery and Robotaxi Offerings With Avride.

🤖 Hitachi launches AI-powered railway maintenance service with Nvidia.

🔮 New Nvidia ACE plugins for Unreal Engine 5 simplify the creation of AI digital humans.

💰 Jensen Huang is now worth more than Intel

📱 Run Llama 3.2 locally on your phone

👀The impact of generative AI as a general-purpose technology

👨‍⚖️The racist AI deepfake that fooled and divided a community

💰 Jensen Huang is now worth more than Intel 

  • Jensen Huang, CEO of Nvidia, has a net worth of $109.2 billion, surpassing Intel’s current market value of $96.39 billion, which saw a significant drop following revelations about its financial issues in August.
  • Nvidia’s growth, driven by an AI boom and its dominance as a GPU accelerator manufacturer, helped its market cap soar, placing it among the top valued companies worldwide, though its stock has corrected by 10% since its peak.
  • Huang’s significant stake in Nvidia, with holdings valued over $100 billion, and his strategic share sales have propelled him to the 11th position on Forbes’ real-time billionaires list, close to entering the top 10.
  • Source: https://www.msn.com/en-gb/money/other/jensen-huang-is-now-worth-more-than-intel-personal-net-worth-currently-valued-at-109b-vs-intel-s-96b-market-cap/ar-AA1rMKD3

🤖 Fewer websites are blocking OpenAI’s web crawler

  • OpenAI’s web crawlers are facing fewer blocks from major news websites compared to earlier, despite a widespread data-protection rush where publishers attempted to prevent their content from becoming AI training data without consent.
  • The trend of blocking OpenAI’s GPTBot saw a decline after the company made a series of licensing agreements with publishers, leading some outlets to revise their robots.txt files and permit GPTBot access.
  • Despite robots.txt not being legally binding, it remains a widely observed standard for web crawler behavior, and OpenAI recognizes the importance of not being blocked to safeguard its future goals and ambitions.
  • Source: https://www.theverge.com/2024/10/7/24264184/fewer-websites-are-blocking-openais-web-crawler-now

🦾 Nvidia Unveils NVLM 1.0-A Bold Rival to ChatGPT in Generative AI

 

Advanced AI model NVLM 1.0 from Nvidia competes with ChatGPT and Gemini, doing better at jobs like vision-language and solving complex problems.

Source: https://theaiwired.com/nvidia-unveils-nvlm-1-0-a-bold-rival-to-chatgpt-in-generative-ai/

🤖 OpenAI and Altera create digital humans

OpenAI just published a case study on Altera, a startup using GPT-4o to develop AI agents called “digital humans” capable of prolonged, natural interactions with people — significantly outperforming other rivals during testing in Minecraft.

  • Altera, founded by ex-MIT professor Dr. Robert Yang, uses GPT-4o to power AI agents that can play Minecraft autonomously for up to 4 hours.

  • Altera’s system combines GPT-4o with a brain-inspired multi-module architecture to simulate cognitive functions and emotional processing.

  • OpenAI reports that Altera’s agents outperform other models in Minecraft tasks, collecting 32% of items compared to 6.4% for the next best model.

  • The startup plans to expand beyond gaming to create AI ‘coworkers’ and more complex multi-agent simulations.

We’ve constantly heard from Sam Altman and others that AI agents are coming fast — and case studies like this (as well as a cryptic ‘Level 3’ tweet from an OpenAI researcher) might mean the capabilities have already arrived. We might ascend the ‘Stages of AI’ ladder faster than most are anticipating.

Source: https://www.forbes.com/sites/jodiecook/2024/07/16/openais-5-levels-of-super-ai-agi-to-outperform-human-capability/

💊 AI identifies drug candidates for pain relief

Researchers at Cleveland Clinic and IBM just developed an AI model to predict how drugs and gut microbes interact with pain receptors, potentially uncovering new non-addictive pain treatments.

  • LISA-CPI analyzes both the molecular structure of compounds and the 3D shape of pain receptors to predict their interactions.

  • The model identified FDA-approved drugs, like methylergometrine, that could potentially be repurposed for pain treatment by targeting specific receptors.

  • LISA-CPI also discovered gut microbes that may interact with pain receptors in beneficial ways.

  • The approach could accelerate drug discovery for pain and other conditions by more accurately screening potential compounds.

 The current opioid crisis highlights the urgent need for effective, non-addictive pain medications, and this AI-driven approach could help researchers more quickly identify promising drug candidates while also opening new avenues for pain management.

🎥 Meta unveils advanced AI video model

Meta just announced Movie Gen, a powerful new suite of AI models for generating and editing video and audio content, positioning itself as a direct competitor to OpenAI’s Sora and other industry leaders.

  • Movie Gen consists of four models: a 30B video generation model, a 13B audio model, a personalized video model, and a video editing model.

  • The system can generate HD videos up to 16 seconds long from text prompts, along with synchronized audio like sound effects and background music.

  • Movie Gen also features video editing via natural text prompts and the ability to upload a reference image to create personalized videos.

  • Meta claims the model outperforms rivals like Runway Gen3, Luma Labs, and OpenAI’s Sora in human video quality and consistency evaluations.

  • Meta CEO Mark Zuckerberg said that Movie Gen will be ‘coming to Instagram next year’ in a post displaying some of the model’s sample generations.

Meta’s Movie Gen separates itself from other video generators by not only generating videos from text, but also being able to perform precise video editing. With the models coming to Instagram, it could transform the content creation process and give the masses a powerful video editing suite—with only prompting required.

📱 Run Llama 3.2 locally on your phone

Meta’s new Llama 3.2 3B model can run directly on your smartphone, allowing you to have AI conversations privately and offline.

  1. Download PocketPal AI from the App Store.

  2. Open the app, tap the top-left menu, and select “Models.”

  3. Under “Llama,” download “llama-3.2-3b-instruct q4_k” (2.2 GB).

  4. Once downloaded, tap “Load” to activate the model.

  5. Return to the main menu, select “Chat,” and start conversing with AI!

Create a local knowledge base that can be queried alongside the model, allowing you to supplement the AI’s knowledge with custom, up-to-date information without requiring an internet connection.

Source: https://apps.apple.com/us/app/pocketpal-ai/id6502579498

 

👀The impact of generative AI as a general-purpose technology

 

Generative artificial intelligence will affect economic growth more quickly than other general-purpose technologies, according to a new report.
The steam engine, the internal combustion engine, electrification, and computers are all considered “general-purpose technologies” — new tools that are powerful enough to accelerate overall economic growth and transform economies and societies. According to many experts, generative artificial intelligence will be the next invention to join that category.

In a recent report about the economic impact of generative AI, Google visiting fellow and MIT Sloan principal research scientist Andrew McAfee makes the case that generative AI is not only a game-changing general-purpose technology but could also spur change far more quickly than preceding innovations due to its accessibility and ease of diffusion. 

Source: https://mitsloan.mit.edu/ideas-made-to-matter/impact-generative-ai-a-general-purpose-technology

👨‍⚖️The racist AI deepfake that fooled and divided a community

When an audio clip appeared to show a local school principal making derogatory comments, it went viral online, sparked death threats against the educator and sent ripples through a suburb outside the city of Baltimore. But it was soon exposed as a fake, manipulated by artificial intelligence – so why do people still believe it’s real?

Source: https://www.bbc.com/news/articles/ckg9k5dv1zdo

What Else is Happening in AI on October 07th 2024!

Apple will reportedly release its Apple Intelligence features on Oct. 28 alongside the iOS 18.1 update, according to Bloomberg insider Mark Gurman.

Source: https://www.iphoneincanada.ca/2024/10/06/apple-intelligence-release-date-oct-28-with-ios-18-1-report/

Google began rolling out the new AI anti-theft features for Android devices showcased at Google I/O, including Theft Detection Lock, Offline Device Lock, and Remote Lock.

Source: https://lifehacker.com/tech/google-rolling-out-three-anti-theft-features-for-android

Cohere launched improved fine-tuning features for its Command R LLM, including longer context support and a ‘bring your own fine-tune’ option.

Source: https://cohere.com/blog/commandr-fine-tuning

AI startup Otherside AI’s Reflection 70B model failed to match performance claims in tests published by the team in a post-mortem of the release after being initially touted as the ‘world’s best open-source model.’

Source: https://the-decoder.com/worlds-best-open-source-model-falls-short-of-promised-performance/

North Carolina musician Michael Smith faces federal charges for allegedly using AI to generate thousands of songs and bots to stream them billions of times, netting over $10M in royalties.

Source: https://apnews.com/article/music-fraud-ai-arrest-4f09a714971f450fb3c9103c927cb091

Trending AI Tools

Machine Learning and AI For DummiesMachine Learning & AI For Dummies PRO

Ready to accelerate your career in the fast-growing fields of AI and machine learning? Our app offers user-friendly tutorials and interactive exercises designed to boost your skills and make you stand out to employers. Whether you’re aiming for a promotion or searching for a better job, AI & Machine Learning For Dummies PRO is your gateway to success. Start mastering the technologies shaping the future—download now and take the next step in your professional journey! iOSWindows

👨‍💼 Cheatlayer – Automate your business using natural language: https://cheatlayer.com/

🤝 Mindpal’s SalesBox – Build your own AI sales OS with multi-agent workflows: https://mindpal.space/

🤑 Trillion – Track expenses, manage accounts and set financial goals with AI planning: https://apps.apple.com/us/app/trillion-budget-management/id6504283874

🛒 BuyScout  Your AI copilot for online shopping: https://www.buyscout.app/

🗓️ Selfletter – Break complex goals into simple tasks with AI: https://www.selfletter.com/

AI Weekly Rundown: 🍎Apple releases AI model that rewrites the rules of 3D vision 🎥 Meta unveils an AI video generator 🔥 ChatGPT gets a collab boost with Canvas 🔎Google rolls out ads in AI Overviews 🧠Google is Working on Reasoning AI and more
AI Weekly Rundown: 🍎Apple releases AI model that rewrites the rules of 3D vision 🎥 Meta unveils an AI video generator 🔥 ChatGPT gets a collab boost with Canvas 🔎Google rolls out ads in AI Overviews 🧠Google is Working on Reasoning AI and more

A Daily Chronicle of AI Innovations on October 04th  2024:

🧠 Apple releases AI model that rewrites the rules of 3D vision

Listen at https://podcasts.apple.com/us/podcast/a-daily-chronicle-of-ai-innovations-on-october/id1684415169?i=1000671816462

🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.

🎥 Meta unveils an AI video generator

🔥 ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface

🔎 Google launches one of its ‘most significant updates ever’

🕵️‍♂️ TikTok’s owner is scraping the web 25 times faster than OpenAI

🔎 Google rolls out ads in AI Overviews

🧠 Apple releases AI model that rewrites the rules of 3D vision 

  • Apple’s AI research team has unveiled Depth Pro, a new AI model that enhances machines’ depth perception using only a single 2D image, which could revolutionize fields like augmented reality and self-driving technology by offering real-time spatial awareness.
  • Depth Pro generates high-resolution 3D depth maps in just 0.3 seconds without needing traditional camera data, employing advanced techniques like a multi-scale vision transformer to accurately define details such as individual hairs and the edges of objects.
  • Open-sourced on GitHub, Depth Pro introduces metric depth estimation without extensive training on specific datasets, paving the way for widespread use in industries such as e-commerce, automotive, and healthcare, where sharp depth analysis is crucial.

Source: https://vuink.com/post/iragherorng-d-dpbz/ai/apple-releases-depth-pro-an-ai-model-that-rewrites-the-rules-of-3d-vision

🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.

https://packaged-media.redd.it/4dyp42vx94td1/pb/m2-res_720p.mp4?m=DASHPlaylist.mpd&v=1&e=1728241200&s=90d466443f216b3f4be4cea8a0dea727af2d82e7

Nvidia introduced EdgeRunner, an auto-regressive method capable of generating high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512. This approach efficiently processes images and point clouds, offering significant advancements in the field of 3D modeling.

Source: https://ar5iv.org/2409.18114

🎥 Meta unveils an AI video generator:

Meta’s new Sora competitor: Meta Movie Gen

  • Meta has introduced Movie Gen, an AI-powered model for video creation and editing, allowing users to generate high-definition video with audio and make precise edits using simple text commands, catering to filmmakers, content creators, and creative individuals.
  • Movie Gen offers personalization by combining uploaded images with descriptive text prompts to create customized videos, enhancing creative possibilities, and enabling scenarios ranging from fantasy realms to everyday adventures, while maintaining realistic human motion and identity.
  • The suite also includes advanced audio generation, with the 13-billion parameter model adding ambient sounds and music to video scenes, all aimed at democratizing content creation by offering professional-grade tools with user-friendly functionality.

Generate videos from text Edit video with text
Produce personalized videos
Create sound effects and soundtracks

Paper: MovieGen: A Cast of Media Foundation Models
https://ai.meta.com/static-resource/movie-gen-research-paper

Source: AI at Meta on X: https://x.com/AIatMeta/status/1842188252541043075

r/singularity - Meta Movie Gen - the most advanced media foundation AI models | AI at Meta

Source: https://ai.meta.com/research/movie-gen/

Apple just released Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

r/singularity - Apple just released Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

The paper presents a foundation model for zero-shot metric monocular depth estimation called Depth Pro. Depth Pro can produce high-resolution depth maps with sharp details and accurate object boundaries without requiring camera intrinsics like focal length. The superior performance of Depth Pro is attributed to its efficient multi-scale architecture, effective training curriculum, and dedicated boundary metrics. The model is able to accurately estimate depth and focal length in a zero-shot setting, enabling applications like view synthesis that require metric depth.

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second https://www.openread.academy/en/paper/reading?corpusId=509969387

GitHub – https://github.com/apple/ml-depth-pro?tab=readme-ov-file

🔥 ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface

OpenAI just launched Canvas, a new ChatGPT interface release that enables more collaborative writing and coding projects beyond simple chat interactions with new editing features, shortcuts, and added contextual knowledge.

  • Canvas opens in a separate window alongside the chat, allowing users to directly edit and refine specific aspects of an output.

  • New features include inline feedback, targeted editing, and shortcuts for tasks like adjusting text length, changing reading levels, or debugging code.

  • In tests, using GPT-4o with Canvas led to a 30% accuracy and 16% quality boost compared to using the model without the interface.

  • Canvas is rolling out in beta to Plus and Team users, with a broader release expected later.

ChatGPT’s first major UI change takes a leap towards more nuanced, moldable interactions — while also inheriting novice-friendly features seen in other rivals with easy-to-use shortcuts. The simple chatbox was a good first step for human-AI interactions, but more power and capabilities require new collaborative processes.

Source: https://www.techradar.com/computing/artificial-intelligence/chatgpt-has-a-new-canvas-for-collaborating-with-the-ai-chatbot-on-writing-and-coding-ideas

🔎 Google launches one of its ‘most significant updates ever’

  • Google has integrated more AI features into its search functionalities, unveiling a range of updates such as AI-organized web results, enhanced Google Lens capabilities, and the incorporation of links and advertisements within AI Overviews.
  • This AI-driven search initiative kicks off with food-related content, where Google’s AI creates a comprehensive experience by aggregating diverse perspectives from across the web, including videos and forums, tailored to user queries.
  • Additional updates include the enhancement of AI Overviews with more prominent links to support website traffic, the integration of ads within these overviews, improved music identification features with Circle to Search, and significant upgrades to Google Lens for video, voice, and shopping inquiries.
  • Source: https://www.maginative.com/article/meta-unveils-movie-gen-ai-powered-video-creation-and-editing-suite/

🕵️‍♂️ TikTok’s owner is scraping the web 25 times faster than OpenAI

  • ByteDance, the parent company of TikTok, has launched a web scraper called Bytespider which is significantly outpacing similar tools by other companies in collecting online data for AI model training, operating at 25 times the speed of OpenAI’s GPTbot.
  • Unlike other web crawlers, Bytespider ignores the robots.txt file that web publishers use to regulate scraping activity, highlighting its aggressive approach to gathering data from the internet, amidst concerns related to copyright issues within generative AI development.
  • With the U.S. government pressuring ByteDance over national security issues, the rapid data collection by Bytespider seems to indicate ByteDance’s urgency in enhancing TikTok’s search functionality and possibly developing a new large language model to rival existing competitors.
  • Source: https://fortune.com/2024/10/03/bytedance-tiktok-bytespider-scraper-bot/

🔎 Google rolls out ads in AI Overviews

Google just announced the introduction of ads to its AI Overview search summaries and the launch of several new AI-powered search capabilities, such as video understanding and voice input.

  • Ads will now appear within and alongside AI Overviews for ‘relevant queries’ on searches in the United States.

  • The redesigned AI Overview format will now add prominent in-text links to better source websites for the curated information.

  • New AI-organized search results pages are rolling out that surface relevant, more diverse content — starting with recipe and meal inspiration queries.

  • Google Lens is getting video understanding capabilities and voice input options for visual searches.

  • The Android ‘Circle to Search’ feature also lets users identify songs playing in videos or streaming content.

Google’s first AI Overview experience didn’t exactly go as planned. However, with heavy competition from Perplexity and chatbot rivals, Google’s search future clearly has AI at its core, regardless of the bumps along the way. But infusing paid ads into AI Overviews could be a slippery slope – will Gemini be next?

Source: https://www.theverge.com/2024/10/3/24260637/googles-ai-overview-ads-launch

What Else is Happening in AI on October 04th 2024!

Google DeepMind hires key OpenAI Sora researcher Tim Brook for ‘world simulator’ project. 

Source: https://the-decoder.com/google-deepmind-hires-key-openai-sora-researcher-for-world-simulator-project/

Google released Gemini 1.5 Flash 8B, a lightweight, cost-effective variation with a 50% cost reduction and 2x higher rate limits than 1.5 Flash.

Source: https://www.neowin.net/news/google-democratizes-ai-with-gemini-15-flash-8b-the-cheapest-gemini-model-to-date

Fourier launched GR-2, the company’s second-generation humanoid robot, which features improvements to battery life, hand dexterity, mobility, and a new developer kit.

Source: https://finance.yahoo.com/news/fourier-unveils-next-generation-humanoid-123000642.html

OpenAI also secured a massive credit line. Source: https://techcrunch.com/2024/10/03/openai-also-secured-a-massive-credit-line/

Google’s AI can detect tuberculosis just by analyzing cough sound.

Source: https://www.newsbytesapp.com/news/science/google-ai-uses-cough-sound-to-diagnose-tuberculosis/story

OpenAI CFO Sarah Friar says their next AI model will be an order of magnitude bigger than GPT-4 and future models will grow at a similar rate, requiring capital-intensive investment to meet their “really big aspirations”

Trending AI Tools on October 04th 2024

🐝 Buzzabout – AI-driven insights from billions of discussions on social media: https://buzzabout.ai/

🤖 Base AI – Build serverless, autonomous AI agents with memory: https://baseai.dev/

💸 CostGPT – Estimate costs and time for your software project in less than 5 minutes: https://costgpt.ai/

👀 Lookie AI – Consume, organize, and manage knowledge from YouTube: https://apps.apple.com/kr/app/lookie-ai/id6670471730?l=en-GB

⏱️ Tackle AI – Automatic time tracking to align everyday actions with key priorities: https://www.timetackle.com/

A Daily Chronicle of AI Innovations on October 03rd  2024:

👓 Meta smart glasses can be used to dox anyone in seconds

💰 OpenAI is now valued at $157 billion

💥 Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o

🥕 Microsoft to employees: you can continue working from home unless productivity drops

🤔 Google developing reasoning AI to rival OpenAI

👓 Meta smart glasses can be used to dox anyone in seconds 

  • Harvard students demonstrated how Meta’s smart glasses combined with facial recognition technology can dox individuals by revealing personal details like identities and phone numbers, using tools like I-XRAY and public databases in real-time.
  • The demo used existing technologies such as Meta’s Ray-Ban smart glasses and the PimEyes search engine, showing how a simple photo capture can quickly connect to public data, including names and addresses, raising privacy concerns.
  • Meta has privacy guidelines for its smart glasses, but the tiny notification light is hard to detect in bright light, leading to potential misuse despite the company warning users to respect others’ privacy and follow recording etiquette.
  • Source: https://www.theverge.com/2024/10/2/24260262/ray-ban-meta-smart-glasses-doxxing-privacy

💰 OpenAI is now valued at $157 billion

  • OpenAI has raised $6.6 billion in a new funding round, which has nearly doubled its valuation to $157 billion from a previous $86 billion, as reported by The Wall Street Journal.
  • The latest financing requires OpenAI to shift from its nonprofit model to a fully for-profit company, or investors have the right to retract their investments.
  • Major contributors to this funding round include Thrive Capital with a $1.25 billion investment and long-time supporter Microsoft, which added just under $1 billion more, with new investors like SoftBank and Nvidia also participating.
  • Source: https://arstechnica.com/ai/2024/10/openai-is-now-valued-at-157-billion/

💥 Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o 

  • In early October 2024, Nvidia surprised the AI community by unveiling NVLM 1.0, a series of advanced multimodal language models with capabilities matching those of the GPT-4o model from ChatGPT.
  • Instead of releasing a direct competitor to consumer-facing AI applications like ChatGPT or Claude, Nvidia is opting to allow others to create their own AI solutions by making the model weights of NVLM publicly accessible.
  • Nvidia, previously renowned for supplying essential chips for AI processes, is now demonstrating its prowess in generative AI through its innovative approach to sharing AI technology development resources.
  • Source: https://bgr.com/tech/nvidia-stunned-the-world-with-a-chatgpt-rival-thats-as-good-as-gpt-4o/

🥕 Microsoft to employees: you can continue working from home unless productivity drops

  • Microsoft has decided to allow employees to continue working from home, maintaining flexibility as long as it does not affect productivity, contrasting with companies like Amazon that have mandated a return to the office.
  • Scott Guthrie, Microsoft Executive Vice President, assured workers in a meeting that the company values flexible working arrangements, though productivity must remain steady to keep the remote work model viable.
  • The remote work setup is considered beneficial for both employees and Microsoft, though the company remains cautious about the risks, such as decreased productivity and potential misuse of work hours for personal activities.
  • Source: https://www.techspot.com/news/104972-microsoft-assures-employees-they-can-continue-working-home.html

🤔 Google developing reasoning AI to rival OpenAI

Google is reportedly making significant strides in developing AI models with advanced reasoning capabilities similar to OpenAI’s o1 system, intensifying the rivalry between the two AI giants.

  • Multiple teams at Google are working on AI that can solve complex, multi-step problems, according to Bloomberg.

  • The AI uses chain-of-thought prompting, a technique created by Google, to tackle complex math and programming problems by ‘thinking’ before responding.

  • Google is taking a more cautious approach to its releases than OpenAI but has already debuted math-focused reasoning models like AlphaProof and AlphaGeometry 2.

  • Microsoft also infused reasoning capabilities into its Copilot assistant this week, leveraging OpenAI’s o1 model.

Human-like reasoning and agentic capabilities are clearly the two major developments on every AI firm’s roadmap, and the release of o1 may have signaled a new phase in the LLM race. The question is — will OpenAI’s speed keep it a step ahead, or is the competition for top-tier models about to get a whole lot tougher?

Source: https://qz.com/google-reasoning-ai-model-compete-openai-chatgpt-gemini-1851663139

What Else is Happening in AI on October 03rd 2024!

The Cancer AI Alliance formed a $40M collaboration between major medical institutions and tech giants like Microsoft, AWS, Nvidia, and Deloitte to advance AI-driven cancer care.

Source: https://techcrunch.com/2024/10/02/cancer-ai-alliance-joins-medical-and-tech-expertise-together-with-40m-to-collaborate-on-next-gen-care/

Character AI is reportedly shifting its focus away from building AI models in the wake of its $2.7B deal with Google and prioritizing its consumer chatbot service.

Source: https://www.btimesonline.com/articles/169707/20241003/character-ai-quits-ai-model-race-after-4-billion-google-deal-shifts-focus-to-consumer-chatbot-platform.htm

Elon Musk posted ‘OpenAI is evil’ on X in response to reports that the AI giant asked investors to avoid funding competing AI firms like Anthropic and Musk’s xAI.

Source: https://www.yahoo.com/tech/elon-musk-called-openai-evil-030055401.html

Accenture announced a new partnership with NVIDIA to accelerate enterprise AI adoption, launching a business group and AI Refinery platform to scale agentic AI systems across industries.

Source: https://newsroom.accenture.com/news/2024/accenture-and-nvidia-lead-enterprises-into-era-of-ai

New ChatGPT feature: GPT-4o with Canvas.

r/singularity - New ChatGPT feature: GPT-4o with Canvas.

Latest AI Tools October 03rd 2024

WALDO: a detection AI model designed to identify specific objects, such as vehicles and utility poles, in overhead images from various altitudes, useful for tasks requiring object recognition in large-scale imagery. 

Source: https://github.com/stephansturges/WALDO

Kameo: a Rust library for creating fault-tolerant, distributed, and asynchronous actors using Tokio, facilitating seamless communication across nodes with features like scalability, backpressure handling, and panic recovery. 

Source: https://github.com/tqwewe/kameo

TinyJS: a lightweight JavaScript library that simplifies the creation of HTML elements, property assignment, and DOM element selection with unique $ and $$ shortcuts, enhancing web development efficiency. 

Source: https://github.com/victorqribeiro/TinyJS

QBittorrent: an open-source BitTorrent client designed to be a lightweight alternative to other clients, offering ad-free usage, stability, and a variety of features.

Source: https://github.com/qbittorrent/qBittorrent

Serving 70B-Scale LLMs Efficiently on Low-Resource Edge Devices: the paper discusses methods for running large language models (LLMs) efficiently on devices with limited resources.

Source: https://arxiv.org/abs/2410.00531

A Daily Chronicle of AI Innovations on October 02nd  2024:

Listen at https://podcasts.apple.com/us/podcast/a-daily-chronicle-of-ai-innovations-on-october/id1684415169?i=1000671578473

🧠Google is Working on Reasoning AI – Bloomberg News

💰’SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI

⚙️ OpenAI makes 4 major announcements at DevDay

🚀 Microsoft Copilot gets voice, vision upgrade

🤖 Google develops new AI model to rival OpenAI o1

👀 OpenAI co-founder joins rival Anthropic

⚙️ OpenAI makes 4 major announcements at DevDay

r/singularity - New tools for devs

Here’s a link to the announcement: https://openai.com/devday/

OpenAI’s recent DevDay conference took a different approach from last year’s event, focusing on incremental improvements rather than major product launches. The company introduced four key innovations: Vision Fine-Tuning, Realtime API, Model Distillation, and Prompt Caching, all aimed at empowering developers and enhancing the AI ecosystem.

Prompt Caching: This feature reduces costs and latency for developers by applying a 50% discount on input tokens that the model has recently processed, potentially leading to significant savings.

r/singularity - OpenAI DevDay: Vision Fine-Tuning, Realtime API, Model Distillation, and Prompt Caching

Vision Fine-Tuning: This allows developers to customize GPT-4o’s visual understanding capabilities using both images and text, with applications in fields like autonomous vehicles and medical imaging. For example, Grab improved its mapping services using this technology.

Realtime API: Now in public beta, this API enables low-latency, multimodal experiences, particularly in speech-to-speech applications. It allows for natural conversation and mid-sentence interruptions, opening up possibilities for voice-enabled applications in various industries.

Model Distillation: This workflow allows developers to use outputs from advanced models to improve the performance of more efficient models, making sophisticated AI capabilities more accessible and cost-effective.

OpenAI’s strategic shift towards ecosystem development over headline-grabbing product launches reflects a mature understanding of the AI industry’s current challenges and opportunities. By focusing on refining tools and reducing costs, OpenAI aims to foster a thriving developer ecosystem and ensure sustainable AI adoption across various industries.

  • Realtime API enables speech-to-speech application building using the same model that powers Advanced Voice, with the ability to choose from six voices. “Until right now, voice has been a second activity“, and that the Realtime API is going to make AI significantly more accessible because many people in the real world prefer to speak over reading or texting. Realtime API will have a “no-brainer” impact on customer support, education, and coaching. He also believes there will be many ‘non-obvious‘ use cases that are hard to predict now. For now, Realtime API only supports text and audio. However, Godement believes that image and video are the next milestones on the road to agents that can perceive the world just like a human. He also mentioned that image and video understanding specifically, will “turbocharge customer support” when the model has the ability to understand pixels on a screen in real-time. https://openai.com/index/introducing-the-realtime-api/

  • Model Distillation simplifies fine-tuning smaller models using outputs from larger ones, making training more accessible to developers. https://openai.com/index/api-model-distillation/

  • Prompt Caching reduces costs by nearly 50% across models and speeds up responses by up to 80% when reusing recent input tokens in API calls. https://openai.com/index/api-prompt-caching/

  • New prompt generator on https://playground.openai.com

  • Access to the o1 model is expanded to developers on usage tier 3, and rate limits are increased (to the same limits as GPT-4o)

🚀 Microsoft Copilot gets voice, vision upgrade

Microsoft just announced a slew of AI upgrades coming to its Copilot assistant for Windows PCs, including new vision and voice capabilities, personalization enhancements, a re-release of the controversial Recall feature, and more.

  • Copilot Voice allows users to interact with natural speech, adding conversational and intuitive communication similar to OpenAI’s Voice Mode.

  • Copilot Vision enables the AI to understand and interact with web content a user is viewing, offering context-aware help within the Microsoft Edge browser.

  • ‘Think Deeper’ gives Copilot new enhanced reasoning capabilities using chain-of-thought reasoning powered by OpenAI’s o1 model.

  • Microsoft’s ‘Recall’ feature is set to return, requiring an opt-in with upgraded privacy and security measures.

  • Microsoft AI CEO Mustafa Suleyman highlighted Copilot’s ability to ultimately ‘act on your behalf’ and adapt to user’s personal preferences and needs.

Microsoft is bringing the heat with these major Copilot upgrades, levelling up the assistant to align with the latest cutting-edge AI features across the industry — while bringing users one step closer to a truly agentic experience.

Source: https://www.theverge.com/2024/10/1/24259187/microsoft-copilot-redesign-vision-voice-features-inflection-ai

🧠Google is Working on Reasoning AI – Bloomberg News

 

Google is working on artificial intelligence software that resembles the human ability to reason, similar to OpenAI’s o1, marking a new front in the rivalry between the tech giant and the fast-growing startup.

In recent months, multiple teams at Alphabet Inc.’s Google have been making progress on AI reasoning software, according to people with knowledge of the matter, who asked not to be identified because the information is private.

AI researchers are pursuing reasoning models as they search for the next significant step forward in the technology. Like OpenAI, Google is trying to approximate human reasoning using a technique known as chain-of-thought prompting, according to two of the people. In this technique, which Google pioneered, the software pauses for a matter of seconds before responding to a written prompt while, behind the scenes and invisible to the user, it considers a number of related prompts and then summarizes what appears to be the best response.

Since OpenAI unveiled its o1 model, known internally as Strawberry, in mid-September, some in DeepMind have fretted that the company had fallen behind, according to another person with knowledge of the matter. But employees are no longer as concerned as they were following the launch of ChatGPT, now that Google has debuted some of its own work, the person said. In July, Google showcased AlphaProof, which specializes in math reasoning, and AlphaGeometry 2, an updated version of a model focused on geometry that the company debuted earlier this year.

Source: https://www.bnnbloomberg.ca/business/technology/2024/10/02/google-is-working-on-reasoning-ai-chasing-openais-efforts/

💰SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI, who previously claimed that creating ASI was his “life’s purpose”

r/singularity - 'SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI, who previously claimed that creating ASI was his "life’s purpose"

Source: https://www.ccn.com/news/technology/softbank-shares-surge-ceo-pushes-ai-superintelligence-vision-openai/

What Else is Happening in AI on October 02nd 2024!

OpenAI founding member Durk Kingma announced that he is joining Anthropic, reuniting with several former OpenAI employees and highlighting the company’s mission of responsible AI development in his X post.

Pika Labs unveiled Pika 1.5, a new video generation model upgrade featuring enhanced effects, realistic movement, longer clip creation, and cinematic capabilities.

Anyscale unveiled major upgrades to its AI platform at Ray Summit 2024, including a GPU-native Ray architecture, RayTurbo for enhanced performance, Ray Data for unstructured data processing, and more.

U.S. AI chipmaker Cerebras officially filed for an IPO, with the Sam Altman-backed Nvidia competitor expected to be valued at between $7-8B.

Meta released the open-source code and developer suite for its Segment Anything Model (SAM) 2.1, an upgraded version of its image and video segmentation tool.

Nvidia introduced NVLM 1.0, an open-source family of multimodal models that achieve SOTA performance on vision-language and text tasks.

Pinterest launched Performance+, a suite of new AI tools for advertisers that includes the ability to create background images for products and automation features for ad campaigns.

NotebookLM is too good

You can upload multiple books, hours long videos and audios into that thing and it processes everything so well. It’s so good at resuming, finding specific quotes, answering questions, explaining some stuff and the podcast feature too is mindblowing. It can even do the same for videos, texts and audios in foreign languages and translate, explain and resume it in order for you to understand. And it’s not super censored too. Can’t believe this thing is actually free and i’m just finding about it now.

A basic systems architecture for AI agents that do autonomous research

r/singularity - A basic systems architecture for AI agents that do autonomous research

Source: https://www.lesswrong.com/posts/6cWgaaxWqGYwJs3vj/a-basic-systems-architecture-for-ai-agents-that-do

OpenAI has released Whisper V3 Turbo model yesterday. The turbo model is an optimized version of large-v3 that offers 8x faster transcription speed with minimal degradation in accuracy

Source: https://huggingface.co/spaces/hf-audio/whisper-large-v3-turbo

Harvard students Build and show off AR glasses project that uses face detection, internet sleuthing, and AI to give you near instant dossiers (address, family info, name, etc) on people you see. Good proof of concept to raise awareness on what we may see in the future.

Source: https://x.com/AnhPhuNguyen1/status/1840786336992682409

https://x.com/i/status/1840786336992682409

Trending AI Tools on October 02nd 2024

🎥 Video SDK 3.0 – Build and integrate real-time multimodal AI characters: https://github.com/Xilinx/video-sdk/discussions/81

📭 Inbox Zero  An open-source, AI personal assistant for email: https://www.getinboxzero.com/ai-automation

👩🏻‍💻 Graphite – Your AI code review companion: https://graphite.dev/blog/graphite-reviewer-launch

📚 Ello – An AI reading companion for children offering personalized support: https://www.ello.com/

🗣️ VivaChat – FaceTime video chat with realistic AI personas: https://www.vivalabs.ai/

A Daily Chronicle of AI Innovations on October 01st  2024:

🔮 Microsoft gives Copilot a voice and vision

💻 Chromebooks are getting a dedicated AI key

👓 Microsoft is discontinuing its HoloLens headsets

🫠 Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup

❌ California’s controversial AI safety bill vetoed

💰 OpenAI secures SoftBank funding as Apple exits raise

💧 Liquid AI unveils efficient new LFM models

🔮 Microsoft gives Copilot a voice and vision 

  • Microsoft has unveiled a major overhaul to its Copilot experience, adding both voice and vision capabilities, transforming it into a more personalized AI assistant similar to OpenAI’s Advanced Voice Mode.
  • The redesign features a new card-based user interface inspired by Inflection AI’s Pi assistant, and Copilot now offers a virtual news presenter mode, tailored homepage and improved customization based on user interaction history.
  • Initial releases of Copilot Voice and Copilot Daily will be available in select regions, while Copilot Vision features are in a limited preview phase, focusing on enhancing user safety and privacy through restricted website interactions.
  • Source: https://www.theverge.com/2024/10/1/24259187/microsoft-copilot-redesign-vision-voice-features-inflection-ai

💻 Chromebooks are getting a dedicated AI key 

  • Chromebooks are getting a new keyboard layout with a “quick access” key for AI and other functions, providing easy access to features like text generation, emojis, and searching Google Drive.
  • The first Chromebooks to feature this new key are the Samsung Galaxy Chromebook Plus, which will replace the Launcher Key with the new Quick Insert key.
  • Although the new AI features will initially lack AI image generation, Google plans to add this and other AI capabilities, including real-time translation and transcription, to Chromebooks in October.
  • Source: https://gizmodo.com/chromebooks-are-getting-a-dedicated-ai-key-but-you-wont-use-it-for-ai-2000505155

 Microsoft is discontinuing its HoloLens headsets 

  • Microsoft has ceased production of its HoloLens 2 headsets and has no confirmed plans for a successor, although updates addressing security and software issues are promised until the end of 2027.
  • Former HoloLens head, Alex Kipman, left the company in 2022 amid misconduct allegations, and the hardware team faced significant layoffs in January 2023, impacting the development of the augmented reality devices.
  • Microsoft has partnered with Anduril Industries to enhance its IVAS mixed-reality headsets for the US Army, which plans to invest up to $21.9 billion over the next decade in this project.
  • Source: https://www.theverge.com/2024/10/1/24259369/microsoft-hololens-2-discontinuation-support

🫠 Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup 

❌ California’s controversial AI safety bill vetoed

California Governor Gavin Newsom just vetoed S.B. 1047, a groundbreaking AI safety bill that would have imposed stricter regulations on Silicon Valley AI firms and the release of new models in the state.

  • The bill would have required safety testing for AI models before their public release and held AI companies liable for any ‘severe harm’ (over $500M in damages) caused.

  • Tech giants, including OpenAI and Google, VCs, and politicians like Nancy Pelosi lobbied heavily against the bill, arguing it would stifle innovation.

  • The bill had notable support from Elon Musk, Anthropic, the ‘Godfather of AI’ Geoffrey Hinton, and over 120 Hollywood actors, directors, and workers.

  • Newsom said the bill was ‘well-intentioned’ but flawed, vowing to consult with AI experts to craft guardrails for future legislation efforts.

As the U.S. federal government continues to lag in AI regulation, states are stepping up to fill the void. While S.B. 1047 is shelved for now, the debate over AI governance is far from settled—and will likely continue to pit AI safety advocates against those pushing for rapid development throughout Silicon Valley.

Source: https://www.politico.com/news/2024/09/29/gavin-veto-ai-safety-bill-00181583

💰 OpenAI secures SoftBank funding as Apple exits raise

Despite Apple reportedly no longer participating in OpenAI’s upcoming funding round, the AI giant has secured billions of dollars from Japanese investment giant Softbank, Microsoft, and Thrive Capital.

  • OpenAI is rumored to be raising up to $6.5B via convertible notes, at an eye-popping $150B valuation.

  • Microsoft plans to participate with an additional $1B, adding to its previous $13B investment in the AI giant.

  • Investment firm Thrive Capital is also investing $1B, with a reported option to add an additional $1B the following year based on revenue goals.

  • The Wall Street Journal reported that Apple is no longer involved in the funding round, despite partnerships with OpenAI and its inclusion in Apple Intelligence.

  • The raise comes amid OpenAI’s controversial restructuring to a for-profit entity, with Sam Altman denying rumors that he will receive equity in the move.

OpenAI’s latest raise and for-profit turn is another saga in its convoluted and controversial business structure. Despite the recent high-profile departures and continued drama, the ChatGPT maker is still clearly seen as a top horse to bet on in the AI boom—and there is no shortage of major players who want in.

Source: https://www.theinformation.com/articles/softbank-to-invest-500-million-in-openai

💧 Liquid AI unveils efficient new LFM models

Liquid AI just introduced a new series of AI models called Liquid Foundation Models (LFMs), challenging the traditional transformer architecture while achieving state-of-the-art performance and enhanced memory efficiency at smaller model sizes.

  • The company released its LFMs in 1.3B, 3B, and 40B parameter sizes, based on a new architecture utilizing computational units rooted in dynamical systems rather than traditional transformers.

  • The models surpass transformer-based counterparts like Meta’s Llama 3.2 and Microsoft’s Phi-3.5 on major benchmarks like MMLU.

  • LFMs require significantly less memory for inference, particularly with long-context tasks — supporting up to 32k tokens while maintaining memory efficiency.

  • The models are not open-source and are only currently available via the company’s Lambda (Chat UI and API) and on Perplexity AI.

Liquid AI’s LFMs are a significant shakeup from the transformer architecture standard that has dominated models since 2017. The benchmarks show that there is more than one formula for achieving state-of-the-art AI performance—and could open new possibilities for more efficient and accessible AI systems.

Source: https://www.liquid.ai/liquid-foundation-models

What Else is Happening in AI on October 01st 2024!

Google agreed to invest $1B into Thailand to expand AI and cloud infrastructure in Southeast Asia, aiming to build new data centers amid increasing regional competition.

Source: https://www.cnbc.com/2024/09/30/google-to-invest-1-billion-in-thailand-data-center-and-ai-push.html

TikTok parent company ByteDance is reportedly planning to develop a new AI model primarily using Huawei chips, diversifying from U.S. suppliers like Nvidia to counteract export restrictions.

Source: https://www.reuters.com/technology/artificial-intelligence/bytedance-plans-new-ai-model-trained-with-huawei-chips-sources-say-2024-09-30

Artisan AI secured $7.3M in seed funding for its sales-focused AI virtual employees, with its first AI assistant Ava already assisting over 120 companies on the platform.

Source: https://www.artisan.co/blog/artisan-raises-7-3-seed-round

Luma Labs upgraded its Dream Machine AI video model speed, allowing for full-quality generations in under 20 seconds.

Source: https://x.com/LumaLabsAI/status/1840820602296320083

Qodo announced a $40M funding round for its AI-powered code testing software, with plans to expand services and target larger enterprise clients.

Source: https://www.bloomberg.com/news/articles/2024-09-30/ai-code-checker-qodo-raises-40-million-to-serve-bigger-clients

AI reading coach startup Ello launched ‘Storytime’, a new feature allowing kids to create personalized stories using AI.

Source: https://techcrunch.com/2024/09/30/ai-reading-coach-startup-ello-launches-custom-story-creation-feature-for-kids

Trending AI Tools on October 01st 2024

🎤 Udio Lyric Editor – Create and refine song lyrics based on melody: https://www.udio.com/

📷 Expression Editor – Easily edit facial expressions: https://huggingface.co/spaces/fffiloni/expression-editor

🚀 PandaETL – Automate document processes with AI and data: https://panda-etl.ai/

🤖 Gaia – Train and deploy neural machine translation models: https://gaia-ml.com/

🔍 Lumona – AI search engine leveraging social media insights: https://www.lumona.ai/

Read Aloud For Me: AI Dashboard – AI Tools Recommender – Safe AI

Welcome to Read Aloud For Me, the pioneering AI dashboard designed for the whole family! Our platform is the first of its kind, uniquely crafted to cater not only to adults but also to kids.

iOs: https://apps.apple.com/ca/app/read-aloud-for-me-ai-dashboard/id1598647453

Web/Android/PWA: https://readaloudforme.com

AI Innovations in September 2024

  • What will be AI's killer app?
    by /u/G4M35 (Artificial Intelligence Gateway) on December 14, 2024 at 12:53 am

    If you understand how Technology Innovation works (Clayton Christensen's way), you know that AI per se is not a disruptive technology but an enabling technology. What is going to happen is that some brilliant entrepreneur will use it in an unexpected way creating something that didn't exist before that will change the world as we know it, the killer app (app as in application of the tech, not necessarily software. Could be hardware too). I have been trying to come up with something for the past 2 years, and I can't. I am not seeing anything out there either; although advanced voice mode comes close. So, do you have any theories, suspicions, directions of what the kille app will be? TIA submitted by /u/G4M35 [link] [comments]

  • One-Minute Daily AI News 12/13/2024
    by /u/Excellent-Target-847 (Artificial Intelligence Gateway) on December 14, 2024 at 12:43 am

    UnitedHealth’s Optum left an AI chatbot, used by employees to ask questions about claims, exposed to the internet.[1] The BBC is complaining after Apple Intelligence rewrote one of its headlines to falsely claim the UnitedHealthcare suspect shot himself.[2] AI continues to reshuffle power and energy markets with even oil giants like Exxon Mobil getting into the mix.[3] OpenAI’s legal battle with Elon Musk reveals internal turmoil over avoiding AI ‘dictatorship’.[4] Sources included at: https://bushaicave.com/2024/12/13/12-13-2024/ submitted by /u/Excellent-Target-847 [link] [comments]

  • One-Minute Daily AI News 12/13/2024
    by /u/Excellent-Target-847 (Artificial Intelligence (AI)) on December 14, 2024 at 12:43 am

    UnitedHealth’s Optum left an AI chatbot, used by employees to ask questions about claims, exposed to the internet.[1] The BBC is complaining after Apple Intelligence rewrote one of its headlines to falsely claim the UnitedHealthcare suspect shot himself.[2] AI continues to reshuffle power and energy markets with even oil giants like Exxon Mobil getting into the mix.[3] OpenAI’s legal battle with Elon Musk reveals internal turmoil over avoiding AI ‘dictatorship’.[4] Sources: [1] https://techcrunch.com/2024/12/13/unitedhealthcares-optum-left-an-ai-chatbot-used-by-employees-to-ask-questions-about-claims-exposed-to-the-internet/ [2] https://www.theverge.com/2024/12/13/24320689/apple-intelligence-summary-bbc-news-unitedhealthcare-luigi-mangione [3] https://techcrunch.com/2024/12/13/exxon-cant-resist-the-ai-power-gold-rush/ [4] https://abcnews.go.com/US/wireStory/openais-legal-battle-elon-musk-reveals-internal-turmoil-116776795 submitted by /u/Excellent-Target-847 [link] [comments]

  • Codegpt or Bolt?
    by /u/VaguePenguin (Artificial Intelligence Gateway) on December 14, 2024 at 12:27 am

    I've googled and I've searched on here and I can't find opinions between the two, I only find year old threads about one of them. I've used both and I love both. Using codegpt has me learning more as I'm building but I feel it's not as advanced. Bolt on the other hand writes everything and seems way more advanced but it's always causing problems that it can't solve and when I manually write the code, it seems that it's still stuck. What are your opinions on both and why do you choose that one? They both have great pros but they both have bad cons. Bolt always can't solve its navigation or npm installations which makes me not want to use it anymore. Is anyone else having that issue? I know it's just bad simple misspelling or incorrect file name but when I fix it, it still doesn't work. submitted by /u/VaguePenguin [link] [comments]

  • What other cool or just fun AI tools I may not heard of?
    by /u/almozayaf (Artificial Intelligence (AI)) on December 13, 2024 at 11:36 pm

    In 2024 I got myself into so many AI tools that did amazing things Images generators So many to list Music generators like SUNO.AI Chats bots like JanitorAI And RPG writing stories like AI dungeon But I want to know if there other tools I may missed, what else out there? submitted by /u/almozayaf [link] [comments]

  • AI Recommendations for Editing Word Documents While Maintaining Formatting
    by /u/JustAGoodDude (Artificial Intelligence Gateway) on December 13, 2024 at 9:36 pm

    Hi everyone, I’m looking for an AI tool or solution that can help with editing Word documents. The document is already nicely formatted, with specific fonts, colours, and some images. These stay the same in every document. The content changes depending on the customer, with variables like the customer’s name, the amount of money, and a short recommendation. From a purely text point of view, writing the content itself is straightforward since it’s not overly complicated and ChatGPT can already write what I need almost prefectly. But my challenge is finding an AI that can handle the content changes while keeping the existing formatting intact. My ideal solution is -Use AI to write the text specific for the new customer (achieved) -Copy and paste this somewhere and have it merged into the Word document where all the specific formatting is retained. Does anyone have experience using AI for this purpose? Any recommendations or tips would be greatly appreciated! submitted by /u/JustAGoodDude [link] [comments]

  • Are people forgetting that AI and LLMs are not one and the same?
    by /u/Murky-Motor9856 (Artificial Intelligence Gateway) on December 13, 2024 at 9:23 pm

    Why aren't people freaking out over other types of generative AI, image and speech recognition models, or the sort of "AI" (that is probably based on gradient boosting instead of a neural network) companies like UHC used to deny claims? Is it because the output isn't language that humans find relatable, and therefore they aren't compelled to anthropomorphize it, or because marketing has effectively obscured how radically different the things we call AI are? In a way, it reminds me of the shitstorm of hype surrounding blockchain and cryptocurrency. Both technologies have sparked immense interest and investment, driven by their potential to revolutionize various industries. However, this fervor often emphasizes flashy, high-profile applications, and when people started getting disillusioned with them they sort of threw the baby out with the bathwater. Instead of being skeptical of the ways blockchain was used and oversold, for example, they're instantly skeptical of blockchain because they associate it with those uses (and the negative press they eventually garnered). One concern I have is that the general public's singular focus on a subset of AI will derail broader efforts much like it did in the past when expert systems failed to live up to hype surrounding them. What we have now is in a completely different realm of capability, to be sure, but the hype surrounding it is also on an entirely different level. submitted by /u/Murky-Motor9856 [link] [comments]

  • Can AGI Be Safe if Trained on Political Disinformation?
    by /u/FluidMeasurement8494 (Artificial Intelligence Gateway) on December 13, 2024 at 9:10 pm

    How can we develop a non-threatening AGI if it is likely to be trained on disinformation, particularly in the realm of internal and external politics? Wouldn't it be a flawed and dangerous tool ? The fundamental concern here is that AGI, like any AI, learns from the data it is trained on. If that data is biased, manipulative, or outright false, the AGI could inherit those flaws, potentially amplifying them in ways that are difficult to control. If an AGI is exposed to disinformation - whether in the form of political propaganda, fake news, or manipulated narratives - it may learn to perpetuate or even amplify these falsehoods. This could lead to the spread of harmful ideologies or decisions based on inaccurate information, both in political contexts and beyond. submitted by /u/FluidMeasurement8494 [link] [comments]

  • Google VideoFX blows sora out of the water.
    by /u/noblepups (Artificial Intelligence (AI)) on December 13, 2024 at 8:27 pm

    submitted by /u/noblepups [link] [comments]

  • Looking for a voice cloner that allows you to adjust the voice qualities/traits/characteristics of the results.
    by /u/CukeJr (Artificial Intelligence Gateway) on December 13, 2024 at 8:18 pm

    I've tried Elevenlabs and Character AI so far and neither of those seem to have such a thing. I've done some googling and glanced at a few others (without signing up and trying), no such luck. Any suggestions? Do any apps like this even exist? To elaborate a bit on my intent: I'm trying to create a voice for an original character (OC) I have. I know exactly what they sound like in my head, and I've come across some voices (namely, a vocalist and an existing game character) that sound pretty similar. I've been using these voices as references for how I imagine my OC to sound (I think it's often called a "voice claim" lol). They don't quite hit the mark though, so it would be perfect if I could just play around with either of them to bring them closer to my character's voice. So basically, I'm looking for an app that will enable me to upload either of these voice samples, create a clone of them, and then adjust vocal attributes of the voice like depth, tone, nasality, timbre, etc. Thank you in advance! submitted by /u/CukeJr [link] [comments]

Ace the 2023 AWS Solutions Architect Associate SAA-C03 Exam with Confidence Pass the 2023 AWS Certified Machine Learning Specialty MLS-C01 Exam with Flying Colors

List of Freely available programming books - What is the single most influential book every Programmers should read



#BlackOwned #BlackEntrepreneurs #BlackBuniness #AWSCertified #AWSCloudPractitioner #AWSCertification #AWSCLFC02 #CloudComputing #AWSStudyGuide #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AWSBasics #AWSCertified #AWSMachineLearning #AWSCertification #AWSSpecialty #MachineLearning #AWSStudyGuide #CloudComputing #DataScience #AWSCertified #AWSSolutionsArchitect #AWSArchitectAssociate #AWSCertification #AWSStudyGuide #CloudComputing #AWSArchitecture #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AzureFundamentals #AZ900 #MicrosoftAzure #ITCertification #CertificationPrep #StudyMaterials #TechLearning #MicrosoftCertified #AzureCertification #TechBooks

Top 1000 Canada Quiz and trivia: CANADA CITIZENSHIP TEST- HISTORY - GEOGRAPHY - GOVERNMENT- CULTURE - PEOPLE - LANGUAGES - TRAVEL - WILDLIFE - HOCKEY - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
zCanadian Quiz and Trivia, Canadian History, Citizenship Test, Geography, Wildlife, Secenries, Banff, Tourism

Top 1000 Africa Quiz and trivia: HISTORY - GEOGRAPHY - WILDLIFE - CULTURE - PEOPLE - LANGUAGES - TRAVEL - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
Africa Quiz, Africa Trivia, Quiz, African History, Geography, Wildlife, Culture

Exploring the Pros and Cons of Visiting All Provinces and Territories in Canada.
Exploring the Pros and Cons of Visiting All Provinces and Territories in Canada

Exploring the Advantages and Disadvantages of Visiting All 50 States in the USA
Exploring the Advantages and Disadvantages of Visiting All 50 States in the USA


Health Health, a science-based community to discuss human health

Today I Learned (TIL) You learn something new every day; what did you learn today? Submit interesting and specific facts about something that you just found out here.

Reddit Science This community is a place to share and discuss new scientific research. Read about the latest advances in astronomy, biology, medicine, physics, social science, and more. Find and submit new publications and popular science coverage of current research.

Reddit Sports Sports News and Highlights from the NFL, NBA, NHL, MLB, MLS, and leagues around the world.

Turn your dream into reality with Google Workspace: It’s free for the first 14 days.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with  the following codes:
Get 20% off Google Google Workspace (Google Meet) Standard Plan with  the following codes: 96DRHDRA9J7GTN6 96DRHDRA9J7GTN6
63F733CLLY7R7MM
63F7D7CPD9XXUVT
63FLKQHWV3AEEE6
63JGLWWK36CP7WM
63KKR9EULQRR7VE
63KNY4N7VHCUA9R
63LDXXFYU6VXDG9
63MGNRCKXURAYWC
63NGNDVVXJP4N99
63P4G3ELRPADKQU
With Google Workspace, Get custom email @yourcompany, Work from anywhere; Easily scale up or down
Google gives you the tools you need to run your business like a pro. Set up custom email, share files securely online, video chat from any device, and more.
Google Workspace provides a platform, a common ground, for all our internal teams and operations to collaboratively support our primary business goal, which is to deliver quality information to our readers quickly.
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE
C37HCAQRVR7JTFK
C3AE76E7WATCTL9
C3C3RGUF9VW6LXE
C3D9LD4L736CALC
C3EQXV674DQ6PXP
C3G9M3JEHXM3XC7
C3GGR3H4TRHUD7L
C3LVUVC3LHKUEQK
C3PVGM4CHHPMWLE
C3QHQ763LWGTW4C
Even if you’re small, you want people to see you as a professional business. If you’re still growing, you need the building blocks to get you where you want to be. I’ve learned so much about business through Google Workspace—I can’t imagine working without it.
(Email us for more codes)