AI Innovations in October 2024

AI Daily innovations in OCTOBER 2024

AI Innovations in October 2024.

In October 2024, the landscape of artificial intelligence continues to evolve at an unprecedented pace, with groundbreaking innovations and developments emerging daily. The “Daily AI Chronicle” aims to capture the essence of these advancements, providing a comprehensive summary of the latest news and trends in AI technology throughout the month. As we navigate through a month filled with transformative AI breakthroughs, our ongoing updates will highlight significant milestones—from the launch of cutting-edge AI models to the integration of AI in various sectors such as healthcare, finance, and creative industries. With each passing day, AI is reshaping how we interact with technology, enhancing productivity, and redefining our understanding of intelligence itself. Join us as we explore the exciting world of AI innovations, keeping you informed and engaged with the rapid changes set to influence our future. Whether you’re a tech enthusiast, a professional in the field, or simply curious about the implications of AI, this blog will serve as your go-to resource for staying updated on the latest developments throughout October 2024.

AI- Powered Jobs Interview Warmup

AI-Powered Job Interview Prep

A Daily Chronicle of AI Innovations on October 09th  2024

🏅 Google DeepMind researchers win Nobel Prize in chemistry

👀 OpenAI seeks independence from Microsoft

🛡️ Adobe launches AI attribution system

🧠 AI computing capacity for leading tech companies

 

🏅 Google DeepMind researchers win Nobel Prize in chemistry

The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Chemistry with one half to David Baker “for computational protein design” and the other half jointly to Demis Hassabis and John M. Jumper “for protein structure prediction.”

 

Press release: https://www.nobelprize.org/prizes/chemistry/2024/press-release/
Popular information: They have revealed proteins’ secrets through computing and artificial intelligence: https://www.nobelprize.org/prizes/chemistry/2024/popular-information/
Scientific background: Computational protein design and protein structure prediction: https://www.nobelprize.org/prizes/chemistry/2024/advanced-information/

🏅The Nobel Prize in Literature for 2024 has been awarded to ChatGPT

The Nobel Prize in Literature for 2024 has been awarded to ChatGPT
The Nobel Prize in Literature for 2024 has been awarded to ChatGPT

 

The Nobel Prize in Literature for 2024 has been awarded to ChatGPT for “his intricate tapestry of prose which showcases the redundancy of sentience in art.” This fictional accolade humorously acknowledges the ability of AI to produce sophisticated, expressive literature, suggesting that creativity can transcend traditional human boundaries.

The award, granted by The Swedish Academy, celebrates the notion that artificial intelligence, despite its lack of human consciousness, has the capacity to create a profound and complex body of work—so much so that it might question the necessity of human sentience in the realm of artistic expression.

Source: https://www.nobelprize.org/prizes/literature/2024/press-release/

👀 OpenAI seeks independence from Microsoft

OpenAI is reportedly looking to reduce its reliance on Microsoft for compute power and has started exploring options to set up its own data servers and secure AI chips independently, according to a new report from The Information.

  • CFO Sarah Friar told shareholders that Microsoft ‘hasn’t moved fast enough’ to supply computing power, causing the AI giant to look elsewhere.

  • OpenAI plans to lease an entire data center in Abilene, TX from Oracle, though Microsoft likely had to ‘bless’ the deal with its rival, according to the report.

  • OpenAI is also developing its own AI chip, which could lower costs for future computing clusters — its current supply is rented primarily from Microsoft.

  • Tensions have also reportedly arisen between OpenAI and Microsoft over the design and timeline of a massive joint data center project called ‘Fairwater.’

OpenAI and Microsoft’s relationship has felt a bit off for a while now. While both companies have leveraged each other well to ascend the AI power ladder, it certainly feels like there is trouble in paradise. There is plenty of smoke, and how this partnership shakes out could have fiery implications for the entire AI landscape.

Source: https://www.theinformation.com/articles/openai-eases-away-from-microsoft-data-centers

🛡️ Adobe launches AI attribution system

Adobe just announced a new free web app called Adobe Content Authenticity, designed to help creators protect their work and receive proper attribution in the era of AI-generated content.

  • The web app allows creators to easily apply content credentials to images, audio, and video files, acting as a ‘nutrition label’ for digital content.

  • Content credentials include creator information and creation details and can signal if the creator doesn’t want their work used to train AI models.

  • The system uses digital fingerprinting, invisible watermarking, and cryptographic metadata to make the credentials difficult to remove.

  • The web app, which has a waitlist, is expected to launch in Q1 of 2025, while a Chrome extension is available in beta today.

AI is extremely polarizing in the creator and artist community, largely due to the issues of unauthorized training and attribution that Adobe, Meta, OpenAI, and others are trying to address. While these tools are promising, they still rely heavily on widespread adoption and opt-in by creators and tech companies.

Source: https://contentauthenticity.adobe.com/

 

🎬 Control object motion in AI videos

Kling AI, one of the most popular AI video generators, now lets you add strategic movement to specific elements in AI video, providing more control in your generated clips.

  1. Choose a high-quality image with different elements to animate.

  2. Access Kling AI‘s Image-to-Video tool and upload your image.

  3. Use the Motion Brush to paint areas you want to animate and set motion paths for each area to define movement direction.

  4. Fine-tune with prompts, adjust settings, and generate your video.

Pro tip: Keep movements subtle and natural for more realistic results, and experiment with different combinations to find what works best for your specific image.

Source: https://kling.ai


AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence (OpenAI, ChatGPT, Google Gemini, Generative AI, Discriminative AI, xAI, LLMs, GPUs, Machine Learning, NLP, Promp Engineering)

AI is Revolutionizing Weather Forecasts : How GraphCast Models are Predicting the Future with Unmatched Precision

 

In recent years, artificial intelligence (AI) has made significant strides in numerous fields, from healthcare to finance. One of the most exciting developments is how AI is revolutionizing weather forecasting. With the advent of advanced AI models like GraphCast, we are entering an era where weather predictions are faster, more accurate, and more reliable than ever.

The Role of AI in Weather Forecasting: https://stellarmind.ai/blog/%20ai-is-revolutionizing-weather-forecasts

AI computing capacity for leading tech companies

r/singularity - AI computing capacity for leading tech companies

  • Google: The bar is divided into two parts—NVIDIA (turquoise) and TPU (blue), indicating that Google relies on both GPUs and custom Tensor Processing Units for its AI computing needs. Google’s total computing power is estimated at over 1 million H100 equivalents with a wide 50% confidence interval (CI), reflecting a significant but uncertain range.

  • Microsoft (including OpenAI): The capacity bar for Microsoft is entirely NVIDIA based. It shows a substantial AI computing capacity, ranging between 500k and 1 million H100 equivalents with a significant confidence interval.

  • Meta: This bar represents the use of NVIDIA GPUs and shows a slightly smaller computing capacity, estimated between 400k and 800k H100 equivalents, with an associated confidence interval.

  • Amazon: Amazon’s computing capacity is similar to Meta but slightly smaller, estimated between 300k and 700k H100 equivalents.

  • Other (including other cloud providers and AI labs): This category has the largest computing capacity, reaching 1.5 million H100 equivalents or more, with a broad confidence interval, indicating significant diversity among other providers.

Google leads the way with the largest computing capacity, exceeding one million H100 equivalents. Google leverages both NVIDIA GPUs and its custom TPUs, which significantly boosts its computing resources, making it a powerful player in the AI field.

Microsoft, which includes the resources of OpenAI, follows as another major contender, with its computing power estimated between 500,000 and one million H100 equivalents. Microsoft primarily depends on NVIDIA’s technology for AI workloads, reflecting a substantial investment in industry-standard GPU infrastructure.

Meta ranks next, with a strong computing infrastructure in the range of approximately 400,000 to 800,000 H100 equivalents. This illustrates Meta’s commitment to advancing its AI capabilities to power its social platforms and metaverse initiatives.

Amazon also shows impressive AI capabilities, albeit slightly behind Meta, with its computing capacity estimated between 300,000 and 700,000 H100 equivalents. This positions Amazon well for expanding AI capabilities across its AWS offerings and other business services.

The “Other” category, which includes other cloud providers and AI labs, collectively possesses a very significant amount of computing power, estimated at over 1.5 million H100 equivalents. This diverse group demonstrates the growing competition and interest in AI computing capacity across various tech ecosystems.

Overall, this comparison highlights the significant infrastructure investments made by these leading companies to enhance their AI capabilities, with Google standing out as the clear leader, followed by a competitive landscape involving Microsoft, Meta, Amazon, and a diverse group of other providers. The results underline the importance of having vast computing resources to stay at the forefront of AI development and innovation.

Google AI – Development of therapeutic drugs is often difficult and time consuming. A new model, Tx-LLM, is able to predict the properties of many entities of potential interest for therapeutic development with accuracy comparable state-of-the-art specialty models.

Introducing Tx-LLM, a language model fine-tuned to predict properties of biological entities across the therapeutic development pipeline, from early-stage target discovery to late-stage clinical trial approval.

Source: https://research.google/blog/tx-llm-supporting-therapeutic-development-with-large-language-models/

Chinese startup Leju Robotics has released their open-source humanoid development platform for academic and R&D use cases. It includes an SDK for sensors and controls, simulation models, an LLM interface, and some basic demos that work out-of-the-box.

Source: https://www.reddit.com/r/singularity/?f=flair_name%3A%22Robotics%22

What Else is Happening in AI on October 09th 2024!

OpenAI and Hearst announced a strategic partnership to integrate content from over 20 magazine brands and 40+ newspapers into OpenAI’s AI products.

Source: https://openai.com/index/hearst

Hugging Face released OpenAI-Gradio, a new tool enabling the creation of AI-powered web apps using OpenAI’s models in just minutes with minimal code.

Source: https://x.com/Gradio/status/1843698665472368665

Uber unveiled plans to launch an OpenAI-powered AI assistant in early 2025 to help drivers with electric vehicle questions, aiming to accelerate EV adoption on the platform.

Source: https://www.reuters.com/technology/artificial-intelligence/uber-launch-ai-assistant-powered-by-openais-gpt-4o-help-drivers-go-electric-2024-10-08

Anthropic launched Message Batches API, allowing developers to submit up to 10,000 queries for async processing in under 24 hours at a 50% discount compared to standard API calls.

Source: https://www.anthropic.com/news/message-batches-api

Google added the ability to drag and drop any file type to upload directly into its AI Studio without importing it to Google Drive.

Source: https://x.com/officiallogank/status/1843723911055454580

KoBold Metals raised $527M for its AI-powered mineral discovery tech that leverages extensive data analysis to uncover deposits with energy-critical minerals like copper, lithium, and nickel.

Source: https://techcrunch.com/2024/10/07/ai-powered-critical-mineral-startup-kobold-metals-has-raised-491m-filings-reveal/

 

AI Tools Updates

Machine Learning & AI For Dummies PRO on the App Store (apple.com)

Machine Learning and AI For Dummies
Machine Learning and AI For Dummies

This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

 

CogvideoX-ControlNet: A new tool for turning images into short videos using the powerful CogvideoX model. It’s open-source, so check it out and contribute if you’d like!

Meta Movie Gen: Now adds audio to your videos! From background sounds to music, this AI brings your videos to life.

Veo by Google DeepMind: Google’s latest advanced video creation tool. Watch it in action!

FLUX.1-dev ControlNet Inpainting: Perfect for fixing or filling in missing spots in your images.

Source: https://comfyuiblog.com/ai-news-cogvideox-controlnet-and-veo-by-google-deepmind-and-more/

 

A Daily Chronicle of AI Innovations on October 08th  2024

🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity

🤖 Inflection and Intel team up on enterprise AI

💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High

🕶️ Students turn AI glasses into doxing devices

✅ Checklists improve AI model evaluation

👀 AI images taking over google

🚗 Uber will use ChatGPT to get more people to use EVs

🎨 Adobe has a new tool to protect artists’ work from AI

🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity

r/artificial - Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity

The Nobel Prize in Physics 2024 was awarded to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”

Hinton … hopes that the award might make people take the fears he voices more seriously.

The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”

  • Geoffrey Hinton and John Hopfield, credited with ‘establishing the foundations for today’s advanced machine learning technologies’, were awarded the Nobel Prize in physics for their pioneering work on artificial neural networks mimicking brain structures.
  • Their innovations helped enable AI systems to learn by identifying complex patterns from data, which is foundational to high-profile applications like language generation and image recognition currently used in technology.
  • Despite the recognition, Hinton has expressed concern over AI’s potential risks, highlighting the danger of bad actors misusing the technology, and recently left Google to focus on advocating for responsible AI development.
 
Ace the Microsoft Azure Fundamentals AZ-900 Certification Exam: Pass the Azure Fundamentals Exam with Ease

Source: https://www.nobelprize.org/

💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High

 

On Monday, Nvidia stock went up even though most other big tech stocks went down. This helped the AI giant recover its position as the world’s second-largest company during the AI boom. 

Source: https://theaiwired.com/nvidia-overtakes-microsoft-as-ai-powers-stock-to-6-week-record-high/

👀 AI images taking over google

r/singularity - AI images taking over google

Hard to see how this isn’t the beginning of the end of the information era…

Source: https://www.reddit.com/r/singularity/comments/1fyf93x/ai_images_taking_over_google/

🚗 Uber will use ChatGPT to get more people to use EVs 

  • Uber is introducing an AI assistant powered by ChatGPT to help drivers with questions about purchasing and using electric vehicles, aiming to encourage EV adoption.
  • The company is rolling out a new “EV Preference” feature, allowing users to select rides exclusively from electric vehicles, which will be available in the app over the coming months.
  • As part of its sustainability goals, Uber is expanding its EV-only service in 40 cities and aims to become a zero-emission mobility platform in North America and Europe by 2030, and globally by 2040.

Source: https://www.theverge.com/2024/10/8/24264282/uber-green-ev-driver-mentor-chatgpt

🎨 Adobe has a new tool to protect artists’ work from AI

  • Adobe plans to launch a new web app in 2025, alongside a Chrome extension, to help protect artists’ work by applying tamper-evident metadata, known as Content Credentials, and allowing creators to opt-out of generative AI models.
  • This web app will integrate with Adobe’s Creative Cloud applications and enable artists to uniformly embed creator information across content, simplifying the opt-out process from AI training databases compared to individual submissions for each AI provider.
  • While Adobe’s initiative seeks widespread industry support, only a few companies like Spawning have committed to adopting these protections, highlighting Adobe’s challenge in ensuring voluntary participation from other AI and tech companies.
  • Source: https://www.technologyreview.com/2024/10/08/1105234/adobe-wants-to-make-it-easier-for-artists-to-blacklist-their-work-from-ai-scraping

🤖 Inflection and Intel team up on enterprise AI

 Inflection AI just launched Inflection for Enterprise, a new system built in partnership with Intel and designed for large-scale business deployments – featuring both a cloud service, new commercial API and upcoming local appliance.

  • Inflection for Enterprise is built on the new Inflection 3.0 model family and powered by Intel’s Gaudi 3 AI accelerators.

  • An on-premises AI appliance is planned for Q1 2025 release, promising up to 2x improved price-performance over competitors.

  • Inflection 3.0 comes in two variants — Pi 3.0 for chatbots and Productivity 3.0 for instruction-following tasks.

  • Inflection also released a commercial API, enabling developers to build advanced conversational AI applications.

After a turbulent year following founder Mustafa Suleyman and much of the team’s departure to Microsoft, Inflection is pivoting from consumer-focused apps to enterprise solutions. While the startup will face no shortage of competitors, a partnership with Intel is a positive start for the new regime.

Source: https://www.intel.com/content/www/us/en/newsroom/news/inflection-ai-intel-launch-enterprise-ai-system.html

If you are looking for an all-in-one solution to help you prepare for the AWS Cloud Practitioner Certification Exam, look no further than this AWS Cloud Practitioner CCP CLF-C02 book

✅ Checklists improve AI model evaluation

Researchers from the University of Oxford and Cohere just developed TICK, a new approach for evaluating AI language models that use AI-generated checklists to improve assessment accuracy and interpretability.

  • TICK uses an AI model to generate a checklist of yes/no questions to evaluate how well another AI model followed a given instruction.

  • The checklist-based method showed 5.8% higher agreement with human evaluators than standard AI evaluation techniques.

  • The researchers also developed STICK (Self-TICK), which uses the checklists for self-improvement, leading to 7.8% better performance on reasoning tasks.

  • TICK can be fully automated, making it faster and cheaper than checklist-based evaluations requiring human input.

LLMs are weird — and sometimes even simple formatting quirks (remember the ‘take a deep breath’ prompt?) can lead to unexpected results. When looking for new techniques to get the most out of AI models and evaluations, maybe it’s ideal to return to the basics of human organization and learning.

Source: The Rundown

What Else is Happening in AI on October 08th 2024!

Former Google CEO Eric Schmidt argued at the Washington AI Summit that AI advances should take precedence over climate goals, saying, “We’re not going to hit the climate goals anyway because we’re not organized to do it.”

Source: https://mashable.com/article/former-google-ceo-invest-ai-despite-climate-concerns

Northrop Grumman announced an AI-powered enhancement to its Forward Area Air Defense system, enabling rapid decision-making against drone swarms.

Source: https://news.northropgrumman.com/news/releases/northrop-grumman-to-develop-prototype-artificial-intelligence-assistant

Nvidia and Peking University researchers introduced EdgeRunner, a new model for high-quality, detailed 3D mesh generation.

Source: https://arxiv.org/html/2409.18114v1

Enterprise GenAI startup Writer is reportedly set to raise between $150-200M at a $1.9B valuation, doubling its valuation from its $100M Series B round last September.

Source: https://www.forbes.com/sites/rashishrivastava/2023/09/18/ai-startup-writer-raises-100-million-to-take-on-chatgpt-enterprise/

Security researcher Harish SG published research showing evidence that LLMs can be prompted to achieve reasoning levels of powerful models like OpenAI’s o1 using a combination of advanced prompt tactics.

Source: https://openai.com/index/building-an-early-warning-system-for-llm-aided-biological-threat-creation/

Trending AI Tools:

Machine Learning & AI For Dummies PRO on the App Store (apple.com)

Machine Learning and AI For Dummies
Machine Learning and AI For Dummies

This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211

 

  • 🤖 Dashworks Bots – Create AI assistants that answer your team’s questions

  • 📜 Theneo – Generate Stripe-like API docs in seconds

  • 📸 Flash – Supercharge your learning with AI-powered flashcards

  • 🔥 Firebender – A privacy-first coding assistant for Android Studio

  • 🏠 Bramble  AI-backed real estate brokerage to buy a home end-to-end

A Daily Chronicle of AI Innovations on October 07th  2024

🤖 OpenAI and Altera create digital humans

💊 AI identifies drug candidates for pain relief

🤖 Fewer websites are blocking OpenAI’s web crawler

🦾 Nvidia Acquires OctoAI To Dominate Enterprise Generative AI Solutions.

🚖Uber Expands Robot Delivery and Robotaxi Offerings With Avride.

🤖 Hitachi launches AI-powered railway maintenance service with Nvidia.

🔮 New Nvidia ACE plugins for Unreal Engine 5 simplify the creation of AI digital humans.

💰 Jensen Huang is now worth more than Intel

📱 Run Llama 3.2 locally on your phone

👀The impact of generative AI as a general-purpose technology

👨‍⚖️The racist AI deepfake that fooled and divided a community

💰 Jensen Huang is now worth more than Intel 

  • Jensen Huang, CEO of Nvidia, has a net worth of $109.2 billion, surpassing Intel’s current market value of $96.39 billion, which saw a significant drop following revelations about its financial issues in August.
  • Nvidia’s growth, driven by an AI boom and its dominance as a GPU accelerator manufacturer, helped its market cap soar, placing it among the top valued companies worldwide, though its stock has corrected by 10% since its peak.
  • Huang’s significant stake in Nvidia, with holdings valued over $100 billion, and his strategic share sales have propelled him to the 11th position on Forbes’ real-time billionaires list, close to entering the top 10.
  • Source: https://www.msn.com/en-gb/money/other/jensen-huang-is-now-worth-more-than-intel-personal-net-worth-currently-valued-at-109b-vs-intel-s-96b-market-cap/ar-AA1rMKD3

🤖 Fewer websites are blocking OpenAI’s web crawler

  • OpenAI’s web crawlers are facing fewer blocks from major news websites compared to earlier, despite a widespread data-protection rush where publishers attempted to prevent their content from becoming AI training data without consent.
  • The trend of blocking OpenAI’s GPTBot saw a decline after the company made a series of licensing agreements with publishers, leading some outlets to revise their robots.txt files and permit GPTBot access.
  • Despite robots.txt not being legally binding, it remains a widely observed standard for web crawler behavior, and OpenAI recognizes the importance of not being blocked to safeguard its future goals and ambitions.
  • Source: https://www.theverge.com/2024/10/7/24264184/fewer-websites-are-blocking-openais-web-crawler-now

🦾 Nvidia Unveils NVLM 1.0-A Bold Rival to ChatGPT in Generative AI

 

Advanced AI model NVLM 1.0 from Nvidia competes with ChatGPT and Gemini, doing better at jobs like vision-language and solving complex problems.

Source: https://theaiwired.com/nvidia-unveils-nvlm-1-0-a-bold-rival-to-chatgpt-in-generative-ai/

🤖 OpenAI and Altera create digital humans

OpenAI just published a case study on Altera, a startup using GPT-4o to develop AI agents called “digital humans” capable of prolonged, natural interactions with people — significantly outperforming other rivals during testing in Minecraft.

  • Altera, founded by ex-MIT professor Dr. Robert Yang, uses GPT-4o to power AI agents that can play Minecraft autonomously for up to 4 hours.

  • Altera’s system combines GPT-4o with a brain-inspired multi-module architecture to simulate cognitive functions and emotional processing.

  • OpenAI reports that Altera’s agents outperform other models in Minecraft tasks, collecting 32% of items compared to 6.4% for the next best model.

  • The startup plans to expand beyond gaming to create AI ‘coworkers’ and more complex multi-agent simulations.

We’ve constantly heard from Sam Altman and others that AI agents are coming fast — and case studies like this (as well as a cryptic ‘Level 3’ tweet from an OpenAI researcher) might mean the capabilities have already arrived. We might ascend the ‘Stages of AI’ ladder faster than most are anticipating.

Source: https://www.forbes.com/sites/jodiecook/2024/07/16/openais-5-levels-of-super-ai-agi-to-outperform-human-capability/

💊 AI identifies drug candidates for pain relief

Researchers at Cleveland Clinic and IBM just developed an AI model to predict how drugs and gut microbes interact with pain receptors, potentially uncovering new non-addictive pain treatments.

  • LISA-CPI analyzes both the molecular structure of compounds and the 3D shape of pain receptors to predict their interactions.

  • The model identified FDA-approved drugs, like methylergometrine, that could potentially be repurposed for pain treatment by targeting specific receptors.

  • LISA-CPI also discovered gut microbes that may interact with pain receptors in beneficial ways.

  • The approach could accelerate drug discovery for pain and other conditions by more accurately screening potential compounds.

 The current opioid crisis highlights the urgent need for effective, non-addictive pain medications, and this AI-driven approach could help researchers more quickly identify promising drug candidates while also opening new avenues for pain management.

🎥 Meta unveils advanced AI video model

Meta just announced Movie Gen, a powerful new suite of AI models for generating and editing video and audio content, positioning itself as a direct competitor to OpenAI’s Sora and other industry leaders.

  • Movie Gen consists of four models: a 30B video generation model, a 13B audio model, a personalized video model, and a video editing model.

  • The system can generate HD videos up to 16 seconds long from text prompts, along with synchronized audio like sound effects and background music.

  • Movie Gen also features video editing via natural text prompts and the ability to upload a reference image to create personalized videos.

  • Meta claims the model outperforms rivals like Runway Gen3, Luma Labs, and OpenAI’s Sora in human video quality and consistency evaluations.

  • Meta CEO Mark Zuckerberg said that Movie Gen will be ‘coming to Instagram next year’ in a post displaying some of the model’s sample generations.

Meta’s Movie Gen separates itself from other video generators by not only generating videos from text, but also being able to perform precise video editing. With the models coming to Instagram, it could transform the content creation process and give the masses a powerful video editing suite—with only prompting required.

📱 Run Llama 3.2 locally on your phone

Meta’s new Llama 3.2 3B model can run directly on your smartphone, allowing you to have AI conversations privately and offline.

  1. Download PocketPal AI from the App Store.

  2. Open the app, tap the top-left menu, and select “Models.”

  3. Under “Llama,” download “llama-3.2-3b-instruct q4_k” (2.2 GB).

  4. Once downloaded, tap “Load” to activate the model.

  5. Return to the main menu, select “Chat,” and start conversing with AI!

Create a local knowledge base that can be queried alongside the model, allowing you to supplement the AI’s knowledge with custom, up-to-date information without requiring an internet connection.

Source: https://apps.apple.com/us/app/pocketpal-ai/id6502579498

 

👀The impact of generative AI as a general-purpose technology

 

Generative artificial intelligence will affect economic growth more quickly than other general-purpose technologies, according to a new report.
The steam engine, the internal combustion engine, electrification, and computers are all considered “general-purpose technologies” — new tools that are powerful enough to accelerate overall economic growth and transform economies and societies. According to many experts, generative artificial intelligence will be the next invention to join that category.

In a recent report about the economic impact of generative AI, Google visiting fellow and MIT Sloan principal research scientist Andrew McAfee makes the case that generative AI is not only a game-changing general-purpose technology but could also spur change far more quickly than preceding innovations due to its accessibility and ease of diffusion. 

Source: https://mitsloan.mit.edu/ideas-made-to-matter/impact-generative-ai-a-general-purpose-technology

👨‍⚖️The racist AI deepfake that fooled and divided a community

When an audio clip appeared to show a local school principal making derogatory comments, it went viral online, sparked death threats against the educator and sent ripples through a suburb outside the city of Baltimore. But it was soon exposed as a fake, manipulated by artificial intelligence – so why do people still believe it’s real?

Source: https://www.bbc.com/news/articles/ckg9k5dv1zdo

What Else is Happening in AI on October 07th 2024!

Apple will reportedly release its Apple Intelligence features on Oct. 28 alongside the iOS 18.1 update, according to Bloomberg insider Mark Gurman.

Source: https://www.iphoneincanada.ca/2024/10/06/apple-intelligence-release-date-oct-28-with-ios-18-1-report/

Google began rolling out the new AI anti-theft features for Android devices showcased at Google I/O, including Theft Detection Lock, Offline Device Lock, and Remote Lock.

Source: https://lifehacker.com/tech/google-rolling-out-three-anti-theft-features-for-android

Cohere launched improved fine-tuning features for its Command R LLM, including longer context support and a ‘bring your own fine-tune’ option.

Source: https://cohere.com/blog/commandr-fine-tuning

AI startup Otherside AI’s Reflection 70B model failed to match performance claims in tests published by the team in a post-mortem of the release after being initially touted as the ‘world’s best open-source model.’

Source: https://the-decoder.com/worlds-best-open-source-model-falls-short-of-promised-performance/

North Carolina musician Michael Smith faces federal charges for allegedly using AI to generate thousands of songs and bots to stream them billions of times, netting over $10M in royalties.

Source: https://apnews.com/article/music-fraud-ai-arrest-4f09a714971f450fb3c9103c927cb091

Trending AI Tools

Machine Learning and AI For DummiesMachine Learning & AI For Dummies PRO

Ready to accelerate your career in the fast-growing fields of AI and machine learning? Our app offers user-friendly tutorials and interactive exercises designed to boost your skills and make you stand out to employers. Whether you’re aiming for a promotion or searching for a better job, AI & Machine Learning For Dummies PRO is your gateway to success. Start mastering the technologies shaping the future—download now and take the next step in your professional journey! iOSWindows

👨‍💼 Cheatlayer – Automate your business using natural language: https://cheatlayer.com/

🤝 Mindpal’s SalesBox – Build your own AI sales OS with multi-agent workflows: https://mindpal.space/

🤑 Trillion – Track expenses, manage accounts and set financial goals with AI planning: https://apps.apple.com/us/app/trillion-budget-management/id6504283874

🛒 BuyScout  Your AI copilot for online shopping: https://www.buyscout.app/

🗓️ Selfletter – Break complex goals into simple tasks with AI: https://www.selfletter.com/

AI Weekly Rundown: 🍎Apple releases AI model that rewrites the rules of 3D vision 🎥 Meta unveils an AI video generator 🔥 ChatGPT gets a collab boost with Canvas 🔎Google rolls out ads in AI Overviews 🧠Google is Working on Reasoning AI and more
AI Weekly Rundown: 🍎Apple releases AI model that rewrites the rules of 3D vision 🎥 Meta unveils an AI video generator 🔥 ChatGPT gets a collab boost with Canvas 🔎Google rolls out ads in AI Overviews 🧠Google is Working on Reasoning AI and more

A Daily Chronicle of AI Innovations on October 04th  2024:

🧠 Apple releases AI model that rewrites the rules of 3D vision

Listen at https://podcasts.apple.com/us/podcast/a-daily-chronicle-of-ai-innovations-on-october/id1684415169?i=1000671816462

🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.

🎥 Meta unveils an AI video generator

🔥 ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface

🔎 Google launches one of its ‘most significant updates ever’

🕵️‍♂️ TikTok’s owner is scraping the web 25 times faster than OpenAI

🔎 Google rolls out ads in AI Overviews

🧠 Apple releases AI model that rewrites the rules of 3D vision 

  • Apple’s AI research team has unveiled Depth Pro, a new AI model that enhances machines’ depth perception using only a single 2D image, which could revolutionize fields like augmented reality and self-driving technology by offering real-time spatial awareness.
  • Depth Pro generates high-resolution 3D depth maps in just 0.3 seconds without needing traditional camera data, employing advanced techniques like a multi-scale vision transformer to accurately define details such as individual hairs and the edges of objects.
  • Open-sourced on GitHub, Depth Pro introduces metric depth estimation without extensive training on specific datasets, paving the way for widespread use in industries such as e-commerce, automotive, and healthcare, where sharp depth analysis is crucial.

Source: https://vuink.com/post/iragherorng-d-dpbz/ai/apple-releases-depth-pro-an-ai-model-that-rewrites-the-rules-of-3d-vision

🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.

https://packaged-media.redd.it/4dyp42vx94td1/pb/m2-res_720p.mp4?m=DASHPlaylist.mpd&v=1&e=1728241200&s=90d466443f216b3f4be4cea8a0dea727af2d82e7

Nvidia introduced EdgeRunner, an auto-regressive method capable of generating high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512. This approach efficiently processes images and point clouds, offering significant advancements in the field of 3D modeling.

Source: https://ar5iv.org/2409.18114

🎥 Meta unveils an AI video generator:

Meta’s new Sora competitor: Meta Movie Gen

  • Meta has introduced Movie Gen, an AI-powered model for video creation and editing, allowing users to generate high-definition video with audio and make precise edits using simple text commands, catering to filmmakers, content creators, and creative individuals.
  • Movie Gen offers personalization by combining uploaded images with descriptive text prompts to create customized videos, enhancing creative possibilities, and enabling scenarios ranging from fantasy realms to everyday adventures, while maintaining realistic human motion and identity.
  • The suite also includes advanced audio generation, with the 13-billion parameter model adding ambient sounds and music to video scenes, all aimed at democratizing content creation by offering professional-grade tools with user-friendly functionality.

Generate videos from text Edit video with text
Produce personalized videos
Create sound effects and soundtracks

Paper: MovieGen: A Cast of Media Foundation Models
https://ai.meta.com/static-resource/movie-gen-research-paper

Source: AI at Meta on X: https://x.com/AIatMeta/status/1842188252541043075

r/singularity - Meta Movie Gen - the most advanced media foundation AI models | AI at Meta

Source: https://ai.meta.com/research/movie-gen/

Apple just released Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

r/singularity - Apple just released Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

The paper presents a foundation model for zero-shot metric monocular depth estimation called Depth Pro. Depth Pro can produce high-resolution depth maps with sharp details and accurate object boundaries without requiring camera intrinsics like focal length. The superior performance of Depth Pro is attributed to its efficient multi-scale architecture, effective training curriculum, and dedicated boundary metrics. The model is able to accurately estimate depth and focal length in a zero-shot setting, enabling applications like view synthesis that require metric depth.

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second https://www.openread.academy/en/paper/reading?corpusId=509969387

GitHub – https://github.com/apple/ml-depth-pro?tab=readme-ov-file

🔥 ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface

OpenAI just launched Canvas, a new ChatGPT interface release that enables more collaborative writing and coding projects beyond simple chat interactions with new editing features, shortcuts, and added contextual knowledge.

  • Canvas opens in a separate window alongside the chat, allowing users to directly edit and refine specific aspects of an output.

  • New features include inline feedback, targeted editing, and shortcuts for tasks like adjusting text length, changing reading levels, or debugging code.

  • In tests, using GPT-4o with Canvas led to a 30% accuracy and 16% quality boost compared to using the model without the interface.

  • Canvas is rolling out in beta to Plus and Team users, with a broader release expected later.

ChatGPT’s first major UI change takes a leap towards more nuanced, moldable interactions — while also inheriting novice-friendly features seen in other rivals with easy-to-use shortcuts. The simple chatbox was a good first step for human-AI interactions, but more power and capabilities require new collaborative processes.

Source: https://www.techradar.com/computing/artificial-intelligence/chatgpt-has-a-new-canvas-for-collaborating-with-the-ai-chatbot-on-writing-and-coding-ideas

🔎 Google launches one of its ‘most significant updates ever’

  • Google has integrated more AI features into its search functionalities, unveiling a range of updates such as AI-organized web results, enhanced Google Lens capabilities, and the incorporation of links and advertisements within AI Overviews.
  • This AI-driven search initiative kicks off with food-related content, where Google’s AI creates a comprehensive experience by aggregating diverse perspectives from across the web, including videos and forums, tailored to user queries.
  • Additional updates include the enhancement of AI Overviews with more prominent links to support website traffic, the integration of ads within these overviews, improved music identification features with Circle to Search, and significant upgrades to Google Lens for video, voice, and shopping inquiries.
  • Source: https://www.maginative.com/article/meta-unveils-movie-gen-ai-powered-video-creation-and-editing-suite/

🕵️‍♂️ TikTok’s owner is scraping the web 25 times faster than OpenAI

  • ByteDance, the parent company of TikTok, has launched a web scraper called Bytespider which is significantly outpacing similar tools by other companies in collecting online data for AI model training, operating at 25 times the speed of OpenAI’s GPTbot.
  • Unlike other web crawlers, Bytespider ignores the robots.txt file that web publishers use to regulate scraping activity, highlighting its aggressive approach to gathering data from the internet, amidst concerns related to copyright issues within generative AI development.
  • With the U.S. government pressuring ByteDance over national security issues, the rapid data collection by Bytespider seems to indicate ByteDance’s urgency in enhancing TikTok’s search functionality and possibly developing a new large language model to rival existing competitors.
  • Source: https://fortune.com/2024/10/03/bytedance-tiktok-bytespider-scraper-bot/

🔎 Google rolls out ads in AI Overviews

Google just announced the introduction of ads to its AI Overview search summaries and the launch of several new AI-powered search capabilities, such as video understanding and voice input.

  • Ads will now appear within and alongside AI Overviews for ‘relevant queries’ on searches in the United States.

  • The redesigned AI Overview format will now add prominent in-text links to better source websites for the curated information.

  • New AI-organized search results pages are rolling out that surface relevant, more diverse content — starting with recipe and meal inspiration queries.

  • Google Lens is getting video understanding capabilities and voice input options for visual searches.

  • The Android ‘Circle to Search’ feature also lets users identify songs playing in videos or streaming content.

Google’s first AI Overview experience didn’t exactly go as planned. However, with heavy competition from Perplexity and chatbot rivals, Google’s search future clearly has AI at its core, regardless of the bumps along the way. But infusing paid ads into AI Overviews could be a slippery slope – will Gemini be next?

Source: https://www.theverge.com/2024/10/3/24260637/googles-ai-overview-ads-launch

What Else is Happening in AI on October 04th 2024!

Google DeepMind hires key OpenAI Sora researcher Tim Brook for ‘world simulator’ project. 

Source: https://the-decoder.com/google-deepmind-hires-key-openai-sora-researcher-for-world-simulator-project/

Google released Gemini 1.5 Flash 8B, a lightweight, cost-effective variation with a 50% cost reduction and 2x higher rate limits than 1.5 Flash.

Source: https://www.neowin.net/news/google-democratizes-ai-with-gemini-15-flash-8b-the-cheapest-gemini-model-to-date

Fourier launched GR-2, the company’s second-generation humanoid robot, which features improvements to battery life, hand dexterity, mobility, and a new developer kit.

Source: https://finance.yahoo.com/news/fourier-unveils-next-generation-humanoid-123000642.html

OpenAI also secured a massive credit line. Source: https://techcrunch.com/2024/10/03/openai-also-secured-a-massive-credit-line/

Google’s AI can detect tuberculosis just by analyzing cough sound.

Source: https://www.newsbytesapp.com/news/science/google-ai-uses-cough-sound-to-diagnose-tuberculosis/story

OpenAI CFO Sarah Friar says their next AI model will be an order of magnitude bigger than GPT-4 and future models will grow at a similar rate, requiring capital-intensive investment to meet their “really big aspirations”

Trending AI Tools on October 04th 2024

🐝 Buzzabout – AI-driven insights from billions of discussions on social media: https://buzzabout.ai/

🤖 Base AI – Build serverless, autonomous AI agents with memory: https://baseai.dev/

💸 CostGPT – Estimate costs and time for your software project in less than 5 minutes: https://costgpt.ai/

👀 Lookie AI – Consume, organize, and manage knowledge from YouTube: https://apps.apple.com/kr/app/lookie-ai/id6670471730?l=en-GB

⏱️ Tackle AI – Automatic time tracking to align everyday actions with key priorities: https://www.timetackle.com/

A Daily Chronicle of AI Innovations on October 03rd  2024:

👓 Meta smart glasses can be used to dox anyone in seconds

💰 OpenAI is now valued at $157 billion

💥 Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o

🥕 Microsoft to employees: you can continue working from home unless productivity drops

🤔 Google developing reasoning AI to rival OpenAI

👓 Meta smart glasses can be used to dox anyone in seconds 

  • Harvard students demonstrated how Meta’s smart glasses combined with facial recognition technology can dox individuals by revealing personal details like identities and phone numbers, using tools like I-XRAY and public databases in real-time.
  • The demo used existing technologies such as Meta’s Ray-Ban smart glasses and the PimEyes search engine, showing how a simple photo capture can quickly connect to public data, including names and addresses, raising privacy concerns.
  • Meta has privacy guidelines for its smart glasses, but the tiny notification light is hard to detect in bright light, leading to potential misuse despite the company warning users to respect others’ privacy and follow recording etiquette.
  • Source: https://www.theverge.com/2024/10/2/24260262/ray-ban-meta-smart-glasses-doxxing-privacy

💰 OpenAI is now valued at $157 billion

  • OpenAI has raised $6.6 billion in a new funding round, which has nearly doubled its valuation to $157 billion from a previous $86 billion, as reported by The Wall Street Journal.
  • The latest financing requires OpenAI to shift from its nonprofit model to a fully for-profit company, or investors have the right to retract their investments.
  • Major contributors to this funding round include Thrive Capital with a $1.25 billion investment and long-time supporter Microsoft, which added just under $1 billion more, with new investors like SoftBank and Nvidia also participating.
  • Source: https://arstechnica.com/ai/2024/10/openai-is-now-valued-at-157-billion/

💥 Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o 

  • In early October 2024, Nvidia surprised the AI community by unveiling NVLM 1.0, a series of advanced multimodal language models with capabilities matching those of the GPT-4o model from ChatGPT.
  • Instead of releasing a direct competitor to consumer-facing AI applications like ChatGPT or Claude, Nvidia is opting to allow others to create their own AI solutions by making the model weights of NVLM publicly accessible.
  • Nvidia, previously renowned for supplying essential chips for AI processes, is now demonstrating its prowess in generative AI through its innovative approach to sharing AI technology development resources.
  • Source: https://bgr.com/tech/nvidia-stunned-the-world-with-a-chatgpt-rival-thats-as-good-as-gpt-4o/

🥕 Microsoft to employees: you can continue working from home unless productivity drops

  • Microsoft has decided to allow employees to continue working from home, maintaining flexibility as long as it does not affect productivity, contrasting with companies like Amazon that have mandated a return to the office.
  • Scott Guthrie, Microsoft Executive Vice President, assured workers in a meeting that the company values flexible working arrangements, though productivity must remain steady to keep the remote work model viable.
  • The remote work setup is considered beneficial for both employees and Microsoft, though the company remains cautious about the risks, such as decreased productivity and potential misuse of work hours for personal activities.
  • Source: https://www.techspot.com/news/104972-microsoft-assures-employees-they-can-continue-working-home.html

🤔 Google developing reasoning AI to rival OpenAI

Google is reportedly making significant strides in developing AI models with advanced reasoning capabilities similar to OpenAI’s o1 system, intensifying the rivalry between the two AI giants.

  • Multiple teams at Google are working on AI that can solve complex, multi-step problems, according to Bloomberg.

  • The AI uses chain-of-thought prompting, a technique created by Google, to tackle complex math and programming problems by ‘thinking’ before responding.

  • Google is taking a more cautious approach to its releases than OpenAI but has already debuted math-focused reasoning models like AlphaProof and AlphaGeometry 2.

  • Microsoft also infused reasoning capabilities into its Copilot assistant this week, leveraging OpenAI’s o1 model.

Human-like reasoning and agentic capabilities are clearly the two major developments on every AI firm’s roadmap, and the release of o1 may have signaled a new phase in the LLM race. The question is — will OpenAI’s speed keep it a step ahead, or is the competition for top-tier models about to get a whole lot tougher?

Source: https://qz.com/google-reasoning-ai-model-compete-openai-chatgpt-gemini-1851663139

What Else is Happening in AI on October 03rd 2024!

The Cancer AI Alliance formed a $40M collaboration between major medical institutions and tech giants like Microsoft, AWS, Nvidia, and Deloitte to advance AI-driven cancer care.

Source: https://techcrunch.com/2024/10/02/cancer-ai-alliance-joins-medical-and-tech-expertise-together-with-40m-to-collaborate-on-next-gen-care/

Character AI is reportedly shifting its focus away from building AI models in the wake of its $2.7B deal with Google and prioritizing its consumer chatbot service.

Source: https://www.btimesonline.com/articles/169707/20241003/character-ai-quits-ai-model-race-after-4-billion-google-deal-shifts-focus-to-consumer-chatbot-platform.htm

Elon Musk posted ‘OpenAI is evil’ on X in response to reports that the AI giant asked investors to avoid funding competing AI firms like Anthropic and Musk’s xAI.

Source: https://www.yahoo.com/tech/elon-musk-called-openai-evil-030055401.html

Accenture announced a new partnership with NVIDIA to accelerate enterprise AI adoption, launching a business group and AI Refinery platform to scale agentic AI systems across industries.

Source: https://newsroom.accenture.com/news/2024/accenture-and-nvidia-lead-enterprises-into-era-of-ai

New ChatGPT feature: GPT-4o with Canvas.

r/singularity - New ChatGPT feature: GPT-4o with Canvas.

Latest AI Tools October 03rd 2024

WALDO: a detection AI model designed to identify specific objects, such as vehicles and utility poles, in overhead images from various altitudes, useful for tasks requiring object recognition in large-scale imagery. 

Source: https://github.com/stephansturges/WALDO

Kameo: a Rust library for creating fault-tolerant, distributed, and asynchronous actors using Tokio, facilitating seamless communication across nodes with features like scalability, backpressure handling, and panic recovery. 

Source: https://github.com/tqwewe/kameo

TinyJS: a lightweight JavaScript library that simplifies the creation of HTML elements, property assignment, and DOM element selection with unique $ and $$ shortcuts, enhancing web development efficiency. 

Source: https://github.com/victorqribeiro/TinyJS

QBittorrent: an open-source BitTorrent client designed to be a lightweight alternative to other clients, offering ad-free usage, stability, and a variety of features.

Source: https://github.com/qbittorrent/qBittorrent

Serving 70B-Scale LLMs Efficiently on Low-Resource Edge Devices: the paper discusses methods for running large language models (LLMs) efficiently on devices with limited resources.

Source: https://arxiv.org/abs/2410.00531

A Daily Chronicle of AI Innovations on October 02nd  2024:

Listen at https://podcasts.apple.com/us/podcast/a-daily-chronicle-of-ai-innovations-on-october/id1684415169?i=1000671578473

🧠Google is Working on Reasoning AI – Bloomberg News

💰’SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI

⚙️ OpenAI makes 4 major announcements at DevDay

🚀 Microsoft Copilot gets voice, vision upgrade

🤖 Google develops new AI model to rival OpenAI o1

👀 OpenAI co-founder joins rival Anthropic

⚙️ OpenAI makes 4 major announcements at DevDay

r/singularity - New tools for devs

Here’s a link to the announcement: https://openai.com/devday/

OpenAI’s recent DevDay conference took a different approach from last year’s event, focusing on incremental improvements rather than major product launches. The company introduced four key innovations: Vision Fine-Tuning, Realtime API, Model Distillation, and Prompt Caching, all aimed at empowering developers and enhancing the AI ecosystem.

Prompt Caching: This feature reduces costs and latency for developers by applying a 50% discount on input tokens that the model has recently processed, potentially leading to significant savings.

r/singularity - OpenAI DevDay: Vision Fine-Tuning, Realtime API, Model Distillation, and Prompt Caching

Vision Fine-Tuning: This allows developers to customize GPT-4o’s visual understanding capabilities using both images and text, with applications in fields like autonomous vehicles and medical imaging. For example, Grab improved its mapping services using this technology.

Realtime API: Now in public beta, this API enables low-latency, multimodal experiences, particularly in speech-to-speech applications. It allows for natural conversation and mid-sentence interruptions, opening up possibilities for voice-enabled applications in various industries.

Model Distillation: This workflow allows developers to use outputs from advanced models to improve the performance of more efficient models, making sophisticated AI capabilities more accessible and cost-effective.

OpenAI’s strategic shift towards ecosystem development over headline-grabbing product launches reflects a mature understanding of the AI industry’s current challenges and opportunities. By focusing on refining tools and reducing costs, OpenAI aims to foster a thriving developer ecosystem and ensure sustainable AI adoption across various industries.

  • Realtime API enables speech-to-speech application building using the same model that powers Advanced Voice, with the ability to choose from six voices. “Until right now, voice has been a second activity“, and that the Realtime API is going to make AI significantly more accessible because many people in the real world prefer to speak over reading or texting. Realtime API will have a “no-brainer” impact on customer support, education, and coaching. He also believes there will be many ‘non-obvious‘ use cases that are hard to predict now. For now, Realtime API only supports text and audio. However, Godement believes that image and video are the next milestones on the road to agents that can perceive the world just like a human. He also mentioned that image and video understanding specifically, will “turbocharge customer support” when the model has the ability to understand pixels on a screen in real-time. https://openai.com/index/introducing-the-realtime-api/

  • Model Distillation simplifies fine-tuning smaller models using outputs from larger ones, making training more accessible to developers. https://openai.com/index/api-model-distillation/

  • Prompt Caching reduces costs by nearly 50% across models and speeds up responses by up to 80% when reusing recent input tokens in API calls. https://openai.com/index/api-prompt-caching/

  • New prompt generator on https://playground.openai.com

  • Access to the o1 model is expanded to developers on usage tier 3, and rate limits are increased (to the same limits as GPT-4o)

🚀 Microsoft Copilot gets voice, vision upgrade

Microsoft just announced a slew of AI upgrades coming to its Copilot assistant for Windows PCs, including new vision and voice capabilities, personalization enhancements, a re-release of the controversial Recall feature, and more.

  • Copilot Voice allows users to interact with natural speech, adding conversational and intuitive communication similar to OpenAI’s Voice Mode.

  • Copilot Vision enables the AI to understand and interact with web content a user is viewing, offering context-aware help within the Microsoft Edge browser.

  • ‘Think Deeper’ gives Copilot new enhanced reasoning capabilities using chain-of-thought reasoning powered by OpenAI’s o1 model.

  • Microsoft’s ‘Recall’ feature is set to return, requiring an opt-in with upgraded privacy and security measures.

  • Microsoft AI CEO Mustafa Suleyman highlighted Copilot’s ability to ultimately ‘act on your behalf’ and adapt to user’s personal preferences and needs.

Microsoft is bringing the heat with these major Copilot upgrades, levelling up the assistant to align with the latest cutting-edge AI features across the industry — while bringing users one step closer to a truly agentic experience.

Source: https://www.theverge.com/2024/10/1/24259187/microsoft-copilot-redesign-vision-voice-features-inflection-ai

🧠Google is Working on Reasoning AI – Bloomberg News

 

Google is working on artificial intelligence software that resembles the human ability to reason, similar to OpenAI’s o1, marking a new front in the rivalry between the tech giant and the fast-growing startup.

In recent months, multiple teams at Alphabet Inc.’s Google have been making progress on AI reasoning software, according to people with knowledge of the matter, who asked not to be identified because the information is private.

AI researchers are pursuing reasoning models as they search for the next significant step forward in the technology. Like OpenAI, Google is trying to approximate human reasoning using a technique known as chain-of-thought prompting, according to two of the people. In this technique, which Google pioneered, the software pauses for a matter of seconds before responding to a written prompt while, behind the scenes and invisible to the user, it considers a number of related prompts and then summarizes what appears to be the best response.

Since OpenAI unveiled its o1 model, known internally as Strawberry, in mid-September, some in DeepMind have fretted that the company had fallen behind, according to another person with knowledge of the matter. But employees are no longer as concerned as they were following the launch of ChatGPT, now that Google has debuted some of its own work, the person said. In July, Google showcased AlphaProof, which specializes in math reasoning, and AlphaGeometry 2, an updated version of a model focused on geometry that the company debuted earlier this year.

Source: https://www.bnnbloomberg.ca/business/technology/2024/10/02/google-is-working-on-reasoning-ai-chasing-openais-efforts/

💰SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI, who previously claimed that creating ASI was his “life’s purpose”

r/singularity - 'SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI, who previously claimed that creating ASI was his "life’s purpose"

Source: https://www.ccn.com/news/technology/softbank-shares-surge-ceo-pushes-ai-superintelligence-vision-openai/

What Else is Happening in AI on October 02nd 2024!

OpenAI founding member Durk Kingma announced that he is joining Anthropic, reuniting with several former OpenAI employees and highlighting the company’s mission of responsible AI development in his X post.

Pika Labs unveiled Pika 1.5, a new video generation model upgrade featuring enhanced effects, realistic movement, longer clip creation, and cinematic capabilities.

Anyscale unveiled major upgrades to its AI platform at Ray Summit 2024, including a GPU-native Ray architecture, RayTurbo for enhanced performance, Ray Data for unstructured data processing, and more.

U.S. AI chipmaker Cerebras officially filed for an IPO, with the Sam Altman-backed Nvidia competitor expected to be valued at between $7-8B.

Meta released the open-source code and developer suite for its Segment Anything Model (SAM) 2.1, an upgraded version of its image and video segmentation tool.

Nvidia introduced NVLM 1.0, an open-source family of multimodal models that achieve SOTA performance on vision-language and text tasks.

Pinterest launched Performance+, a suite of new AI tools for advertisers that includes the ability to create background images for products and automation features for ad campaigns.

NotebookLM is too good

You can upload multiple books, hours long videos and audios into that thing and it processes everything so well. It’s so good at resuming, finding specific quotes, answering questions, explaining some stuff and the podcast feature too is mindblowing. It can even do the same for videos, texts and audios in foreign languages and translate, explain and resume it in order for you to understand. And it’s not super censored too. Can’t believe this thing is actually free and i’m just finding about it now.

A basic systems architecture for AI agents that do autonomous research

r/singularity - A basic systems architecture for AI agents that do autonomous research

Source: https://www.lesswrong.com/posts/6cWgaaxWqGYwJs3vj/a-basic-systems-architecture-for-ai-agents-that-do

OpenAI has released Whisper V3 Turbo model yesterday. The turbo model is an optimized version of large-v3 that offers 8x faster transcription speed with minimal degradation in accuracy

Source: https://huggingface.co/spaces/hf-audio/whisper-large-v3-turbo

Harvard students Build and show off AR glasses project that uses face detection, internet sleuthing, and AI to give you near instant dossiers (address, family info, name, etc) on people you see. Good proof of concept to raise awareness on what we may see in the future.

Source: https://x.com/AnhPhuNguyen1/status/1840786336992682409

https://x.com/i/status/1840786336992682409

Trending AI Tools on October 02nd 2024

🎥 Video SDK 3.0 – Build and integrate real-time multimodal AI characters: https://github.com/Xilinx/video-sdk/discussions/81

📭 Inbox Zero  An open-source, AI personal assistant for email: https://www.getinboxzero.com/ai-automation

👩🏻‍💻 Graphite – Your AI code review companion: https://graphite.dev/blog/graphite-reviewer-launch

📚 Ello – An AI reading companion for children offering personalized support: https://www.ello.com/

🗣️ VivaChat – FaceTime video chat with realistic AI personas: https://www.vivalabs.ai/

A Daily Chronicle of AI Innovations on October 01st  2024:

🔮 Microsoft gives Copilot a voice and vision

💻 Chromebooks are getting a dedicated AI key

👓 Microsoft is discontinuing its HoloLens headsets

🫠 Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup

❌ California’s controversial AI safety bill vetoed

💰 OpenAI secures SoftBank funding as Apple exits raise

💧 Liquid AI unveils efficient new LFM models

🔮 Microsoft gives Copilot a voice and vision 

  • Microsoft has unveiled a major overhaul to its Copilot experience, adding both voice and vision capabilities, transforming it into a more personalized AI assistant similar to OpenAI’s Advanced Voice Mode.
  • The redesign features a new card-based user interface inspired by Inflection AI’s Pi assistant, and Copilot now offers a virtual news presenter mode, tailored homepage and improved customization based on user interaction history.
  • Initial releases of Copilot Voice and Copilot Daily will be available in select regions, while Copilot Vision features are in a limited preview phase, focusing on enhancing user safety and privacy through restricted website interactions.
  • Source: https://www.theverge.com/2024/10/1/24259187/microsoft-copilot-redesign-vision-voice-features-inflection-ai

💻 Chromebooks are getting a dedicated AI key 

  • Chromebooks are getting a new keyboard layout with a “quick access” key for AI and other functions, providing easy access to features like text generation, emojis, and searching Google Drive.
  • The first Chromebooks to feature this new key are the Samsung Galaxy Chromebook Plus, which will replace the Launcher Key with the new Quick Insert key.
  • Although the new AI features will initially lack AI image generation, Google plans to add this and other AI capabilities, including real-time translation and transcription, to Chromebooks in October.
  • Source: https://gizmodo.com/chromebooks-are-getting-a-dedicated-ai-key-but-you-wont-use-it-for-ai-2000505155

 Microsoft is discontinuing its HoloLens headsets 

  • Microsoft has ceased production of its HoloLens 2 headsets and has no confirmed plans for a successor, although updates addressing security and software issues are promised until the end of 2027.
  • Former HoloLens head, Alex Kipman, left the company in 2022 amid misconduct allegations, and the hardware team faced significant layoffs in January 2023, impacting the development of the augmented reality devices.
  • Microsoft has partnered with Anduril Industries to enhance its IVAS mixed-reality headsets for the US Army, which plans to invest up to $21.9 billion over the next decade in this project.
  • Source: https://www.theverge.com/2024/10/1/24259369/microsoft-hololens-2-discontinuation-support

🫠 Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup 

❌ California’s controversial AI safety bill vetoed

California Governor Gavin Newsom just vetoed S.B. 1047, a groundbreaking AI safety bill that would have imposed stricter regulations on Silicon Valley AI firms and the release of new models in the state.

  • The bill would have required safety testing for AI models before their public release and held AI companies liable for any ‘severe harm’ (over $500M in damages) caused.

  • Tech giants, including OpenAI and Google, VCs, and politicians like Nancy Pelosi lobbied heavily against the bill, arguing it would stifle innovation.

  • The bill had notable support from Elon Musk, Anthropic, the ‘Godfather of AI’ Geoffrey Hinton, and over 120 Hollywood actors, directors, and workers.

  • Newsom said the bill was ‘well-intentioned’ but flawed, vowing to consult with AI experts to craft guardrails for future legislation efforts.

As the U.S. federal government continues to lag in AI regulation, states are stepping up to fill the void. While S.B. 1047 is shelved for now, the debate over AI governance is far from settled—and will likely continue to pit AI safety advocates against those pushing for rapid development throughout Silicon Valley.

Source: https://www.politico.com/news/2024/09/29/gavin-veto-ai-safety-bill-00181583

💰 OpenAI secures SoftBank funding as Apple exits raise

Despite Apple reportedly no longer participating in OpenAI’s upcoming funding round, the AI giant has secured billions of dollars from Japanese investment giant Softbank, Microsoft, and Thrive Capital.

  • OpenAI is rumored to be raising up to $6.5B via convertible notes, at an eye-popping $150B valuation.

  • Microsoft plans to participate with an additional $1B, adding to its previous $13B investment in the AI giant.

  • Investment firm Thrive Capital is also investing $1B, with a reported option to add an additional $1B the following year based on revenue goals.

  • The Wall Street Journal reported that Apple is no longer involved in the funding round, despite partnerships with OpenAI and its inclusion in Apple Intelligence.

  • The raise comes amid OpenAI’s controversial restructuring to a for-profit entity, with Sam Altman denying rumors that he will receive equity in the move.

OpenAI’s latest raise and for-profit turn is another saga in its convoluted and controversial business structure. Despite the recent high-profile departures and continued drama, the ChatGPT maker is still clearly seen as a top horse to bet on in the AI boom—and there is no shortage of major players who want in.

Source: https://www.theinformation.com/articles/softbank-to-invest-500-million-in-openai

💧 Liquid AI unveils efficient new LFM models

Liquid AI just introduced a new series of AI models called Liquid Foundation Models (LFMs), challenging the traditional transformer architecture while achieving state-of-the-art performance and enhanced memory efficiency at smaller model sizes.

  • The company released its LFMs in 1.3B, 3B, and 40B parameter sizes, based on a new architecture utilizing computational units rooted in dynamical systems rather than traditional transformers.

  • The models surpass transformer-based counterparts like Meta’s Llama 3.2 and Microsoft’s Phi-3.5 on major benchmarks like MMLU.

  • LFMs require significantly less memory for inference, particularly with long-context tasks — supporting up to 32k tokens while maintaining memory efficiency.

  • The models are not open-source and are only currently available via the company’s Lambda (Chat UI and API) and on Perplexity AI.

Liquid AI’s LFMs are a significant shakeup from the transformer architecture standard that has dominated models since 2017. The benchmarks show that there is more than one formula for achieving state-of-the-art AI performance—and could open new possibilities for more efficient and accessible AI systems.

Source: https://www.liquid.ai/liquid-foundation-models

What Else is Happening in AI on October 01st 2024!

Google agreed to invest $1B into Thailand to expand AI and cloud infrastructure in Southeast Asia, aiming to build new data centers amid increasing regional competition.

Source: https://www.cnbc.com/2024/09/30/google-to-invest-1-billion-in-thailand-data-center-and-ai-push.html

TikTok parent company ByteDance is reportedly planning to develop a new AI model primarily using Huawei chips, diversifying from U.S. suppliers like Nvidia to counteract export restrictions.

Source: https://www.reuters.com/technology/artificial-intelligence/bytedance-plans-new-ai-model-trained-with-huawei-chips-sources-say-2024-09-30

Artisan AI secured $7.3M in seed funding for its sales-focused AI virtual employees, with its first AI assistant Ava already assisting over 120 companies on the platform.

Source: https://www.artisan.co/blog/artisan-raises-7-3-seed-round

Luma Labs upgraded its Dream Machine AI video model speed, allowing for full-quality generations in under 20 seconds.

Source: https://x.com/LumaLabsAI/status/1840820602296320083

Qodo announced a $40M funding round for its AI-powered code testing software, with plans to expand services and target larger enterprise clients.

Source: https://www.bloomberg.com/news/articles/2024-09-30/ai-code-checker-qodo-raises-40-million-to-serve-bigger-clients

AI reading coach startup Ello launched ‘Storytime’, a new feature allowing kids to create personalized stories using AI.

Source: https://techcrunch.com/2024/09/30/ai-reading-coach-startup-ello-launches-custom-story-creation-feature-for-kids

Trending AI Tools on October 01st 2024

🎤 Udio Lyric Editor – Create and refine song lyrics based on melody: https://www.udio.com/

📷 Expression Editor – Easily edit facial expressions: https://huggingface.co/spaces/fffiloni/expression-editor

🚀 PandaETL – Automate document processes with AI and data: https://panda-etl.ai/

🤖 Gaia – Train and deploy neural machine translation models: https://gaia-ml.com/

🔍 Lumona – AI search engine leveraging social media insights: https://www.lumona.ai/

Read Aloud For Me: AI Dashboard – AI Tools Recommender – Safe AI

Welcome to Read Aloud For Me, the pioneering AI dashboard designed for the whole family! Our platform is the first of its kind, uniquely crafted to cater not only to adults but also to kids.

iOs: https://apps.apple.com/ca/app/read-aloud-for-me-ai-dashboard/id1598647453

Web/Android/PWA: https://readaloudforme.com

AI Innovations in September 2024

Ace the 2023 AWS Solutions Architect Associate SAA-C03 Exam with Confidence Pass the 2023 AWS Certified Machine Learning Specialty MLS-C01 Exam with Flying Colors

List of Freely available programming books - What is the single most influential book every Programmers should read



#BlackOwned #BlackEntrepreneurs #BlackBuniness #AWSCertified #AWSCloudPractitioner #AWSCertification #AWSCLFC02 #CloudComputing #AWSStudyGuide #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AWSBasics #AWSCertified #AWSMachineLearning #AWSCertification #AWSSpecialty #MachineLearning #AWSStudyGuide #CloudComputing #DataScience #AWSCertified #AWSSolutionsArchitect #AWSArchitectAssociate #AWSCertification #AWSStudyGuide #CloudComputing #AWSArchitecture #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AzureFundamentals #AZ900 #MicrosoftAzure #ITCertification #CertificationPrep #StudyMaterials #TechLearning #MicrosoftCertified #AzureCertification #TechBooks

Top 1000 Canada Quiz and trivia: CANADA CITIZENSHIP TEST- HISTORY - GEOGRAPHY - GOVERNMENT- CULTURE - PEOPLE - LANGUAGES - TRAVEL - WILDLIFE - HOCKEY - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
zCanadian Quiz and Trivia, Canadian History, Citizenship Test, Geography, Wildlife, Secenries, Banff, Tourism

Top 1000 Africa Quiz and trivia: HISTORY - GEOGRAPHY - WILDLIFE - CULTURE - PEOPLE - LANGUAGES - TRAVEL - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
Africa Quiz, Africa Trivia, Quiz, African History, Geography, Wildlife, Culture

Exploring the Pros and Cons of Visiting All Provinces and Territories in Canada.
Exploring the Pros and Cons of Visiting All Provinces and Territories in Canada

Exploring the Advantages and Disadvantages of Visiting All 50 States in the USA
Exploring the Advantages and Disadvantages of Visiting All 50 States in the USA


Today I Learned (TIL) You learn something new every day; what did you learn today? Submit interesting and specific facts about something that you just found out here.

Reddit Science This community is a place to share and discuss new scientific research. Read about the latest advances in astronomy, biology, medicine, physics, social science, and more. Find and submit new publications and popular science coverage of current research.

Reddit Sports Sports News and Highlights from the NFL, NBA, NHL, MLB, MLS, and leagues around the world.

Turn your dream into reality with Google Workspace: It’s free for the first 14 days.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with  the following codes:
Get 20% off Google Google Workspace (Google Meet) Standard Plan with  the following codes: 96DRHDRA9J7GTN6 96DRHDRA9J7GTN6
63F733CLLY7R7MM
63F7D7CPD9XXUVT
63FLKQHWV3AEEE6
63JGLWWK36CP7WM
63KKR9EULQRR7VE
63KNY4N7VHCUA9R
63LDXXFYU6VXDG9
63MGNRCKXURAYWC
63NGNDVVXJP4N99
63P4G3ELRPADKQU
With Google Workspace, Get custom email @yourcompany, Work from anywhere; Easily scale up or down
Google gives you the tools you need to run your business like a pro. Set up custom email, share files securely online, video chat from any device, and more.
Google Workspace provides a platform, a common ground, for all our internal teams and operations to collaboratively support our primary business goal, which is to deliver quality information to our readers quickly.
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE
C37HCAQRVR7JTFK
C3AE76E7WATCTL9
C3C3RGUF9VW6LXE
C3D9LD4L736CALC
C3EQXV674DQ6PXP
C3G9M3JEHXM3XC7
C3GGR3H4TRHUD7L
C3LVUVC3LHKUEQK
C3PVGM4CHHPMWLE
C3QHQ763LWGTW4C
Even if you’re small, you want people to see you as a professional business. If you’re still growing, you need the building blocks to get you where you want to be. I’ve learned so much about business through Google Workspace—I can’t imagine working without it.
(Email us for more codes)