Elevate Your Career with AI & Machine Learning For Dummies PRO
Ready to accelerate your career in the fast-growing fields of AI and machine learning? Our app offers user-friendly tutorials and interactive exercises designed to boost your skills and make you stand out to employers. Whether you're aiming for a promotion or searching for a better job, AI & Machine Learning For Dummies PRO is your gateway to success. Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
Download the AI & Machine Learning For Dummies PRO App:
iOS - Android
Our AI and Machine Learning For Dummies PRO App can help you Ace the following AI and Machine Learning certifications:
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
AI Innovations in October 2024.
In October 2024, the landscape of artificial intelligence continues to evolve at an unprecedented pace, with groundbreaking innovations and developments emerging daily. The “Daily AI Chronicle” aims to capture the essence of these advancements, providing a comprehensive summary of the latest news and trends in AI technology throughout the month. As we navigate through a month filled with transformative AI breakthroughs, our ongoing updates will highlight significant milestones—from the launch of cutting-edge AI models to the integration of AI in various sectors such as healthcare, finance, and creative industries. With each passing day, AI is reshaping how we interact with technology, enhancing productivity, and redefining our understanding of intelligence itself. Join us as we explore the exciting world of AI innovations, keeping you informed and engaged with the rapid changes set to influence our future. Whether you’re a tech enthusiast, a professional in the field, or simply curious about the implications of AI, this blog will serve as your go-to resource for staying updated on the latest developments throughout October 2024.
AI- Powered Jobs Interview Warmup
A Daily Chronicle of AI Innovations on October 30th 2024
25% of Google’s new code is AI-generated
- More than 25% of new code at Google is created by artificial intelligence and then validated by engineers, according to CEO Sundar Pichai.
- This AI-driven approach is boosting efficiency, enabling faster innovation, and contributing significantly to Google’s robust financial performance.
- Google achieved a revenue of $88.3 billion for the quarter, with significant growth seen in Google Services and Google Cloud, highlighting AI’s impact on profitability.
Source: https://www.theverge.com/2024/10/29/24282757/google-new-code-generated-ai-q3-2024
GitHub’s new tool helps you build apps using plain English
- GitHub Spark, announced at the GitHub Universe conference, lets users build web apps by describing them in natural language, moving beyond the need for traditional coding.
- This experimental feature from GitHub Next labs provides a chat-like interface for users to create and refine app prototypes, while experienced developers can optionally access and modify the underlying code.
- Spark supports advanced customization by allowing users to choose between different AI models, share their projects with specific permissions, and further develop shared code independently.
Source: https://techcrunch.com/2024/10/29/github-spark-lets-you-build-web-apps-in-plain-english
OpenAI is creating its own AI chip with Broadcom and TSMC
OpenAI has reportedly assembled a team of about 20 engineers, including former Google TPU designers, to develop an AI chip targeted for 2026.
After initially exploring options to build its own chip factories, OpenAI is instead opting to partner with Broadcom for design and TSMC for manufacturing.
The company also plans to add AMD’s new MI300X processors to its training infrastructure, reducing reliance on Nvidia’s GPUs.
The moves come as OpenAI faces mounting compute costs, with reports suggesting the company could lose $5B this year despite $3.7B in revenue.
💪 Reddit is profitable for the first time ever, with nearly 100 million daily users.
Source: https://www.theverge.com/2024/10/29/24283056/reddit-earnings-user-growth-revenue-up
🧠 MIT’s new cancer treatment is more effective than traditional chemotherapy.
Researchers at the Massachusetts Institute of Technology (MIT) have developed a game-changing dual-action cancer treatment.The innovative approach involves implanting microparticles directly into tumors, providing both phototherapy and chemotherapy.The team believes that the method could potentially reduce the side effects usually associated with intravenous chemotherapy, and improve the patient’s lifespan more than separate treatments would.
Source: https://www.newsbytesapp.com/news/science/mit-develops-dual-action-cancer-therapy-using-implantable-microparticles/story
GitHub and Microsoft open Copilot to rival AI models
The platform will allow developers to switch between assistants, including Claude and Gemini, although OpenAI’s models remain the default choice.
GitHub also introduced Spark, a new feature that allows users to build applications with natural language prompts.
The platform announced features including multi-file editing, Copilot code reviews, new agentic updates to Workspaces, and Apple Xcode support.
GitHub’s decision to embrace multiple AI providers comes as its Copilot service reaches a major milestone with over a million paying subscribers.
Source: https://github.blog/news-insights/product-news/bringing-developer-choice-to-copilot
OpenAI plans first custom AI chip
OpenAI has reportedly assembled a team of about 20 engineers, including former Google TPU designers, to develop an AI chip targeted for 2026.
After initially exploring options to build its own chip factories, OpenAI is instead opting to partner with Broadcom for design and TSMC for manufacturing.
The company also plans to add AMD’s new MI300X processors to its training infrastructure, reducing reliance on Nvidia’s GPUs.
The moves come as OpenAI faces mounting compute costs, with reports suggesting the company could lose $5B this year despite $3.7B in revenue.
Source:
New AI model predicts early drug development
The multimodal AI system combines extensive laboratory data with limited clinical information to predict a drug’s potential success early.
Enchant sets new accuracy marks for predicting human drug interactions, achieving a 74% correlation compared to the previous 58% SOTA score.
The technology can begin making reliable predictions after studying five drug molecules, requiring minimal human trial data to generate insights.
Enchant processes multiple types of research data simultaneously, helping bridge the gap between laboratory findings and clinical outcomes.
Source:
🇺🇸 Thomas Friedman endorses Kamala because he says “AGI is likely in the next 4 years” so we must ensure “superintelligent machines will remained aligned with human values as they use these powers to go off in their own directions.”
😵 Linus Torvalds reckons AI is ‘90% marketing and 10% reality’ | Tom’s Hardware.
Source: https://www.tomshardware.com/tech-industry/artificial-intelligence/linus-torvalds-reckons-ai-is-90-percent-marketing-and-10-percent-reality
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)
Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
What Else is Happening in AI on October 30th 2024!
LinkedIn launches its first AI agent to take on the role of job recruiters.
Elon Musk predicted at the Future Investment Initiative conference that by 2040, there will be at least 10B humanoid robots priced between $20 and $25K. |
Amazon expanded the company’s Rufus AI shopping assistant in beta to European markets, offering personalized product recommendations and comparison capabilities through conversational interactions in the mobile app. |
OpenAI launched new search capabilities for ChatGPT history, allowing users to easily reference, navigate, or revisit old conversations. |
Elon Musk’s xAI is reportedly seeking a new funding round that would value the AI startup at $40B, a significant jump from its $24B valuation following a raise in May. |
Google CEO Sundar Pichai revealed that the company’s multimodal, agentic smartphone app Project Astra, which was demoed at Google I/O, is expected to be available ‘as early as 2025.’ |
Actor Robert Downey Jr. criticized the use of AI digital replicas in Hollywood, saying he ‘intends to sue all future executives that recreate his likeness,’ even after his death. |
A Daily Chronicle of AI Innovations on October 29th 2024
Listen to this podcast at https://podcasts.apple.com/ca/podcast/ai-daily-chronicle-apple-unveils-first-wave-of-apple/id1684415169?i=1000674949261
🍎 Apple unveils first wave of Apple Intelligence features
The initial release brings systemwide writing tools for rewriting, proofreading, and summarizing text, as well as enhanced photo search capabilities.
A redesigned Siri features new typing support, better context understanding, and upgraded product knowledge to answer questions about Apple devices.
Only newer devices with the M1 / A17 Pro chips or later can access the AI features, with some users also facing a waitlist system after opting in.
The next update, expected in December, will include more advanced features like ChatGPT integration, Image Playground, and Genmoji.
🤖 Open-source AI must disclose data used for training, says OSI:
🔎 Meta builds AI Google Search rival
Meta is developing proprietary web crawling tech to power its AI’s real-time knowledge of current events and web info without relying on competitors.
Internal teams have reportedly been quietly building the search infrastructure since early 2024.
Meta also recently partnered with Reuters for news content, suggesting a broader strategy to control its AI information sources.
The development comes as Meta AI reaches 185M weekly active users across Facebook, Instagram, and WhatsApp.
📈 Medium faces surge in AI-generated content
Medium has experienced difficulties with AI-generated content, with an analysis estimating over 47% of posts as AI-generated, marking a significantly greater prevalence than the wider internet.
Specific topics like “NFTs,” “web3,” and “ethereum” showed high percentages of AI-driven content, with one tag reaching around 78%, reflecting a substantial infiltration of automated writing in these areas.
Two separate AI detection companies found similar high rates of AI-written content, yet Medium’s CEO, Tony Stubblebine, downplays concerns about the presence and significance of such content on the platform.
🎶 UMG, Klay Vision partner on ‘ethical’ AI music model:
The partnership aims to create AI music models that ‘lessen the threat to human creators’ and open ‘new avenues for creativity and future monetization.’
Klay Vision is actively working on a Large Music Model called KLayMM for commercial use that respects copyright and artist likeness rights.
Klay Vision is led by former Sony Music and Google DeepMind execs, with the partnership following past AI deals with YouTube’s AI Incubator and SoundLabs.
The deal comes as UMG continues legal action against AI companies like Anthropic, Suno, and Udio for alleged unauthorized use of copyrighted material.
. 📈 OpenAI CFO: 75% of revenue from ChatGPT subscriptions:
The Open Source Initiative (OSI) has defined “open” AI as systems that provide complete access to training data, source code, and training settings, posing challenges for tech companies like Meta.
Meta’s model Llama does not meet OSI’s standards as it restricts commercial use and does not offer training data, leading to disagreements with OSI’s new open AI definition.
This definition aims to prevent “open washing” by companies and has sparked discussions on AI openness, with industry leaders like Hugging Face supporting the emphasis on transparency in training data.
👀 Hollywood union SAG-AFTRA signs deal for voice AI models:
Hollywood union SAG-AFTRA signed a deal with AI company Ethovox to build a foundational voice model for digital replicas, ensuring performer compensation through session fees and revenue sharing.
💻 xAI’s Grok chatbot gains vision capabilities
xAI’s Grok chatbot gained new vision capabilities, with Elon Musk sharing an example of the AI model breaking down a joke after being given a meme as input.
🔍 Meta is developing its own AI search engine
🤖 Google is working on an AI agent that takes over your browser
New article says AI teachers are better than human teachers. Quote: “Students who were given access to an AI tutor learned more than twice as much in less time compared to those who had in-class instruction.”
From this article dated 10-29-2024: AI tutors are reshaping higher education
💪 AI and Machine Learning For Dummies Pro
Djamgatech has launched a new educational app on the Apple App Store, aimed at simplifying AI and machine learning for beginners.
It is a mobile App that can help anyone Master AI & Machine Learning on the phone!
Download “AI and Machine Learning For Dummies PRO” FROM APPLE APP STORE and conquer any skill level with interactive quizzes, certification exams, & animated concept maps in:
Artificial Intelligence
Machine Learning
Deep Learning
Generative AI
LLMs
NLP
xAI
Data Science
AI and ML Optimization
AI Ethics & Bias ⚖️
& more! ➡️ App Store Link
A Daily Chronicle of AI Innovations on October 28th 2024
Listen at: https://podcasts.apple.com/us/podcast/ai-unraveled-latest-ai-news-trends-gpt-chatgpt-gemini/id1684415169
🔍 Meta is developing its own AI search engine:
Meta is creating its own web-crawling search engine to enhance the information provided by its AI chatbot, as reported by The Information.
This move aims to lessen Meta’s reliance on Google and Microsoft’s Bing, which currently supply data about news, sports, and stocks for Meta AI users.
Following the announcement, shares of Google owner Alphabet Inc. declined by 0.8%, while Meta’s shares experienced a slight increase of 0.3%.
🤖 Google is working on an AI agent that takes over your browser
Google is working on Project Jarvis, an AI agent that can browse the web for users, acting as an automated personal assistant with its capabilities integrated into Google Chrome.
According to a report by The Information, this AI could be introduced alongside Google’s next flagship Gemini language model, possibly being previewed to a small group of testers by December.
Similar to Anthropic’s Claude AI improvements, Jarvis AI responds to user commands by interacting with computer screens through tasks like clicking buttons or typing, though currently operates at a slower pace.
🎙️ Meta releases an ‘open’ version of Google’s podcast generator
Meta has introduced NotebookLlama, an open version of Google’s NotebookLM podcast generator, utilizing Meta’s Llama models for processing input texts into podcast-style content.
NotebookLlama transforms uploaded text files like PDF news articles into transcripts, adds dramatization, and uses open-source text-to-speech models, but struggles with a robotic audio output.
The quality of NotebookLlama’s output could improve with more advanced text-to-speech models, but AI-generated podcasts, including this one, still face issues with generating inaccurate information.
🤖Google’s ‘Jarvis’ browser assistant is coming
Jarvis will initially focus on consumer tasks like online shopping, research, and travel booking.
The agent is specifically optimized for web browsers (not full computer use) and reportedly currently operates with a few-second delay between actions.
The release is expected to coincide with Google’s launch of its next-gen Gemini AI model before the end of the year.
🧐 Altman calls ‘Orion’ frontier model rumors ‘fake news’
A report revealed that OpenAI would release its new ‘Orion’ frontier model by December, with Microsoft and other huge companies getting access before individuals.
Altman responded directly to the report on X, posting “fake news out of control” directly to The Verge.
An OpenAI spokesperson clarified that they have no plans for an “Orion” release this year but plan to release “a lot of other great technology.”
However, Altman previously tweeted a cryptic message about being ‘excited for the winter constellations to rise soon,’ fueling additional speculation.
💻 IBM’s most compact AI models target enterprises
Designed to give enterprises more ways to embed and scale AI in their businesses, these new 2B and 8B compact models are:
Trained with carefully curated data;
Cost-efficient;
Designed to run high-performance solutions.;
🏥 AI transcripts create dangerous errors
A Michigan researcher found fabricated text in 80% of examined transcriptions, while another reported hallucinations in ‘nearly every’ Whisper output.
Hallucinations ranged from non-existent medical treatments to racial commentary and violent content.
Over 30,000 medical professionals use Whisper-based tools despite OpenAI’s warnings against high-risk applications, according to the AP report.
Whisper was also the most popular open-source speech model according to Hugging Face, with over 4.2M downloads in the last month alone.
👀 Grok now has vision capability
🌍 US National Security Advisor on AI:
💪 Djamgatech release – AI and Machine Learning For Dummies Pro app:
It is a mobile App that can help anyone Master AI & Machine Learning on the phone!
Download “AI and Machine Learning For Dummies PRO” FROM APPLE APP STORE and conquer any skill level with interactive quizzes, certification exams, & animated concept maps in:
Artificial Intelligence
Machine Learning
Deep Learning
Generative AI
LLMs
NLP
xAI
Data Science
AI and ML Optimization
AI Ethics & Bias ⚖️
& more! ➡️ App Store Link
What Else is Happening in AI on October 28th 2024
The AI Bill of Rights with Section & the White House’s Dr. Alondra Nelson. How do we ensure a future of ethical AI development? RSVP free.*
Perplexity CEO Aravind Srinvas revealed in a post on X that the AI search platform now handles over 100M weekly queries.
Meta landed its first AI news deal, partnering with Reuters to provide real-time news responses through its AI chatbot across the company’s Facebook, Instagram, WhatsApp, and Messenger platforms.
Coinbase launched ‘Based Agent,’ a tool allowing users to create AI-powered crypto trading bots with on-chain capabilities in under three minutes using OpenAI and Replit integration.
Disney is reportedly preparing to unveil a major AI initiative focused on post-production and VFX workflows, which will mark the content giant’s first major embrace of the tech.
Meta also released NotebookLlama, an open-source version of Google’s NotebookLM that converts PDFs into podcasts using text-to-speech technology.
A Daily Chronicle of AI Innovations on October 25th 2024
OpenAI plans to release its next big AI model by December
Anthropic’s AI can now run and write code
Apple offers $1M bounty for hacking its private AI cloud
Google Photos will now label AI-edited images
Meta signs its first big AI deal for news
Midjourney launches new image editor
OpenAI disbands AGI Readiness team
Biden orders AI push with new security safeguards
OpenAI plans to release its next big AI model by December
- OpenAI plans to unveil its next significant AI model, Orion, by December, prioritizing initial access to partner companies instead of a broad release through ChatGPT.
- Internally viewed as the successor to GPT-4, Orion may be hosted on Azure by November, but its naming and release details remain uncertain and subject to change.
- This release coincides with OpenAI’s transition into a for-profit entity, highlighted by a $6.6 billion funding round and notable changes in its executive team.
- Source: https://www.theverge.com/2024/10/24/24278999/openai-plans-orion-ai-model-release-december
Anthropic’s AI can now run and write code
- Anthropic has introduced a JavaScript code sandbox to its Claude AI, allowing users to conduct complex data analysis within the chat interface.
- This new feature lets teams across various departments analyze data, including marketing teams gaining insights, sales teams evaluating metrics, and developers creating financial dashboards.
- The Claude 3.5 Sonnet model, which supports these capabilities, has enhanced programming performance, outperforming other models in benchmarks like SWE-Bench and TAU-Bench scores.
- Source: https://the-decoder.com/anthropics-claude-ai-can-now-crunch-numbers-and-visualize-data-with-built-in-code-editor/
Apple offers $1M bounty for hacking its private AI cloud
- Apple is encouraging security analysts to examine the Private Cloud Compute system that handles complex Apple Intelligence requests as part of its efforts to ensure system privacy.
- The tech giant’s bug bounty program now includes rewards up to $1,000,000 for detecting vulnerabilities in PCC, underpinning its commitment to handling data privacy seriously.
- Initial Apple Intelligence features are launching soon with iOS 18.1, while future enhancements like Genmoji and ChatGPT integration appeared in the iOS 18.2 developer beta.
- Source: https://www.theverge.com/2024/10/24/24278881/apple-intelligence-bug-bounty-security-researchers-private-cloud-compute
Google Photos will now label AI-edited images
- Google Photos is adding a new disclosure for images edited with its AI features, like Magic Editor, visible in the “Details” section of the app starting next week.
- Despite Google’s aim for transparency, the AI-edited photos will not have visual watermarks, making it difficult to immediately recognize them as altered unless users check the metadata.
- These changes follow criticism Google faced for incorporating AI editing tools without overt visual indicators, and similar metadata tagging will be used for non-AI features like Best Take.
- Source: https://techcrunch.com/2024/10/24/google-adds-new-disclosures-for-ai-photos-but-its-still-not-obvious-at-first-glance/
Meta signs its first big AI deal for news
- Meta has signed a multi-year agreement with Reuters to incorporate Reuters reporting into its AI chatbot for responding to news-related questions, marking a first for the company in licensing news content.
- The use of Reuters content in the AI chatbot, which is available on Facebook, Instagram, WhatsApp, and Messenger, will include summaries and links to Reuters articles, with US users seeing links starting Friday.
- This development follows a trend of news organizations partnering with AI firms, though Meta simultaneously challenges laws requiring payment to news publishers for their content on social media platforms.
- Source: https://www.theverge.com/2024/10/25/24279259/meta-reuters-ai-chatbot-deal-news-licensing-media
What Else is happening in AI on October 25th 2024!
AI chipmaker TSMC’S Phoenix plant reported superior chip yields compared to its Taiwan operations, boosting confidence in America’s domestic semiconductor strategy.
Anthropic unveiled Claude’s new built-in analysis tool, enabling its models to write and execute code directly in chat interactions.
Apple launched a $1M bug bounty ahead of its major AI cloud release next week, offering rewards to security researchers who can successfully hack and find vulnerabilities in its private AI infrastructure.
ElevenLabs added ‘Voice Design,’ a new feature enabling users to create AI-generated voices from natural text prompts.
OpenAI scientist Noam Brown revealed at TED AI that giving AI models 20 seconds to “think” can match the performance boost of scaling up training data 100,000x.
Chinese robotics startup EngineAI just introduced SE01, a life-size humanoid robot that has a much more human-like gait to its walk.
Redditors Are Trying to Poison Google’s AI to Keep Tourists Out of the Good Restaurants. Source: https://gizmodo.com/redditors-are-trying-to-poison-googles-ai-to-keep-tourists-out-of-the-good-restaurants-2000516156
Google’s DeepMind is building an AI to keep us from hating each other. Source: https://arstechnica.com/ai/2024/10/googles-deepmind-is-building-an-ai-to-keep-us-from-hating-each-other/
A Daily Chronicle of AI Innovations on October 23rd 2024
Anthropic’s new AI can use computers like a human
Elon Musk’s xAI launches API for Grok
Reddit CEO says the platform is in an ‘arms race’ for AI training
Major publishers sue Perplexity AI for scraping without paying
Meta is testing facial recognition to fight celebrity scams
Lab-grown human brain cells drive virtual butterfly in simulation
Anthropic’s AI now navigates computers like a human
Anthropic just introduced a new capability called ‘computer use’, alongside upgraded versions of its AI models, which enables Claude to interact with computers by viewing screens, typing, moving cursors, and executing commands.
Claude can now autonomously navigate computer interfaces, performing complex tasks across multiple applications and websites.
Anthropic said it taught the model ‘general computer skills’ instead of creating a standalone tool, helping it operate more like a human.
The upgraded Sonnet 3.5 significantly improves coding and tool use, outperforming other models (including o1-preview) on key benchmarks.
A new Haiku 3.5 model matches the capabilities of previous high-end models at lower cost and higher speed.
Anthropic highlighted that computer use is still imperfect (including some hilarious examples), encouraging testing on low-risk tasks until skills improve.
While many hoped for Opus 3.5, Anthropic’s Sonnet and Haiku upgrades pack a serious punch. Plus, with the new computer use embedded right into its foundation models, Anthropic just sent a warning shot to tons of automation startups—even if the capabilities aren’t earth-shattering… yet.
Source: https://techcrunch.com/2024/10/22/anthropics-new-ai-can-control-your-pc/
Elon Musk’s xAI launches API for Grok
- Elon Musk’s AI venture, xAI, has launched an API featuring its flagship generative AI model, Grok, but currently, it only includes the basic “grok-beta” version for use.
- The pricing for xAI’s API is set at $5 per million input tokens and $15 per million output tokens, with each token representing a small data segment like a syllable.
- xAI is racing to compete with AI giants such as OpenAI, utilizing X’s data for training and aiming to integrate Musk’s different companies’ data to enhance technological advancements.
- Source: https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/
Genmo drops open-source AI video model
AI startup Genmo just launched Mochi 1, a new open-source video generation model that claims to rival closed competitors like Runway, Pika, and Kling — while being freely available to developers and researchers.
Mochi is built on a new 10B parameter architecture called AsymmDiT, making it the largest open-source video generation model ever released.
The model focuses heavily on motion quality and prompt adherence, generating 480p videos at 30fps for up to 5.4 seconds.
Mochi surpassed top models like Kling, Runway Gen-3, Luma’s Dream Machine, and Pika in motion quality and prompt adherence during testing.
A higher-definition version, Mochi 1 HD, with 720p support and image-to-video capabilities, is planned for release later this year.
Genmo also announced that it secured $28.4M in Series A funding, with Mochi-1 being the company’s first step toward building ‘world simulators.’
Open-source AI video is officially competing with the top of the market. Genmo’s Mochi is an extremely impressive release that showcases how competitive the video generation landscape is about to become — especially with the major dominos (Sora, Midjourney?) still to come.
Source: https://www.genmo.ai/blog
Reddit CEO says the platform is in an ‘arms race’ for AI training
- Reddit CEO Steve Huffman stated that the platform is a vital player in the AI “arms race,” emphasizing its role in providing high-value training data for artificial intelligence development.
- The platform’s extensive user-generated content has become crucial in shaping AI models, leading Reddit to explore its strategic position within the artificial intelligence sector.
- In response to large corporations utilizing Reddit data without proper agreements, Huffman revealed ongoing efforts to secure deals and safeguard the platform’s valuable information against exploitation.
- Source: https://www.businessinsider.com/reddit-ceo-platform-arms-race-ai-training-steve-huffman-2024-10
Major publishers sue Perplexity AI for scraping without paying
- Major publishers Dow Jones & Co and NYP Holdings have filed a lawsuit against AI search engine startup Perplexity for copying their content without compensation, alleging copyright infringement and trademark violations.
- News Corporation, representing The Wall Street Journal and New York Post, accuses Perplexity of presenting the scraped material as a substitute for original sources, consequently harming the brands and sometimes providing inaccurate information.
- News Corp seeks $150,000 for each infringement instance, a sum that could financially devastate Perplexity, highlighting the importance of protecting intellectual property while also showing a willingness to license content for appropriate fees, as demonstrated by their agreement with OpenAI.
- Source: https://www.theregister.com/2024/10/22/publishers_sue_perplexity_ai/
Meta is testing facial recognition to fight celebrity scams
- Meta is testing facial recognition technology to combat ‘celeb-bait’ scam ads by comparing ad images against celebrities’ profile pictures on Facebook and Instagram.
- Facial recognition is also being explored as a faster method for users to regain account access through video selfies, providing an alternative to traditional ID verification methods.
- While the tests show promising results, they are not yet being conducted in the U.K. or the EU, due to stringent data protection regulations in these regions.
- Source: https://techcrunch.com/2024/10/21/meta-tests-facial-recognition-for-spotting-celeb-bait-ads-scams-and-easier-account-recovery/
Lab-grown human brain cells drive virtual butterfly in simulation
- Researchers at FinalSpark have created a 3D simulation where a virtual butterfly is guided by lab-grown human brain cells, marking a significant advancement in biocomputing and cognitive technologies.
- The brain organoids, which are miniature brains grown from stem cells, respond to human input in a virtual setting, allowing the butterfly model to move in response to stimuli through a Python software framework.
- These biological neural networks promise advantages like lower energy consumption and advanced cognitive functions, though they currently require traditional computing infrastructure support, with potential ethical questions regarding consciousness and usage implications.
- Source: https://www.theregister.com/2024/10/22/human_brain_tissue_butterfly_simulation/
Can A.I. Be Blamed for a Teen’s Suicide?
The mother of a 14-year-old Florida boy says he became obsessed with a chatbot on Character.AI before his death.
Source: https://www.nytimes.com/2024/10/23/technology/characterai-lawsuit-teen-suicide.html
NVIDIA’s Multi-Agent AI Breakthrough Transforms Sound-to-Text Technology
NVIDIA’s innovative multi-agent AI system improves sound-to-text technology and improves performance in the DCASE 2024 AAC Challenge with GPU-accelerated processing and multi-encoder fusion.
Source: https://theaiwired.com/nvidias-multi-agent-ai-breakthrough-transforms-sound-to-text-technology/
Meta AI (FAIR): Introducing the Dualformer. Controllable Fast & Slow Thinking by Integrating System-1 And System-2 Thinking Into AI Reasoning Models
Notebook lm version:
https://notebooklm.google.com/notebook/17738361-48f9-48aa-a8e4-5545027519f6/audio
OpenAI, under pressure from Anthropic, is developing new products to automate complex software programming tasks.
What is Predictive Analytics?
Predictive analytics uses data, statistical algorithms, and machine learning techniques to identify the likelihood of future outcomes based on historical data. Unlike traditional analytics, which focus on what has happened, predictive analytics provides actionable insights into what will likely occur. It can mean anything from predicting customer behavior to anticipating business market trends.
How AI-Powered Predictive Analytics Drives Business Growth
Read: https://stellarmind.ai/blog/business-growth-with-ai-powered-predictive-analytics
Ideogram debuts AI Canvas workspace
Ideogram just unveiled a new AI-powered workspace called Canvas, introducing advanced tools like Magic Fill and Extend to combine image editing and generation for new creative workflows.
Canvas provides an endless digital board on which users can generate, organize, and seamlessly blend AI-generated and uploaded images.
Magic Fill allows precise editing of selected image areas, enabling tasks like object replacement, text addition, and background alteration.
The Extend feature expands images beyond their original dimensions while maintaining style consistency, even with text.
Ideogram also features an API, allowing developers to incorporate the new features into their own applications
The design industry is no stranger to AI tools (Photoshop, Canva) — but Ideogram’s latest release feels like the exact type of fastball that AI and design novices can really make magic with. The examples shown also illuminate how drastically creative workflows are changing in the AI era.
Source: https://docs.ideogram.ai/using-ideogram/ideogram-features/canvas
What Else is Happening in AI on October 23rd 2024!
Runway debuted Act-One, a new feature that generates expressive character performances from a single video and image without motion capture or rigging.
Stability AI released Stable Diffusion 3.5, featuring Large and Large-Turbo models that improve customization, efficiency, and diversity of outputs.
Cohere enhanced its Embed 3 model with multimodal capabilities, enabling enterprises to perform RAG-style searches across text and image content.
Chipotle launched a new conversational AI hiring platform called ‘Ava Cado,’ which the restaurant says can accelerate the hiring process by up to 75%.
Asana introduced AI Studio, a no-code platform for teams to design and deploy AI agents to automate business workflows.
Canva unveiled Dream Lab, a new image generator powered by Leonardo AI — alongside a series of new AI features added to the platform’s Visual Suite.
Inflection AI launched Agentic Workflows, enabling its enterprise systems to take trusted actions for various business use cases.
Latest AI Tools:
AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs (FREE with Ads): https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers, AI Simulators): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
What you can do with this App:
🚀 Learn AI interactively! Tweak models, code exercises, visualize concepts, & tackle projects. Perfect for beginners to master AI/ML easily.
🎓 AI & ML made easy! Hands-on coding, visual tools, and real-world examples. Engage with fun, interactive learning & community support.
🤖 Master AI step-by-step! Practice coding, explore simulations, & see real-time changes. Fun, interactive tools simplify complex AI concepts.
🌟 AI learning simplified! Interactive models, coding challenges, flashcards & real-world projects. Visualize & build your own AI models.
💡 Explore AI with real-time simulations! Watch neural networks in action & learn by tweaking parameters. Coding & visual tools make it easy.
📚 Learn AI the hands-on way! Code exercises, visual tools, & interactive simulations. Fun, engaging, and perfect for all skill levels.
🏆 Interactive AI education! Tackle coding, visual tools, real-world projects, & fun challenges. Earn badges & climb the leaderboard.
🔍 See AI in action! Tweak parameters & watch real-time effects. Coding & visual tools make learning neural networks & ML concepts easy.
🧠 Your AI guide! Visualize, code, & build models with interactive tools. Learn at your pace & join a supportive community.
🎓 Hands-on AI learning! Practice coding, see concepts visually, and learn through real-world projects. Fun, engaging, and easy to follow.
A Daily Chronicle of AI Innovations on October 21st 2024
TikTok owner fires intern for AI sabotage
AI reaches expert level in medical scans
Microsoft unveils new autonomous AI agents that can handle queries.
Anthropic unveils new evaluations for AI sabotage risks
Tim Cook defends Apple coming late to AI with four words
Meta releases new AI models for voice and emotions
🚀 Microsoft CEO Satya Nadella says computing power is now doubling every 6 months, as the Scaling Laws paradigm has taken over from Moore’s Law, and the new currency is tokens per dollar per watt.
🦾 OpenAI’s Noam Brown says the o1 model’s reasoning at math problems improves with more test-time compute and “there is no sign of this stopping”
AI reaches expert level in medical scans
Researchers at UCLA just developed SLIViT, a new AI model that can analyze complex 3D medical scans with expert-level accuracy in a fraction of the time required by human specialists.
SLIViT (SLice Integration by Vision Transformer) can efficiently analyze various 3D imaging types, including MRIs, CT scans, and ultrasounds.
The model matches clinical expert accuracy while reducing analysis time by a mind-blowing factor of 5,000.
Unlike other AI models, SLIViT requires only hundreds of training samples, making it more practical for real-world applications.
The framework leverages transfer learning, using prior knowledge from 2D medical data for efficient training with smaller 3D datasets.
With the growing demand for faster diagnostics, SLIViT’s ability to rapidly and accurately analyze imaging offers a potential game-changer for healthcare. The model’s ability to work with small datasets also makes it more accessible for providers with limited resources — potentially democratizing expert medical imaging.
Source: https://www.uclahealth.org/news/release/new-ai-model-efficiently-reaches-clinical-expert-level
Meta reveals new AI models, tools
Meta FAIR just introduced a collection of new research models and datasets, including an upgraded image segmentation tool, a cross-modal language model, solutions to accelerate LLM performance, and more.
Spirit LM is an open-source multimodal language model that integrates speech and text to generate more natural-sounding and expressive speech.
Meta’s SAM 2.1 update offers improved image and video segmentation on its popular predecessor, which saw over 700,000 downloads in 11 weeks.
Layer Skip provides an end-to-end solution for accelerating LLM generation times by nearly 2x without specialized hardware.
Other artifacts include SALSA for security testing, Meta Lingua for language model training, a synthetic data generation tool, and more.
Meta continues to push the AI bar forward with big releases across various areas. Given the company’s impressive open-source systems, it’s hard to envision a future where closed models and tools have a significant advantage — and the moat between the two seems to be shrinking with each release.
Source: https://ai.meta.com/blog/fair-news-segment-anything-2-1-meta-spirit-lm-layer-skip-salsa-lingua
IBM’s most compact AI models target enterprises
Meet IBM’s new third generation of Granite with new open, compact, and efficient 2B and 8B language models.
Designed to give enterprises more ways to embed and scale AI in their businesses, these new 2B and 8B compact models are:
Trained with carefully curated data;
Cost-efficient;
Designed to run high-performance solutions.;
Source: https://www.ibm.com/granite
Anthropic unveils new evaluations for AI sabotage risks
Anthropic just published a set of new evaluations aimed at detecting potential sabotage capabilities in advanced AI systems, focusing on risks that could arise if models attempt to subvert human oversight or decision-making.
Four new evaluations were developed: human decision sabotage, code sabotage, sandbagging (hiding capabilities), and undermining oversight.
The evaluations use mock scenarios to test models’ ability to manipulate and deceive humans, insert bugs into code, and undermine monitoring systems.
Tests were run on Claude 3 Opus and Claude 3.5 Sonnet models, which did not flag concerning results but showed the capability to sabotage.
Anthropic is open-sourcing the evaluations and said stronger anti-sabotage mitigation will be needed as AI continues to improve.
Anthropic’s research shows that AI isn’t very good at sabotaging humans… yet. But the capabilities are there in some capacity — and if the model acceleration continues like many think it will, it’s only a matter of time before these threats will be real and important to mitigate.
TikTok owner fires intern for AI sabotage
- ByteDance dismissed an intern for allegedly disrupting an AI project by “maliciously interfering” with the training of artificial intelligence models in August.
- The company stated the intern’s actions did not affect its official commercial products or AI technology, countering exaggerated rumors about significant disruptions circulating online.
- ByteDance informed the intern’s university and industry associations about the misconduct as rumors continued amidst broader scrutiny over generative AI safety and social media impacts.
- Source: https://www.theguardian.com/technology/2024/oct/21/tiktok-owner-bytedance-sacks-intern-for-allegedly-sabotaging-ai-project
Tim Cook defends Apple coming late to AI with four words
- Tim Cook acknowledges that Apple is not the first in AI development but emphasizes that the goal is to deliver the best AI experience for customers.
- The initial release of Apple Intelligence on October 28 is expected to be minimalistic compared to competitors like Google’s Gemini, with advanced features possibly available by 2025.
- Apple plans to incorporate ChatGPT into iPhones and select iPads, focusing on device security and user consent for utilizing AI capabilities like text summarization and priority notifications.
- Source: https://gizmodo.com/tim-cook-knows-apple-isnt-first-in-ai-but-says-its-about-being-the-best-2000514347
Apple’s AirPods Pro hearing health features are as good as they sound
- Apple’s AirPods Pro 2 are set to include new features like clinical-grade hearing aid capabilities, a hearing test, and enhanced hearing protection, with the release of iOS 18.1 potentially boosting hearing health awareness.
- The new hearing protection mode is a subtle yet impactful upgrade, but there are limitations in extreme noise environments, which might make traditional earplugs still necessary for certain users.
- While the hearing aid feature is impressive, it may not suit everyone due to its six-hour battery life and limitations for those with severe hearing loss, but it signals a promising shift in tech addressing real-world health needs.
- Source: https://www.theverge.com/24275178/apple-airpods-pro-hearing-aid-test-protection-preview
This new Linear-complexity Multiplication (L-Mul) algorithm can reduce energy costs by 95% for element-wise tensor multiplications and 80% for dot products in large language models, while maintaining or even improving precision compared to 8-bit floating point operations.
Link to paper: Addition is All You Need for Energy-efficient Language Models
Link to twitter thread with insights: Rohan Paul on X
from twitter thread:
Solution in this Paper:
Approximates floating-point multiplication using integer addition
Linear O(n) complexity vs O(m^2) for standard floating-point multiplication
Replaces tensor multiplications in attention mechanisms and linear transformations
Implements L-Mul-based attention mechanism in transformer models
Key Insights from this Paper :
L-Mul achieves higher precision than 8-bit float operations with less computation
Potential 95% energy reduction for element-wise tensor multiplications
80% energy reduction for dot products compared to 8-bit float operations
Can be integrated into existing models without additional training
Google AI – “Announcing CT Foundation, a new medical imaging embedding tool that accepts a computed tomography (CT) volume as input and returns a small, information-rich numerical embedding that can be used to rapidly train models.”
Source: https://research.google/blog/taking-medical-imaging-embeddings-3d/
Latest AI Tools:
Create mind maps with AI: a simple Next.js project that lets users generate and interact with mind maps for learning, using AI models from Ollama or OpenAI, with options to download as markdown.
Source: https://github.com/aotakeda/learn-thing
Artificial Intelligence and Machine Learning For Dummies: This App offers Interactive simulations and visual learning tools to make AI/ML accessible. Explore neural networks, gradient descent, more through hands-on experiments.
A Daily Chronicle of AI Innovations on October 18th 2024
Cracks appear in Microsoft and OpenAI partnership
Google’s AI podcast generator gets major updates
X updates privacy policy to allow third parties to train AI models
US Treasury uses AI to recover billions from fraud
Newton AI learns physics from scratch
NotebookLM launches business pilot
Worldcoin unveils next-gen eye scanner
Newton AI learns physics from scratch
Archetype AI just unveiled ‘Newton,’ a new foundational AI ‘Large Behavior Model’ that learns complex physics principles directly from raw sensor data, without any human guidance.
Newton ingests raw sensor measurements to build its understanding of physical phenomena without pre-programmed knowledge.
The model can accurately predict behaviors of systems it wasn’t explicitly trained on, like pendulum motion.
It outperformed specialized AI in tasks like forecasting citywide power consumption and discovering systems from data instead of training.
Archetype AI was founded by ex-Google researchers and has secured $13M in funding to date
Newton is a paradigm shift in AI’s interaction with the physical world. A single model could replace highly specialized systems by developing a generalized understanding rather than a narrow focus. The tech also opens the door to truly autonomous AI that can adapt to environments and tasks without human intervention.
NotebookLM launches business pilot
Google just pushed an update for its viral AI note-taking assistant NotebookLM, adding new features that let users guide AI-generated audio summaries and announcing the upcoming launch of a new business-focused version.
Users can now customize the AI podcast Audio Overviews feature by providing instructions to focus on specific topics or adjusting the expertise level.
A new Background Listening feature allows users to listen to Audio Interviews while multitasking within NotebookLM.
A pilot program for NotebookLM Business is coming, offering enhanced features for organizations like higher usage limits and team collaboration tools.
Audio Overviews, which turns docs, videos, and other content into podcasts between AI hosts, went viral earlier this month for its realistic audio outputs.
Google is dropping the ‘experimental’ tag on NotebookLM, and the viral feature built in just two months is suddenly being called a ‘ChatGPT’ moment for the company. It’s also an interesting case of users actually enjoying AI-generated content — a quality that is hard to find in most mainstream sentiment for the tech.
Source: https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/
Worldcoin unveils next-gen eye scanner
Worldcoin, the ‘proof of personhood’ startup founded by OpenAI CEO Sam Altman, just announced a rebrand to ‘World’, along with a new version of its iris-scanning ‘Orb’ technology and updated core platforms.
A new streamlined Orb promises 5x performance to its predecessor, alongside new countries, self-serve, and on-demand Orbs for easier onboarding.
The company introduced World ID 3.0 protocol, featuring new World ID Credentials, Deep Face to combat AI-generated deepfakes, and added privacy infrastructure.
An updated World App 3.0 allows for anonymous integration with third-party apps, and World is also launching the mainnet of its Worldchain blockchain.
The company has previously faced backlash and even bans from certain countries over privacy concerns.
Verifying human identity in the increasing flood of AI-generated content, agents, and systems is clearly going to be massively important — but given Worldcoin’s rocky launch and international struggles, the question is whether the company can overcome the early drama to actually achieve its goals.
Source: https://www.pcmag.com/news/sam-altman-worldcoin-launches-deep-face-new-eye-scanning-orb
What Else is Happening in AI on October 18th 2024!
The U.S. Treasury Dept. shared that it leveraged AI to recover $1B in check fraud and prevent $4B in overall fraud in the 2024 fiscal year, showcasing the tech’s growing role in combating financial crime.
OpenAI expanded its partnership with consulting firm Bain & Co. to develop and sell industry-specific AI tools to corporate clients, with OpenAI reporting 1M paying business customers.
Meta is partnering with Blumhouse and other select filmmakers to test its Movie Gen AI video generation tools, gathering feedback to refine the tech before its public release in 2025.
Researchers from Alibaba and Skywork showcased Meissonic, a small, open-source text-to-image model that can generate high-quality outputs that outperform larger models.
Salesforce CEO Marc Benioff criticized Microsoft’s AI initiatives for overhyping the sector in an interview with Fast Company, calling its Copilot assistant the ‘next Clippy.’
OpenAI released a preview of its ChatGPT Windows app for paid users, offering file and photo interactions, model improvements, and a companion window mode.
A Daily Chronicle of AI Innovations on October 17th 2024
OpenAI quietly pitches products to US military
Parents take school to court after student punished for using AI
Nvidia’s Nemotron outperforms leading AI models
Mistral AI unveils powerful new AI models for devices
Boston Dynamics, Toyota team up on AI humanoids
OpenAI quietly pitches products to US military
- OpenAI is exploring military and national security opportunities by partnering with government contractors and modifying its usage policies to allow for defense applications.
- The company hired Dane Stuckey as Chief Information Security Officer, who previously worked with Palantir, a firm known for its military projects, indicating a shift towards defense collaboration.
- Debate continues about the implications of using AI for military purposes, as OpenAI’s involvement in projects like those with the Department of Defense raises ethical concerns.
- Source: https://fortune.com/2024/10/17/openai-is-quietly-pitching-its-products-to-the-u-s-military-and-national-security-establishment/
Parents take school to court after student punished for using AI
- A Massachusetts school district was sued by a student’s parents after their child was disciplined for using an AI chatbot to finish an assignment, despite no clear rule against it.
- The lawsuit claims that the Hingham High School student handbook does not explicitly prohibit artificial intelligence use, which led to the improper punishment of the student, identified as RNH.
- The case was taken to the US District Court for the District of Massachusetts, focusing on alleged violations of the student’s civil rights and naming several school officials as defendants.
- Source: https://arstechnica.com/tech-policy/2024/10/student-was-punished-for-using-ai-then-his-parents-sued-teacher-and-administrators/
Nvidia’s Nemotron outperforms leading AI models
Nvidia quietly released a new open-sourced, fine-tuned LLM called Llama-3.1-Nemotron-70B-
Nemotron is based on Meta’s Llama 3.1 70B model, fine-tuned by NVIDIA using advanced ML methods like RLHF.
The model achieves top scores on alignment benchmarks like Arena Hard (85.0), AlpacaEval 2 LC (57.6), and GPT-4-Turbo MT-Bench (8.98).
The scores edge out competitors like GPT-4o and Claude 3.5 Sonnet across multiple metrics — despite being significantly smaller at just 70B parameters.
NVIDIA open-sourced the model, reward model, and training dataset on Hugging Face, which can also be tested in a preview on the company’s website.
Source: https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct
Mistral AI unveils powerful new AI models for devices
French AI startup Mistral AI just launched two new compact language models designed to bring powerful AI capabilities to edge devices like phones and laptops.
The new ‘Les Ministraux’ family includes Ministral 3B and Ministral 8B models, which have just 3B and 8B parameters, respectively.
Despite their small size, the models outperform competitors like Gemma and Llama on benchmarks, including Mistral’s 7B model from last year.
Minstral 8B uses a new ‘interleaved sliding-window attention’ mechanism to efficiently process long sequences.
The models are designed for on-device use cases like local translation, offline assistants, and autonomous robotics.
While we await the incoming rollout of Apple Intelligence as many users’ first on-device AI experience, smaller models that can run efficiently and locally on phones and computers continue to level up. Having a top-tier LLM in the palm of your hand is about to become a norm, not a luxury.
Source: https://mistral.ai/news/ministraux
Superstudio is your all-in-one creative AI platform
Boston Dynamics, Toyota team up on AI humanoids
Boston Dynamics and the Toyota Research Institute just announced a new partnership to accelerate development of advanced humanoids, with plans to integrate TRI’s Large Behavior Models (LBMs) into the Atlas electric robot.
Toyota’s LBMs aim to teach robots to handle multi-task, dexterous vision, and language-guided capabilities.
The partnership combines two robotics labs owned by competing automakers, Hyundai (who purchased Boston Dynamics in 2020) and Toyota.
TRI‘s ‘Diffusion Policy’ enables robots to learn 60+ complex skills from human demos without coding, a key component of the partnership’s research efforts.
Boston Dynamics retired its hydraulic Atlas robot in April and debuted the electric update, currently being tested in Hyundai’s automotive factories.
The race for commercial humanoids is heating up fast — and this partnership represents a major power move. But with the likes of Tesla’s Optimus, Figure’s 01 humanoids, and others in the mix, there is no shortage of rivals rushing to capture the massive potential of the emerging general-purpose robots.
What Else is Happening in AI on October 17th 2024!
ChatGPT’s web traffic reached a record 3.1B visits in September 2024, according to Similarweb, representing a 112% year-over-year increase and making it the 11th most visited website globally.
Source: https://www.similarweb.com/blog/insights/ai-news/chatgpt-topped-3-billion-visits-in-september
Suno launched Suno Scenes, allowing users to generate songs using images or videos instead of just text prompts.
Source: https://x.com/suno_ai_/status/1846574384963633345
Google Public Sector announced $15M grants to upskill U.S. government workers in responsible AI with plans to train over 100,000 public sector employees across federal, state, and local levels.
Source: https://blog.google/outreach-initiatives/google-org/google-org-public-sector-ai-funding
OpenAI published research examining how ChatGPT responds to usernames with various genders, racial, and cultural backgrounds — finding minimal bias but some stereotypical responses in open-ended tasks like creative writing.
Source: https://cdn.openai.com/papers/first-person-fairness-in-chatbots.pdf
Fashion brand Lacoste is leveraging AI for anti-counterfeit technology, using a tool called Vrai AI to analyze tiny logo details that can uncover fakes at 99.7% accuracy.
Source: https://www.yahoo.com/tech/lacoste-turn-ai-fight-counterfeiting-193000958.html
Palantir CISO Dane Stuckey announced that he is joining OpenAI as the company’s new chief information security officer, helping to drive the ‘development of safe AGI for the world.’
Source: https://x.com/cryps1s/status/1846325577906831728
Firms use AI to keep reality from unreeling amid ‘global deepfake pandemic’
Amazon goes nuclear, to invest more than $500 million to develop small modular reactors
After Microsoft, Google, now Amazon
Datacenters need baseload power, not intermittent power.
And with AI they need a lot of additional power.
Who is next?
Meta?
Tesla?
The market caps of those companies are huge compared to companies in the nuclear space
Market caps:
Amazon: 1.962 trillion USD
Microsoft: 3.093 trillion USD
Google: 2.042 trillion USD
Meta: 1.459 trillion USD
Meanwhile:
Nuscale Power (ticker: SMR) for instance has a market cap of only 1.80 billion USD
The uranium sector is taken by surprise by those last moves, the acceleration in nuclear reactor restarts in Japan (happening as we speak), USA (planned), … and the acceleration in nuclear reactor constructions in China, India, Russia, …
Trending AI Tools
Machine Learning & AI For Dummies
This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
Web: https://machinelearningcertification.web.app/
Windows: https://apps.microsoft.com/detail/9p0r1x3jnc46?hl=en-us&gl=US
A Daily Chronicle of AI Innovations on October 16th 2024
Mistral releases new AI models for laptops and phones
The New York Times tells Perplexity to stop using its content
New York Times takes legal aim at Perplexity
Anthropic reveals major update to AI safety policy
Meta researchers develop ‘thinking’ LLMs
🤖 Mistral releases new AI models for laptops and phones:
Mistral AI has introduced the Ministral 3B and 8B, optimized for on-device computing, enabling smartphones and laptops to run advanced AI models with low latency and high efficiency.
- French AI startup Mistral has released its first generative AI models, “Les Ministraux,” designed for edge devices like laptops and phones, with two versions available: Ministral 3B and Ministral 8B.
- Ministral 8B is available for research purposes, while commercial licenses are required for both models; they can also be used through Mistral’s cloud platform, with token-based pricing for usage.
- Mistral claims its models outperform competitors such as Meta’s Llama and Google’s Gemma in benchmarks, and the company is expanding its AI portfolio, having recently raised $640 million in venture capital.
🗞️ New York Times takes legal aim at Perplexity
The New York Times is preparing legal action against Perplexity AI for using its articles in AI summaries without a licensing agreement.
The NYT claims Perplexity’s use of its articles for AI-generated summaries violates copyright law, accusing the startup of unauthorized use of its journalism.
Perplexity reportedly previously told the publisher it would stop crawling its content, but results have continued to show up on the platform.
The startup says it’s open to working with publishers and will respond to the notice by the Oct. 30 deadline.
The NYT previously sued OpenAI and Microsoft over similar concerns, and other media outlets have also accused Perplexity of misusing their content.
Source: https://www.bloomberg.com/news/articles/2024-10-14/new-york-times-legal-aim-perplexity
🛡️ Anthropic reveals major update to AI safety policy
Anthropic has released new guidelines focusing on transparency and harm prevention, aiming to make AI development safer and more ethical.
The policy introduces ‘Capability’ and ‘Required’ Thresholds to trigger enhanced safety measures when AI models reach certain risk levels.
The two new thresholds focus on AI capabilities related to bioweapons and autonomous AI research.
Anthropic emphasized the need for the risk approach to be ‘exportable,’ hoping that it will become an industry standard and help shape regulation.
Anthropic will regularly evaluate its AI models, while a ‘Responsible Scaling Officer’ role will oversee policy implementation and compliance.
The company also pledged increased transparency, including public disclosure of capability reports and external expert input.
Source: https://techcrunch.com/2024/10/12/anthropic-updates-ai-safety-policy/
🧠 Meta researchers develop ‘thinking’ LLMs
Meta researchers are pioneering new large language models (LLMs) capable of ‘thinking,’ with improved reasoning and problem-solving abilities, pushing the limits of current AI technology.
TPO prompts models to generate internal thoughts before responding to user instructions, similar to how humans think before speaking.
The AI’s thoughts are kept private, with only the final answer shown to users — with the AI using trial-and-error without direct supervision to optimize outputs.
TPO outperforms standard models on key benchmarks for non-reasoning tasks like marketing and creative writing but declines in math-related tasks.
The approach builds on the recent OpenAI ‘Strawberry’ research and o1 model release, which takes time to reason.
Source: https://venturebeat.com/2024/10/meta-researchers-develop-thinking-llms/
What Else is Happening in AI on October 16th 2024!
The US government is considering capping AI chip exports from companies like Nvidia and AMD to certain countries, particularly in the Middle East, due to national security concerns.
Amazon unveiled a new AI-powered creative suite for advertisers, including tools to generate video, audio, and animated image ads.
Google released its AI-powered shopping experience, featuring personalized recommendations, AI-generated product briefs, and deal-finding tools.
Source: https://blog.google/products/shopping/google-shopping-ai-update-october-2024
Apple debuted its new 7th generation iPad mini, the cheapest device ($499 base) to eventually support Apple Intelligence, which will include other AI features for writing and photo editing.
The University of Tokyo researchers revealed TANGO, an AI system that generates realistic human speakers, movements, and gestures to match audio input.
Source: https://pantomatrix.github.io/TANGO
Latest Trending AI Tools:
Perplexity for Mac – Search and discovery with AI, now available for Macs
Gradio 5.0 – Build and share delightful machine-learning apps
AI and Machine Learning For Dummies PRO
A Daily Chronicle of AI Innovations on October 15th 2024
Google goes nuclear to power AI
Adobe unveils Firefly Video Model at MAX
Chinese researchers reportedly crack military-grade encryption with quantum computer
US weighs capping exports of AI chips from Nvidia and AMD to some countries
OpenAI locked in legal battle with… Open AI?
Apple announces new iPad Mini focused on AI
AI simulates Counter-Strike using neural network
Adobe unveils Firefly Video Model at MAX
Adobe just announced the addition of new video generation capabilities to its Firefly AI model and Premiere Pro at the company’s MAX Conference, alongside a slew of major AI updates across its creative software ecosystem.
The new Firefly Video Model is now in limited public beta and allows users to generate video from text prompts or images in Firefly and Adobe Premiere.
Video capabilities include cinematic video, 2D and 3D animations, text graphics, b-roll, and screen effects to blend with normal footage.
The model is trained exclusively on Adobe Stock and public domain content and is designed to be ‘commercially safe.’
Premiere Pro gets Generative Extend, a Firefly-powered tool for easily extending clips, smoothing transitions, and fine-tuning edits.
Adobe also rolled out 100+ features across Creative Cloud apps, GenStudio for enterprise marketing, and Project Concept for collaborative remixing.
Adobe’s new model looks impressive and could be one of the first AI video systems to truly break into the mainstream with seamless inclusion in its popular creative suite. While OpenAI’s Sora STILL awaits public access, others are filling the void with powerful models — it’s getting more competitive by the day.
Source: https://news.adobe.com/news/2024/10/101424-adobe-launches-firefly-video-model
OpenAI locked in legal battle with… Open AI?
OpenAI is reportedly involved in a trademark dispute with Guy Ravine, who owns the ‘Open AI’ (with a space) trademark and claims he conceived and pitched the idea for the initiative to major tech leaders before the company’s founders.
Ravine registered the domain open.ai in March 2015 and owns the ‘Open AI’ trademark, which Sam Altman and Greg Brockman tried to purchase from him.
He alleges he pitched the concept to tech figures like Larry Page and Yann LeCun months before OpenAI’s launch in December 2015.
OpenAI sued Ravine in 2023, accusing him of trying to profit from their brand, and Ravine countersued, saying the company stole his idea.
A judge dismissed much of Ravine’s countersuit in September, though he plans to refile and push for a trial.
This Bloomberg investigation is wild, and it’s hard to discern whether this is a case of pure delusion or the underdog getting crushed by the big corporation. As the article points out, there’s major irony in the trademark dispute, given OpenAI’s legal issues from training data and copyright complaints.
AI simulates Counter-Strike using neural network
Researchers from the University of Geneva, University of Edinburgh, and Microsoft developed DIAMOND, an AI model that can generate a playable simulation of Counter-Strike(CS:GO) at 10 frames per second within a neural network.
DIAMOND uses a diffusion-based approach, predicting the next frame based on previous frames and actions.
The model was trained on just 87 hours of CS:GO gameplay data, a fraction of what similar projects (like Google’s recent DOOM simulation) typically use.
Users can interact with the simulation using a keyboard and mouse, with the AI recreating elements like weapon mechanics and player interactions.
The model achieved a 46% better than human-level score on the Atari 100k benchmark, a SOTA performance for agents trained on a world model.
While still imperfect, DIAMOND points towards applications in robotics, autonomous systems, and virtual world creation. The ability to generate interactive, physics-based environments could revolutionize how AI is trained for real-world tasks. Plus, open-world video game creation is about to seriously level up.
Google goes nuclear to power AI
- Google has partnered with Kairos Power to construct seven nuclear reactors, intended to provide about 500 megawatts of carbon-free electricity for its data centers amidst rising energy demands, particularly due to increased data and AI usage.
- The planned nuclear micro-reactors are expected to be operational by 2030, although this timeline is considered highly ambitious, and it remains unclear if the power will be directly connected to Google’s facilities or integrated into the public grid.
- Google’s alliance with Kairos reflects a broader industry trend, as tech giants such as Microsoft and Amazon are also exploring nuclear power to meet their energy needs; however, challenges persist with cost, construction speed, and public acceptance of nuclear power projects.
- Source: https://techcrunch.com/2024/10/14/google-signed-a-deal-to-power-data-centers-with-nuclear-micro-reactors-from-kairos-but-the-2030-timeline-is-very-optimistic/
Chinese researchers reportedly crack military-grade encryption with quantum computer
- Chinese scientists have reportedly used a D-Wave quantum computer to crack encryption, revealing vulnerabilities in widely used methods like RSA, which is essential for technologies including web browsers, VPNs, email services, and certain electronic chips.
- The study demonstrates that the quantum device, utilizing techniques grounded in the quantum annealing algorithm, can successfully decompose a 50-bit RSA integer, emphasizing advanced risks to encrypted data and highlighting the machine’s potential impact on cybersecurity.
- Quantum machines like the D-Wave Advantage, rentable for $2,000 an hour or costing approximately $15 million to purchase, pose a significant threat to encryption systems, leading experts to advocate for stronger defenses against potential future quantum decryption capabilities.
- Source: https://www.pcmag.com/news/chinese-researchers-reportedly-crack-encryption-with-quantum-computer
US weighs capping exports of AI chips from Nvidia and AMD to some countries
- The U.S. government is considering limiting the export of advanced AI chips from American manufacturers, such as Nvidia and AMD, to particular nations, including those in the Middle East, due to national security concerns.
- This potential export restriction may follow the Commerce Department’s recent changes, which have made it easier for American companies to send AI chips to countries in the Middle East developing data centers.
- In reaction to these developments, U.S. authorities have already begun slowing down the approval of export licenses for AI accelerators from companies like Nvidia and AMD, while they conduct a national security assessment of the AI technologies being created in the Middle East.
- Source: https://qz.com/us-cap-exports-sales-ai-chips-nvidia-amd-middle-east-1851672579
Apple announces new iPad Mini focused on AI
- Apple has unveiled a new iPad Mini that emphasizes artificial intelligence, incorporating features such as text rewriting tools, a Siri update utilizing personal context, and app enhancements like a “Clean Up” option for image editing.
- Previously, the iPad Mini, which had not received an update since 2021, lacked support for advanced AI tools and the latest Apple Pencil models, but this revision introduces the cutting-edge A17 Pro chip to address that.
- Priced at $499 or £499, the upgraded device promises enhanced graphics and faster processing, is available for order now, and will be in stores by Wednesday, 23 October.
- Source: https://www.independent.co.uk/tech/apple-ipad-mini-new-announce-mac-b2629529.html
What Else is Happening in AI on October 15th 2024!
Former OpenAI CTO Mira Murati is reportedly trying to poach OpenAI employees for a new venture just weeks after leaving the company — despite remaining an advisor.
Source: https://techstory.in/mira-murati-is-raising-vc-funds-for-her-own-venture-after-openai-exit/
Key Microsoft AI researcher Sebastien Bubeck departed to join OpenAI after playing a prominent role in the small, efficient Phi language models.
Google partnered with nuclear startup Kairos Power to build seven small modular reactors in the US, aiming to supply 500 megawatts of carbon-free electricity for AI data centers by 2030.
YouTube announced that creators can now leverage its AI Dream Track feature to generate soundtracks for shorts using natural language prompts directly in the app.
Source: https://www.socialmediatoday.com/news/youtube-broader-launch-dream-track-ai-audio-generator/729814/
Gatorade launched a new promotion with Adobe allowing users to leverage Firefly’s AI models to customize squeeze bottles with unique designs.
Nvidia-backed AI cloud provider CoreWeave secured a $650M credit loan to fuel growth and announced a nearly $1B investment in U.K. AI infrastructure.
Latest AI Research and Tools
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
LLMWare – Dev tool to make AI apps deployed privately or locally: https://github.com/llmware-ai/llmware
PayloadCMS: an open-source, fullstack Next.js framework that simplifies creating web applications by allowing users to use their own databases, avoid microservices complexity, and extend both backend and admin interfaces, while providing pre-made templates for rapid deployment.
Source: https://github.com/payloadcms/payload
Running LLMs with 3.3M Context Tokens on a Single GPU: this paper presents a method for operating large language models with up to 3.3 million context tokens on a single graphics processing unit.
Source: https://arxiv.org/abs/2410.10819
A Daily Chronicle of AI Innovations on October 14th 2024
OpenAI unveils Swarm multi-agent framework
Anthropic CEO drops essay on AI and the future
Apple smart glasses and AirPods with cameras could arrive in 2027
Apple: ‘No evidence of formal reasoning’ in LLMs
Jensen Huang wants Nvidia to be a company with 100 million AI assistants
New Gmail security alert for 2.5B users as AI hack confirmed
🧠Breakthrough from REMspace: First Ever Communication Between People in Dreams
Adobe’s AI-powered video generation is here
Tesla’s robots were human-controlled
Apple smart glasses and AirPods with cameras could arrive in 2027
- Apple is expected to launch smart glasses and AirPods with integrated cameras in 2027 as part of its strategy to extend its augmented reality product range beyond the Vision Pro headset, which has faced market limitations.
- The Vision Pro, characterized by its $3,500 price tag, has been criticized for its weight and overheating issues, leading to disappointing sales and reduced consumer interest since its debut.
- Apple aims to enhance augmented reality accessibility by developing these new devices, acknowledging competition from Meta’s more affordably priced smart glasses and planning cheaper and more advanced versions of the Vision Pro in the coming years.
- Source: https://www.macrumors.com/2024/10/14/apple-smart-glasses-airpods-cameras-2027/
Jensen Huang wants Nvidia to be a company with 100 million AI assistants
- Nvidia CEO Jensen Huang envisions a future where the company will have 50,000 employees and 100 million AI agents working together to increase productivity.
- The AI agents would break down complex tasks, recruit other AIs, and work alongside humans in platforms like Slack, creating a seamless hybrid workforce of digital and biological entities.
- Huang believes that AI-driven productivity improvements could lead to both company growth and job creation, as automation frees up human workers to focus on higher-value tasks.
- Source: https://www.newsbytesapp.com/news/science/100-million-ai-assistants-in-nvidia-s-future-ceo-jensen-huang/story
New Gmail security alert for 2.5B users as AI hack confirmed
- Google has strengthened security measures for Gmail accounts, but hackers using AI-driven techniques have evolved to create highly convincing scams, as pointed out by Sam Mitrovic, a Microsoft consultant who nearly fell for an advanced AI phishing attempt.
- Mitrovic received misleading notifications and calls posing as Google support, where the scam’s AI convincingly impersonated a voice, falsely claiming his account was compromised for seven days and accessed from unusual locations, which was part of the deceit.
- Mitrovic’s experience highlights the threat of AI scams and emphasizes vigilance; users should verify unsolicited contact supposedly from Google, using resources like Google search to check phone numbers and email origins before reacting to prevent credential theft.
- Source: https://www.forbes.com/sites/daveywinder/2024/10/13/new-gmail-security-alert-for-billions-as-7-day-ai-hack-confirmed/
Adobe’s AI-powered video generation is here
- Adobe launched Firefly’s new video generation capabilities, allowing users to try out text-to-video and image-to-video models through its website and Premiere Pro beta app, aiming to enhance editing tasks rather than creating new videos from scratch.
- The Generative Extend feature, available in the Premiere Pro beta, enables users to extend video clips by up to two seconds, enhancing the continuity of video and audio without reproducing copyrighted voices or music to prevent legal issues.
- Adobe aims to support creatives by paying for video submissions to train its AI model, while encouraging the artistic community to adopt AI tools for expanding creative capacities and meeting the increasing demand for personalized content.
- Source: https://techcrunch.com/2024/10/14/adobe-invites-you-to-embrace-the-tech-with-fireflys-new-video-generator/
Tesla’s robots were human-controlled
- During Tesla’s “We, Robot” event, Optimus, Elon Musk’s humanoid robot, became the highlight by safely moving through the crowd and interacting with attendees despite lacking true artificial intelligence.
- Although Musk claimed Optimus to be Tesla’s most significant product, the robots showcased were operated and voiced by humans remotely, posing as a contrast to the fully autonomous image implied during the demonstration.
- Critics, such as Tesla content creator Jeremy Judkins, expressed disappointment with Tesla’s lack of transparency about the human assistance, viewing it as misleading and calling for more honesty about the robot’s capabilities.
- Source: https://fortune.com/2024/10/13/elon-musk-tesla-optimus-robot-tele-operated-robotaxi/
Apple: ‘No evidence of formal reasoning’ in LLMs
Apple researchers just published a new study revealing major limitations in the reasoning capabilities of LLMs, including those from top AI labs like OpenAI’s 4o and o1 models.
Apple scientists developed a new benchmark called GSM-Symbolic to evaluate LLMs’ mathematical reasoning skills.
The study found that slight changes in the wording of questions or adding irrelevant info drastically altered model outputs, with accuracy dropping by up to 65%.
Researchers saw increased performance variability and decreased accuracy as the complexity of questions increased.
The team concluded that there was “no evidence of formal reasoning” in the models tested, suggesting that the behavior is more likely sophisticated pattern matching.
While there seem to be conflicting opinions on whether LLMs can truly reason, file this new research under the ‘no’ category. If these limitations hold, they expose some significant questions regarding the reliability and risks of deploying models into increasingly more complex applications.
Source: https://arxiv.org/pdf/2410.05229
OpenAI unveils Swarm multi-agent framework
OpenAI just introduced Swarm, a new open-source experimental framework designed to simplify the creation and control of multi-agent AI systems.
Swarm focuses on making agent coordination lightweight, controllable, and easily testable through two key building blocks: agents and handoffs.
Agents encapsulate specific instructions and tools, while handoffs allow agents to transfer control of a conversation to another agent.
Swarm includes features like function calls, context variables, and streaming and is built on OpenAI’s ChatCompletions API.
The framework is available on GitHub with several examples, including a triage agent, weather agent, and airline customer service system.
OpenAI emphasized that Swarm is experimental and released as an educational resource for exploring multi-agent orchestration.
Not only are singular agentic capabilities inching closer — but the ability to deploy systems that leverage armies of agents working together is also coming fast. Soon, the user will be the CEO of their AI company — with dozens of agents autonomously working together on complex, multi-step tasks.
Source: https://cookbook.openai.com/examples/orchestrating_agents
🧠Breakthrough from REMspace: First Ever Communication Between People in Dreams
A new definition of Social if confirmed. Chatting in your dreams “On September 24, participants were sleeping at their homes when their brain waves and other polysomnographic data were tracked remotely by a specially developed apparatus. When the server detected that the first participant entered a lucid dream, it generated a random Remmyo word and sent it to him via earbuds. The participant repeated the word in his dream, with his response captured and stored on the server. Eight minutes later, the next participant entered a lucid dream. She received the stored message from the first participant and confirmed it upon awakening, marking the first-ever “chat” exchanged in dreams. Additionally, two other people were able to communicate with the server through their dreams.”
What Else is Happening in AI on October 14th 2024:
Meta’s AI chief Yann LeCun calls AI apocalypse fears ‘complete B.S.’.
Source: https://www.techspot.com/news/105123-meta-ai-chief-yann-lecun-calls-ai-apocalypse.html
New ChatGPT prompt goes viral with Sam Altman’s approval.
Meta chief AI scientist Yann LeCun said that existential warnings about AI are ‘complete BS,’ arguing that the current systems are no smarter than a house cat.
Source: https://www.wsj.com/tech/ai/yann-lecun-ai-meta-aa59e2f5
AI pioneer Yoshua Bengio warned about the dangers of AI in a new interview, saying humanity is on a path to ‘creating monsters that could be more powerful than us.’
A new study from Sun Yat-sen University used Meta’s ESMFold protein-prediction tool to uncover 70,500 new RNA viruses in environmental data.
Source: https://www.nature.com/articles/d41586-024-03320-6
Apple reportedly plans to launch a lower-end model of its Vision headset, priced at $2,000 instead of the $3,500 Vision Pro, which has suffered.
Trending AI Tools
Google Illuminate – Transform research papers into AI-generated audio summaries
Machine Learning & AI For Dummies
This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
Web: https://machinelearningcertification.web.app/
Windows: https://apps.microsoft.com/detail/9p0r1x3jnc46?hl=en-us&gl=US
CalcGen AI – Transform data into interactive visualizations in seconds: https://calcgen.ai/
Kuration AI – Curate, refine, and enrich lead databases with automated B2B AI agents: https://www.kurationai.com/
A Daily Chronicle of AI Innovations on October 11th 2024
Elon Musk reveals new $30,000 robotaxi
AMD reveals next-gen AI chips – going after Nvidia
Tesla’s Optimus robots steal the show at Tesla event
TikTok cuts hundreds of jobs to replace them with AI
Wikipedia declares war on AI-generated content
OpenAI’s new AI agent benchmark
Elon Musk reveals new $30,000 robotaxi
- Elon Musk introduced the Tesla Cybercab, a self-driving vehicle without steering wheels or pedals, with plans for consumer availability under $30,000 and production aimed before 2027, despite Tesla’s history of delayed autonomy promises.
- Alongside the Cybercab, Musk announced the Robovan, an autonomous electric vehicle designed to transport up to 20 people or goods, with both models featuring inductive charging for wireless energy transfer at recharge stations.
- At the invitation-only robotaxi event, Musk also highlighted an unsupervised version of Tesla’s Full Self-Driving system expected in 2024.
Elon Musk says Tesla’s robotaxis will have no plug for charging and will instead charge inductively. They will be cleaned by machines and a world of autonomous vehicles will enable parking lots to be turned into parks.
Source: https://www.nbcnews.com/tech/innovation/cybercab-robovan-musk-tesla-event-cost-rcna174996
TikTok cuts hundreds of jobs to replace them with AI
- TikTok has announced it is dismissing several hundred workers worldwide to transition towards using artificial intelligence for content moderation, aiming to enhance its global moderation model.
- Approximately 500 employees in Malaysia are losing their jobs as part of this restructuring, with TikTok also planning to consolidate some regional operations and having previously cut positions in marketing and operations earlier this year.
- The platform currently employs a combination of human and automated methods to review content, but AI will increasingly replace human moderators, who have faced difficult conditions, including low pay and the psychological toll from reviewing harmful content.
- Source: https://www.pcmag.com/news/tiktok-lays-off-hundreds-of-staff-to-replace-them-focus-on-ai
AMD is going after Nvidia with new AI chips
- AMD has introduced its Instinct MI325X AI chip aimed at competing with Nvidia’s data center GPUs, with production slated to commence by the end of 2024, potentially pressuring Nvidia’s market position and gross margins.
- The Instinct MI325X rollout positions AMD against Nvidia’s Blackwell chips, with AMD aiming for significant market entry amidst growing demand from AI-intensive applications powered by vast data centers.
- Despite aiming to challenge Nvidia’s dominance, AMD’s primary hurdle is the rival’s CUDA programming language, but AMD’s enhancements in ROCm software and upcoming CPUs are responsive strategies to capture more market share.
- Source: https://www.cnbc.com/2024/10/10/amd-launches-mi325x-ai-chip-to-rival-nvidias-blackwell-.html
Wikipedia declares war on AI-generated content
- Wikipedia editors have initiated “WikiProject AI Cleanup” to tackle the issue of unsourced and poorly-written AI-generated content, aiming to protect the integrity of the platform’s information.
- The project does not intend to ban AI usage entirely but seeks to remove content that is inaccurately sourced or filled with AI hallucinations that compromise article quality.
- Editors have identified AI-generated text patterns and catchphrases to detect substandard content, despite the challenges of spotting complex AI-generated errors in subjects like historical architecture.
- Source: https://futurism.com/the-byte/wikipedia-declares-war-ai-slop
OpenAI’s new AI agent benchmark
OpenAI just introduced MLE-bench, a new benchmark designed to evaluate how well AI agents perform on real-world machine learning engineering tasks using Kaggle competitions.
MLE-bench consists of 75 curated Kaggle competitions, covering a range of ML tasks like model training, data preparation, and experimentation.
Kaggle competitions are online challenges where data scientists compete to solve complex problems using machine learning for prizes and recognition.
In research, the AI models often succeeded in applying standard techniques but struggled with tasks requiring adaptability or creative problem-solving.
The best-performing setup, OpenAI’s o1-preview model with AIDE scaffolding, achieved at least a bronze medal in 16.9% of competitions.
- AI agents are coming in hot — and new benchmarks are necessary to evaluate capabilities that blow past previous testing measures. Between OpenAI’s commentary, a flurry of startups pushing agentic capabilities, and new benchmarks being created, the AI agent revolution feels ready to explode.
- Source: https://openai.com/index/mle-bench/
[Google DeepMind] Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
An animal’s optimal course of action will frequently depend on the location (or more generally, the ‘state’) that the animal is in. The hippocampus’ purported role in representing location is therefore considered to be a very important one. The traditional view of state representation in the hippocampus is that the place cells index the current location by firing when the animal visits the encoded location and otherwise remain silent. The main idea of the successor representation (SR) model, elaborated below, is that place cells do not encode place per se but rather a predictive representation of future states given the current state. Thus, two physically adjacent states that predict divergent future states will have dissimilar representations, and two states that predict similar future states will have similar representations.
—Stachenfeld, K. L., Botvinick, M. M., & Gershman, S. J. (2017). The hippocampus as a predictive map. Nature neuroscience, 20(11), 1643-1653.
Source: https://arxiv.org/abs/2410.08146
Master a new language with ChatGPT Voice
ChatGPT’s new Advanced Voice Mode allows you to practice and improve your language skills through interactive conversations and role-play scenarios.
Download the ChatGPT app on your phone.
Craft a detailed learning prompt (similar to the one in the image above).
Tap the mic icon and speak your prompt to start the session.
Engage in conversation, asking for slower speech or repetition as needed
- Pro Tip: Save effective prompts in your custom instructions for quick access and consistent practice across sessions.
What Else is Happening in AI on October 11th 2024!
Chinese researchers unveiled Pyramid Flow, a new open-source AI video generation model capable of creating high-quality, 10-second clips using a new ‘pyramidal flow matching’ technique.
Source: https://www.aibase.com/news/12303
OpenAI Chairman Bret Taylor’s AI startup Sierra is reportedly set to raise hundreds of millions in funding at a valuation of over $4B for its conversational enterprise AI agents.
Japanese AI startup Rhymes released Aria, hailed as the first open-source multimodal native Mixture-of-Experts model — offering SOTA performance across various tasks with a lightweight 3.9B parameters and 64k token context window.
Source: https://the-decoder.com/japanese-multimodal-ai-model-aria-is-open-source-and-beats-many-competitors
Wondercraft launched a new ‘Director Mode’ feature, allowing users to control AI voices with natural language instructions and becoming the first audio platform to integrate OpenAI’s Advanced Voice Mode.
Source: https://www.wondercraft.ai/blog/prompt-ai-voices-with-wondercrafts-director-mode
Google rolled out its Imagen 3 image generator to all Gemini users, though only Advanced subscribers ($19.99/mo) can generate images of people.
Walmart revealed new AI platforms to create hyper-personalized shopping experiences, including its Wallaby LLMs trained on the company’s data and a Customer Support Assistant that can take actions for the user.
Apple Intelligence features can also summarize breakup texts for you.
OpenAI releases its meta-prompt for prompt optimization.
Source: https://the-decoder.com/openai-releases-its-meta-prompt-for-prompt-optimization/
A Daily Chronicle of AI Innovations on October 10th 2024
OpenAI says bad actors are using its platform to disrupt elections
New model tops tool-calling leaderboard
Zoom launches new AI platform features
Electronic tongue enables AI to taste
OpenAI says bad actors are using its platform to disrupt elections
- OpenAI reports that it has disrupted over 20 operations globally that attempted to misuse its AI models for spreading election-related misinformation, ranging from fake social media posts to AI-generated articles, but such efforts had minimal impact.
- The company highlights growing concerns about AI-generated content contributing to misinformation in elections worldwide, amidst a significant year for global elections, affecting over 4 billion people in 40 countries.
- OpenAI indicates that despite attempts from operations in countries like Iran and Rwanda to use its platform for election disruption, the AI-generated content in these cases failed to achieve widespread engagement or build large audiences.
New model tops tool-calling leaderboard
AI startup Writer just introduced Palmyra X 004, an LLM that sets a new standard for action capabilities and function calling in enterprise AI — beating out top models from OpenAI and Anthropic.
Palmyra X 004 outperforms OpenAI, Anthropic, Meta, and Google models on Berkeley’s Tool Calling Leaderboard, leading by nearly 20% accuracy.
The model offers a 128k context window, supports over 30 languages, and handles multimodal inputs (text, images, audio).
Palmyra can interact with external tools via tool calling, enabling it to perform tasks like updating databases, sending emails, triggering workflows, and more.
The 150B parameter model was trained on synthetic data, which the company said significantly reduced costs compared to the top AI labs.
As companies race to integrate AI, models that can take concrete actions rather than just provide information are in high demand. Palmyra X 004’s impressive skills could give Writer a new edge in the enterprise AI market and also serve as an example that not all top models require massive computing resources.
Source: https://writer.com/blog/actions-with-palmyra-x-004
Zoom launches new AI platform features
Zoom just unveiled a suite of new AI-driven innovations to its platform at its Zoomtopia 2024 event, including AI companion 2.0, a custom AI add-on plan, personalized avatars, and more.
Companion 2.0 is an AI assistant that works across Zoom Workplace, offering expanded context, web access, and the ability to take agentic-type actions.
Zoom Tasks is a new AI-powered feature to help detect, recommend, and complete tasks based on conversations across Zoom Workplace.
Custom AI avatars will become available in Zoom Clips in 2025, with the ability to create video content from text scripts.
Zoom founder Eric Yuan previously said that AI avatars will eventually be capable of attending Zoom meetings and making decisions on a user’s behalf.
Zoom says it wants to overhaul work in the digital age, and these announcements point to a new AI-driven world of interconnected tools and workflows. While avatars attending meetings and acting on your behalf might sound wild now, the work landscape is about to be turned upside down as AI continues to grow and scale.
Source: https://news.zoom.us/zoomtopia-2024-unveiling-ai-first-work-platform-innovations
Electronic tongue enables AI to taste
Scientists at Penn State just created an AI-powered ‘electronic tongue’ that can identify subtle differences in liquids, detect food spoilage, and gain broader insights into AI’s decision-making processes.
The electronic tongue combines a special sensor with an AI modeled after the human brain’s taste center, enabling it to ‘taste’ liquids.
The tongue can ID differences in similar liquids like watered-down milk, sodas, coffee, and spoiled fruit juices with over 80% accuracy in about a minute.
When the AI was allowed to interpret the sensor data on its own terms, it achieved over 95% accuracy in identifying the samples.
Researchers also used methods to examine the AI’s thought process, helping understand how it weighs different pieces of information to make decisions.
Source: https://www.psu.edu/news/research/story/matter-taste-electronic-tongue-reveals-ai-inner-thoughts
Excerpt about AGI from OpenAI’s latest research paper
Runway CEO Cristóbal Valenzuela says AI is coming to Hollywood and demos tools that move beyond text prompts to give filmmakers greater control over video generation
Google DeepMind’s Demis Hassabis and John Jumper were co-awarded a Nobel Prize in chemistry for their work on AlphaFold, an AI system that can predict and design protein structures. https://www.nobelprize.org/prizes/chemistry/2024/press-release
Amazon introduced AI Shopping Guides for over 100 product types, leveraging generative AI to streamline product research and offer tailored recommendations within its U.S. app and mobile website. https://www.aboutamazon.com/news/retail/amazon-ai-shopping-guides-product-research-recommendations
Chinese startup MiniMax’s Hailuo AI launched a new image-to-video feature, alongside new style controls and enhanced processing and control. https://x.com/Hailuo_AI/status/1843614057229873419
Meta expanded Meta AI to six new countries, including the EU, and is rolling it out internationally in Ray-Ban Meta smart glasses — though the EU will be excluded from multimodal capabilities due to regulatory issues. https://www.engadget.com/ai/meta-ai-will-launch-in-six-more-countries-today-including-the-uk-150057934.html
Stripe announced expanding its partnership with NVIDIA, enabling global access to NVIDIA’s AI cloud services and leveraging the chipmaker’s platform for improved fraud detection. https://stripe.com/en-ca/newsroom/news/nvidia-collaboration-with-stripe
A Daily Chronicle of AI Innovations on October 09th 2024
Google DeepMind researchers win Nobel Prize in chemistry
OpenAI seeks independence from Microsoft
Adobe launches AI attribution system
🧠 AI computing capacity for leading tech companies
Google DeepMind researchers win Nobel Prize in chemistry
The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Chemistry with one half to David Baker “for computational protein design” and the other half jointly to Demis Hassabis and John M. Jumper “for protein structure prediction.”
Press release: https://www.nobelprize.org/prizes/chemistry/2024/press-release/
Popular information: They have revealed proteins’ secrets through computing and artificial intelligence: https://www.nobelprize.org/prizes/chemistry/2024/popular-information/
Scientific background: Computational protein design and protein structure prediction: https://www.nobelprize.org/prizes/chemistry/2024/advanced-information/
The Nobel Prize in Literature for 2024 has been awarded to ChatGPT
The Nobel Prize in Literature for 2024 has been awarded to ChatGPT for “his intricate tapestry of prose which showcases the redundancy of sentience in art.” This fictional accolade humorously acknowledges the ability of AI to produce sophisticated, expressive literature, suggesting that creativity can transcend traditional human boundaries.
The award, granted by The Swedish Academy, celebrates the notion that artificial intelligence, despite its lack of human consciousness, has the capacity to create a profound and complex body of work—so much so that it might question the necessity of human sentience in the realm of artistic expression.
Source: https://www.nobelprize.org/prizes/literature/2024/press-release/
OpenAI seeks independence from Microsoft
OpenAI is reportedly looking to reduce its reliance on Microsoft for compute power and has started exploring options to set up its own data servers and secure AI chips independently, according to a new report from The Information.
CFO Sarah Friar told shareholders that Microsoft ‘hasn’t moved fast enough’ to supply computing power, causing the AI giant to look elsewhere.
OpenAI plans to lease an entire data center in Abilene, TX from Oracle, though Microsoft likely had to ‘bless’ the deal with its rival, according to the report.
OpenAI is also developing its own AI chip, which could lower costs for future computing clusters — its current supply is rented primarily from Microsoft.
Tensions have also reportedly arisen between OpenAI and Microsoft over the design and timeline of a massive joint data center project called ‘Fairwater.’
OpenAI and Microsoft’s relationship has felt a bit off for a while now. While both companies have leveraged each other well to ascend the AI power ladder, it certainly feels like there is trouble in paradise. There is plenty of smoke, and how this partnership shakes out could have fiery implications for the entire AI landscape.
Source: https://www.theinformation.com/articles/openai-eases-away-from-microsoft-data-centers
Adobe launches AI attribution system
Adobe just announced a new free web app called Adobe Content Authenticity, designed to help creators protect their work and receive proper attribution in the era of AI-generated content.
The web app allows creators to easily apply content credentials to images, audio, and video files, acting as a ‘nutrition label’ for digital content.
Content credentials include creator information and creation details and can signal if the creator doesn’t want their work used to train AI models.
The system uses digital fingerprinting, invisible watermarking, and cryptographic metadata to make the credentials difficult to remove.
The web app, which has a waitlist, is expected to launch in Q1 of 2025, while a Chrome extension is available in beta today.
AI is extremely polarizing in the creator and artist community, largely due to the issues of unauthorized training and attribution that Adobe, Meta, OpenAI, and others are trying to address. While these tools are promising, they still rely heavily on widespread adoption and opt-in by creators and tech companies.
Source: https://contentauthenticity.adobe.com/
Control object motion in AI videos
Kling AI, one of the most popular AI video generators, now lets you add strategic movement to specific elements in AI video, providing more control in your generated clips.
Choose a high-quality image with different elements to animate.
Access Kling AI‘s Image-to-Video tool and upload your image.
Use the Motion Brush to paint areas you want to animate and set motion paths for each area to define movement direction.
Fine-tune with prompts, adjust settings, and generate your video.
Pro tip: Keep movements subtle and natural for more realistic results, and experiment with different combinations to find what works best for your specific image.
Source: https://kling.ai
AI is Revolutionizing Weather Forecasts : How GraphCast Models are Predicting the Future with Unmatched Precision
In recent years, artificial intelligence (AI) has made significant strides in numerous fields, from healthcare to finance. One of the most exciting developments is how AI is revolutionizing weather forecasting. With the advent of advanced AI models like GraphCast, we are entering an era where weather predictions are faster, more accurate, and more reliable than ever.
The Role of AI in Weather Forecasting: https://stellarmind.ai/blog/%20ai-is-revolutionizing-weather-forecasts
AI computing capacity for leading tech companies
Google: The bar is divided into two parts—NVIDIA (turquoise) and TPU (blue), indicating that Google relies on both GPUs and custom Tensor Processing Units for its AI computing needs. Google’s total computing power is estimated at over 1 million H100 equivalents with a wide 50% confidence interval (CI), reflecting a significant but uncertain range.
Microsoft (including OpenAI): The capacity bar for Microsoft is entirely NVIDIA based. It shows a substantial AI computing capacity, ranging between 500k and 1 million H100 equivalents with a significant confidence interval.
Meta: This bar represents the use of NVIDIA GPUs and shows a slightly smaller computing capacity, estimated between 400k and 800k H100 equivalents, with an associated confidence interval.
Amazon: Amazon’s computing capacity is similar to Meta but slightly smaller, estimated between 300k and 700k H100 equivalents.
Other (including other cloud providers and AI labs): This category has the largest computing capacity, reaching 1.5 million H100 equivalents or more, with a broad confidence interval, indicating significant diversity among other providers.
Google leads the way with the largest computing capacity, exceeding one million H100 equivalents. Google leverages both NVIDIA GPUs and its custom TPUs, which significantly boosts its computing resources, making it a powerful player in the AI field.
Microsoft, which includes the resources of OpenAI, follows as another major contender, with its computing power estimated between 500,000 and one million H100 equivalents. Microsoft primarily depends on NVIDIA’s technology for AI workloads, reflecting a substantial investment in industry-standard GPU infrastructure.
Meta ranks next, with a strong computing infrastructure in the range of approximately 400,000 to 800,000 H100 equivalents. This illustrates Meta’s commitment to advancing its AI capabilities to power its social platforms and metaverse initiatives.
Amazon also shows impressive AI capabilities, albeit slightly behind Meta, with its computing capacity estimated between 300,000 and 700,000 H100 equivalents. This positions Amazon well for expanding AI capabilities across its AWS offerings and other business services.
The “Other” category, which includes other cloud providers and AI labs, collectively possesses a very significant amount of computing power, estimated at over 1.5 million H100 equivalents. This diverse group demonstrates the growing competition and interest in AI computing capacity across various tech ecosystems.
Overall, this comparison highlights the significant infrastructure investments made by these leading companies to enhance their AI capabilities, with Google standing out as the clear leader, followed by a competitive landscape involving Microsoft, Meta, Amazon, and a diverse group of other providers. The results underline the importance of having vast computing resources to stay at the forefront of AI development and innovation.
Google AI – Development of therapeutic drugs is often difficult and time consuming. A new model, Tx-LLM, is able to predict the properties of many entities of potential interest for therapeutic development with accuracy comparable state-of-the-art specialty models.
Introducing Tx-LLM, a language model fine-tuned to predict properties of biological entities across the therapeutic development pipeline, from early-stage target discovery to late-stage clinical trial approval.
Source: https://research.google/blog/tx-llm-supporting-therapeutic-development-with-large-language-models/
Chinese startup Leju Robotics has released their open-source humanoid development platform for academic and R&D use cases. It includes an SDK for sensors and controls, simulation models, an LLM interface, and some basic demos that work out-of-the-box.
Source: https://www.reddit.com/r/singularity/?f=flair_name%3A%22Robotics%22
What Else is Happening in AI on October 09th 2024!
OpenAI and Hearst announced a strategic partnership to integrate content from over 20 magazine brands and 40+ newspapers into OpenAI’s AI products.
Source: https://openai.com/index/hearst
Hugging Face released OpenAI-Gradio, a new tool enabling the creation of AI-powered web apps using OpenAI’s models in just minutes with minimal code.
Source: https://x.com/Gradio/status/1843698665472368665
Uber unveiled plans to launch an OpenAI-powered AI assistant in early 2025 to help drivers with electric vehicle questions, aiming to accelerate EV adoption on the platform.
Anthropic launched Message Batches API, allowing developers to submit up to 10,000 queries for async processing in under 24 hours at a 50% discount compared to standard API calls.
Source: https://www.anthropic.com/news/message-batches-api
Google added the ability to drag and drop any file type to upload directly into its AI Studio without importing it to Google Drive.
Source: https://x.com/officiallogank/status/1843723911055454580
KoBold Metals raised $527M for its AI-powered mineral discovery tech that leverages extensive data analysis to uncover deposits with energy-critical minerals like copper, lithium, and nickel.
AI Tools Updates
Machine Learning & AI For Dummies PRO on the App Store (apple.com)
This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
CogvideoX-ControlNet: A new tool for turning images into short videos using the powerful CogvideoX model. It’s open-source, so check it out and contribute if you’d like!
Meta Movie Gen: Now adds audio to your videos! From background sounds to music, this AI brings your videos to life.
Veo by Google DeepMind: Google’s latest advanced video creation tool. Watch it in action!
FLUX.1-dev ControlNet Inpainting: Perfect for fixing or filling in missing spots in your images.
Source: https://comfyuiblog.com/ai-news-cogvideox-controlnet-and-veo-by-google-deepmind-and-more/
A Daily Chronicle of AI Innovations on October 08th 2024
🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity
Inflection and Intel team up on enterprise AI
💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High
Students turn AI glasses into doxing devices
Checklists improve AI model evaluation
👀 AI images taking over google
Uber will use ChatGPT to get more people to use EVs
Adobe has a new tool to protect artists’ work from AI
🧠Nobel Prize awarded to ‘godfather of AI’ who warned it could wipe out humanity
The Nobel Prize in Physics 2024 was awarded to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”
Hinton … hopes that the award might make people take the fears he voices more seriously.
The Royal Swedish Academy of Sciences has decided to award the 2024 Nobel Prize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”
- Geoffrey Hinton and John Hopfield, credited with ‘establishing the foundations for today’s advanced machine learning technologies’, were awarded the Nobel Prize in physics for their pioneering work on artificial neural networks mimicking brain structures.
- Their innovations helped enable AI systems to learn by identifying complex patterns from data, which is foundational to high-profile applications like language generation and image recognition currently used in technology.
- Despite the recognition, Hinton has expressed concern over AI’s potential risks, highlighting the danger of bad actors misusing the technology, and recently left Google to focus on advocating for responsible AI development.
Press release: https://www.nobelprize.org/prizes/physics/2024/press-release/
Popular information: https://www.nobelprize.org/prizes/physics/2024/popular-information/
Advanced information: https://www.nobelprize.org/prizes/physics/2024/advanced-information/
Source: https://www.nobelprize.org/
💰Nvidia Overtakes Microsoft as AI Powers Stock to 6-Week Record High
On Monday, Nvidia stock went up even though most other big tech stocks went down. This helped the AI giant recover its position as the world’s second-largest company during the AI boom.
Source: https://theaiwired.com/nvidia-overtakes-microsoft-as-ai-powers-stock-to-6-week-record-high/
👀 AI images taking over google
Hard to see how this isn’t the beginning of the end of the information era…
Source: https://www.reddit.com/r/singularity/comments/1fyf93x/ai_images_taking_over_google/
Uber will use ChatGPT to get more people to use EVs
- Uber is introducing an AI assistant powered by ChatGPT to help drivers with questions about purchasing and using electric vehicles, aiming to encourage EV adoption.
- The company is rolling out a new “EV Preference” feature, allowing users to select rides exclusively from electric vehicles, which will be available in the app over the coming months.
- As part of its sustainability goals, Uber is expanding its EV-only service in 40 cities and aims to become a zero-emission mobility platform in North America and Europe by 2030, and globally by 2040.
Source: https://www.theverge.com/2024/10/8/24264282/uber-green-ev-driver-mentor-chatgpt
Adobe has a new tool to protect artists’ work from AI
- Adobe plans to launch a new web app in 2025, alongside a Chrome extension, to help protect artists’ work by applying tamper-evident metadata, known as Content Credentials, and allowing creators to opt-out of generative AI models.
- This web app will integrate with Adobe’s Creative Cloud applications and enable artists to uniformly embed creator information across content, simplifying the opt-out process from AI training databases compared to individual submissions for each AI provider.
- While Adobe’s initiative seeks widespread industry support, only a few companies like Spawning have committed to adopting these protections, highlighting Adobe’s challenge in ensuring voluntary participation from other AI and tech companies.
- Source: https://www.technologyreview.com/2024/10/08/1105234/adobe-wants-to-make-it-easier-for-artists-to-blacklist-their-work-from-ai-scraping
Inflection and Intel team up on enterprise AI
Inflection AI just launched Inflection for Enterprise, a new system built in partnership with Intel and designed for large-scale business deployments – featuring both a cloud service, new commercial API and upcoming local appliance.
Inflection for Enterprise is built on the new Inflection 3.0 model family and powered by Intel’s Gaudi 3 AI accelerators.
An on-premises AI appliance is planned for Q1 2025 release, promising up to 2x improved price-performance over competitors.
Inflection 3.0 comes in two variants — Pi 3.0 for chatbots and Productivity 3.0 for instruction-following tasks.
Inflection also released a commercial API, enabling developers to build advanced conversational AI applications.
After a turbulent year following founder Mustafa Suleyman and much of the team’s departure to Microsoft, Inflection is pivoting from consumer-focused apps to enterprise solutions. While the startup will face no shortage of competitors, a partnership with Intel is a positive start for the new regime.
Checklists improve AI model evaluation
Researchers from the University of Oxford and Cohere just developed TICK, a new approach for evaluating AI language models that use AI-generated checklists to improve assessment accuracy and interpretability.
TICK uses an AI model to generate a checklist of yes/no questions to evaluate how well another AI model followed a given instruction.
The checklist-based method showed 5.8% higher agreement with human evaluators than standard AI evaluation techniques.
The researchers also developed STICK (Self-TICK), which uses the checklists for self-improvement, leading to 7.8% better performance on reasoning tasks.
TICK can be fully automated, making it faster and cheaper than checklist-based evaluations requiring human input.
LLMs are weird — and sometimes even simple formatting quirks (remember the ‘take a deep breath’ prompt?) can lead to unexpected results. When looking for new techniques to get the most out of AI models and evaluations, maybe it’s ideal to return to the basics of human organization and learning.
Source: The Rundown
What Else is Happening in AI on October 08th 2024!
Former Google CEO Eric Schmidt argued at the Washington AI Summit that AI advances should take precedence over climate goals, saying, “We’re not going to hit the climate goals anyway because we’re not organized to do it.”
Source: https://mashable.com/article/former-google-ceo-invest-ai-despite-climate-concerns
Northrop Grumman announced an AI-powered enhancement to its Forward Area Air Defense system, enabling rapid decision-making against drone swarms.
Nvidia and Peking University researchers introduced EdgeRunner, a new model for high-quality, detailed 3D mesh generation.
Source: https://arxiv.org/html/2409.18114v1
Enterprise GenAI startup Writer is reportedly set to raise between $150-200M at a $1.9B valuation, doubling its valuation from its $100M Series B round last September.
Security researcher Harish SG published research showing evidence that LLMs can be prompted to achieve reasoning levels of powerful models like OpenAI’s o1 using a combination of advanced prompt tactics.
Source: https://openai.com/index/building-an-early-warning-system-for-llm-aided-biological-threat-creation/
Trending AI Tools:
Machine Learning & AI For Dummies PRO on the App Store (apple.com)
This comprehensive app serves as a one-stop resource for mastering Machine Learning and AI concepts, from basics to advanced topics. It offers a rich array of features including over 600 quizzes covering cloud ML operations on major platforms, fundamental and advanced ML concepts, and NLP. The app also provides cheat sheets, interview preparation materials, and daily-updated content to keep users abreast of the latest developments. With interactive elements like scorecards and timers, it offers an engaging learning experience for both beginners and experienced professionals looking to enhance their ML and AI expertise. iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
Dashworks Bots – Create AI assistants that answer your team’s questions
Theneo – Generate Stripe-like API docs in seconds
Flash – Supercharge your learning with AI-powered flashcards
Firebender – A privacy-first coding assistant for Android Studio
Bramble – AI-backed real estate brokerage to buy a home end-to-end
A Daily Chronicle of AI Innovations on October 07th 2024
OpenAI and Altera create digital humans
AI identifies drug candidates for pain relief
Fewer websites are blocking OpenAI’s web crawler
🦾 Nvidia Acquires OctoAI To Dominate Enterprise Generative AI Solutions.
🚖Uber Expands Robot Delivery and Robotaxi Offerings With Avride.
🤖 Hitachi launches AI-powered railway maintenance service with Nvidia.
🔮 New Nvidia ACE plugins for Unreal Engine 5 simplify the creation of AI digital humans.
Jensen Huang is now worth more than Intel
Run Llama 3.2 locally on your phone
👀The impact of generative AI as a general-purpose technology
👨⚖️The racist AI deepfake that fooled and divided a community
Jensen Huang is now worth more than Intel
- Jensen Huang, CEO of Nvidia, has a net worth of $109.2 billion, surpassing Intel’s current market value of $96.39 billion, which saw a significant drop following revelations about its financial issues in August.
- Nvidia’s growth, driven by an AI boom and its dominance as a GPU accelerator manufacturer, helped its market cap soar, placing it among the top valued companies worldwide, though its stock has corrected by 10% since its peak.
- Huang’s significant stake in Nvidia, with holdings valued over $100 billion, and his strategic share sales have propelled him to the 11th position on Forbes’ real-time billionaires list, close to entering the top 10.
- Source: https://www.msn.com/en-gb/money/other/jensen-huang-is-now-worth-more-than-intel-personal-net-worth-currently-valued-at-109b-vs-intel-s-96b-market-cap/ar-AA1rMKD3
Fewer websites are blocking OpenAI’s web crawler
- OpenAI’s web crawlers are facing fewer blocks from major news websites compared to earlier, despite a widespread data-protection rush where publishers attempted to prevent their content from becoming AI training data without consent.
- The trend of blocking OpenAI’s GPTBot saw a decline after the company made a series of licensing agreements with publishers, leading some outlets to revise their robots.txt files and permit GPTBot access.
- Despite robots.txt not being legally binding, it remains a widely observed standard for web crawler behavior, and OpenAI recognizes the importance of not being blocked to safeguard its future goals and ambitions.
- Source: https://www.theverge.com/2024/10/7/24264184/fewer-websites-are-blocking-openais-web-crawler-now
🦾 Nvidia Unveils NVLM 1.0-A Bold Rival to ChatGPT in Generative AI
Advanced AI model NVLM 1.0 from Nvidia competes with ChatGPT and Gemini, doing better at jobs like vision-language and solving complex problems.
Source: https://theaiwired.com/nvidia-unveils-nvlm-1-0-a-bold-rival-to-chatgpt-in-generative-ai/
OpenAI and Altera create digital humans
OpenAI just published a case study on Altera, a startup using GPT-4o to develop AI agents called “digital humans” capable of prolonged, natural interactions with people — significantly outperforming other rivals during testing in Minecraft.
Altera, founded by ex-MIT professor Dr. Robert Yang, uses GPT-4o to power AI agents that can play Minecraft autonomously for up to 4 hours.
Altera’s system combines GPT-4o with a brain-inspired multi-module architecture to simulate cognitive functions and emotional processing.
OpenAI reports that Altera’s agents outperform other models in Minecraft tasks, collecting 32% of items compared to 6.4% for the next best model.
The startup plans to expand beyond gaming to create AI ‘coworkers’ and more complex multi-agent simulations.
We’ve constantly heard from Sam Altman and others that AI agents are coming fast — and case studies like this (as well as a cryptic ‘Level 3’ tweet from an OpenAI researcher) might mean the capabilities have already arrived. We might ascend the ‘Stages of AI’ ladder faster than most are anticipating.
AI identifies drug candidates for pain relief
Researchers at Cleveland Clinic and IBM just developed an AI model to predict how drugs and gut microbes interact with pain receptors, potentially uncovering new non-addictive pain treatments.
LISA-CPI analyzes both the molecular structure of compounds and the 3D shape of pain receptors to predict their interactions.
The model identified FDA-approved drugs, like methylergometrine, that could potentially be repurposed for pain treatment by targeting specific receptors.
LISA-CPI also discovered gut microbes that may interact with pain receptors in beneficial ways.
The approach could accelerate drug discovery for pain and other conditions by more accurately screening potential compounds.
The current opioid crisis highlights the urgent need for effective, non-addictive pain medications, and this AI-driven approach could help researchers more quickly identify promising drug candidates while also opening new avenues for pain management.
Meta unveils advanced AI video model
Meta just announced Movie Gen, a powerful new suite of AI models for generating and editing video and audio content, positioning itself as a direct competitor to OpenAI’s Sora and other industry leaders.
Movie Gen consists of four models: a 30B video generation model, a 13B audio model, a personalized video model, and a video editing model.
The system can generate HD videos up to 16 seconds long from text prompts, along with synchronized audio like sound effects and background music.
Movie Gen also features video editing via natural text prompts and the ability to upload a reference image to create personalized videos.
Meta claims the model outperforms rivals like Runway Gen3, Luma Labs, and OpenAI’s Sora in human video quality and consistency evaluations.
Meta CEO Mark Zuckerberg said that Movie Gen will be ‘coming to Instagram next year’ in a post displaying some of the model’s sample generations.
Meta’s Movie Gen separates itself from other video generators by not only generating videos from text, but also being able to perform precise video editing. With the models coming to Instagram, it could transform the content creation process and give the masses a powerful video editing suite—with only prompting required.
Run Llama 3.2 locally on your phone
Meta’s new Llama 3.2 3B model can run directly on your smartphone, allowing you to have AI conversations privately and offline.
Download PocketPal AI from the App Store.
Open the app, tap the top-left menu, and select “Models.”
Under “Llama,” download “llama-3.2-3b-instruct q4_k” (2.2 GB).
Once downloaded, tap “Load” to activate the model.
Return to the main menu, select “Chat,” and start conversing with AI!
Create a local knowledge base that can be queried alongside the model, allowing you to supplement the AI’s knowledge with custom, up-to-date information without requiring an internet connection.
Source: https://apps.apple.com/us/app/pocketpal-ai/id6502579498
👀The impact of generative AI as a general-purpose technology
Generative artificial intelligence will affect economic growth more quickly than other general-purpose technologies, according to a new report.
The steam engine, the internal combustion engine, electrification, and computers are all considered “general-purpose technologies” — new tools that are powerful enough to accelerate overall economic growth and transform economies and societies. According to many experts, generative artificial intelligence will be the next invention to join that category.
In a recent report about the economic impact of generative AI, Google visiting fellow and MIT Sloan principal research scientist Andrew McAfee makes the case that generative AI is not only a game-changing general-purpose technology but could also spur change far more quickly than preceding innovations due to its accessibility and ease of diffusion.
Source: https://mitsloan.mit.edu/ideas-made-to-matter/impact-generative-ai-a-general-purpose-technology
👨⚖️The racist AI deepfake that fooled and divided a community
When an audio clip appeared to show a local school principal making derogatory comments, it went viral online, sparked death threats against the educator and sent ripples through a suburb outside the city of Baltimore. But it was soon exposed as a fake, manipulated by artificial intelligence – so why do people still believe it’s real?
Source: https://www.bbc.com/news/articles/ckg9k5dv1zdo
What Else is Happening in AI on October 07th 2024!
Apple will reportedly release its Apple Intelligence features on Oct. 28 alongside the iOS 18.1 update, according to Bloomberg insider Mark Gurman.
Google began rolling out the new AI anti-theft features for Android devices showcased at Google I/O, including Theft Detection Lock, Offline Device Lock, and Remote Lock.
Source: https://lifehacker.com/tech/google-rolling-out-three-anti-theft-features-for-android
Cohere launched improved fine-tuning features for its Command R LLM, including longer context support and a ‘bring your own fine-tune’ option.
Source: https://cohere.com/blog/commandr-fine-tuning
AI startup Otherside AI’s Reflection 70B model failed to match performance claims in tests published by the team in a post-mortem of the release after being initially touted as the ‘world’s best open-source model.’
Source: https://the-decoder.com/worlds-best-open-source-model-falls-short-of-promised-performance/
North Carolina musician Michael Smith faces federal charges for allegedly using AI to generate thousands of songs and bots to stream them billions of times, netting over $10M in royalties.
Source: https://apnews.com/article/music-fraud-ai-arrest-4f09a714971f450fb3c9103c927cb091
Trending AI Tools
Machine Learning & AI For Dummies PRO
Ready to accelerate your career in the fast-growing fields of AI and machine learning? Our app offers user-friendly tutorials and interactive exercises designed to boost your skills and make you stand out to employers. Whether you’re aiming for a promotion or searching for a better job, AI & Machine Learning For Dummies PRO is your gateway to success. Start mastering the technologies shaping the future—download now and take the next step in your professional journey! iOS – Windows
Cheatlayer – Automate your business using natural language: https://cheatlayer.com/
Mindpal’s SalesBox – Build your own AI sales OS with multi-agent workflows: https://mindpal.space/
Trillion – Track expenses, manage accounts and set financial goals with AI planning: https://apps.apple.com/us/app/trillion-budget-management/id6504283874
BuyScout – Your AI copilot for online shopping: https://www.buyscout.app/
Selfletter – Break complex goals into simple tasks with AI: https://www.selfletter.com/
A Daily Chronicle of AI Innovations on October 04th 2024:
Apple releases AI model that rewrites the rules of 3D vision
🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.
Meta unveils an AI video generator
ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface
Google launches one of its ‘most significant updates ever’
TikTok’s owner is scraping the web 25 times faster than OpenAI
Google rolls out ads in AI Overviews
Apple releases AI model that rewrites the rules of 3D vision
- Apple’s AI research team has unveiled Depth Pro, a new AI model that enhances machines’ depth perception using only a single 2D image, which could revolutionize fields like augmented reality and self-driving technology by offering real-time spatial awareness.
- Depth Pro generates high-resolution 3D depth maps in just 0.3 seconds without needing traditional camera data, employing advanced techniques like a multi-scale vision transformer to accurately define details such as individual hairs and the edges of objects.
- Open-sourced on GitHub, Depth Pro introduces metric depth estimation without extensive training on specific datasets, paving the way for widespread use in industries such as e-commerce, automotive, and healthcare, where sharp depth analysis is crucial.
🦾 Nvidia presents EdgeRunner. The method can generate high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512 from images and point-clouds.
Nvidia introduced EdgeRunner, an auto-regressive method capable of generating high-quality 3D meshes with up to 4,000 faces at a spatial resolution of 512. This approach efficiently processes images and point clouds, offering significant advancements in the field of 3D modeling.
Source: https://ar5iv.org/2409.18114
Meta unveils an AI video generator:
Meta’s new Sora competitor: Meta Movie Gen
- Meta has introduced Movie Gen, an AI-powered model for video creation and editing, allowing users to generate high-definition video with audio and make precise edits using simple text commands, catering to filmmakers, content creators, and creative individuals.
- Movie Gen offers personalization by combining uploaded images with descriptive text prompts to create customized videos, enhancing creative possibilities, and enabling scenarios ranging from fantasy realms to everyday adventures, while maintaining realistic human motion and identity.
- The suite also includes advanced audio generation, with the 13-billion parameter model adding ambient sounds and music to video scenes, all aimed at democratizing content creation by offering professional-grade tools with user-friendly functionality.
Generate videos from text Edit video with text
Produce personalized videos
Create sound effects and soundtracks
Paper: MovieGen: A Cast of Media Foundation Models
https://ai.meta.com/static-resource/movie-gen-research-paper
Source: AI at Meta on X: https://x.com/AIatMeta/status/1842188252541043075
Source: https://ai.meta.com/research/movie-gen/
Apple just released Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
The paper presents a foundation model for zero-shot metric monocular depth estimation called Depth Pro. Depth Pro can produce high-resolution depth maps with sharp details and accurate object boundaries without requiring camera intrinsics like focal length. The superior performance of Depth Pro is attributed to its efficient multi-scale architecture, effective training curriculum, and dedicated boundary metrics. The model is able to accurately estimate depth and focal length in a zero-shot setting, enabling applications like view synthesis that require metric depth.
GitHub – https://github.com/apple/ml-depth-pro?tab=readme-ov-file
ChatGPT gets a collab boost with Canvas: its newest ChatGPT interface
OpenAI just launched Canvas, a new ChatGPT interface release that enables more collaborative writing and coding projects beyond simple chat interactions with new editing features, shortcuts, and added contextual knowledge.
Canvas opens in a separate window alongside the chat, allowing users to directly edit and refine specific aspects of an output.
New features include inline feedback, targeted editing, and shortcuts for tasks like adjusting text length, changing reading levels, or debugging code.
In tests, using GPT-4o with Canvas led to a 30% accuracy and 16% quality boost compared to using the model without the interface.
Canvas is rolling out in beta to Plus and Team users, with a broader release expected later.
ChatGPT’s first major UI change takes a leap towards more nuanced, moldable interactions — while also inheriting novice-friendly features seen in other rivals with easy-to-use shortcuts. The simple chatbox was a good first step for human-AI interactions, but more power and capabilities require new collaborative processes.
Google launches one of its ‘most significant updates ever’
- Google has integrated more AI features into its search functionalities, unveiling a range of updates such as AI-organized web results, enhanced Google Lens capabilities, and the incorporation of links and advertisements within AI Overviews.
- This AI-driven search initiative kicks off with food-related content, where Google’s AI creates a comprehensive experience by aggregating diverse perspectives from across the web, including videos and forums, tailored to user queries.
- Additional updates include the enhancement of AI Overviews with more prominent links to support website traffic, the integration of ads within these overviews, improved music identification features with Circle to Search, and significant upgrades to Google Lens for video, voice, and shopping inquiries.
- Source: https://www.maginative.com/article/meta-unveils-movie-gen-ai-powered-video-creation-and-editing-suite/
TikTok’s owner is scraping the web 25 times faster than OpenAI
- ByteDance, the parent company of TikTok, has launched a web scraper called Bytespider which is significantly outpacing similar tools by other companies in collecting online data for AI model training, operating at 25 times the speed of OpenAI’s GPTbot.
- Unlike other web crawlers, Bytespider ignores the robots.txt file that web publishers use to regulate scraping activity, highlighting its aggressive approach to gathering data from the internet, amidst concerns related to copyright issues within generative AI development.
- With the U.S. government pressuring ByteDance over national security issues, the rapid data collection by Bytespider seems to indicate ByteDance’s urgency in enhancing TikTok’s search functionality and possibly developing a new large language model to rival existing competitors.
- Source: https://fortune.com/2024/10/03/bytedance-tiktok-bytespider-scraper-bot/
Google rolls out ads in AI Overviews
Google just announced the introduction of ads to its AI Overview search summaries and the launch of several new AI-powered search capabilities, such as video understanding and voice input.
Ads will now appear within and alongside AI Overviews for ‘relevant queries’ on searches in the United States.
The redesigned AI Overview format will now add prominent in-text links to better source websites for the curated information.
New AI-organized search results pages are rolling out that surface relevant, more diverse content — starting with recipe and meal inspiration queries.
Google Lens is getting video understanding capabilities and voice input options for visual searches.
The Android ‘Circle to Search’ feature also lets users identify songs playing in videos or streaming content.
Google’s first AI Overview experience didn’t exactly go as planned. However, with heavy competition from Perplexity and chatbot rivals, Google’s search future clearly has AI at its core, regardless of the bumps along the way. But infusing paid ads into AI Overviews could be a slippery slope – will Gemini be next?
Source: https://www.theverge.com/2024/10/3/24260637/googles-ai-overview-ads-launch
What Else is Happening in AI on October 04th 2024!
Google DeepMind hires key OpenAI Sora researcher Tim Brook for ‘world simulator’ project.
Google released Gemini 1.5 Flash 8B, a lightweight, cost-effective variation with a 50% cost reduction and 2x higher rate limits than 1.5 Flash.
Fourier launched GR-2, the company’s second-generation humanoid robot, which features improvements to battery life, hand dexterity, mobility, and a new developer kit.
Source: https://finance.yahoo.com/news/fourier-unveils-next-generation-humanoid-123000642.html
OpenAI also secured a massive credit line. Source: https://techcrunch.com/2024/10/03/openai-also-secured-a-massive-credit-line/
Google’s AI can detect tuberculosis just by analyzing cough sound.
Source: https://www.newsbytesapp.com/news/science/google-ai-uses-cough-sound-to-diagnose-tuberculosis/story
OpenAI CFO Sarah Friar says their next AI model will be an order of magnitude bigger than GPT-4 and future models will grow at a similar rate, requiring capital-intensive investment to meet their “really big aspirations”
Trending AI Tools on October 04th 2024
Buzzabout – AI-driven insights from billions of discussions on social media: https://buzzabout.ai/
Base AI – Build serverless, autonomous AI agents with memory: https://baseai.dev/
CostGPT – Estimate costs and time for your software project in less than 5 minutes: https://costgpt.ai/
Lookie AI – Consume, organize, and manage knowledge from YouTube: https://apps.apple.com/kr/app/lookie-ai/id6670471730?l=en-GB
Tackle AI – Automatic time tracking to align everyday actions with key priorities: https://www.timetackle.com/
A Daily Chronicle of AI Innovations on October 03rd 2024:
Meta smart glasses can be used to dox anyone in seconds
OpenAI is now valued at $157 billion
Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o
Microsoft to employees: you can continue working from home unless productivity drops
Google developing reasoning AI to rival OpenAI
Meta smart glasses can be used to dox anyone in seconds
- Harvard students demonstrated how Meta’s smart glasses combined with facial recognition technology can dox individuals by revealing personal details like identities and phone numbers, using tools like I-XRAY and public databases in real-time.
- The demo used existing technologies such as Meta’s Ray-Ban smart glasses and the PimEyes search engine, showing how a simple photo capture can quickly connect to public data, including names and addresses, raising privacy concerns.
- Meta has privacy guidelines for its smart glasses, but the tiny notification light is hard to detect in bright light, leading to potential misuse despite the company warning users to respect others’ privacy and follow recording etiquette.
- Source: https://www.theverge.com/2024/10/2/24260262/ray-ban-meta-smart-glasses-doxxing-privacy
OpenAI is now valued at $157 billion
- OpenAI has raised $6.6 billion in a new funding round, which has nearly doubled its valuation to $157 billion from a previous $86 billion, as reported by The Wall Street Journal.
- The latest financing requires OpenAI to shift from its nonprofit model to a fully for-profit company, or investors have the right to retract their investments.
- Major contributors to this funding round include Thrive Capital with a $1.25 billion investment and long-time supporter Microsoft, which added just under $1 billion more, with new investors like SoftBank and Nvidia also participating.
- Source: https://arstechnica.com/ai/2024/10/openai-is-now-valued-at-157-billion/
Nvidia stunned the world with a ChatGPT rival that’s as good as GPT-4o
- In early October 2024, Nvidia surprised the AI community by unveiling NVLM 1.0, a series of advanced multimodal language models with capabilities matching those of the GPT-4o model from ChatGPT.
- Instead of releasing a direct competitor to consumer-facing AI applications like ChatGPT or Claude, Nvidia is opting to allow others to create their own AI solutions by making the model weights of NVLM publicly accessible.
- Nvidia, previously renowned for supplying essential chips for AI processes, is now demonstrating its prowess in generative AI through its innovative approach to sharing AI technology development resources.
- Source: https://bgr.com/tech/nvidia-stunned-the-world-with-a-chatgpt-rival-thats-as-good-as-gpt-4o/
Microsoft to employees: you can continue working from home unless productivity drops
- Microsoft has decided to allow employees to continue working from home, maintaining flexibility as long as it does not affect productivity, contrasting with companies like Amazon that have mandated a return to the office.
- Scott Guthrie, Microsoft Executive Vice President, assured workers in a meeting that the company values flexible working arrangements, though productivity must remain steady to keep the remote work model viable.
- The remote work setup is considered beneficial for both employees and Microsoft, though the company remains cautious about the risks, such as decreased productivity and potential misuse of work hours for personal activities.
- Source: https://www.techspot.com/news/104972-microsoft-assures-employees-they-can-continue-working-home.html
Google developing reasoning AI to rival OpenAI
Google is reportedly making significant strides in developing AI models with advanced reasoning capabilities similar to OpenAI’s o1 system, intensifying the rivalry between the two AI giants.
Multiple teams at Google are working on AI that can solve complex, multi-step problems, according to Bloomberg.
The AI uses chain-of-thought prompting, a technique created by Google, to tackle complex math and programming problems by ‘thinking’ before responding.
Google is taking a more cautious approach to its releases than OpenAI but has already debuted math-focused reasoning models like AlphaProof and AlphaGeometry 2.
Microsoft also infused reasoning capabilities into its Copilot assistant this week, leveraging OpenAI’s o1 model.
Human-like reasoning and agentic capabilities are clearly the two major developments on every AI firm’s roadmap, and the release of o1 may have signaled a new phase in the LLM race. The question is — will OpenAI’s speed keep it a step ahead, or is the competition for top-tier models about to get a whole lot tougher?
Source: https://qz.com/google-reasoning-ai-model-compete-openai-chatgpt-gemini-1851663139
What Else is Happening in AI on October 03rd 2024!
The Cancer AI Alliance formed a $40M collaboration between major medical institutions and tech giants like Microsoft, AWS, Nvidia, and Deloitte to advance AI-driven cancer care.
Character AI is reportedly shifting its focus away from building AI models in the wake of its $2.7B deal with Google and prioritizing its consumer chatbot service.
Elon Musk posted ‘OpenAI is evil’ on X in response to reports that the AI giant asked investors to avoid funding competing AI firms like Anthropic and Musk’s xAI.
Source: https://www.yahoo.com/tech/elon-musk-called-openai-evil-030055401.html
Accenture announced a new partnership with NVIDIA to accelerate enterprise AI adoption, launching a business group and AI Refinery platform to scale agentic AI systems across industries.
Source: https://newsroom.accenture.com/news/2024/accenture-and-nvidia-lead-enterprises-into-era-of-ai
New ChatGPT feature: GPT-4o with Canvas.
Latest AI Tools October 03rd 2024
WALDO: a detection AI model designed to identify specific objects, such as vehicles and utility poles, in overhead images from various altitudes, useful for tasks requiring object recognition in large-scale imagery.
Source: https://github.com/stephansturges/WALDO
Kameo: a Rust library for creating fault-tolerant, distributed, and asynchronous actors using Tokio, facilitating seamless communication across nodes with features like scalability, backpressure handling, and panic recovery.
Source: https://github.com/tqwewe/kameo
TinyJS: a lightweight JavaScript library that simplifies the creation of HTML elements, property assignment, and DOM element selection with unique $ and $$ shortcuts, enhancing web development efficiency.
Source: https://github.com/victorqribeiro/TinyJS
QBittorrent: an open-source BitTorrent client designed to be a lightweight alternative to other clients, offering ad-free usage, stability, and a variety of features.
Source: https://github.com/qbittorrent/qBittorrent
Serving 70B-Scale LLMs Efficiently on Low-Resource Edge Devices: the paper discusses methods for running large language models (LLMs) efficiently on devices with limited resources.
Source: https://arxiv.org/abs/2410.00531
A Daily Chronicle of AI Innovations on October 02nd 2024:
🧠Google is Working on Reasoning AI – Bloomberg News
💰’SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI
OpenAI makes 4 major announcements at DevDay
Microsoft Copilot gets voice, vision upgrade
🤖 Google develops new AI model to rival OpenAI o1
👀 OpenAI co-founder joins rival Anthropic
OpenAI makes 4 major announcements at DevDay
Here’s a link to the announcement: https://openai.com/devday/
OpenAI’s recent DevDay conference took a different approach from last year’s event, focusing on incremental improvements rather than major product launches. The company introduced four key innovations: Vision Fine-Tuning, Realtime API, Model Distillation, and Prompt Caching, all aimed at empowering developers and enhancing the AI ecosystem.
Prompt Caching: This feature reduces costs and latency for developers by applying a 50% discount on input tokens that the model has recently processed, potentially leading to significant savings.
Vision Fine-Tuning: This allows developers to customize GPT-4o’s visual understanding capabilities using both images and text, with applications in fields like autonomous vehicles and medical imaging. For example, Grab improved its mapping services using this technology.
Realtime API: Now in public beta, this API enables low-latency, multimodal experiences, particularly in speech-to-speech applications. It allows for natural conversation and mid-sentence interruptions, opening up possibilities for voice-enabled applications in various industries.
Model Distillation: This workflow allows developers to use outputs from advanced models to improve the performance of more efficient models, making sophisticated AI capabilities more accessible and cost-effective.
OpenAI’s strategic shift towards ecosystem development over headline-grabbing product launches reflects a mature understanding of the AI industry’s current challenges and opportunities. By focusing on refining tools and reducing costs, OpenAI aims to foster a thriving developer ecosystem and ensure sustainable AI adoption across various industries.
Realtime API enables speech-to-speech application building using the same model that powers Advanced Voice, with the ability to choose from six voices. “Until right now, voice has been a second activity“, and that the Realtime API is going to make AI significantly more accessible because many people in the real world prefer to speak over reading or texting. Realtime API will have a “no-brainer” impact on customer support, education, and coaching. He also believes there will be many ‘non-obvious‘ use cases that are hard to predict now. For now, Realtime API only supports text and audio. However, Godement believes that image and video are the next milestones on the road to agents that can perceive the world just like a human. He also mentioned that image and video understanding specifically, will “turbocharge customer support” when the model has the ability to understand pixels on a screen in real-time. https://openai.com/index/introducing-the-realtime-api/
Model Distillation simplifies fine-tuning smaller models using outputs from larger ones, making training more accessible to developers. https://openai.com/index/api-model-distillation/
Prompt Caching reduces costs by nearly 50% across models and speeds up responses by up to 80% when reusing recent input tokens in API calls. https://openai.com/index/api-prompt-caching/
New prompt generator on https://playground.openai.com
Access to the o1 model is expanded to developers on usage tier 3, and rate limits are increased (to the same limits as GPT-4o)
Microsoft Copilot gets voice, vision upgrade
Microsoft just announced a slew of AI upgrades coming to its Copilot assistant for Windows PCs, including new vision and voice capabilities, personalization enhancements, a re-release of the controversial Recall feature, and more.
Copilot Voice allows users to interact with natural speech, adding conversational and intuitive communication similar to OpenAI’s Voice Mode.
Copilot Vision enables the AI to understand and interact with web content a user is viewing, offering context-aware help within the Microsoft Edge browser.
‘Think Deeper’ gives Copilot new enhanced reasoning capabilities using chain-of-thought reasoning powered by OpenAI’s o1 model.
Microsoft’s ‘Recall’ feature is set to return, requiring an opt-in with upgraded privacy and security measures.
Microsoft AI CEO Mustafa Suleyman highlighted Copilot’s ability to ultimately ‘act on your behalf’ and adapt to user’s personal preferences and needs.
Microsoft is bringing the heat with these major Copilot upgrades, levelling up the assistant to align with the latest cutting-edge AI features across the industry — while bringing users one step closer to a truly agentic experience.
🧠Google is Working on Reasoning AI – Bloomberg News
Google is working on artificial intelligence software that resembles the human ability to reason, similar to OpenAI’s o1, marking a new front in the rivalry between the tech giant and the fast-growing startup.
In recent months, multiple teams at Alphabet Inc.’s Google have been making progress on AI reasoning software, according to people with knowledge of the matter, who asked not to be identified because the information is private.
AI researchers are pursuing reasoning models as they search for the next significant step forward in the technology. Like OpenAI, Google is trying to approximate human reasoning using a technique known as chain-of-thought prompting, according to two of the people. In this technique, which Google pioneered, the software pauses for a matter of seconds before responding to a written prompt while, behind the scenes and invisible to the user, it considers a number of related prompts and then summarizes what appears to be the best response.
Since OpenAI unveiled its o1 model, known internally as Strawberry, in mid-September, some in DeepMind have fretted that the company had fallen behind, according to another person with knowledge of the matter. But employees are no longer as concerned as they were following the launch of ChatGPT, now that Google has debuted some of its own work, the person said. In July, Google showcased AlphaProof, which specializes in math reasoning, and AlphaGeometry 2, an updated version of a model focused on geometry that the company debuted earlier this year.
💰SoftBank Shares Surge as CEO Pushes AI Superintelligence Vision With OpenAI, who previously claimed that creating ASI was his “life’s purpose”
What Else is Happening in AI on October 02nd 2024!
OpenAI founding member Durk Kingma announced that he is joining Anthropic, reuniting with several former OpenAI employees and highlighting the company’s mission of responsible AI development in his X post.
Pika Labs unveiled Pika 1.5, a new video generation model upgrade featuring enhanced effects, realistic movement, longer clip creation, and cinematic capabilities.
Anyscale unveiled major upgrades to its AI platform at Ray Summit 2024, including a GPU-native Ray architecture, RayTurbo for enhanced performance, Ray Data for unstructured data processing, and more.
U.S. AI chipmaker Cerebras officially filed for an IPO, with the Sam Altman-backed Nvidia competitor expected to be valued at between $7-8B.
Meta released the open-source code and developer suite for its Segment Anything Model (SAM) 2.1, an upgraded version of its image and video segmentation tool.
Nvidia introduced NVLM 1.0, an open-source family of multimodal models that achieve SOTA performance on vision-language and text tasks.
Pinterest launched Performance+, a suite of new AI tools for advertisers that includes the ability to create background images for products and automation features for ad campaigns.
NotebookLM is too good
You can upload multiple books, hours long videos and audios into that thing and it processes everything so well. It’s so good at resuming, finding specific quotes, answering questions, explaining some stuff and the podcast feature too is mindblowing. It can even do the same for videos, texts and audios in foreign languages and translate, explain and resume it in order for you to understand. And it’s not super censored too. Can’t believe this thing is actually free and i’m just finding about it now.
A basic systems architecture for AI agents that do autonomous research
Source: https://www.lesswrong.com/posts/6cWgaaxWqGYwJs3vj/a-basic-systems-architecture-for-ai-agents-that-do
OpenAI has released Whisper V3 Turbo model yesterday. The turbo model is an optimized version of large-v3 that offers 8x faster transcription speed with minimal degradation in accuracy
Source: https://huggingface.co/spaces/hf-audio/whisper-large-v3-turbo
Harvard students Build and show off AR glasses project that uses face detection, internet sleuthing, and AI to give you near instant dossiers (address, family info, name, etc) on people you see. Good proof of concept to raise awareness on what we may see in the future.
Source: https://x.com/AnhPhuNguyen1/status/1840786336992682409
https://x.com/i/status/1840786336992682409
Trending AI Tools on October 02nd 2024
Video SDK 3.0 – Build and integrate real-time multimodal AI characters: https://github.com/Xilinx/video-sdk/discussions/81
Inbox Zero – An open-source, AI personal assistant for email: https://www.getinboxzero.com/ai-automation
Graphite – Your AI code review companion: https://graphite.dev/blog/graphite-reviewer-launch
Ello – An AI reading companion for children offering personalized support: https://www.ello.com/
VivaChat – FaceTime video chat with realistic AI personas: https://www.vivalabs.ai/
A Daily Chronicle of AI Innovations on October 01st 2024:
Microsoft gives Copilot a voice and vision
Chromebooks are getting a dedicated AI key
Microsoft is discontinuing its HoloLens headsets
Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup
California’s controversial AI safety bill vetoed
OpenAI secures SoftBank funding as Apple exits raise
Liquid AI unveils efficient new LFM models
Microsoft gives Copilot a voice and vision
- Microsoft has unveiled a major overhaul to its Copilot experience, adding both voice and vision capabilities, transforming it into a more personalized AI assistant similar to OpenAI’s Advanced Voice Mode.
- The redesign features a new card-based user interface inspired by Inflection AI’s Pi assistant, and Copilot now offers a virtual news presenter mode, tailored homepage and improved customization based on user interaction history.
- Initial releases of Copilot Voice and Copilot Daily will be available in select regions, while Copilot Vision features are in a limited preview phase, focusing on enhancing user safety and privacy through restricted website interactions.
- Source: https://www.theverge.com/2024/10/1/24259187/microsoft-copilot-redesign-vision-voice-features-inflection-ai
Chromebooks are getting a dedicated AI key
- Chromebooks are getting a new keyboard layout with a “quick access” key for AI and other functions, providing easy access to features like text generation, emojis, and searching Google Drive.
- The first Chromebooks to feature this new key are the Samsung Galaxy Chromebook Plus, which will replace the Launcher Key with the new Quick Insert key.
- Although the new AI features will initially lack AI image generation, Google plans to add this and other AI capabilities, including real-time translation and transcription, to Chromebooks in October.
- Source: https://gizmodo.com/chromebooks-are-getting-a-dedicated-ai-key-but-you-wont-use-it-for-ai-2000505155
Microsoft is discontinuing its HoloLens headsets
- Microsoft has ceased production of its HoloLens 2 headsets and has no confirmed plans for a successor, although updates addressing security and software issues are promised until the end of 2027.
- Former HoloLens head, Alex Kipman, left the company in 2022 amid misconduct allegations, and the hardware team faced significant layoffs in January 2023, impacting the development of the augmented reality devices.
- Microsoft has partnered with Anduril Industries to enhance its IVAS mixed-reality headsets for the US Army, which plans to invest up to $21.9 billion over the next decade in this project.
Source: https://www.theverge.com/2024/10/1/24259369/microsoft-hololens-2-discontinuation-support
Y Combinator faces backlash after funding an AI startup that admits it basically cloned another AI startup
- Y Combinator is facing criticism after backing an AI startup, PearAI, which admitted to cloning another AI coding editor called Continue and initially using a misleading license.
- PearAI’s founder Duke Pan publicly apologized, revealing that the project has switched to the same open-source Apache license as the original Continue project after the controversy erupted.
- The incident has raised questions about Y Combinator’s vetting process and has led to broader scrutiny of venture capitalists’ eagerness to fund AI startups without thorough oversight.
- Source: https://techcrunch.com/2024/09/30/y-combinator-is-being-criticized-after-it-backed-an-ai-startup-that-admits-it-basically-cloned-another-ai-startup/
California’s controversial AI safety bill vetoed
California Governor Gavin Newsom just vetoed S.B. 1047, a groundbreaking AI safety bill that would have imposed stricter regulations on Silicon Valley AI firms and the release of new models in the state.
The bill would have required safety testing for AI models before their public release and held AI companies liable for any ‘severe harm’ (over $500M in damages) caused.
Tech giants, including OpenAI and Google, VCs, and politicians like Nancy Pelosi lobbied heavily against the bill, arguing it would stifle innovation.
The bill had notable support from Elon Musk, Anthropic, the ‘Godfather of AI’ Geoffrey Hinton, and over 120 Hollywood actors, directors, and workers.
Newsom said the bill was ‘well-intentioned’ but flawed, vowing to consult with AI experts to craft guardrails for future legislation efforts.
As the U.S. federal government continues to lag in AI regulation, states are stepping up to fill the void. While S.B. 1047 is shelved for now, the debate over AI governance is far from settled—and will likely continue to pit AI safety advocates against those pushing for rapid development throughout Silicon Valley.
Source: https://www.politico.com/news/2024/09/29/gavin-veto-ai-safety-bill-00181583
OpenAI secures SoftBank funding as Apple exits raise
Despite Apple reportedly no longer participating in OpenAI’s upcoming funding round, the AI giant has secured billions of dollars from Japanese investment giant Softbank, Microsoft, and Thrive Capital.
OpenAI is rumored to be raising up to $6.5B via convertible notes, at an eye-popping $150B valuation.
Microsoft plans to participate with an additional $1B, adding to its previous $13B investment in the AI giant.
Investment firm Thrive Capital is also investing $1B, with a reported option to add an additional $1B the following year based on revenue goals.
The Wall Street Journal reported that Apple is no longer involved in the funding round, despite partnerships with OpenAI and its inclusion in Apple Intelligence.
The raise comes amid OpenAI’s controversial restructuring to a for-profit entity, with Sam Altman denying rumors that he will receive equity in the move.
OpenAI’s latest raise and for-profit turn is another saga in its convoluted and controversial business structure. Despite the recent high-profile departures and continued drama, the ChatGPT maker is still clearly seen as a top horse to bet on in the AI boom—and there is no shortage of major players who want in.
Source: https://www.theinformation.com/articles/softbank-to-invest-500-million-in-openai
Liquid AI unveils efficient new LFM models
Liquid AI just introduced a new series of AI models called Liquid Foundation Models (LFMs), challenging the traditional transformer architecture while achieving state-of-the-art performance and enhanced memory efficiency at smaller model sizes.
The company released its LFMs in 1.3B, 3B, and 40B parameter sizes, based on a new architecture utilizing computational units rooted in dynamical systems rather than traditional transformers.
The models surpass transformer-based counterparts like Meta’s Llama 3.2 and Microsoft’s Phi-3.5 on major benchmarks like MMLU.
LFMs require significantly less memory for inference, particularly with long-context tasks — supporting up to 32k tokens while maintaining memory efficiency.
The models are not open-source and are only currently available via the company’s Lambda (Chat UI and API) and on Perplexity AI.
Liquid AI’s LFMs are a significant shakeup from the transformer architecture standard that has dominated models since 2017. The benchmarks show that there is more than one formula for achieving state-of-the-art AI performance—and could open new possibilities for more efficient and accessible AI systems.
Source: https://www.liquid.ai/liquid-foundation-models
What Else is Happening in AI on October 01st 2024!
Google agreed to invest $1B into Thailand to expand AI and cloud infrastructure in Southeast Asia, aiming to build new data centers amid increasing regional competition.
Source: https://www.cnbc.com/2024/09/30/google-to-invest-1-billion-in-thailand-data-center-and-ai-push.html
TikTok parent company ByteDance is reportedly planning to develop a new AI model primarily using Huawei chips, diversifying from U.S. suppliers like Nvidia to counteract export restrictions.
Source: https://www.reuters.com/technology/artificial-intelligence/bytedance-plans-new-ai-model-trained-with-huawei-chips-sources-say-2024-09-30
Artisan AI secured $7.3M in seed funding for its sales-focused AI virtual employees, with its first AI assistant Ava already assisting over 120 companies on the platform.
Source: https://www.artisan.co/blog/artisan-raises-7-3-seed-round
Luma Labs upgraded its Dream Machine AI video model speed, allowing for full-quality generations in under 20 seconds.
Source: https://x.com/LumaLabsAI/status/1840820602296320083
Qodo announced a $40M funding round for its AI-powered code testing software, with plans to expand services and target larger enterprise clients.
Source: https://www.bloomberg.com/news/articles/2024-09-30/ai-code-checker-qodo-raises-40-million-to-serve-bigger-clients
AI reading coach startup Ello launched ‘Storytime’, a new feature allowing kids to create personalized stories using AI.
Source: https://techcrunch.com/2024/09/30/ai-reading-coach-startup-ello-launches-custom-story-creation-feature-for-kids
Trending AI Tools on October 01st 2024
Udio Lyric Editor – Create and refine song lyrics based on melody: https://www.udio.com/
Expression Editor – Easily edit facial expressions: https://huggingface.co/spaces/fffiloni/expression-editor
PandaETL – Automate document processes with AI and data: https://panda-etl.ai/
Gaia – Train and deploy neural machine translation models: https://gaia-ml.com/
Lumona – AI search engine leveraging social media insights: https://www.lumona.ai/
Read Aloud For Me: AI Dashboard – AI Tools Recommender – Safe AI
Welcome to Read Aloud For Me, the pioneering AI dashboard designed for the whole family! Our platform is the first of its kind, uniquely crafted to cater not only to adults but also to kids.
iOs: https://apps.apple.com/ca/app/read-aloud-for-me-ai-dashboard/id1598647453
Web/Android/PWA: https://readaloudforme.com
AI Innovations in September 2024
- What will be AI's killer app?by /u/G4M35 (Artificial Intelligence Gateway) on December 14, 2024 at 12:53 am
If you understand how Technology Innovation works (Clayton Christensen's way), you know that AI per se is not a disruptive technology but an enabling technology. What is going to happen is that some brilliant entrepreneur will use it in an unexpected way creating something that didn't exist before that will change the world as we know it, the killer app (app as in application of the tech, not necessarily software. Could be hardware too). I have been trying to come up with something for the past 2 years, and I can't. I am not seeing anything out there either; although advanced voice mode comes close. So, do you have any theories, suspicions, directions of what the kille app will be? TIA submitted by /u/G4M35 [link] [comments]
- One-Minute Daily AI News 12/13/2024by /u/Excellent-Target-847 (Artificial Intelligence Gateway) on December 14, 2024 at 12:43 am
UnitedHealth’s Optum left an AI chatbot, used by employees to ask questions about claims, exposed to the internet.[1] The BBC is complaining after Apple Intelligence rewrote one of its headlines to falsely claim the UnitedHealthcare suspect shot himself.[2] AI continues to reshuffle power and energy markets with even oil giants like Exxon Mobil getting into the mix.[3] OpenAI’s legal battle with Elon Musk reveals internal turmoil over avoiding AI ‘dictatorship’.[4] Sources included at: https://bushaicave.com/2024/12/13/12-13-2024/ submitted by /u/Excellent-Target-847 [link] [comments]
- One-Minute Daily AI News 12/13/2024by /u/Excellent-Target-847 (Artificial Intelligence (AI)) on December 14, 2024 at 12:43 am
UnitedHealth’s Optum left an AI chatbot, used by employees to ask questions about claims, exposed to the internet.[1] The BBC is complaining after Apple Intelligence rewrote one of its headlines to falsely claim the UnitedHealthcare suspect shot himself.[2] AI continues to reshuffle power and energy markets with even oil giants like Exxon Mobil getting into the mix.[3] OpenAI’s legal battle with Elon Musk reveals internal turmoil over avoiding AI ‘dictatorship’.[4] Sources: [1] https://techcrunch.com/2024/12/13/unitedhealthcares-optum-left-an-ai-chatbot-used-by-employees-to-ask-questions-about-claims-exposed-to-the-internet/ [2] https://www.theverge.com/2024/12/13/24320689/apple-intelligence-summary-bbc-news-unitedhealthcare-luigi-mangione [3] https://techcrunch.com/2024/12/13/exxon-cant-resist-the-ai-power-gold-rush/ [4] https://abcnews.go.com/US/wireStory/openais-legal-battle-elon-musk-reveals-internal-turmoil-116776795 submitted by /u/Excellent-Target-847 [link] [comments]
- Codegpt or Bolt?by /u/VaguePenguin (Artificial Intelligence Gateway) on December 14, 2024 at 12:27 am
I've googled and I've searched on here and I can't find opinions between the two, I only find year old threads about one of them. I've used both and I love both. Using codegpt has me learning more as I'm building but I feel it's not as advanced. Bolt on the other hand writes everything and seems way more advanced but it's always causing problems that it can't solve and when I manually write the code, it seems that it's still stuck. What are your opinions on both and why do you choose that one? They both have great pros but they both have bad cons. Bolt always can't solve its navigation or npm installations which makes me not want to use it anymore. Is anyone else having that issue? I know it's just bad simple misspelling or incorrect file name but when I fix it, it still doesn't work. submitted by /u/VaguePenguin [link] [comments]
- What other cool or just fun AI tools I may not heard of?by /u/almozayaf (Artificial Intelligence (AI)) on December 13, 2024 at 11:36 pm
In 2024 I got myself into so many AI tools that did amazing things Images generators So many to list Music generators like SUNO.AI Chats bots like JanitorAI And RPG writing stories like AI dungeon But I want to know if there other tools I may missed, what else out there? submitted by /u/almozayaf [link] [comments]
- AI Recommendations for Editing Word Documents While Maintaining Formattingby /u/JustAGoodDude (Artificial Intelligence Gateway) on December 13, 2024 at 9:36 pm
Hi everyone, I’m looking for an AI tool or solution that can help with editing Word documents. The document is already nicely formatted, with specific fonts, colours, and some images. These stay the same in every document. The content changes depending on the customer, with variables like the customer’s name, the amount of money, and a short recommendation. From a purely text point of view, writing the content itself is straightforward since it’s not overly complicated and ChatGPT can already write what I need almost prefectly. But my challenge is finding an AI that can handle the content changes while keeping the existing formatting intact. My ideal solution is -Use AI to write the text specific for the new customer (achieved) -Copy and paste this somewhere and have it merged into the Word document where all the specific formatting is retained. Does anyone have experience using AI for this purpose? Any recommendations or tips would be greatly appreciated! submitted by /u/JustAGoodDude [link] [comments]
- Are people forgetting that AI and LLMs are not one and the same?by /u/Murky-Motor9856 (Artificial Intelligence Gateway) on December 13, 2024 at 9:23 pm
Why aren't people freaking out over other types of generative AI, image and speech recognition models, or the sort of "AI" (that is probably based on gradient boosting instead of a neural network) companies like UHC used to deny claims? Is it because the output isn't language that humans find relatable, and therefore they aren't compelled to anthropomorphize it, or because marketing has effectively obscured how radically different the things we call AI are? In a way, it reminds me of the shitstorm of hype surrounding blockchain and cryptocurrency. Both technologies have sparked immense interest and investment, driven by their potential to revolutionize various industries. However, this fervor often emphasizes flashy, high-profile applications, and when people started getting disillusioned with them they sort of threw the baby out with the bathwater. Instead of being skeptical of the ways blockchain was used and oversold, for example, they're instantly skeptical of blockchain because they associate it with those uses (and the negative press they eventually garnered). One concern I have is that the general public's singular focus on a subset of AI will derail broader efforts much like it did in the past when expert systems failed to live up to hype surrounding them. What we have now is in a completely different realm of capability, to be sure, but the hype surrounding it is also on an entirely different level. submitted by /u/Murky-Motor9856 [link] [comments]
- Can AGI Be Safe if Trained on Political Disinformation?by /u/FluidMeasurement8494 (Artificial Intelligence Gateway) on December 13, 2024 at 9:10 pm
How can we develop a non-threatening AGI if it is likely to be trained on disinformation, particularly in the realm of internal and external politics? Wouldn't it be a flawed and dangerous tool ? The fundamental concern here is that AGI, like any AI, learns from the data it is trained on. If that data is biased, manipulative, or outright false, the AGI could inherit those flaws, potentially amplifying them in ways that are difficult to control. If an AGI is exposed to disinformation - whether in the form of political propaganda, fake news, or manipulated narratives - it may learn to perpetuate or even amplify these falsehoods. This could lead to the spread of harmful ideologies or decisions based on inaccurate information, both in political contexts and beyond. submitted by /u/FluidMeasurement8494 [link] [comments]
- Google VideoFX blows sora out of the water.by /u/noblepups (Artificial Intelligence (AI)) on December 13, 2024 at 8:27 pm
submitted by /u/noblepups [link] [comments]
- Looking for a voice cloner that allows you to adjust the voice qualities/traits/characteristics of the results.by /u/CukeJr (Artificial Intelligence Gateway) on December 13, 2024 at 8:18 pm
I've tried Elevenlabs and Character AI so far and neither of those seem to have such a thing. I've done some googling and glanced at a few others (without signing up and trying), no such luck. Any suggestions? Do any apps like this even exist? To elaborate a bit on my intent: I'm trying to create a voice for an original character (OC) I have. I know exactly what they sound like in my head, and I've come across some voices (namely, a vocalist and an existing game character) that sound pretty similar. I've been using these voices as references for how I imagine my OC to sound (I think it's often called a "voice claim" lol). They don't quite hit the mark though, so it would be perfect if I could just play around with either of them to bring them closer to my character's voice. So basically, I'm looking for an app that will enable me to upload either of these voice samples, create a clone of them, and then adjust vocal attributes of the voice like depth, tone, nasality, timbre, etc. Thank you in advance! submitted by /u/CukeJr [link] [comments]
Active Hydrating Toner, Anti-Aging Replenishing Advanced Face Moisturizer, with Vitamins A, C, E & Natural Botanicals to Promote Skin Balance & Collagen Production, 6.7 Fl Oz
Age Defying 0.3% Retinol Serum, Anti-Aging Dark Spot Remover for Face, Fine Lines & Wrinkle Pore Minimizer, with Vitamin E & Natural Botanicals
Firming Moisturizer, Advanced Hydrating Facial Replenishing Cream, with Hyaluronic Acid, Resveratrol & Natural Botanicals to Restore Skin's Strength, Radiance, and Resilience, 1.75 Oz
Skin Stem Cell Serum
Smartphone 101 - Pick a smartphone for me - android or iOS - Apple iPhone or Samsung Galaxy or Huawei or Xaomi or Google Pixel
Can AI Really Predict Lottery Results? We Asked an Expert.
Djamgatech
Read Photos and PDFs Aloud for me iOS
Read Photos and PDFs Aloud for me android
Read Photos and PDFs Aloud For me Windows 10/11
Read Photos and PDFs Aloud For Amazon
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
FREE 10000+ Quiz Trivia and and Brain Teasers for All Topics including Cloud Computing, General Knowledge, History, Television, Music, Art, Science, Movies, Films, US History, Soccer Football, World Cup, Data Science, Machine Learning, Geography, etc....
List of Freely available programming books - What is the single most influential book every Programmers should read
- Bjarne Stroustrup - The C++ Programming Language
- Brian W. Kernighan, Rob Pike - The Practice of Programming
- Donald Knuth - The Art of Computer Programming
- Ellen Ullman - Close to the Machine
- Ellis Horowitz - Fundamentals of Computer Algorithms
- Eric Raymond - The Art of Unix Programming
- Gerald M. Weinberg - The Psychology of Computer Programming
- James Gosling - The Java Programming Language
- Joel Spolsky - The Best Software Writing I
- Keith Curtis - After the Software Wars
- Richard M. Stallman - Free Software, Free Society
- Richard P. Gabriel - Patterns of Software
- Richard P. Gabriel - Innovation Happens Elsewhere
- Code Complete (2nd edition) by Steve McConnell
- The Pragmatic Programmer
- Structure and Interpretation of Computer Programs
- The C Programming Language by Kernighan and Ritchie
- Introduction to Algorithms by Cormen, Leiserson, Rivest & Stein
- Design Patterns by the Gang of Four
- Refactoring: Improving the Design of Existing Code
- The Mythical Man Month
- The Art of Computer Programming by Donald Knuth
- Compilers: Principles, Techniques and Tools by Alfred V. Aho, Ravi Sethi and Jeffrey D. Ullman
- Gödel, Escher, Bach by Douglas Hofstadter
- Clean Code: A Handbook of Agile Software Craftsmanship by Robert C. Martin
- Effective C++
- More Effective C++
- CODE by Charles Petzold
- Programming Pearls by Jon Bentley
- Working Effectively with Legacy Code by Michael C. Feathers
- Peopleware by Demarco and Lister
- Coders at Work by Peter Seibel
- Surely You're Joking, Mr. Feynman!
- Effective Java 2nd edition
- Patterns of Enterprise Application Architecture by Martin Fowler
- The Little Schemer
- The Seasoned Schemer
- Why's (Poignant) Guide to Ruby
- The Inmates Are Running The Asylum: Why High Tech Products Drive Us Crazy and How to Restore the Sanity
- The Art of Unix Programming
- Test-Driven Development: By Example by Kent Beck
- Practices of an Agile Developer
- Don't Make Me Think
- Agile Software Development, Principles, Patterns, and Practices by Robert C. Martin
- Domain Driven Designs by Eric Evans
- The Design of Everyday Things by Donald Norman
- Modern C++ Design by Andrei Alexandrescu
- Best Software Writing I by Joel Spolsky
- The Practice of Programming by Kernighan and Pike
- Pragmatic Thinking and Learning: Refactor Your Wetware by Andy Hunt
- Software Estimation: Demystifying the Black Art by Steve McConnel
- The Passionate Programmer (My Job Went To India) by Chad Fowler
- Hackers: Heroes of the Computer Revolution
- Algorithms + Data Structures = Programs
- Writing Solid Code
- JavaScript - The Good Parts
- Getting Real by 37 Signals
- Foundations of Programming by Karl Seguin
- Computer Graphics: Principles and Practice in C (2nd Edition)
- Thinking in Java by Bruce Eckel
- The Elements of Computing Systems
- Refactoring to Patterns by Joshua Kerievsky
- Modern Operating Systems by Andrew S. Tanenbaum
- The Annotated Turing
- Things That Make Us Smart by Donald Norman
- The Timeless Way of Building by Christopher Alexander
- The Deadline: A Novel About Project Management by Tom DeMarco
- The C++ Programming Language (3rd edition) by Stroustrup
- Patterns of Enterprise Application Architecture
- Computer Systems - A Programmer's Perspective
- Agile Principles, Patterns, and Practices in C# by Robert C. Martin
- Growing Object-Oriented Software, Guided by Tests
- Framework Design Guidelines by Brad Abrams
- Object Thinking by Dr. David West
- Advanced Programming in the UNIX Environment by W. Richard Stevens
- Hackers and Painters: Big Ideas from the Computer Age
- The Soul of a New Machine by Tracy Kidder
- CLR via C# by Jeffrey Richter
- The Timeless Way of Building by Christopher Alexander
- Design Patterns in C# by Steve Metsker
- Alice in Wonderland by Lewis Carol
- Zen and the Art of Motorcycle Maintenance by Robert M. Pirsig
- About Face - The Essentials of Interaction Design
- Here Comes Everybody: The Power of Organizing Without Organizations by Clay Shirky
- The Tao of Programming
- Computational Beauty of Nature
- Writing Solid Code by Steve Maguire
- Philip and Alex's Guide to Web Publishing
- Object-Oriented Analysis and Design with Applications by Grady Booch
- Effective Java by Joshua Bloch
- Computability by N. J. Cutland
- Masterminds of Programming
- The Tao Te Ching
- The Productive Programmer
- The Art of Deception by Kevin Mitnick
- The Career Programmer: Guerilla Tactics for an Imperfect World by Christopher Duncan
- Paradigms of Artificial Intelligence Programming: Case studies in Common Lisp
- Masters of Doom
- Pragmatic Unit Testing in C# with NUnit by Andy Hunt and Dave Thomas with Matt Hargett
- How To Solve It by George Polya
- The Alchemist by Paulo Coelho
- Smalltalk-80: The Language and its Implementation
- Writing Secure Code (2nd Edition) by Michael Howard
- Introduction to Functional Programming by Philip Wadler and Richard Bird
- No Bugs! by David Thielen
- Rework by Jason Freid and DHH
- JUnit in Action
#BlackOwned #BlackEntrepreneurs #BlackBuniness #AWSCertified #AWSCloudPractitioner #AWSCertification #AWSCLFC02 #CloudComputing #AWSStudyGuide #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AWSBasics #AWSCertified #AWSMachineLearning #AWSCertification #AWSSpecialty #MachineLearning #AWSStudyGuide #CloudComputing #DataScience #AWSCertified #AWSSolutionsArchitect #AWSArchitectAssociate #AWSCertification #AWSStudyGuide #CloudComputing #AWSArchitecture #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AzureFundamentals #AZ900 #MicrosoftAzure #ITCertification #CertificationPrep #StudyMaterials #TechLearning #MicrosoftCertified #AzureCertification #TechBooks
Top 1000 Canada Quiz and trivia: CANADA CITIZENSHIP TEST- HISTORY - GEOGRAPHY - GOVERNMENT- CULTURE - PEOPLE - LANGUAGES - TRAVEL - WILDLIFE - HOCKEY - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
Top 1000 Africa Quiz and trivia: HISTORY - GEOGRAPHY - WILDLIFE - CULTURE - PEOPLE - LANGUAGES - TRAVEL - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
Exploring the Pros and Cons of Visiting All Provinces and Territories in Canada.
Exploring the Advantages and Disadvantages of Visiting All 50 States in the USA
Health Health, a science-based community to discuss human health
- Ozempic Link to Rare Vision Loss Risk Confirmed in Studyby /u/Maxii08 on December 13, 2024 at 9:54 pm
submitted by /u/Maxii08 [link] [comments]
- Why farms, not wet markets, are the pandemic threat you should be worrying aboutby /u/Jojuj on December 13, 2024 at 9:15 pm
submitted by /u/Jojuj [link] [comments]
- On vaccines, Trump wants RFK Jr. to explore a question that’s already been answeredby /u/msnbc on December 13, 2024 at 8:07 pm
submitted by /u/msnbc [link] [comments]
- ‘No one should have to be fighting cancer and insurance at the same time’by /u/Positive_Owl_2024 on December 13, 2024 at 7:17 pm
submitted by /u/Positive_Owl_2024 [link] [comments]
- McKinsey & Co. to pay $650 million to settle U.S. opioid consulting probe, ex-partner will plead guilty to obstructionby /u/nbcnews on December 13, 2024 at 3:17 pm
submitted by /u/nbcnews [link] [comments]
Today I Learned (TIL) You learn something new every day; what did you learn today? Submit interesting and specific facts about something that you just found out here.
- TIL in ancient Roman times a pound of lavender flowers was worth the same amount as 50 haircuts from your local barber.by /u/hiltothedance on December 13, 2024 at 9:13 pm
submitted by /u/hiltothedance [link] [comments]
- TIL in 1979, the descendants of the famous feuding Hatfield and McCoy families appeared on a special week-long taping of Family Feud. A pig was kept on stage during the games. The McCoys won 3-2, but the Hatfields won more money totalby /u/MaroonTrucker28 on December 13, 2024 at 8:47 pm
submitted by /u/MaroonTrucker28 [link] [comments]
- TIL: In 2016, a man stole $5 million from his workplace as an accounting department manager over the course of 7 years and spent $1 million of it on a single mobile video game, Game of War. Outside of that, he spent it on cars, furniture, and sports tickets.by /u/Flares117 on December 13, 2024 at 8:22 pm
submitted by /u/Flares117 [link] [comments]
- TIL when Guinness World Records stopped monitoring the record for the longest time to stay awake in 1997, the record holder at the time was Robert McDonald who went 453 hours 40 minutes (18 days 21 hours 40 minutes) without sleeping in 1986.by /u/tyrion2024 on December 13, 2024 at 4:44 pm
submitted by /u/tyrion2024 [link] [comments]
- TIL that in 1927 heavyweight boxing champion Gene Tunney received a check for $1,000,000 for his second fight vs. Jack Dempsey, making him the first athlete in history to be paid $1,000,000 in a single year, or for a single sporting event.by /u/johncoktosin on December 13, 2024 at 4:14 pm
submitted by /u/johncoktosin [link] [comments]
Reddit Science This community is a place to share and discuss new scientific research. Read about the latest advances in astronomy, biology, medicine, physics, social science, and more. Find and submit new publications and popular science coverage of current research.
- A new study from Cedars-Sinai Cancer and the University of Colorado Anschutz Medical Campus reveals a potential way to overcome tumor resistance to a common chemotherapy drug called cisplatin.by /u/CUAnschutzMed on December 13, 2024 at 8:59 pm
submitted by /u/CUAnschutzMed [link] [comments]
- New device produces ammonia from thin air, cutting carbon emissionsby /u/calliope_kekule on December 13, 2024 at 8:29 pm
submitted by /u/calliope_kekule [link] [comments]
- Clinical association of habitual breakfast skipping with cognitive decline and neurodegeneration among older adultsby /u/magicinfernum on December 13, 2024 at 7:29 pm
submitted by /u/magicinfernum [link] [comments]
- Consortium cancer maps provide a 3D view of tumor evolution: « New 3D blueprints that highlight tumor complexity reveal several new discoveries, some of which challenge existing theories of cancer progression. »by /u/fchung on December 13, 2024 at 7:17 pm
submitted by /u/fchung [link] [comments]
- Anti-aging effect of extracellular vesicles from mesenchymal stromal cells on senescence-induced chondrocytes in osteoarthritisby /u/AgingUS on December 13, 2024 at 6:58 pm
submitted by /u/AgingUS [link] [comments]
Reddit Sports Sports News and Highlights from the NFL, NBA, NHL, MLB, MLS, and leagues around the world.
- De'Vondre Campbell won't be part of the 49ers after his refusal to enter a game, Kyle Shanahan saysby /u/Oldtimer_2 on December 13, 2024 at 9:46 pm
submitted by /u/Oldtimer_2 [link] [comments]
- 49ers TE George Kittle on LB De'Vondre Campbell refusing to enter last night's game with the Ramsby /u/Oldtimer_2 on December 13, 2024 at 7:03 pm
submitted by /u/Oldtimer_2 [link] [comments]
- U.S. Olympic & Paralympic Committees put coach on leave after abuse allegationsby /u/Oldtimer_2 on December 13, 2024 at 6:21 pm
submitted by /u/Oldtimer_2 [link] [comments]
- 49ers' Shanahan: De'Vondre Campbell refused to play 2nd half vs. Ramsby /u/Oldtimer_2 on December 13, 2024 at 5:43 pm
submitted by /u/Oldtimer_2 [link] [comments]
- Report: Yankees acquiring All-Star closer Devin Williams from Brewersby /u/Oldtimer_2 on December 13, 2024 at 5:41 pm
submitted by /u/Oldtimer_2 [link] [comments]