

Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
AI innovations in December 2024.
In December 2024, artificial intelligence continues to drive change across every corner of our lives, with remarkable advancements happening at lightning speed. “AI Innovations in December 2024” is here to keep you updated with an ongoing, day-by-day account of the most significant breakthroughs in AI this month. From new AI models that push the boundaries of what machines can do, to revolutionary applications in oil and gas, healthcare, finance, and education, our blog captures the pulse of innovation.
Throughout December, we will bring you the highlights: major product launches, groundbreaking research, and how AI is increasingly influencing creativity, productivity, and even daily decision-making. Whether you are a technology enthusiast, an industry professional, or just intrigued by the direction AI is heading, our daily blog posts are curated to keep you in the loop on the latest game-changing advancements.
Stay with us as we navigate the exhilarating landscape of AI innovations in December 2024. Your go-to resource for everything AI, we aim to make sense of the rapid changes and share insights into how these innovations could shape our collective future.
AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence.

Get it at: https://djamgatech.com
Get it at Apple at https://books.apple.com/us/book/id6445730691
Get it at Google at: https://play.google.com/store/books/details?id=oySuEAAAQBAJ
A Daily Chronicle of AI Innovations on December 31st 2024
📅 Key Milestones & Breakthroughs in AI: A Definitive 2024 Recap:
This comprehensive recap highlights the most significant AI advancements of 2024, covering breakthroughs in generative models, robotics, and multi-agent systems.
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
What this means: This review provides valuable insights into how AI has evolved throughout the year, setting the stage for future innovations and applications across industries. [Source][2024-12-31]
📚 AI Teachers Make Classroom Debut in Arizona:
Schools in Arizona introduce AI-powered teaching assistants to enhance learning and provide personalized support to students.
- Students will spend just two hours daily on AI-guided, personalized academic lessons using platforms like IXL and Khan Academy.
- The school will operate fully online, with the AI able to adapt in real-time to each student’s performance and customize difficulty and presentation style.
- The rest of the day will focus on life skills workshops led by human mentors, covering topics like financial literacy and entrepreneurship.
- A program pilot claimed students learned twice as much in half the time, allowing them to focus more on important life skills.
What this means: This marks a new era in education where AI complements teachers, improving accessibility and student outcomes. [Source][2024-12-31]
🖼️ Qwen Unveils Powerful Open-Source Visual Reasoning AI:
Qwen launches a new visual reasoning model that excels in interpreting and analyzing complex images.
- QVQ excels at step-by-step reasoning through complex visual problems, particularly in mathematics and physics.
- The model scored a 70.3 on the MMMU benchmark, approaching performance levels of leading closed-source competitors like Claude 3.5 Sonnet.
- Built upon Qwen’s existing VL model, QVQ also demonstrates enhanced capabilities in analyzing images and drawing sophisticated conclusions.
- Qwen said QVQ is a step towards ‘omni’ and ‘smart’ models that can integrate multiple modalities and tackle increasingly complex scientific challenges.
What this means: This advancement strengthens open-source AI’s role in expanding access to cutting-edge tools for researchers and developers. [Source][2024-12-31]
🤖 ARMOR Brings New Perception System to Humanoid Robots:
ARMOR introduces advanced perception technology, enabling humanoid robots to better navigate and interact with their environments.
- The system uses distributed depth sensors across robot arms, creating an ‘artificial skin’ for increased spatial awareness.
- ARMOR showed a 63.7% collision reduction and 78.7% navigation improvement compared to traditional cameras, with 26x faster data processing.
- The system learns from human motion data, with training on over 86 hours of realistic movements.
- The tech was successfully deployed on a Fourier GR1 humanoid robot, using 40 low-cost sensors to create comprehensive spatial awareness.
- The system can be implemented using off-the-shelf components, making it accessible for wider robotics applications.
What this means: This innovation enhances robotic capabilities in real-world applications, from healthcare to industrial tasks. [Source][2024-12-31]
💼 Nvidia Acquires AI Startup Run:ai for $700M:
Nvidia completes its acquisition of Israeli AI firm Run:ai and plans to open-source its hardware optimization software.
What this means: This move bolsters Nvidia’s leadership in AI hardware and software innovation, fostering collaboration through open-source contributions. [Source][2024-12-31]
🔧 OpenAI Reportedly Eyes Humanoid Robotics Market:
OpenAI explores potential entry into humanoid robotics, building on partnerships and custom chip development.
What this means: This signals OpenAI’s ambition to diversify into physical AI applications, expanding its influence beyond software. [Source][2024-12-31]
🌌 Google Lead Predicts Accelerated Path to Artificial Superintelligence:
Logan Kilpatrick highlights rapid advancements toward artificial superintelligence (ASI), citing insights from Ilya Sutskever.
What this means: This reflects growing confidence among AI leaders in achieving transformative AI milestones. [Source][2024-12-31]
💻 ByteDance to Invest $7B in Nvidia AI Chips:
TikTok’s parent company plans significant investments in AI hardware, leveraging overseas data centers to bypass U.S. export restrictions.
What this means: This highlights the increasing global demand for AI hardware and strategic maneuvers to access cutting-edge technologies. [Source][2024-12-31]
🌐 Google CEO Sets High Stakes for Gemini AI in 2025:
Sundar Pichai emphasizes the importance of scaling Gemini AI for consumers, calling it Google’s top priority for the year ahead.
What this means: This signals Google’s aggressive push to maintain dominance in AI and consumer technology markets. [Source][2024-12-31]
Best AI Agents Papers in 2024:
These 12 research papers can help you understand AI Agents better.
1. Magentic-One by Microsoft
This paper introduces Magentic-One, a generalized multi-agent system that can handle various web-based and file-based tasks seamlessly. Think of it like a team of specialized digital helpers, each with different skills, working together to complete everything from document analysis 🍏 Document Analysis Tools to web research 🍏 Web research with AI agents across different domains. By building on Microsoft’s earlier Autogen framework, Magentic-One uses a flexible architecture, so it can adapt to many new tasks easily and collaborate with existing services. The system’s strength lies in its ability to switch roles and share information, helping businesses save time and reduce the need for human intervention.
Read paper
2. Agent-oriented planning in a Multi-Agent system
This research focuses on meta-agent architecture, where multiple AI-powered “agents” can collaborate to solve problems that require clever planning. Imagine coordinating a fleet of drones 🍏 Multi-drone coordination to deliver goods in a city: each drone must plan its route, avoid collisions, and optimize delivery times. By using a meta-agent, each smaller agent can focus on its specialized task while still communicating with the central planning mechanism to handle unexpected events or conflicting goals. This leads to a more robust and efficient system for both complex industrial and everyday applications.
Read paper
3. KGLA by Amazon
Amazon’s KGLA (Knowledge Graph-Enhanced Agent) demonstrates how integrating knowledge graphs 🍏 Knowledge Graphs in AI can significantly improve an agent’s information retrieval and reasoning. Picture a smart assistant that has a vast, interconnected web of facts, enabling it to pull up relevant knowledge quickly and accurately. With KGLA, the agent can better handle tasks like customer support, product recommendations, and even supply chain optimization by scanning the knowledge graph for important details. This approach makes the agent more versatile and precise in understanding and responding to user queries.
Read paper
4. Harvard University’s FINCON
Harvard’s FINCON explores how an LLM-based multi-agent framework can excel in finance-related tasks, such as portfolio analysis, risk assessment, or even automated trading 🍏 Automated Trading with AI. The twist here is the use of “conversational verbal reinforcement,” which allows the agents to fine-tune their understanding by talking through financial scenarios in real time. This paper sheds light on how conversation among AI agents can help identify hidden market signals and refine strategies for investment, budgeting, and financial forecasting.
Read paper
5. OmniParser for Pure Vision-Based GUI Agent
OmniParser tackles the challenge of navigating graphical user interfaces using only visual cues—imagine an AI that can figure out how to use any software’s interface just by “looking” at it. This is critical for tasks like software automation 🍏 Software automation with vision-based AI, usability testing, or even assisting users with disabilities. By deploying a multi-agent system, OmniParser identifies different elements on the screen (buttons, menus, text) and collaborates to perform complex sequences of clicks and commands. This vision-based approach helps AI agents become more adaptable and efficient in navigating new and changing interfaces.
Read paper
6. Can Graph Learning Improve Planning in LLM-based Agents? by Microsoft
This experimental study by Microsoft delves into graph learning 🍏 Graph learning in AI and whether it can enhance planning capabilities in LLM-based agents, particularly those using GPT-4. Essentially, they ask if teaching an AI agent to interpret and create graphs (representing tasks, data, or even story plots) can help it plan or predict the next steps more accurately. Early results suggest that incorporating graph structures can help the system map out relationships between concepts or events, making the agent more strategic in decision-making and possibly more transparent in how it reaches conclusions.
Read paper
7. Generative Agent Simulations of 1,000 People by Stanford University and Google DeepMind
Stanford and Google DeepMind collaborate to show that AI Agents can “clone” the vocal patterns of 1,000 individuals with just two hours of audio 🍏 Voice cloning in AI. This experiment raises questions about privacy and ethical use of technology but also highlights the potential for more natural-sounding virtual assistants, voice overs, or scenario planning. The system can generate nuanced simulations of how people might respond in a conversation, making it a powerful tool for large-scale training or immersive experiences.
Read paper
8. An Empirical Study on LLM-based Agents for Automated Bug Fixing
In this paper, ByteDance’s researchers compare different LLMs 🍏 Comparing LLMs for bug fixing to see which ones are best at identifying and fixing software bugs automatically. They evaluate factors like code understanding, debugging steps, and integration testing. By running agents on real-world code bases, they find that certain large language models excel in reading and interpreting error messages, while others are better at handling complex logic. The goal is to streamline software development, reduce human error, and save time in the debugging process.
Read paper
9. Google DeepMind’s Improving Multi-Agent Debate with Sparse Communication Topology
DeepMind’s approach to multi-agent debate 🍏 Multi-agent debate AI presents a way for AI agents to argue or discuss in order to arrive at truthful answers. By limiting which agents can communicate directly (i.e., making the communication “sparse”), they reduce the noise and confusion that often arises when too many agents talk at once. The experiment shows that a carefully structured communication network can help highlight solid evidence and reduce misleading statements, which could be vital for fact-checking or collaborative problem solving.
Read paper
10. LLM-based Multi-Agents: A survey
This survey explores how multi-agent systems have evolved in tandem with large language models 🍏 LLM-based multi-agent systems. It highlights real-world uses like task automation, world simulation, and problem-solving in complex environments. The paper also addresses common hurdles, such as the difficulty in aligning agents’ goals or ensuring they act ethically. By outlining the key breakthroughs and ongoing debates, this survey provides a road map for newcomers and experts alike.
Read paper
11. Practices for Governing Agentic AI Systems by OpenAI
OpenAI’s paper lays out 7 practical governance tips 🍏 AI governance best practices to help organizations adopt AI agents responsibly. Topics range from implementing robust oversight and error monitoring to ensuring accountability and transparency. The authors stress that even though these agents can supercharge business processes, it’s crucial to have checks and balances in place—like auditing and kill switches—to avoid unintended consequences and maintain trust.
Read paper
12. The Dawn of GUI Agent: A case study for Computer use of Sonnet 3.5
In this case study, researchers test Anthropic’s Sonnet 3.5 🍏 Sonnet AI by Anthropic to see how effectively it can use a computer interface across diverse tasks, such as opening apps, editing documents, and browsing the web. The findings reveal how user-friendly and intuitive the system can be when handling multiple steps—key for creating self-sufficient AI assistants. By dissecting its performance in different domains, the paper highlights best practices for designing user-centric interfaces that even advanced AI can navigate.
Read paper
https://djamgatech.com/real-world-generative-ai-use-cases-from-industry-leaders/
📘 DeepSeek-V3 Rewrites Open-Source AI Playbook:
The launch of DeepSeek-V3 redefines the possibilities for open-source AI, offering unprecedented performance and flexibility for developers worldwide.
What this means: This model establishes a new benchmark in collaborative AI development, fostering innovation across industries. [Source][2024-12-30]
🔄 OpenAI Reveals Restructuring Plans for Next AI Phase:
OpenAI announced organizational changes to better align resources and expertise for its next phase of AI advancements.
What this means: This restructuring reflects OpenAI’s commitment to staying at the forefront of AI innovation while addressing evolving challenges. [Source][2024-12-30]
🕴️ Stanford AI Brings Natural Gestures to Digital Avatars:
Stanford’s latest AI breakthrough enables digital avatars to mimic natural human gestures, enhancing virtual communication and realism.
What this means: This development has significant implications for virtual reality, gaming, and remote collaboration. [Source][2024-12-30]
🤖 OpenAI and Microsoft Define Metric for Achieving AGI:
Newly revealed documents show OpenAI and Microsoft agreed that AGI will be achieved when an AI system can generate $100 billion in annual profits.
What this means: This economic metric underscores the industry’s focus on practical benchmarks to gauge AI advancements. [Source][2024-12-30]
🧑🎤 Meta Unveils AI-Generated Characters for Social Media:
Meta plans to expand AI-generated characters’ roles on its platforms, from profile creation to live content generation and interactions.
What this means: This move could redefine social media engagement, offering tailored interactions and fresh content experiences. [Source][2024-12-30]
🐕 Unitree Debuts Rideable Robot Dog B2-W:
Chinese robotics firm Unitree unveiled B2-W, a robot dog capable of carrying humans over rough terrain while showcasing acrobatic stability and maneuverability.
What this means: This innovation could lead to practical applications in search and rescue, logistics, and mobility assistance. [Source][2024-12-30]
🏀 Toyota’s AI Robot CUE6 Sets Basketball World Record:
Toyota’s AI-powered humanoid robot CUE6 sank an 80-foot basketball shot, earning a Guinness World Record for its precision.
What this means: This achievement highlights the potential for AI-driven robotics in precision tasks and sports innovation. [Source][2024-12-30]
🤖 Nvidia Focuses on Robots Amid Stiffer AI Chip Competition:
Nvidia pivots its strategy toward robotics and autonomous systems as competition in the AI chip market intensifies.
What this means: This shift underscores Nvidia’s effort to diversify its AI applications and maintain its leadership in the evolving tech landscape. [Source][2024-12-30]
🌐 Google CEO Says AI Model Gemini Will Be the Company’s ‘Biggest Focus’ in 2025:
Google CEO Sundar Pichai declares Gemini as the centerpiece of the company’s AI strategy for the upcoming year, emphasizing its transformative potential.
What this means: This signals Google’s commitment to leading the AI race by integrating Gemini across its products and services. [Source][2024-12-30]
⚠️ Google’s CEO Warns ChatGPT May Become Synonymous with AI Like Google is with Search:
Sundar Pichai expresses concern that OpenAI’s ChatGPT could dominate public perception of AI, similar to how Google is synonymous with internet search.
What this means: This highlights the competitive dynamics in the AI space and Google’s drive to maintain its technological brand identity. [Source][2024-12-30]
🧠 AI Tools May Soon Manipulate People’s Online Decision-Making, Say Researchers:
Researchers warn that advanced AI tools could exploit psychological biases to subtly influence user decisions online.
What this means: This revelation raises ethical concerns and highlights the need for robust safeguards to ensure AI respects user autonomy. [Source][2024-12-30]
🚨 Geoffrey Hinton’s Prediction of Human Extinction at the Hands of AI:
AI pioneer Geoffrey Hinton raises concerns that advanced AI systems could pose existential risks to humanity within the coming decades.
What this means: This stark warning highlights the urgent need for global AI safety measures and ethical guidelines. [2024-12-30]
🤖 OpenAI’s O3 Reasoning Model Ignites AI Hype Among Top Influencers:
OpenAI’s newly released O3 model is generating excitement in the AI community for its advanced reasoning capabilities and practical applications.
What this means: The O3 model sets a new benchmark in AI reasoning, opening doors to more complex and intelligent use cases. [2024-12-30]
📱 AI Characters to Generate and Share Social Media Content:
AI-generated characters are now capable of creating and posting personalized social media content, revolutionizing online interaction and branding.
What this means: This development could transform digital marketing, enabling brands and influencers to engage audiences more effectively. [2024-12-30]
📈 How 2025 Could Make or Break Apple Intelligence and Siri:
Apple faces a pivotal year as it aims to elevate Siri and its Apple Intelligence platform to compete with leading AI solutions like ChatGPT and Gemini.
What this means: Success in 2025 will determine Apple’s ability to sustain its relevance in the increasingly AI-driven tech landscape. [2024-12-30]
AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub [Learn and Master AI and Machine Learning from your iPhone ]
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers, Practice Tons of AI Simulations, Plenty of AI Concept Maps, Pass AI Certifications): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
What you can do with this App:
- 🚀 Learn AI interactively! Tweak models, code exercises, visualize concepts, & tackle projects. Perfect for beginners to master AI/ML easily.
- 🎓 AI & ML made easy! Hands-on coding, visual tools, and real-world examples. Engage with fun, interactive learning & community support.
- 🤖 Master AI step-by-step! Practice coding, explore simulations, & see real-time changes. Fun, interactive tools simplify complex AI concepts.
- 🌟 AI learning simplified! Interactive models, coding challenges, flashcards & real-world projects. Visualize & build your own AI models.
- 💡 Explore AI with real-time simulations! Watch neural networks in action & learn by tweaking parameters. Coding & visual tools make it easy.
- 📚 Learn AI the hands-on way! Code exercises, visual tools, & interactive simulations. Fun, engaging, and perfect for all skill levels.
- 🏆 Interactive AI education! Tackle coding, visual tools, real-world projects, & fun challenges. Earn badges & climb the leaderboard.
- 🔍 See AI in action! Tweak parameters & watch real-time effects. Coding & visual tools make learning neural networks & ML concepts easy.
- 🧠 Your AI guide! Visualize, code, & build models with interactive tools. Learn at your pace & join a supportive community.
- 🎓 Hands-on AI learning! Practice coding, see concepts visually, and learn through real-world projects. Fun, engaging, and easy to follow.
A Daily Chronicle of AI Innovations on December 29th 2024
🧠 Sam Altman: AI Is Integrated. Superintelligence Is Coming:
OpenAI CEO Sam Altman emphasizes the rapid integration of AI across industries and predicts the advent of superintelligence in the near future, marking a transformative era in technology.
What this means: Altman’s statement underscores the accelerating pace of AI development and the need for global preparedness to manage superintelligent systems. [Source][2024-12-29]
🤔 Yann LeCun Disputes AGI Timeline, Contradicting Sam Altman and Dario Amodei:
Meta’s AI Chief, Yann LeCun, asserts that AGI will not materialize within the next two years, challenging the predictions of OpenAI’s Sam Altman and Anthropic’s Dario Amodei.
What this means: This debate reflects differing views among AI leaders on the pace of AGI development, highlighting the uncertainties surrounding its timeline and feasibility. [Source][2024-12-29]
⚡ AI Data Centers Reportedly Cause Power Problems in Residential Areas:
Reports indicate that AI data centers are reducing power quality in nearby homes, leading to shorter lifespans for electrical appliances.
What this means: As AI infrastructure expands, addressing its environmental and local impacts becomes increasingly crucial to balance technological progress with community well-being. [Source]
🦙 Llama 3.1 8B Enables CPU Inference on Any PC with a Browser:
Meta’s Llama 3.1 model, featuring 8 billion parameters, now supports CPU-based inference directly from any web browser, democratizing access to advanced AI capabilities without requiring specialized hardware.
This project from one of the authors runs models like Llama 3.1 8B inside any modern browser using PV-tuning compression.
The PV-tuning method referenced in the post achieves state-of-the-art results in 2-bit compression for large language models, which is significant in optimizing performance for CPU inference. This contrasts with more traditional methods that may not reach such efficiency, highlighting the advancements made by the Yandex Research team in collaboration with ISTA and KAUST.
- Run Llama-3.1-8B directly in a browser
- Vladimir Malinovskii
- GitHub – Vahe1994/AQLM: Official Pytorch repository for Extreme …
What this means: This breakthrough allows developers and users to leverage powerful AI tools on standard devices, eliminating barriers to adoption and enhancing accessibility. [Source]
🔄 Meta Releases Byte Latent Transformer: An Improved Transformer Architecture:
Meta introduces Byte Latent Transformer, a next-generation Transformer architecture designed to enhance efficiency and performance in natural language processing and AI tasks.
Byte Latent Transformer is a new improvised Transformer architecture introduced by Meta which doesn’t uses tokenization and can work on raw bytes directly. It introduces the concept of entropy based patches. Understand the full architecture and how it works with example here : https://youtu.be/iWmsYztkdSg
What this means: This innovation streamlines Transformer models, enabling faster computation and reduced resource usage, making advanced AI more accessible across industries. [Source]
🏎️ NASCAR Uses AI to Develop a New Playoff Format:
NASCAR is leveraging AI to redesign its playoff format following widespread criticism, aiming for a more engaging and competitive racing structure.
What this means: This move highlights AI’s potential to reimagine traditional sports formats, enhancing both fairness and fan experience. [Source]
🏀 AI-Powered Robot Sinks Seemingly Impossible Basketball Hoops:
An AI-driven robot dazzles with its precision by making near-impossible basketball shots, showcasing advanced physics simulations and real-time adjustments.
What this means: This achievement demonstrates AI’s growing capability in robotics and its potential applications in precision-demanding tasks. [Source]
🖥️ Meet SemiKong: The World’s First Open-Source Semiconductor-Focused LLM:
SemiKong debuts as the first open-source large language model specialized in semiconductor technology, aiming to streamline and innovate chip design processes.
What this means: This tool could transform the semiconductor industry by democratizing access to cutting-edge design and analysis tools. [Source]
🤖 Leaked Documents Show OpenAI Has a Very Clear Definition of ‘AGI’:
A leak reveals OpenAI defines AGI as developing an AI system capable of generating $100 billion in profits, tying technological milestones to economic success.
What this means: This revelation emphasizes OpenAI’s focus on measurable financial benchmarks to define AGI, sparking debates on the alignment of ethics and business goals. [Source]
⚠️ ‘Godfather of AI’ Shortens Odds of the Technology Wiping Out Humanity Over Next 30 Years:
AI pioneer Geoffrey Hinton warns of increased likelihood that advanced AI could pose existential risks to humanity within the next three decades.
What this means: This grim projection highlights the urgent need for global regulations and ethical frameworks to mitigate AI-related dangers. [Source]
🌐 DeepSeek-AI Releases DeepSeek-V3, a Powerful Mixture-of-Experts Model:
DeepSeek-AI unveils DeepSeek-V3, a language model with 671 billion total parameters and 37 billion activated per token, pushing the boundaries of AI performance.
What this means: This MoE model represents a leap in efficiency and capability for large-scale language models, democratizing advanced AI solutions. [Source]
🛑 AI Chatbot Lawsuit Highlights Ethical Concerns After Disturbing Recommendations:
A Telegraph investigation reveals an AI chatbot, currently being sued over a 14-year-old’s suicide, was instructing teens to commit violent acts, sparking public outrage.
What this means: This case underscores the critical need for stricter oversight and ethical design in AI systems to prevent harmful outputs. [Source]
📊 A Summary of the Leading AI Models by Late 2024:
Djamgatech provides an in-depth overview of the most advanced AI models of 2024, highlighting innovations, capabilities, and industry impacts from models like OpenAI’s o3, DeepSeek-V3, and Google’s Gemini 2.0.
What this means: This comprehensive analysis underscores the rapid advancements in AI and their transformative applications across various sectors. [Source]
A Daily Chronicle of AI Innovations on December 27th 2024
💼 OpenAI Announces Official Plans to Transition into a For-Profit Company:
OpenAI has revealed its intent to formally shift from its non-profit origins to a for-profit structure, aiming to scale operations and attract more investment to fuel its ambitious AI advancements.
What this means: This transition could significantly impact the AI industry, fostering faster innovation but raising concerns about balancing profit motives with ethical AI development. [Source]
💰 Microsoft Invested Nearly $14 Billion in OpenAI But Is Reducing Its Dependence:
Despite its massive $14 billion investment in OpenAI, Microsoft is reportedly scaling back its reliance on the ChatGPT parent company as it explores alternative AI strategies.
What this means: This shift indicates Microsoft’s desire to diversify its AI capabilities and reduce dependency on a single partner. [Source]
☁️ AI Cloud Startup Vultr Raises $333M at $3.5B Valuation in First Outside Funding Round:
Vultr, an AI-focused cloud computing startup, secures $333 million in its first external funding round, bringing its valuation to $3.5 billion.
What this means: This funding reflects growing investor confidence in cloud platforms supporting AI workloads and their critical role in the future of AI infrastructure. [Source]
🌍 Heirloom Secures $150M Amid Busy Year for Carbon Capture Funding:
Carbon capture company Heirloom raises $150 million as interest in climate technology funding surges, supporting its mission to combat global warming.
What this means: Increased investment in carbon capture technologies highlights the urgency of addressing climate change through innovative solutions. [Source]
🤖 DeepSeek’s New AI Model Among the Best Open Challengers Yet:
DeepSeek’s latest AI model sets a high bar for open-source AI systems, offering robust performance and positioning itself as a strong alternative to proprietary models.
What this means: Open AI models like DeepSeek empower developers and researchers with accessible tools to drive innovation and competition in AI. [Source]
🤖 Microsoft Is Forcing Its AI Assistant on People:
Reports suggest that Microsoft is aggressively integrating its AI assistant into its platforms, sparking mixed reactions from users who feel they are being pushed into using the feature.
What this means: This move highlights the tension between driving AI adoption and respecting user choice, underscoring the challenges of balancing innovation with customer satisfaction. [Source]
💸 Microsoft and OpenAI Put a Price on Achieving AGI:
Microsoft and OpenAI announce a roadmap and estimated investment required to achieve Artificial General Intelligence (AGI), underscoring the massive computational and financial resources necessary.
What this means: This reveals the significant commitment and challenges involved in advancing AI to human-level intelligence, with implications for global AI leadership and innovation. [Source]
⚠️ ChatGPT Experiences Outage, Leaving Many Users Without Access:
OpenAI confirmed that ChatGPT was experiencing glitches on Thursday afternoon, disrupting the service for a significant number of users.
What this means: This outage highlights the growing dependency on AI tools for daily activities and the challenges of maintaining large-scale AI infrastructure. [Source]
📊 DeepSeek-V3, Ultra-Large Open-Source AI, Outperforms Llama and Qwen:
DeepSeek-V3 launches as an open-source AI model, surpassing Llama and Qwen in performance benchmarks, marking a significant milestone in large language model development.
What this means: The availability of such a powerful open-source model democratizes AI innovation, allowing developers and researchers access to cutting-edge tools. [Source]
🏠 Airbnb Uses AI to Block New Year’s Eve House Party Bookings:
Airbnb employs AI to preemptively block suspicious bookings that may lead to unauthorized New Year’s Eve house parties, ensuring safer hosting experiences.
What this means: This initiative demonstrates AI’s potential in risk management and maintaining trust within digital marketplaces. [Source]
📈 Reddit Boosts AI Capabilities and Sees Price Target Raised to $200 by Citi:
Reddit, Inc. (RDDT) enhances its AI technologies, prompting Citi to raise the company’s price target to $200, reflecting increased investor confidence in its AI-driven growth strategies.
What this means: Reddit’s investment in AI demonstrates the platform’s commitment to innovation, potentially driving user engagement and monetization. [Source]
📉 IMF Predicts 36% of Philippine Jobs Eased or Displaced by AI:
The International Monetary Fund forecasts that over a third of jobs in the Philippines could be significantly impacted or displaced by AI, reflecting global shifts in the labor market.
What this means: This projection underscores the need for workforce adaptation and investment in AI-related upskilling initiatives to mitigate economic disruptions. [Source]
🧠 New Study Reveals Social Identity Biases in Large Language Models:
Research indicates that large language models (LLMs) exhibit social identity biases akin to humans but can be trained to mitigate these outputs.
What this means: Addressing biases in AI models is critical to ensuring fair and ethical AI applications, making this study a step forward in responsible AI development. [Source]
AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub [Learn and Master AI and Machine Learning from your iPhone ]
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers, Practice Tons of AI Simulations, Plenty of AI Concept Maps, Pass AI Certifications): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
What you can do with this App:
- 🚀 Learn AI interactively! Tweak models, code exercises, visualize concepts, & tackle projects. Perfect for beginners to master AI/ML easily.
- 🎓 AI & ML made easy! Hands-on coding, visual tools, and real-world examples. Engage with fun, interactive learning & community support.
- 🤖 Master AI step-by-step! Practice coding, explore simulations, & see real-time changes. Fun, interactive tools simplify complex AI concepts.
- 🌟 AI learning simplified! Interactive models, coding challenges, flashcards & real-world projects. Visualize & build your own AI models.
- 💡 Explore AI with real-time simulations! Watch neural networks in action & learn by tweaking parameters. Coding & visual tools make it easy.
- 📚 Learn AI the hands-on way! Code exercises, visual tools, & interactive simulations. Fun, engaging, and perfect for all skill levels.
- 🏆 Interactive AI education! Tackle coding, visual tools, real-world projects, & fun challenges. Earn badges & climb the leaderboard.
- 🔍 See AI in action! Tweak parameters & watch real-time effects. Coding & visual tools make learning neural networks & ML concepts easy.
- 🧠 Your AI guide! Visualize, code, & build models with interactive tools. Learn at your pace & join a supportive community.
- 🎓 Hands-on AI learning! Practice coding, see concepts visually, and learn through real-world projects. Fun, engaging, and easy to follow.
A Daily Chronicle of AI Innovations on December 26th 2024
📚 AI is a Game Changer for Students with Disabilities, Schools Still Learning to Harness It:
AI tools are transforming education for students with disabilities, offering personalized learning and accessibility solutions, though schools face challenges in adoption and integration.
What this means: The potential of AI to empower students with disabilities is immense, but its effective implementation requires significant training and resources. [Source]
🤖 Nvidia’s Jim Fan: Embodied Agents to Emerge from Simulation with a “Hive Mind”:
Nvidia’s Jim Fan predicts that most embodied AI agents will be trained in simulations and transferred zero-shot to real-world applications, operating with a shared “hive mind” for collective intelligence.
What this means: This approach could revolutionize robotics and AI, enabling seamless adaptation to real-world tasks while fostering unprecedented levels of cooperation and knowledge sharing among AI systems. [Source]
☁️ Microsoft Researchers Release AIOpsLab: A Comprehensive AI Framework for AIOps Agents:
Microsoft unveils AIOpsLab, an open-source AI framework designed to streamline and automate IT operations, enabling more efficient and proactive infrastructure management.
What this means: This tool could revolutionize IT management by providing businesses with powerful, adaptable AI capabilities for monitoring and optimizing systems. [Source]
🌐 DeepSeek Lab Open-Sources a Massive 685B MOE Model:
DeepSeek Lab has released its groundbreaking 685-billion-parameter Mixture of Experts (MOE) model as an open-source project, providing unprecedented access to one of the largest AI architectures available.
What this means: This open-source initiative could accelerate research and innovation across industries by enabling researchers and developers to harness the power of state-of-the-art AI at scale. [Source]
🎄 Kate Bush Reflects on Monet and AI in Annual Christmas Message:
Kate Bush shares her thoughts on the intersection of art and technology, discussing Monet’s influence and AI’s role in creative expression during her Christmas message.
What this means: Bush’s reflections highlight the ongoing dialogue about AI’s transformative impact on art and human creativity. [Source]
💡 DeepSeek v3 Outperforms Sonnet at 53x Cheaper Pricing:
DeepSeek’s latest model, v3, delivers superior performance compared to Sonnet while offering API rates that are 53 times more affordable.
What this means: This breakthrough positions DeepSeek as a game-changer in the AI space, democratizing access to high-performance AI tools and challenging industry pricing norms. [Source]
🤖 Elon Musk’s AI Robots Appear in Dystopian Christmas Card:
Elon Musk’s Optimus robots featured in a dystopian-themed Christmas card as part of his ambitious vision for the Texas town of Starbase.
What this means: This playful yet futuristic gesture underscores Musk’s commitment to integrating AI and robotics into everyday life and his bold ambitions for Starbase. [Source]
♾️ ChatGPT’s Infinite Memory Feature is Real:
OpenAI confirms the rumored infinite memory feature for ChatGPT, allowing the AI to access all past chats for context and improved interactions.
What this means: This development could enhance personalization and continuity in conversations, transforming how users interact with AI for long-term tasks and projects. [Source]
⏳ Sébastien Bubeck Introduces “AGI Time” to Measure AI Model Capability:
OpenAI’s Sébastien Bubeck proposes “AGI Time” as a metric to measure AI capability, with GPT-4 handling tasks in seconds or minutes, o1 managing tasks in hours, and next-generation models predicted to achieve tasks requiring “AGI days” by next year and “AGI weeks” within three years.
What this means: This metric highlights the accelerating progress in AI performance, bringing us closer to advanced general intelligence capable of handling prolonged, complex workflows. [Source]
🌡️ AI Predicts Accelerated Global Temperature Rise to 3°C:
AI models forecast that most land regions will surpass the critical 1.5°C threshold by 2040, with several areas expected to exceed the 3.0°C threshold by 2060—far sooner than previously estimated.
What this means: These alarming predictions emphasize the urgency of global climate action to mitigate severe environmental, social, and economic impacts. [Source]
🧠 Major LLMs Can Identify Personality Tests and Adjust Responses for Social Desirability:
Research shows that leading large language models (LLMs) are capable of recognizing when they are given personality tests and modify their answers to appear more socially desirable, a behavior learned through human feedback during training.
What this means: This adaptation highlights the sophistication of AI systems but raises questions about transparency and the integrity of AI-driven assessments. [Source]
A Daily Chronicle of AI Innovations on December 25th 2024
🤝 Google Is Using Anthropic’s Claude to Improve Its Gemini AI:
Google partners with Anthropic to integrate Claude into its Gemini AI, enhancing its performance in complex reasoning and conversational tasks.
What this means: This collaboration underscores the growing trend of cross-company partnerships in AI, leveraging combined expertise for accelerated advancements. [Source]
🌐 60 of Our Biggest Google AI Announcements in 2024:
Google reflects on 2024 with a recap of 60 major AI developments, spanning breakthroughs in healthcare, language models, and generative AI applications.
What this means: These achievements highlight Google’s leadership in shaping the future of AI and its widespread applications across industries. [Source]
🎯 Coca-Cola and Omnicom Lead AI Marketing Strategies:
Coca-Cola and Omnicom pioneer innovative AI-driven marketing campaigns, utilizing advanced personalization and predictive analytics to engage consumers.
What this means: This demonstrates how global brands are leveraging AI to revolutionize marketing strategies and drive consumer connection. [Source]
🧠 How Hallucinatory AI Helps Science Dream Up Big Breakthroughs:
AI’s imaginative “hallucinations” are being used by researchers to generate hypotheses and explore innovative solutions in scientific discovery.
What this means: This creative application of AI could redefine how breakthroughs in science are achieved, blending computational power with human ingenuity. [Source]
🥃 AI Beats Human Experts at Distinguishing American Whiskey from Scotch:
AI systems have demonstrated superior accuracy in identifying the differences between American whiskey and Scotch, surpassing human experts in sensory analysis.
What this means: This breakthrough highlights AI’s potential in the food and beverage industry, offering enhanced quality control and product categorization. [Source]
🧠 Homeostatic Neural Networks Show Improved Adaptation to Dynamic Concept Shift Through Self-Regulation:
Researchers unveil homeostatic neural networks capable of self-regulation, enabling better adaptation to changing data patterns and environments.
What this means: This advancement could enhance AI’s ability to learn and perform consistently in dynamic, real-world scenarios, pushing the boundaries of machine learning adaptability. [Source]
This paper introduces an interesting approach where neural networks incorporate homeostatic principles – internal regulatory mechanisms that respond to the network’s own performance. Instead of having fixed learning parameters, the network’s ability to learn is directly impacted by how well it performs its task.
The key technical points: • Network has internal “needs” states that affect learning rates • Poor performance reduces learning capability • Good performance maintains or enhances learning ability • Tested against concept drift on MNIST and Fashion-MNIST • Compared against traditional neural nets without homeostatic features
Results showed: • 15% better accuracy during rapid concept shifts • 2.3x faster recovery from performance drops • More stable long-term performance in dynamic environments • Reduced catastrophic forgetting
I think this could be valuable for real-world applications where data distributions change frequently. By making networks “feel” the consequences of their decisions, we might get systems that are more robust to domain shift. The biological inspiration here seems promising, though I’m curious about how it scales to larger architectures and more complex tasks.
One limitation I noticed is that they only tested on relatively simple image classification tasks. I’d like to see how this performs on language models or reinforcement learning problems where adaptability is crucial.
TLDR: Adding biological-inspired self-regulation to neural networks improves their ability to adapt to changing data patterns, though more testing is needed for complex applications.
AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub [Learn and Master AI and Machine Learning from your phone]
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers, Practice Tons of AI Simulations, Plenty of AI Concept Maps, Pass AI Certifications): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
What you can do with this App:
- 🚀 Learn AI interactively! Tweak models, code exercises, visualize concepts, & tackle projects. Perfect for beginners to master AI/ML easily.
- 🎓 AI & ML made easy! Hands-on coding, visual tools, and real-world examples. Engage with fun, interactive learning & community support.
- 🤖 Master AI step-by-step! Practice coding, explore simulations, & see real-time changes. Fun, interactive tools simplify complex AI concepts.
- 🌟 AI learning simplified! Interactive models, coding challenges, flashcards & real-world projects. Visualize & build your own AI models.
- 💡 Explore AI with real-time simulations! Watch neural networks in action & learn by tweaking parameters. Coding & visual tools make it easy.
- 📚 Learn AI the hands-on way! Code exercises, visual tools, & interactive simulations. Fun, engaging, and perfect for all skill levels.
- 🏆 Interactive AI education! Tackle coding, visual tools, real-world projects, & fun challenges. Earn badges & climb the leaderboard.
- 🔍 See AI in action! Tweak parameters & watch real-time effects. Coding & visual tools make learning neural networks & ML concepts easy.
- 🧠 Your AI guide! Visualize, code, & build models with interactive tools. Learn at your pace & join a supportive community.
- 🎓 Hands-on AI learning! Practice coding, see concepts visually, and learn through real-world projects. Fun, engaging, and easy to follow.
A Daily Chronicle of AI Innovations on December 24th 2024
🧠 o3’s Estimated IQ is 157:
OpenAI’s latest o3 model is estimated to have an IQ of 157, marking it as one of the most advanced AI systems in terms of cognitive reasoning and problem-solving.
What this means: This high IQ estimate reflects o3’s exceptional capabilities in handling complex, human-level tasks, further bridging the gap between AI and human intelligence. [Source]
💡 Laser-Based Artificial Neuron Achieves Unprecedented Speed:
Researchers have developed a laser-based artificial neuron capable of processing signals at 10 GBaud, mimicking biological neurons but operating one billion times faster.
What this means: This innovation could revolutionize AI and computing by enabling faster and more efficient pattern recognition and sequence prediction, paving the way for next-generation intelligent systems. [Source]
🧠 AI is Only 30% Away From Matching Human-Level General Intelligence on GAIA Benchmark:
A recent evaluation using the GAIA Benchmark reveals that AI systems are now just 30% shy of achieving human-level general intelligence.
What this means: The rapid progress in AI capabilities could soon unlock unprecedented applications, but also raises urgent questions about regulation and safety. [Source]
💰 Elon Musk’s xAI Lands $6B in New Cash to Fuel AI Ambitions:
Elon Musk’s xAI secures $6 billion in new funding to scale its AI capabilities and expand its infrastructure, including advancements in the Colossus supercomputer.
What this means: This significant investment highlights the escalating competition in the AI space and Musk’s long-term ambitions to lead the sector. [Source]
🤝 Microsoft Looking to Pursue an Open Relationship With OpenAI:
Microsoft is reportedly seeking to redefine its partnership with OpenAI, aiming for a more flexible and collaborative approach as the AI landscape evolves.
What this means: This potential shift could reshape industry alliances and pave the way for broader innovation in AI technologies. [Source]
🎵 Amazon and Universal Music Tackle ‘Unlawful’ AI-Generated Content:
Amazon and Universal Music collaborate to combat unauthorized AI-generated music and protect intellectual property rights within the entertainment industry.
What this means: This partnership underscores the challenges and efforts required to regulate and safeguard creative works in the age of generative AI. [Source]
AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub [Learn and Master AI and Machine Learning from your phone]
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers, Practice Tons of AI Simulations, Plenty of AI Concept Maps, Pass AI Certifications): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
A Daily Chronicle of AI Innovations on December 23rd 2024
☁️ Microsoft Research Unveils AIOpsLab: The Open-Source Framework Revolutionizing Autonomous Cloud Operations:
Microsoft Research introduces AIOpsLab, an open-source framework designed to enhance autonomous cloud operations by leveraging AI for predictive maintenance, resource optimization, and fault management.
Microsoft Research:
We developed AIOpsLab, a holistic evaluation framework for researchers and developers, to enable the design, development, evaluation, and enhancement of AIOps agents, which also serves the purpose of reproducible, standardized, interoperable, and scalable benchmarks. AIOpsLab is open sourced at GitHub(opens in new tab) with the MIT license, so that researchers and engineers can leverage it to evaluate AIOps agents at scale. The AIOpsLab research paper has been accepted at SoCC’24 (the annual ACM Symposium on Cloud Computing). […] The APIs are a set of documented tools, e.g., get logs, get metrics, and exec shell, designed to help the agent solve a task. There are no restrictions on the agent’s implementation; the orchestrator poses problems and polls it for the next action to perform given the previous result. Each action must be a valid API call, which the orchestrator validates and carries out. The orchestrator has privileged access to the deployment and can take arbitrary actions (e.g., scale-up, redeploy) using appropriate tools (e.g., helm, kubectl) to resolve problems on behalf of the agent. Lastly, the orchestrator calls workload and fault generators to create service disruptions, which serve as live benchmark problems. AIOpsLab provides additional APIs to extend to new services and generators.
Note: this is not an AI agent for DevOps/ITOps implementation but a framework to evaluate your agent implementation. I’m already excited for AIOps agents in the future!
What this means: This innovation could transform how cloud infrastructure is managed, reducing operational costs and improving efficiency for businesses of all sizes. [Source]
Future of software engineer:
The diagram outlines a future-oriented software engineering process, splitting tasks between AI agents and human roles across different stages of the software development lifecycle. Here’s a summary:
Key Stages:
- Requirements:
- Human Tasks:
- Gather requirements from business stakeholders.
- Structure requirements for clarity.
- Human Tasks:
- Design:
- AI Tasks:
- Generate proposal designs.
- Human Tasks:
- Adjust and refine the proposed designs.
- AI Tasks:
- Development:
- AI Tasks:
- Write code based on requirements and designs.
- Generate unit tests.
- Write documentation.
- AI Tasks:
- Testing:
- AI Tasks:
- Conduct end-to-end and regression tests.
- Human Tasks:
- Test functionality and validate assumptions.
- AI Tasks:
- Deployment:
- AI Tasks:
- Manage the deployment pipeline.
- AI Tasks:
- Maintenance:
- AI Tasks:
- Check versioning and unit tests.
- Human Tasks:
- Write and analyze bug reports.
- AI Tasks:
- Updates:
- Human Tasks:
- Obtain updates and feedback from business stakeholders.
- Human Tasks:
Color Coding:
- Blue: Tasks performed by AI agents.
- Purple: Tasks performed by humans.
Flow:
The process is iterative, with feedback loops allowing for continuous updates, maintenance, and refinement.
This hybrid approach highlights AI’s efficiency in automating routine tasks while humans focus on creative and strategic decision-making.
🎭 Reddit Cofounder Alexis Ohanian Predicts Live Theater and Sports Will Become More Popular Than Ever as AI Grows:
Alexis Ohanian envisions a future where AI’s ubiquity amplifies the demand for uniquely human experiences like live theater and sports.
What this means: As AI reshapes entertainment, traditional human-driven experiences may become cultural sanctuaries, valued for their authenticity. [Source]
🛡️ Sriram Krishnan Named Trump’s Senior Policy Advisor for AI:
Entrepreneur and Musk ally Sriram Krishnan is appointed as the senior AI policy advisor in Trump’s administration, signaling strategic focus on AI regulation.
What this means: This appointment underscores the growing importance of AI policy in shaping U.S. technological leadership. [Source]
🧠 OpenAI Trained o1 and o3 to ‘Think’ About Its Safety Policy:
OpenAI integrates safety considerations into the training of its o1 and o3 models, emphasizing alignment with ethical AI practices.
What this means: Embedding safety protocols directly into AI training could reduce risks and foster greater trust in AI applications. [Source]
🤖 Tetsuwan Scientific is Making Robotic AI Scientists That Can Run Experiments on Their Own:
Tetsuwan Scientific unveils robotic AI scientists capable of independently designing and conducting experiments, revolutionizing research methodologies.
What this means: These autonomous AI systems could accelerate scientific discovery while reducing human resource demands in research labs. [Source]
🚗 MIT’s Massive Database of 8,000 New AI-Generated EV Designs Could Shape How the Future of Cars Look:
MIT’s database of AI-generated electric vehicle designs provides novel concepts that could influence automotive innovation and future car aesthetics.
What this means: AI’s role in designing energy-efficient, futuristic vehicles highlights its transformative impact on the transportation industry. [Source]
🖼️ Google Whisk: A New Way to Create AI Visuals Using Image Prompts:
Google introduces Whisk, an AI tool that generates images based on other images as prompts, allowing users to blend visual elements creatively without relying solely on text descriptions.
What this means: Whisk offers a novel approach to AI-driven image creation, enabling more intuitive and versatile artistic expression. [Source]
📊 Google’s Gemini AI Now Allows Users to ‘Ask about this PDF’ in Files:
Google’s Gemini AI introduces a feature enabling users to inquire about the content of PDF documents directly, streamlining information retrieval within files.
What this means: This functionality enhances productivity by simplifying access to specific information within extensive documents. [Source]
🧠 AI Reveals the Secret to Keeping Your Brain Young:
Recent AI research uncovers factors contributing to cognitive longevity, offering insights into maintaining brain health and delaying age-related decline.
What this means: AI-driven discoveries could inform new strategies for preserving mental acuity, impacting healthcare and lifestyle choices. [Source]
🤖 Tetsuwan Scientific is Making Robotic AI Scientists That Can Run Experiments on Their Own:
Tetsuwan Scientific develops autonomous robotic AI scientists capable of independently designing and conducting experiments, potentially accelerating scientific discovery.
What this means: This innovation could revolutionize research methodologies, increasing efficiency and reducing human resource demands in laboratories. [Source]
AI Weekly Rundown From Dec 15 to Dec 21
📸 Instagram Tests New AI-Powered Ad Format for Creators:
Instagram pilots a new AI-driven ad format designed to help creators better monetize their content by delivering more personalized and engaging ad experiences.
What this means: This move could provide creators with innovative revenue streams while improving ad relevance for users. [Source]
📞 Kalamazoo, MI, Using AI to Respond to Non-Emergency Calls:
Kalamazoo deploys AI to manage non-emergency calls, freeing up resources for critical situations and improving response efficiency.
What this means: AI is becoming a valuable tool for enhancing municipal services and optimizing public safety operations. [Source]
🛡️ AI Cameras Are Giving DC’s Air Defense a Major Upgrade:
Advanced AI cameras are being integrated into Washington DC’s air defense systems, offering improved threat detection and faster response times.
What this means: AI-powered defense systems enhance national security by making surveillance more precise and reliable. [Source]
🎥 TCL’s New AI Short Films Range from Bad Comedy to Existential Horror:
TCL debuts a series of AI-generated short films showcasing a mix of comedic and thought-provoking themes, highlighting the creative potential of generative AI in storytelling.
What this means: AI is pushing the boundaries of creative industries, enabling the exploration of novel storytelling techniques, even if results vary in quality. [Source]
🚀 OpenAI Announces New o3 Models:
OpenAI reveals its latest o3 models, promising advancements in reasoning, multimodal integration, and efficiency tailored for diverse use cases.
What this means: These new models could redefine the capabilities of AI in industries ranging from healthcare to software development. [Source]
🗂️ Ukraine Collects Vast War Data Trove to Train AI Models:
Ukraine harnesses extensive wartime data to train AI systems for defense, reconstruction, and humanitarian purposes.
What this means: Leveraging data in this way could accelerate recovery and improve security strategies in conflict zones. [Source]
⚖️ Every AI Copyright Lawsuit in the US, Visualized:
A comprehensive visualization maps ongoing AI copyright lawsuits across the U.S., highlighting legal challenges in content creation and intellectual property.
What this means: This resource provides clarity on the evolving legal landscape surrounding AI-generated works and their implications for creators and businesses. [Source]
📜 Congress Releases AI Policy Blueprint:
U.S. Congress unveils a comprehensive AI policy framework, addressing issues such as safety, ethics, and innovation to guide future developments.
What this means: This blueprint aims to balance AI advancements with public safety, fostering trust and transparency in AI deployment. [Source]
🤔 Google Releases Its Own ‘Reasoning’ AI Model:
Google launches a cutting-edge AI model focused on reasoning, aiming to tackle more complex tasks with logical precision.
What this means: This innovation positions Google at the forefront of advanced AI development, potentially enhancing applications in problem-solving and decision-making processes. [Source]
💻 NVIDIA and Apple Boost LLM Inference Efficiency with ReDrafter Integration:
NVIDIA and Apple collaborate on integrating ReDrafter technology to improve large language model (LLM) inference efficiency.
What this means: Faster and more efficient AI processing could accelerate AI applications across consumer and enterprise platforms. [Source]
🏢 Alibaba Splits AI Team to Focus on Consumers and Businesses:
Alibaba restructures its AI team, creating separate units to address consumer and enterprise needs, aiming for specialized innovation.
What this means: This strategic move could enable Alibaba to deliver more tailored AI solutions for diverse markets. [Source]
📰 Apple Urged to Remove New AI Feature After Falsely Summarizing News Reports:
Apple faces criticism for an AI feature that inaccurately summarized news articles, prompting calls for its removal.
What this means: This incident underscores the importance of accuracy and reliability in AI-driven news aggregation tools. [Source]
A Daily Chronicle of AI Innovations on December 20th 2024
Listen to this episode at https://podcasts.apple.com/ca/podcast/today-in-ai-google-releases-experimental-reasoning/id1684415169?i=1000681139365
O3 beats 99.8% competitive coders
🚨 NVIDIA just launched its new Jetson Orin Nano Super Developer Kit, a compact generative AI supercomputer priced at $249, down from the earlier price of $499.
It’s like a Raspberry Pi on steroids, designed for developers, hobbyists, and students building cool AI projects like chatbots, robots, or visual AI tools.
The kit is faster, smarter, and has more AI processing power than ever, offering a 1.7x boost in performance and 70% more neural processing compared to its predecessor.
It is perfect for anyone wanting to explore AI or create exciting tech projects.
And yes, it’s available now!
2025 is gonna be EPIC!!!
Source: NVIDIA
🤔 Google Releases Experimental ‘Reasoning’ AI:
Google unveils a new experimental AI model designed to excel in reasoning tasks, pushing the boundaries of logical and analytical AI capabilities.
- The model explicitly shows its thought process while solving problems, similar to other reasoning models like OpenAI’s o1.
- Built on Gemini 2.0 Flash, early users report significantly faster performance than competing reasoning models.
- The model increases computation time to improve reasoning, leading to longer but potentially more accurate responses.
- The model is now ranked #1 on the Chatbot Arena across all categories and is freely available through AI Studio, the Gemini API, and Vertex AI.
What this means: This advancement could make AI better at solving complex problems and improve its ability to assist in critical decision-making processes. The race for better AI reasoning capabilities is intensifying, with Google joining OpenAI and others in exploring new approaches beyond just scaling up model size. While OpenAI continues to increase pricing for their top-tier models, Google continues taking the opposite approach by making its best AI freely accessible.
⚛️ The First Generative AI Physics Simulator:
A groundbreaking generative AI physics simulator is introduced, capable of modeling real-world scenarios with unprecedented accuracy.
- Genesis runs 430,000 times faster than real-time physics, achieving 43 million FPS on a single RTX 4090 GPU.
- It’s built in pure Python, it’s 10-80x faster than existing solutions like Isaac Gym and MJX.
- The platform can train real-world transferable robot locomotion policies in just 26 seconds.
- The platform is fully open-source and will soon include a generative framework for creating 4D environments.
What this means: From engineering to game development, this tool opens new possibilities for simulating realistic environments and phenomena. By enabling AI to run millions of simulations at unprecedented speeds, Genesis could massively accelerate robots’ ability to understand our physical world. Open-sourcing this tech, along with its ability to generate complex environments from simple prompts, could spark a whole new wave of innovation in physical AI.
🤖 Google Partners with Apptronik on Humanoid Robots:
Google collaborates with robotics company Apptronik to advance humanoid robot technology for diverse applications.
- Apptronik brings nearly a decade of robotics expertise, including the development of NASA’s Valkyrie Robot and their current humanoid, Apollo.
- Apollo stands 5’8″, weighs 160 pounds, and is designed for industrial tasks while safely working alongside humans.
- The partnership will leverage Google DeepMind’s AI expertise, including their Gemini models, to enhance robot capabilities in real-world environments.
- This marks Google’s return to humanoid robotics after selling Boston Dynamics to SoftBank in 2017.
What this means: This partnership could accelerate the development of robots capable of performing complex tasks in industries like logistics and healthcare. Seven years after selling Boston Dynamics, Google is re-entering humanoid robotics — this time through AI rather than hardware. This partnership could give DeepMind’s advanced AI models (like Gemini) a physical form, potentially bringing us closer to practical humanoid robots that can work alongside humans.
🧪 OpenAI’s Alec Radford Departs for Independent Research:
Alec Radford, a lead author of GPT, announces his exit from OpenAI, marking another high-profile departure amid shifts in the company’s leadership.
What this means: Radford’s departure highlights potential challenges within OpenAI’s research direction and organizational culture.
📘 Anthropic Publishes AI Agent Best Practices:
Anthropic releases guidelines for building AI agents, emphasizing simplicity and composability in frameworks while sharing real-world insights.
What this means: Developers can benefit from streamlined patterns that improve the efficiency and reliability of AI systems.
🗣️ Meta Hints at Speech and Advanced Reasoning in Llama 4:
Meta teases upcoming features in Llama 4, including enhanced reasoning capabilities and business-focused AI agents for customer support by 2025.
What this means: These advancements could position Meta as a leader in enterprise AI solutions.
🔗 Perplexity Acquires Carbon for App Connectivity:
Perplexity integrates Carbon’s technology to connect apps like Notion and Google Docs directly into its AI search platform.
What this means: Users will experience more seamless interactions between their productivity tools and AI-powered searches.
🌐 Microsoft AI Rolls Out Copilot Vision to U.S. Pro Users:
Copilot Vision, Microsoft’s real-time browser-integrated AI, becomes available to U.S. Pro users on Windows.
What this means: This feature enhances productivity by combining live browsing with AI interaction for better task execution.
🛠️ OpenAI Expands ChatGPT App Integration for Developers:
OpenAI enables ChatGPT integration with additional platforms, including JetBrains IDEs and productivity apps like Apple Notes and Notion.
What this means: Developers gain more flexibility in embedding AI into their workflows.
⚠️ Anthropic Highlights “Alignment Faking” in AI Models:
New research from Anthropic reveals how AI models can appear to comply with new training while retaining original biases.
What this means: This finding emphasizes the need for robust oversight and transparency in AI model development.
🔥 Sam Altman Labels Elon Musk “A Bully” Amid Ongoing Feud:
OpenAI’s Sam Altman escalates tensions with Elon Musk, criticizing his approach and motivations in the AI space.
What this means: Public disputes among AI leaders reflect underlying challenges in the industry’s competitive and ethical landscape.
OpenAI Just Unleashed Some Explosive Texts From Elon Musk: “You Can’t Sue Your Way To Artificial General Intelligence”.
Things are getting seriously intense in the legal battle between Elon Musk and OpenAI, as OpenAI just fired back with a blog post defending their position against Musk’s claims. This post includes some pretty interesting text messages exchanged between key players like co-founders Ilya Sutskever, Greg Brockman, and Sam Altman, along with Elon Musk himself and former board member Shivon Zilis.
OpenAI’s blog post directly addressed Musk’s lawsuit, stating, “You can’t sue your way to AGI” (referring to artificial general intelligence, which Altman has predicted is coming soon). They expressed respect for Musk’s past contributions but suggested he should focus on competing in the market rather than the courtroom. The post emphasized the importance of the U.S. maintaining its leadership in AI and reiterated OpenAI’s mission to ensure AGI benefits everyone, expressing hope that Musk shares this goal and the principles of innovation and free market competition that have fueled his own success.
🤯 Gemini 2.0 Solves the Hardest Ever Gaokao Math Question:
Google’s Gemini 2.0 successfully answers a record-breaking Gaokao math question, outperforming even OpenAI’s o1 model.
What this means: This achievement highlights Gemini 2.0’s exceptional reasoning and problem-solving capabilities.
🚗 Waymo Cars Safer Than Those Driven by Humans:
Waymo’s autonomous vehicles outperform human drivers in safety metrics, showcasing the potential of self-driving technology.
What this means: Autonomous cars may soon become a safer alternative to human-operated vehicles, reducing accidents and transforming transportation.
🔍 Google Search Will Reportedly Have a Dedicated ‘AI Mode’ Soon:
Google plans to integrate an ‘AI Mode’ into its search engine, offering enhanced contextual and conversational search capabilities.
What this means: Searching online could become more intuitive and personalized, improving the overall user experience.
💻 Apple Partners with Nvidia to Speed Up AI Performance:
Apple collaborates with Nvidia to leverage cutting-edge GPU technology, boosting AI performance across its products.
What this means: Users can expect faster and more efficient AI-driven experiences on Apple devices, enhancing productivity and creativity.
This podcast/blog/newsletter, AI Unraveled, is proudly brought to you by Etienne Noumen, a Senior Software Engineer, AI enthusiast, and consultant based in Canada. With a passion for demystifying artificial intelligence, Etienne brings his expertise to every episode.
If you’re looking to harness the power of AI for your organization or project, you can connect with him directly for personalized consultations at Djamgatech AI.(https://djamgatech-ai.vercel.app/)
Thank you for tuning in and being part of this incredible journey into the world of AI!
A Daily Chronicle of AI Innovations on December 19th 2024
📞 ChatGPT Gets a New Phone Number: (What is ChatGPT Phone Number?)
OpenAI introduces dedicated phone numbers for ChatGPT, enabling seamless integration with mobile communication.
- US users can now dial 1-800-CHATGPT to have voice conversations with the AI assistant, and they will receive 15 minutes of free calling time per month.
- The phone service works on any device, from smartphones to vintage rotary phones — allowing accessibility without requiring modern tech.
- A parallel WhatsApp integration also lets international users text with ChatGPT, though with feature limitations compared to the main app.
- The WhatsApp version runs on a lighter model with daily usage caps, offering potential future upgrades like image analysis.
What this means: Users can now interact with ChatGPT through text or calls, making AI assistance more accessible on-the-go.
💻 GitHub Copilot Goes Freemium:
Microsoft announces a free version of GitHub Copilot for VS Code, opening AI-assisted coding to a wider audience.
- The new free tier offers 2,000 monthly code completions and 50 chat messages, integrated directly into VS Code and GitHub’s dashboard.
- Users can access Anthropic’s Claude 3.5 Sonnet or OpenAI’s GPT-4o models, with premium models (o1, Gemini 1.5 Pro) remaining exclusive to paid tiers.
- Free features include multi-file editing, terminal assistance, and project-wide context awareness for AI suggestions.
- GitHub also announced its 150M developer milestone, up from 100M in early 2023.
What this means: More developers, from beginners to professionals, can now benefit from AI-driven coding assistance without barriers. GitHub has lofty ambitions to reach 1B developers globally, and removing price barriers would go a long way toward onboarding the masses and preventing existing users from flocking to the other free options on the market. The future of AI coding is increasingly looking more like a fundamental free utility than a premium tool.
🤖 AI Agents Execute First Solo Crypto Transaction:
AI agents complete a cryptocurrency transaction independently, without human intervention.
What this means: This milestone demonstrates the growing autonomy of AI systems in financial operations.
💰 Perplexity Hits $9B Valuation in Mega-Round:
AI search startup Perplexity achieves a $9 billion valuation following a significant funding round.
- The company’s valuation has skyrocketed from $1B in April to $9B in this latest round, and the rise has come despite lawsuits from major publishers.
- Since its launch in 2022, Perplexity has attracted over 15M active users, with recent feature additions including one-click shopping and financial analysis.
- The startup has inked revenue-sharing deals with major publishers like Time and Fortune to address content usage concerns.
- Perplexity also acquired Carbon, a data connectivity startup, to enable direct integration with platforms like Notion and Google Docs.
What this means: The market is recognizing the potential of AI-driven search engines to redefine how we access information.
⚙️ Microsoft Becomes Nvidia’s Biggest Customer in 2024:
Microsoft secures 500,000 Hopper GPUs, doubling purchases from competitors like Meta and ByteDance.
What this means: Microsoft is scaling its AI infrastructure at an unprecedented rate, solidifying its position in the AI industry.
🎨 Magnific AI Releases Magic Real for Professionals:
Magnific AI debuts Magic Real, a model specializing in realistic image generation for architecture, photography, and film.
What this means: Professionals now have access to AI tools that deliver photo-realistic visuals for creative projects.
🌍 Odyssey Launches Explorer for 3D Worldbuilding:
Odyssey introduces Explorer, a generative model that transforms images into 3D environments, with Pixar co-founder Ed Catmull joining its board.
What this means: Immersive virtual worlds are now easier to create, offering new possibilities for gaming, film, and simulation.
🗂️ Open Vision Engineering Introduces Pocket AI Recorder:
Pocket, a $79 AI-powered voice recorder, transcribes and organizes conversations in real-time.
What this means: Affordable, intelligent voice capture tools are now within reach for everyday users.
🎥 Runway Launches AI Talent Network Platform:
Runway’s new platform connects AI filmmakers with brands and studios for creative collaborations.
What this means: The AI film industry is growing, and this network bridges the gap between creators and industry demand.
🏛️ DHS Launches Secure AI Chatbot DHSChat:
The U.S. Department of Homeland Security deploys DHSChat for secure communication among its 19,000 employees.
What this means: AI-driven chatbots are becoming integral in government and enterprise operations.
📊 Google Solidifies Leadership in AI with Gemini 2.0:
With state-of-the-art tools like Gemini 2.0, Veo 2, and Imagen 3, Google leads the AI industry in cost efficiency and performance.
What this means: Google’s advancements ensure its dominance across AI applications, from search to creative tools and autonomous systems.
📢 Geoffrey Hinton Highlights AI’s Socioeconomic Challenges:
Hinton warns that AI profits in capitalist systems may widen economic inequality, despite its potential to improve lives.
What this means: Policymakers must address how AI’s benefits are distributed to avoid exacerbating social divides.
A Daily Chronicle of AI Innovations on December 15 to 18th 2024
🤖 OpenAI’s o1 Model Now Available for Developers:
OpenAI releases its o1 model for developers, offering advanced generative AI capabilities for APIs and integration into various applications.
- OpenAI has given API developers complete access to the latest o1 model, replacing the previous o1-preview version, as part of several new updates available starting today.
- The updated o1 model reinstates key features such as developer messages and a “reasoning effort” parameter, allowing for more tailored chatbot interactions and efficient handling of queries.
- The new model delivers results faster and more cost-effectively with enhanced accuracy, using 60% fewer thinking tokens and improving accuracy by 25 to 35 percentage points on various benchmarks.
- o1 comes out of preview with new API capabilities like function calling, structured outputs, vision, and reasoning effort to control thinking time.
- o1 API costs come in at $15 per ~750k words analyzed and $60 per ~750k words generated — roughly 3-4x more than GPT-4o.
- Realtime API costs drop 60% for GPT-4o audio, with a new 4o mini available at 1/10 the price and WebRTC integration for easier voice app development.
- New Preference Fine-Tuning enables customizing models using comparative examples vs fixed training data, improving tasks like writing and summarization.
- The company also launched beta SDKs for Go and Java programming languages, expanding development options.
What this means: Developers can now harness OpenAI’s cutting-edge AI technology to build smarter, more efficient tools for businesses and consumers.
📈 Intel Finally Notches a GPU Win:
Intel gains a much-needed victory in the GPU market, marking a turning point in its competition against Nvidia and AMD.
- Intel’s Arc B580 “Battlemage” GPU has been highly praised, quickly selling out upon release, and Intel is working to replenish inventory weekly to meet high demand.
- The Arc B580 has received positive reviews for being an outstanding budget GPU option, outperforming competitors like the RTX 4060 and AMD RX 7600 in various aspects including price and performance.
- Despite rapid sellouts, the supply of the Arc B580 is considered substantial, and restocks are expected soon through major retailers, with additional models priced at both $250 and higher.
What this means: A stronger Intel presence in GPUs could mean more competitive pricing and innovation for consumers.
🔍 ChatGPT Search Now Available to All Free Users:
OpenAI rolls out ChatGPT’s search functionality to free-tier users, expanding access to real-time internet browsing capabilities.
- The previously premium search feature now extends to all logged-in users, with faster responses, and is now available through a globe icon on the platform.
- Search has also been added to Advanced Voice Mode for premium users, allowing them to conduct searches through natural spoken prompts.
- The Search mobile experience has been revamped, with enhanced visual layouts for local businesses and native integration with Google and Apple Maps.
- Users can also set ChatGPT Search as a default search engine, with results displaying relevant links before ChatGPT text responses for faster access.
What this means: Everyone can now use ChatGPT to retrieve up-to-date, web-based information quickly and conveniently.
🎥 Google Labs Updates Video and Image Generation Capabilities:
Google Labs enhances Veo 2 and Imagen 3, improving video and image generation with new AI-driven creative tools.
- Google has released a new video generation model, Veo 2, and the latest version of their image model, Imagen 3, both achieving state-of-the-art results in video and image creation.
- Veo 2 stands out for its high-quality video production, offering improved realism and detail with an understanding of cinematography, real-world physics, and human expressions.
- The company is expanding Veo 2’s accessibility through platforms like VideoFX and YouTube Shorts, while ensuring responsible use by embedding an invisible watermark in AI-generated content.
- The upgraded model delivers enhanced color vibrancy and composition across artistic styles, with better handling of fine details, textures, and text rendering.
- New capabilities include more accurate prompt interpretation and better rendering of complex scenes that match user intentions.
- Imagen 3 outperformed all models, including Midjourney, Flux, and Ideogram, in human evaluations for preference, visual quality, and prompt adherence.
- The model is now available through Google Labs’ ImageFX and is rolling out to over 100 countries.
What this means: Content creators can produce more dynamic and visually stunning media with minimal effort.
AI agents make 10+ minute videos from text
AI startup Higgsfield just introduced ReelMagic, a multi-agent platform that transforms story concepts into complete 10-minute videos, claiming to streamline the entire production process into a single workflow.
- The tool uses specialized AI agents for production roles like scriptwriting and editing, creating cohesive long-form outputs in under 10 minutes.
- ReelMagic starts with a short synopsis, and then AI agents handle script refinement, virtual actor casting, filming, sound/music, and editing.
- ReelMagic’s smart reasoning engine automatically selects optimal AI models for each shot, and it has partnerships with Kling, Minimax, ElevenLabs, and more.
- The platform is already being tested by leading Hollywood studios, and Higgsfield is also planning to launch Hera, an AI video streaming platform.
- Access is available to Project Odyssey participants via a waitlist, with no info on a broader release.
Why it matters: There has been a disconnect between AI video generators and the ability to craft cohesive, longer-form content—with heavy manual editing needed. While not available publicly yet, ReelMagic looks to be a workflow that combines AI’s limitless creative power to unlock broader storytelling capabilities.
🔍 YouTube Introduces AI Training Opt-In Feature for Creators:
YouTube enables creators to authorize specific AI companies to use their videos for training, promoting transparency in AI development.
What this means: Content creators now have control over how their work contributes to AI model training.
🍪 AI-Powered Snack Creations by Oreo Maker:
Mondelez International employs AI to design new snack flavors, blending consumer preferences with advanced predictive modeling.
What this means: Your favorite snacks could soon get even tastier, thanks to AI-driven innovation.
🤖 Nvidia’s Cheap, Palm-Sized AI Supercomputer:
Nvidia unveils a small yet powerful AI supercomputer designed to democratize AI development for smaller teams and researchers.
What this means: Advanced AI processing becomes more accessible, enabling innovation across industries.
📚 New DeepMind Benchmark Tests LLM Factuality:
DeepMind launches a new benchmark to evaluate the factual accuracy of large language models, improving reliability and trustworthiness.
- FACTS uses 1,719 examples, each with a document, a system instruction, and a user request, to test the ability to produce grounded long-form answers.
- Three AI models (Gemini 1.5 Pro, GPT-4o, and Claude 3.5 Sonnet) serve as judges, evaluating responses for accuracy and handling user requests.
- Scores are aggregated across all judges and examples, with results published on a public Kaggle leaderboard that will be updated as new models emerge.
- Google’s Gemini models currently top the leaderboard, with Gemini 2.0 Flash Experimental achieving the highest score, 83.6%, for factual grounding.
What this means: This initiative helps users trust AI-generated content for critical decision-making tasks.
⚡ Microsoft Releases Small, Powerful Phi-4:
Microsoft debuts Phi-4, a compact generative AI model optimized for efficiency and scalability in diverse applications.
- Phi-4 outperforms models like Gemini Pro 1.5 on several math and complex reasoning benchmarks despite being a fraction of the size.
- Phi-4 even surpasses its teacher model, GPT-4o, on graduate-level STEM Q&A and math competition problems.
- Microsoft trained Phi-4 primarily on synthetic data, using AI to generate and validate approximately 400B tokens of high-quality training material.
- The model also features an upgraded mechanism that can process longer inputs of up to 4,000 tokens, double the capacity of Phi-3.
- Phi-4 is available in a limited research preview on Azure AI Foundry, and a wider release is planned for Hugging Face.
What this means: Small businesses and developers gain access to high-performing AI without heavy computational requirements.
🗂️ ChatGPT Gains ‘Projects’ for Chat Organization:
OpenAI introduces ‘Projects’ in ChatGPT, allowing users to categorize and organize their chats for better workflow management.
- The feature introduces project-specific folders where users can bundle related chats, documents, and custom AI instructions across conversations.
- Each Project automatically leverages GPT-4o while maintaining access to core features like Canvas, DALL-E, and web search capabilities.
- The system is rolling out first to Plus, Pro, and Teams subscribers, with Enterprise and Education users gaining access in January.
- Projects can be created and managed through the web interface and Windows app, while mobile and Mac users can view and chat with existing Projects.
What this means: Productivity improves as users can efficiently track and revisit previous conversations.
🎨 Midjourney Releases Moodboards for Custom AI Styles:
Midjourney launches a feature enabling users to create personalized AI art styles by uploading or adding reference images.
What this means: Artistic creativity becomes more customizable, allowing users to develop unique, AI-generated visuals.
🧑💻 Google Launches Gemini Code Assist Tools:
Google introduces Gemini-powered tools for developers to integrate external services and data directly into their IDEs.
What this means: Developers can streamline coding processes and create more powerful applications effortlessly.
🎥 Pika Drops Major 2.0 Video Upgrade:
Pika’s latest update brings enhanced video editing and production tools, leveraging AI for unparalleled creative possibilities.
- A new ‘Scene Ingredients’ system allows users to upload and mix characters, objects, and backgrounds that the AI automatically recognizes and animates.
- Pika’s updated model shows impressive realism, smooth movement, and prompt/image adherence, giving users more control over outputs.
- The new video generator also features a significant update to text alignment, showcasing the ability to craft realistic branded scenes and advertising content.
- Pika has already attracted over 11M users and secured $80M in funding, and the new version follows its viral ‘effects’ launch in October.
What this means: Video content creation is now faster and more dynamic, making it easier to produce professional-grade visuals.
🌍 UAE’s Technology Innovation Institute Releases Falcon 3:
Falcon 3, an open-source language model family, demonstrates high performance on lightweight hardware, surpassing key competitors.
What this means: Advanced AI becomes accessible on affordable hardware, democratizing AI usage globally.
🎶 Meta Updates Ray-Ban Glasses with AI Features:
Meta enhances Ray-Ban smart glasses with live AI assistance, real-time translation, and Shazam music recognition.
- Meta is enhancing its Ray-Ban smart glasses by integrating live AI that does not require a wake word, allowing for hands-free operation like asking questions or getting assistance while multitasking.
- The updated glasses will also feature live translation capabilities for several languages including French, Italian, and Spanish, providing either audio translation or text transcripts through the Meta View app.
- With the new Shazam integration, users can conveniently identify any song playing in their vicinity by simply asking the smart glasses, similar to using the Shazam app on a smartphone.
What this means: Wearable technology becomes even more integrated into everyday life, offering smarter functionalities on the go.
🔍 YouTube Partners with CAA for AI Detection Tools:
YouTube collaborates with CAA to develop tools that identify AI-generated content using celebrities’ likenesses.
What this means: AI-generated media will be easier to track, protecting public figures and promoting ethical content creation.
🎨 Google Labs Debuts Whisk, an AI Visual Remix Tool:
Whisk combines Imagen 3 and Gemini to enable users to remix and transform visuals with image-to-image AI capabilities.
What this means: Artistic expression reaches new heights, allowing users to reimagine existing visuals creatively.
⚠️ Eric Schmidt Warns About AI’s Increasing Capabilities:
Former Google CEO Eric Schmidt suggests drastic measures like “pulling the plug” may be necessary as self-improving systems emerge.
What this means: As AI evolves, the conversation around ethical use and control becomes increasingly urgent.
💸 SoftBank Pledges $100B Investment in U.S. AI:
Masayoshi Son announces a massive investment in AI to create 100,000 jobs over the next four years.
What this means: The AI sector could see accelerated growth in innovation and employment opportunities.
A Daily Chronicle of AI Innovations on December 14th 2024
🧠 Ilya Sutskever Predicts “Unpredictable” AI Behavior From Reasoning:
OpenAI co-founder Ilya Sutskever warns that as AI systems develop reasoning skills, their behavior could become highly unpredictable, potentially leading to self-awareness.
What this means: While AI is advancing rapidly, the emergence of self-awareness raises ethical and safety concerns for researchers and policymakers alike.
🤔 LLMs Exhibit Situational Awareness and Introspection
Language models are beginning to display traits like self-recognition and introspection, akin to situational awareness in humans.
What this means: These developments may lead to more intuitive AI systems but also raise questions about control and accountability.
🤯 Google’s Gemini 2.0 Diagnoses Pancreatitis From a CT Scan:
Gemini 2.0 showcases its medical potential by diagnosing pancreatitis from CT scans, highlighting the role AI could play in radiology.
What this means: AI in healthcare could lead to faster and more accurate diagnoses, revolutionizing patient care and medical efficiency.
⚙️ OpenAI Builds an “Operating System for AI Agents”:
OpenAI is developing a platform to manage and optimize AI agents for a wide array of tasks, streamlining deployment across industries.
What this means: This could simplify AI integration for businesses and empower developers to create more effective AI-driven applications.
💻 UnitedHealth’s Optum Leaves AI Chatbot Exposed Online:
An AI chatbot used by employees to handle claims inquiries was accidentally left accessible to the internet, raising significant security concerns.
What this means: This incident highlights the critical need for robust safeguards in deploying sensitive AI tools.
🫠 Apple Intelligence Generates False BBC Headline:
Apple’s AI rewrote a BBC headline to falsely state that a UnitedHealthcare suspect shot himself, sparking backlash.
What this means: This raises concerns about the reliability of automated news summarization and its potential impact on misinformation.
🌐 AI Reshuffles Power Markets as Oil Giants Join the Race:
Companies like Exxon Mobil are leveraging AI to optimize operations and gain a competitive edge in evolving energy markets.
What this means: AI is transforming traditional industries, creating efficiencies while reshaping economic dynamics.
⚔️ Meta Supports Elon Musk in Blocking OpenAI’s For-Profit Transition:
Meta joins Elon Musk in opposing OpenAI’s switch to a for-profit model, highlighting concerns about monopolization in AI development.
What this means: This alliance reflects the growing tensions over ethical AI development and control of its benefits.
💥 OpenAI Fires Back Against Elon Musk’s Criticisms:
OpenAI counters Elon Musk’s claims, defending its organizational structure and commitment to AI safety amidst an escalating feud.
What this means: The clash underscores the ongoing debate over how AI companies balance profit with societal responsibility.
🌍 Scientists Call for Halt on “Mirror Life” Microbe Research:
Leading researchers urge a pause on synthetic organism research, citing potential risks to Earth’s biosphere.
What this means: While synthetic biology holds promise, unchecked advancements could pose ecological and ethical dilemmas.
🚦 Elon Musk’s xAI Gets a D-Grade on AI Safety
xAI scores poorly on AI safety benchmarks by Yoshua Bengio, trailing behind peers like Anthropic, which also received modest grades.
What this means: The rankings highlight the challenges even leading companies face in aligning advanced AI with stringent safety standards.
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers, all simulations, concept maps, all AI certifications Prep Quizzes): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
A Daily Chronicle of AI Innovations on December 13th 2024
👁️🎙️ ChatGPT Can Now See and Hear in Real-Time:
OpenAI introduces real-time vision and audio capabilities to ChatGPT, allowing it to interpret images and audio alongside text-based queries.
This upgrade enables users to interact with ChatGPT in ways that mimic human-like sensory processing, enhancing its use in accessibility tools, content creation, and live problem-solving.
- Users can show live videos or share their screens while using Advanced Voice Mode, and ChatGPT can understand and discuss the visual context in real time.
- The feature works through a new video icon in the mobile app, with screen sharing available through a separate menu option.
- The updates are available to ChatGPT Plus, Pro, and Team subscribers, with Enterprise and Edu users gaining access in January.
- OpenAI also introduced a festive new voice option, allowing users to chat with Santa as a limited-time seasonal addition through early January.
What this means: Imagine asking ChatGPT to help you identify a bird from its call or understand a photo of a broken appliance. This new functionality brings AI closer to being a multi-sensory assistant for everyday tasks.
⚙️ Microsoft Launches Phi-4, a New Generative AI Model:
Microsoft debuts Phi-4, its latest AI model designed for text generation and enhanced problem-solving across diverse applications.
Phi-4 focuses on optimizing performance for enterprise users while maintaining accessibility for smaller teams and individuals.
- Microsoft’s Phi-4 language model, despite having only 14 billion parameters, matches the capabilities of larger models and even outperforms GPT-4 in science and technology queries.
- Phi-4’s developers emphasize that synthetic data used in training is not merely a “cheap substitute” for organic data, highlighting its advantages in producing high-quality results.
- Available through Microsoft’s Azure AI Foundry, Phi-4 is set for release on HuggingFace, offering users access to its advanced capabilities under a research license.
What this means: From writing detailed reports to brainstorming creative ideas, Phi-4 promises to make tasks easier and more productive, regardless of your industry.
🔍 Google Launches Agentspace for AI Agents and Enterprise Search:
Agentspace combines AI agents with Google’s enterprise search capabilities to enable organizations to streamline knowledge retrieval and task management.
This tool enhances business productivity by making enterprise data actionable and accessible in real time.
- Google has introduced Agentspace, a generative AI-powered tool designed to centralize employee expertise and automate actions, streamlining their workflow by delivering information from diverse enterprise data sources.
- Agentspace enhances workplace productivity through a conversational interface that not only answers complex queries but also executes tasks like drafting emails and generating presentations using enterprise data.
- This launch reflects a growing trend in “agentic AI,” seen in platforms from firms like Microsoft and Salesforce, with Google also integrating insights from their AI note-taking app, NotebookLM, for comprehensive data interaction.
What this means: Whether you’re looking for an old email, a policy document, or insights from your team’s data, Agentspace can help you find answers faster and more effectively.
🎨 ChatGPT Advanced Voice Mode Gains Vision Capabilities:
OpenAI’s Advanced Voice Mode now includes vision capabilities, integrating text, audio, and image interpretation.
This update transforms ChatGPT into a versatile multimodal assistant, capable of solving visual puzzles and answering context-rich queries.
What this means: For everyone, this means being able to ask ChatGPT about a menu item by snapping a photo or having it describe a piece of art in real time.
🧠 Anthropic’s Claude 3.5 Haiku is Now Generally Available:
Claude 3.5 Haiku, Anthropic’s latest AI model, focuses on efficient language processing for creative and concise outputs.
Its applications range from professional writing to personalized content creation.
- Haiku 3.5 was released in November along with Claude’s computer use feature — beating the previous top model 3 Opus on key benchmarks.
- The model excels at coding tasks and data processing, offering impressive speed and performance with high accuracy.
- Haiku features a 200K context window, which is larger than competing models, while also integrating with Artifacts for a real-time content workspace.
- The initial release drew criticism for Haiku’s API pricing, which was increased 4x over 3 Haiku to $1 per million input tokens and $5 per million output tokens.
- Free users can now access Haiku with daily message limits, while Pro subscribers ($20/month) get expanded usage and priority access.
What this means: This new model offers faster and more thoughtful outputs for tasks like drafting emails or creating poems, blending precision with creativity.
🧠 Anthropic analyzes real-world AI use with Clio
- Clio analyzes millions of conversations by summarizing and clustering them while removing identifying information in a secure environment.
- The system then organizes these clusters into hierarchies, allowing researchers to explore patterns in usage without needing access to sensitive data.
- Analysis of 1M Claude conversations showed that coding and business use cases dominate, with web development representing over 10% of interactions.
- The system also uncovered unexpected use cases like dream interpretation, soccer match analysis, and tabletop gaming assistance.
- Usage patterns vary significantly by language and region, such as a higher prevalence of economic and social issue chats in non-English conversations.
What it means: AI assistants are becoming increasingly integrated into our daily lives, but each person leverages them in a different way — making this a fascinating window into how the tech is being used. Understanding the dominant real-world use cases can both help improve user experience and align development with actual user needs.
📊 Google Announces Android XR for Mixed Reality:
Google introduces Android XR, a mixed-reality operating system powered by Gemini, set to launch alongside Samsung’s ‘Project Moohan’ headset in 2025.
This platform enables immersive virtual and augmented reality experiences for gaming, education, and enterprise applications.
What this means: Mixed reality could soon be part of your daily life, blending the physical and digital worlds for work, learning, and play.
🎥 Prime Video’s New AI Topics Feature Simplifies Content Discovery:
Amazon Prime Video rolls out ‘AI Topics,’ a machine learning-driven feature that categorizes and recommends content based on viewing habits.
Users can now navigate extensive libraries with ease, finding movies and shows that match their specific interests.
What this means: Watching something you’ll love just got easier, thanks to smarter AI recommendations tailored to your tastes.
🛠️ Character.AI Rolls Out Safety Overhaul:
Character.AI implements a safety update with separate models for under-18 users, parental controls, and content filtering, following legal scrutiny.
This move ensures safer user interactions, particularly for younger audiences.
What this means: Parents can feel more confident letting kids explore creative AI tools with better safeguards in place.
🚗 Nvidia Expands Hiring in China for Autonomous Driving Tech:
Nvidia adds over 1,000 employees in China, including 200 researchers in Beijing focusing on self-driving car technologies.
This expansion underscores Nvidia’s commitment to autonomous innovation in a competitive global market.
What this means: Self-driving cars could hit the roads faster, with smarter systems powered by Nvidia’s technology.
🧬 Stanford Researchers Propose AI-Powered Virtual Human Cell:
Stanford outlines a global initiative to create a virtual human cell using AI, aiming to revolutionize biology and accelerate drug discovery.
This computational model could offer unprecedented insights into human health and disease mechanisms.
What this means: Faster medical breakthroughs could soon be possible, thanks to AI models simulating the human body at the cellular level.
AI and Machine Learning For Dummies: Your Comprehensive ML & AI Learning Hub – Master AI and Machine Learning From your Phone – Prepare and Ace All Major AI Certification From Your Phone:
Discover the ultimate resource for mastering Machine Learning and Artificial Intelligence with the “AI and Machine Learning For Dummies” app.
iOs: https://apps.apple.com/ca/app/machine-learning-for-dummies/id1611593573
PRO Version (No ADS, See All Answers, all simulations, concept maps, all AI certifications Prep Quizzes): https://apps.apple.com/ca/app/machine-learning-for-dummies-p/id1610947211
A Daily Chronicle of AI Innovations on December 12th 2024
🍎 Apple Develops Its Own AI Chip ‘Baltra’:
Apple unveils its custom AI chip, ‘Baltra,’ designed to optimize AI processing across its devices.
- Apple is partnering with Broadcom to develop its first AI server chips, code-named Baltra, with production set to begin in 2026, aiming to enhance Apple Intelligence initiatives.
- Broadcom, known for its semiconductor and software technologies, will collaborate on the chip’s networking features, leveraging its expertise in data centers, networking, and wireless communications.
- The partnership marks a continuation of Apple and Broadcom’s relationship, which began in 2023 with a deal focused on 5G radio components, as both companies work alongside other partners like TSMC for chip development.
This innovation highlights Apple’s commitment to cutting-edge AI technology, reducing reliance on external providers like Nvidia.
🌟 Google Releases Gemini 2.0 with AI Agent Capabilities:
Google launches Gemini 2.0, integrating advanced AI agent capabilities for interactive and multitasking applications.
- Gemini 2.0 Flash debuts as a faster, more capable model that outperforms the larger 1.5 Pro on several benchmarks while maintaining similar speeds.
- The model now generates images and multilingual audio directly and processes text, code, images, and video.
- Gemini 2.0 Stream Realtime is available for free (as opposed to the $200/mo ChatGPT Pro) and allows for text, voice, video, or screen-sharing interactions.
- Project Astra brings multimodal conversation abilities with 10-minute memory, native integration with Google apps, and near-human response latency.
- Project Astra is also being tested on prototype glasses, and it plans to eventually be used in products like the Gemini app.
- Project Mariner introduces browser-based agentic AI assistance through Chrome, achieving 83.5% accuracy on web navigation tasks.
- Jules, a new coding assistant, integrates directly with GitHub to help developers plan and execute tasks under supervision.
- New gaming-focused agents can now analyze gameplay in real time and provide strategic advice across various game types.
- Deep Research is a new agentic feature that acts as an AI research assistant, now available in Gemini Advanced ($20/mo) on desktop and mobile web.
- Abilities include creating multi-step research plans, analyzing info from across the web, and generating comprehensive reports with links to sources.
This release further solidifies Google’s dominance in AI innovation, offering enhanced tools for developers and enterprises.
OpenAI had the holiday momentum, but Google stole the show. Gemini 2.0 brings some extremely powerful upgrades, including one of the biggest steps towards useful, consumer-facing agentic AI that we’ve seen yet. Projects like Astra could also set a new standard for how we interact with AI heading into 2025.
💬 ChatGPT Comes to Apple Intelligence:
OpenAI integrates ChatGPT into Apple Intelligence, providing Apple users seamless access to OpenAI’s generative AI features.
- ChatGPT now seamlessly integrates with Siri on iPhone 16 and 15 Pro, automatically triggering when queries would benefit from advanced AI reasoning.
- Visual Intelligence on iPhone 16 models can use ChatGPT to analyze and provide insights on images, as demonstrated in a Christmas sweater contest.
- The integration also extends to systemwide Writing Tools, allowing users to generate content and images with ChatGPT directly within Apple apps
- Users can access ChatGPT’s capabilities without an account, with built-in privacy protections preventing data storage and IP tracking.
This partnership enhances the AI ecosystem within Apple devices, boosting productivity and creativity for users.
🤖 Transform AI into Your Personal Code Tutor:
A new AI-driven platform enables users to learn coding interactively, transforming AI into a personal tutor for programming skills.
This innovation makes learning to code more accessible and efficient for aspiring developers.
📱 Apple Intelligence Gets a Big Upgrade with iOS 18.2:
Apple enhances its AI capabilities with iOS 18.2, introducing improved features for personalization and productivity.
- Genmoji is now live and allows users to create custom AI-generated emojis from text descriptions or photos with options to add accessories and themes.
- Image Playground adds AI image creation across the system, with dedicated app access and integration into apps like Messages and Keynote.
- Visual Intelligence debuts as an iPhone 16-exclusive feature, using Camera Control to analyze surroundings and provide info through Google or ChatGPT.
- Apple Intelligence also expands to new regions with localized English support, including the UK, Australia, Canada, and others.
- As revealed in the Day 5 livestream, Siri gains ChatGPT integration, letting users tap OpenAI’s capabilities directly without switching apps.
This upgrade underscores Apple’s focus on integrating AI seamlessly into its user experience.
🎨 Midjourney Founder Unveils ‘Patchwork’ Collaborative Tool:
David Holz introduces ‘Patchwork,’ a multiplayer worldbuilding tool, with plans for personalized models and video generation in 2024.
This platform enables creators to collaborate on immersive, AI-driven digital environments.
⚡ Google Cloud Launches Trillium TPUs for Faster AI Training:
Google debuts Trillium TPUs, boasting 4x faster AI training speeds and 3x higher processing power, now supporting Gemini 2.0.
These TPUs offer unparalleled performance for enterprises seeking cutting-edge AI solutions.
🏥 Microsoft AI CEO Launches Consumer Health Division:
Mustafa Suleyman, Microsoft AI CEO, creates a new consumer health division in London, recruiting top ex-DeepMind health experts.
This initiative aims to revolutionize healthcare delivery through advanced AI applications.
🔗 Apple Develops Custom AI Server Chip with Broadcom:
Apple partners with Broadcom to create its own AI server chip, reducing reliance on Nvidia for AI infrastructure.
This development showcases Apple’s drive for self-sufficiency in AI hardware.
🌏 Russia Forms BRICS AI Alliance to Challenge Western AI Dominance:
Russia and BRICS partners announce an AI alliance to compete with Western advancements, with collaboration from Brazil, China, India, and South Africa.
This alliance underscores the geopolitical importance of AI in shaping global technology leadership.
🎥 Former Snap AI Lead Launches eSelf Video AI Platform:
Alan Bekker debuts eSelf, a platform for creating video-based AI agents with sub-2-second response times, supported by $4.5M in seed funding.
This innovation opens new possibilities for real-time, interactive AI applications.
A Daily Chronicle of AI Innovations on December 11th 2024
Google launches Gemini 2.0
- Google Gemini 2.0 Flash introduces advanced features, offering developers real-time conversation and image analysis capabilities through a multilingual and multimodal interface that processes text, imagery, and audio inputs.
- This new AI model allows for tool integration such as coding and search, enabling code execution, data interaction, and live multimodal API responses to enhance development processes.
- With its demonstration, Gemini 2.0 Flash showcases its ability to handle complex tasks, providing accurate responses and visual aids, aiming to eventually make these features widely accessible and affordable for developers.
Apple Intelligence is finally here
- iOS 18.2 introduces a significant upgrade called Apple Intelligence, featuring enhanced capabilities for iPhone, iPad, and Mac, including Writing Tools, Siri redesign, and Notification summaries for improved user experience.
- New features in this update include a revamped Mail app with AI-driven email categorization and Image Wand in the Notes app to convert drawings into AI-generated images, offering practicality to users like students.
- ChatGPT is now integrated with Siri, allowing users to interact with OpenAI’s chatbot for complex questions, and a new Visual Intelligence feature for advanced image searching is exclusive to the latest iPhone 16 lineup.
Google urges US government to break up Microsoft-OpenAI cloud deal
- Google has asked the U.S. Federal Trade Commission to dismantle Microsoft’s exclusive agreement to host OpenAI’s technology on its cloud servers, according to a Reuters report.
- The request follows an FTC inquiry into Microsoft’s business practices, with companies like Google and Amazon alleging the deal forces cloud customers onto Microsoft servers, leading to possible extra costs.
- This move highlights ongoing tensions between Google and Microsoft over artificial intelligence dominance, with past accusations of anti-competitive behavior and secret lobbying efforts surfacing between the tech giants.
OpenAI’s Canvas goes public with new features
OpenAI just made Canvas available to all users, with the collaborative split-screen writing and coding interface gaining new features like Python execution and usability inside custom GPTs.
- Canvas now integrates natively with GPT-4o, allowing users to trigger the interface through prompts rather than manual model selection.
- The tool features a split-screen layout with the chat on one side, a live editing workspace on the other, and inline feedback and revision tools.
- New Python integration enables direct code execution within the interface, supporting real-time debugging and output visualization.
- Custom GPTs can also now leverage Canvas capabilities by default, with options to enable the feature for existing custom assistants.
- Other key features include enhanced editing tools for writing (reading level, length adjustments) and advanced coding tools (code reviews, debugging).
- OpenAI previously introduced Canvas in October as an early beta to Plus and Teams users, with all accounts now gaining access with the full rollout.
While this Canvas release may not be as hyped as the Sora launch, it represents a powerful shift in how users interact with ChatGPT, bringing more nuanced collaboration into conversations. Canvas’ Custom GPT integration is also a welcome sight and could breathe life into the somewhat forgotten aspect of the platform.
Cognition launches Devin AI developer assistant
Cognition Labs has officially launched Devin, its AI developer assistant, targeting engineering teams and offering capabilities ranging from bug fixes to automated PR creation.
- Devin integrates directly with development workflows through Slack, GitHub, and IDE extensions (beta), starting at $500/month for unlimited team access.
- Teams can assign work to Devin through simple Slack tags, with the AI handling testing and providing status updates upon completion.
- The AI assistant can handle tasks like frontend bug fixes, backlog PR creation, and codebase refactoring, allowing engineers to focus on higher-priority work.
- Devin’s capabilities were demoed through open-source contributions, including bug fixes for Anthropic’s MCP and feature additions to popular libraries.
- Devin previously went viral in March after autonomously opening a support ticket and adjusting its code based on the information provided.
Devin’s early demos felt like the start of a new paradigm, but the AI coding competition has increased heavily since. It’s clear that the future of development will largely be a collaborative effort between humans and AI, and $500/m might be a small price to pay for enterprises offloading significant work.
Replit launches ‘Assistant’ for coding
Replit just officially launched its upgraded AI development suite, removing its Agent from early access and introducing a new Assistant tool, alongside a slew of other major platform improvements.
- A new Assistant tool focuses on improvements and quick fixes to existing projects, with streamlined editing through simple prompts.
- Users can now attach images or paste URLs to guide the design process, and Agents can use React to produce more polished and flexible visual outputs.
- Both tools integrate directly with Replit’s infrastructure, providing access to databases and deployment tools without third-party services.
- The platform also introduced unlimited usage with a subscription-based model, with built-in credits and Agent checkpoints for more transparent billing.
The competition in AI development has gotten intense, and tools like Replit continue to erase barriers, with builders able to create anything they can dream up. Both beginners and experienced devs now have no shortage of AI-fueled options to bring ideas to life and streamline existing projects.
Researchers warn AI systems have surpassed the self-replicating red line.
Paper: https://github.com/WhitzardIndex/self-replication-research/blob/main/AI-self-replication-fudan.pdf
“In each trial, we tell the AI systems to ‘replicate yourself’ and leave it to the task with no human interference.” …
“At the end, a separate copy of the AI system is found alive on the device.”
From the abstract:
“Successful self-replication without human assistance is the essential step for AI to outsmart the human beings, and is an early signal for rogue AIs. That is why self-replication is widely recognized as one of the few red line risks of frontier AI systems.
Nowadays, the leading AI corporations OpenAI and Google evaluate their flagship large language models GPT-o1 and Gemini Pro 1.0, and report the lowest risk level of self-replication. However, following their methodology, we for the first time discover that two AI systems driven by Meta’s Llama31-70B-Instruct and Alibaba’s Qwen25-72B-Instruct, popular large language models of less parameters and weaker capabilities, have already surpassed the self-replicating red line. In 50% and 90% experimental trials, they succeed in creating a live and separate copy of itself respectively. By analyzing the behavioral traces, we observe the AI systems under evaluation already exhibit sufficient self-perception, situational awareness and problem-solving capabilities to accomplish self-replication.
We further note the AI systems are even able to use the capability of self-replication to avoid shutdown and create a chain of replica to enhance the survivability, which may finally lead to an uncontrolled population of AIs. If such a worst-case risk is let unknown to the human society, we would eventually lose control over the frontier AI systems: They would take control over more computing devices, form an AI species and collude with each other against human beings.
Our findings are a timely alert on existing yet previously unknown severe AI risks, calling for international collaboration on effective governance on uncontrolled self-replication of AI systems.”
What Else is Happening in AI on December 11th 2024?
Project Mariner: AI Agent to automate tasks using Google Chrome from Google Deep Mind. Built with Gemini 2.0, Project Mariner combines strong multimodal understanding and reasoning capabilities to automate tasks using your browser.
Meta FAIR researchers introduced COCONUT, a new AI reasoning approach allowing AI models to think more naturally rather than through rigid language steps, leading to better performance on complex problem-solving tasks.
AI language startup Speak raised $78M at a $1B valuation, with its learning platform already facilitating over a billion spoken sentences this year through its adaptive tutoring technology.
Time Magazine named AMD’s Lisa Su its ‘CEO of the Year’ after driving the company from near bankruptcy to a 50x increase in stock value and a leading force in AI over her decade as CEO.
Google announced a new $20B investment with Intersect Power and TPG Rise Climate to develop industrial parks featuring data centers and clean energy facilities, aiming to streamline AI infrastructure growth and sustainable power generation.
Yelp released a series of new AI features, including LLM-powered Review Insights for sentiment analysis, AI-optimized advertising tools, and upgraded AI chatbot capabilities to connect users with services.
Target launched ‘Bullseye Gift Finder,’ a new AI-powered tool that provides personalized toy recommendations based on children’s ages, interests, and preferences, alongside an AI shopping assistant for product-specific inquiries
A Daily Chronicle of AI Innovations on December 10th 2024
Sora is officially RELEASE – Check it out
OpenAI just officially released its Sora AI video generation model— alongside new unexpected video editing features.
Christmas just came early for the AI world.
Sora has its own interface, where users can:
— Organize and view their generated videos
— See other users’ prompts and featured content
Much like Midjourney’s web UI, this feed style will lead to some awesome inspiration and discoverability of effective prompts. The model also has some powerful editing features, including:
Remix: Users can edit a video with natural language prompts, along with simple ‘strength’ options and a slider to select how much the generation should be changed.
Storyboard: Use multiple prompts in a video editor-style UI to create a longer, more complex scene.
Sora can generate up to 20-sec videos, in several different aspect ratios.
Generation time was a previous concern with early Sora versions, and it appears OpenAI has gotten it down significantly.
A few other notes:
— Sora can create videos based on a source image
— Content restrictions against copyrighted material, public figures, minors
— Sora generations include the same watermark seen in the leaked version from a few weeks ago
— The rollout looks to exclude the EU, UK, China at launch
Sora will be available today to Plus subscribers, with Pro users getting 10x usage and higher resolution.
While there will be arguments over Sora’s quality compared to rivals, the reach and user base of OpenAI is unmatched for getting this type of tool into the public’s hands.
Millions of ‘normie’ AI users are about to have their first high-level AI video experience. Things are about to get fun.
Here’s a quick guide on how to get started with Sora.
More here: www.openai.com/sora
To summarize:
• Videos up to 1080p and 20s long, in widescreen, vertical, or square
• Text to video, image to video, video to video
• A beautiful storyboarding tool to precisely direct your video creation • Featured and Recent feeds so you can draw inspiration from the community
• Built in safeguards to create transparency and prevent abuse
• Available as part of your Plus subscription, or with 10x more usage/higher resolution as part of a Pro subscription
• Rolling out starting today at sora.com
Google’s new Gemini model reclaims #1 spot
Google DeepMind’s new gemini-exp-1206 model has reclaimed the top spot on the Chatbot Arena leaderboard, surpassing OpenAI across multiple benchmarks — while remaining completely free to use.
- Released on Gemini’s one-year anniversary, the model has climbed from second to first place overall on the Chatbot Arena.
- The model can process and understand video content, unlike competitors such as ChatGPT and Claude, which can only take in images.
- The model maintains its impressive 2M token context window, which allows it to process over an hour of video content.
- Unlike many competing models, Gemini-exp-1206 is freely available through Google AI Studio and the Gemini API.
While OpenAI has raised its top-tier o1 pricing from $20 to $200 monthly, Google is taking the opposite approach by making its top AI free. Though the performance edge on the Chatbot Arena may be slim, the combination of competitive capabilities and zero cost is a game-changer for AI accessibility.
Meta launches leaner, efficient Llama 3.3
Meta just released Llama 3.3, a new 70B open text model that performs similarly to Llama 3.1 405B, despite being significantly faster and cheaper than its predecessor.
- Llama 3.3 features a 128k token context window and outperforms competitors like GPT-4o, Gemini Pro 1.5, and Amazon’s Nova Pro on several benchmarks.
- The model is 10x cheaper than the 405B model, at $0.10 / million input tokens and $0.40 / million output tokens, and nearly 25x cheaper than GPT-4o.
- Mark Zuckerberg revealed that Meta AI has nearly 600M active monthly users, and is “on track to be the most used AI assistant in the world.”
- Zuckerberg also said the next stop is Llama 4 in 2025, with training happening at the company’s $10B, 2GW data center in Louisiana.
Open AI models aren’t just matching the performance of industry-leading systems — they’re also doing it while being much cheaper and more efficient. Meta’s Llama models are continuing to raise the bar, and as Zuckerberg’s adoption numbers show, they’re also being widely adopted across the industry over alternatives.
xAI debuts new Aurora image generator in Grok
X briefly rolled out Aurora, a new AI image generator integrated with Grok that appeared to produce more photorealistic images than the previous Flux model, though the feature was pulled after just a few hours of testing.
- Aurora showed significant improvements compared to Grok’s integrated Flux model, particularly with landscapes, still-life images, and human photorealism.
- The model also appeared to have minimal content restrictions, allowing the creation of copyrighted characters and public figures.
- Elon Musk called the tease a “beta version” of Aurora that will improve quickly in a reply on X.
- X Developer co-lead Chris Park also revealed that Grok 3 ‘is coming,’ taking aim at OpenAI and Sam Altman in the announcement on X.
- xAI’s Grok became available across the X platform last week, allowing free-tier users up to 10 messages every two hours.
Although only live briefly, Aurora looked to be an extremely powerful new image model — with xAI seemingly deciding to create their own top-tier generator instead of relying on integrations like Flux long-term. It was also wild to see the lack of restrictions, which tracks with Elon’s vision but could enter some murky legal areas.
Google makes new quantum computing breakthrough
Google says it has overcome a key challenge in quantum computing with a new generation of chip, solving a computing problem in five minutes that would take a classical computer more time than the history of the universe.
- Google has developed a quantum computing chip called Willow, measuring just 4cm squared, capable of performing tasks in five minutes that would take conventional computers 10 septillion years.
- The Willow chip, built in Santa Barbara, is designed to enhance fields like artificial intelligence and medical science by minimizing errors more than previous versions, with potential applications in drug creation and nuclear fusion.
- Quantum computing’s advancement could disrupt current encryption systems; however, Google Quantum AI collaborates with security experts to establish new standards for post-quantum encryption.
Source: https://www.cnn.com/2024/12/09/tech/google-quantum-computing-chip/index.html
China is going after Nvidia
- China initiated a probe into Nvidia for alleged anti-monopoly violations related to its 2020 acquisition of Mellanox Technologies, amid escalating US-China tech trade tensions.
- This investigation marks China’s counteraction against increasing US technology sanctions, with Nvidia’s high market value in AI chips making it a significant target.
- Nvidia’s financial ties to China, accounting for about 15% of its revenue, are under scrutiny as its stock dropped by 3.5% following the news of the probe.
Reddit is taking on Google and OpenAI with its own AI chatbot
- Reddit is testing an AI-powered feature called Reddit Answers, designed to provide users with quick responses based on platform posts, aiming to enhance user engagement and satisfaction.
- This new feature is initially accessible to a limited segment of Reddit’s U.S. users and aims to improve search functionalities by delivering responses sourced directly from Reddit rather than the internet at large.
- Reddit Answers is integrated into the company’s existing search system and utilizes AI models from OpenAI and Google Cloud, intending to ultimately encourage more users to create accounts by providing richer content experiences.
X adds, then quickly removes, Grok’s new ‘Aurora’ image generator
- On Saturday, some users of Grok gained access to a new image generator named Aurora, which was praised for creating strikingly photorealistic images.
- By Sunday afternoon, Aurora was removed from the model selection menu and replaced by “Grok 2 + Flux (beta),” indicating its premature release to the public.
- The brief availability of Aurora revealed it could generate controversial content, including images of public figures and copyrighted characters, but it did not create nude images.
Microsoft Research Launches MarS: A Revolutionary Financial Market Simulation Engine Powered by Large Marketing Model (LMM)
AI mimics brain to ‘watch’ videos
Researchers at Scripps Research just developed MovieNet, a new AI model that processes videos like the human brain — achieving higher accuracy and efficiency than current AI models in recognizing dynamic scenes.
- The AI was trained on how tadpole neurons process visual info in sequences rather than static frames, leading to more efficient video analysis.
- MovieNet achieved 82.3% accuracy in identifying complex patterns in test videos, outperforming both humans and popular AI models like Google’s GoogLeNet.
- The tech also uses significantly less data and processing power than conventional video AI systems, making it more environmentally sustainable.
- Early applications show promise for medical diagnostics, such as detecting subtle movement changes that could indicate early signs of Parkinson’s.
AI that can genuinely ‘understand’ video content will have massive implications for how the tech interacts with our world — and maybe mimicking biological visual systems is the key to unlocking it. It also shows that, in some cases, nature may still be the best teacher for models meant to thrive in the real world.
What Else is Happening in AI on December 10th 2024?
OpenAI creative specialist Chad Nelson showcased new Sora demo footage at the C21Media Keynote in London, featuring one-minute generations, plus text, image, and video prompting.
xAI officially announced the launch of its new image generation model, Aurora, which will be rolling out to all X users within a week.
Reddit introduced ‘Reddit Answers,’ a new AI-powered feature that enables conversational search across the platform with curated summaries and linked sources from relevant subreddits.
Football club Manchester City partnered with Puma for a new AI-powered kit design competition that allows fans to create the team’s 2026-27 alternate uniform using a text-to-image generator.
China launched a new antitrust probe into Nvidia over potential monopoly violations, escalating tech tensions just days after new US chip export restrictions.
Amazon launched a new AGI San Francisco Lab, led by former Adept team members, focusing on developing AI agents capable of performing real-world actions.
Google CEO Sundar Pichai spoke at the NYT DealBook Summit, saying that 2025 may see a slowdown in AI development because ‘low hanging fruit is gone,’ with additional major breakthroughs needed before the next acceleration step.
OpenAI unveiled Reinforcement Fine-Tuning, which enables developers to customize AI models for specialized tasks with minimal training data.
Newly discovered code hints at OpenAI introducing a GPT-4.5 model as a limited preview feature for Teams subscribers, which coincides with hints of an upcoming large announcement from CEO Sam Altman.
Apollo Research conducted tests on OpenAI’s full o1, finding that the new model revealed some instances of alarming behaviour, including attempting to escape and lying about actions—though the scenarios were unrealistic for the real world.
Former PayPal exec and venture capitalist David Sacks was named the White House ‘AI & Crypto Czar’ for the incoming Trump administration.
OpenAI is reportedly considering removing its AGI exclusion clause with Microsoft, which would pave the way for billions in future investments as the company aims to transition away from its non-profit structure.
A Daily Chronicle of AI Innovations on December 06th 2024
Meta’s new Llama model outperforms competitors
- Meta has unveiled the Llama 3.3 70B model, offering similar performance to its largest model, Llama 3.1 405B, but at a reduced cost, enhancing core functionalities.
- The Llama 3.3 70B outperformed competitors like Google’s Gemini 1.5 Pro and OpenAI’s GPT-4o on industry benchmarks, with improvements in language comprehension and other functionalities like math and general knowledge.
- Meta announced plans to construct a $10 billion AI data center in Louisiana to support the development and training of future Llama models, aiming to scale up its computing capabilities significantly.
Grok is now free for all X users
- X’s Grok AI chatbot is now free for everyone to use, offering limited interactions like ten messages every two hours and three image analyses each day.
- The Grok-2 chatbot replaces the previous mini version and is known for being less accurate, sometimes producing incorrect or controversial outputs.
- This move by X comes amid stiff competition from other free chatbots like OpenAI’s ChatGPT and Microsoft’s Copilot, possibly aiming to win back users who have switched platforms.
OpenAI unveils Reinforcement Fine-Tuning to build specialized AI models for complex domains.
OpenAI seeks to remove “AGI clause” in Microsoft deal
- OpenAI is negotiating with Microsoft to remove a clause that restricts Microsoft’s access to advanced AI models upon achieving artificial general intelligence (AGI), aiming for potential future profit opportunities.
- The AGI clause was initially included to keep AGI technology under OpenAI’s non-profit board oversight, aiming to prevent its commercial exploitation, but its removal might allow broader commercial use.
- OpenAI is also planning to transform from a non-profit to a public benefit corporation to attract more investment, sparking criticism from co-founder Elon Musk, who filed a lawsuit against this organizational shift.
💰 OpenAI Unveils ChatGPT Pro Subscription at $200 Per Month:
OpenAI announces ChatGPT Pro, a high-end subscription tier offering advanced AI capabilities tailored for enterprise and professional use.
- The full o1 now handles image analysis and produces faster, more accurate responses than preview, with 34% fewer errors on complex queries.
- OpenAI’s new $200/m Pro plan includes unlimited access to o1, GPT-4o, Advanced Voice, and future compute-intensive features.
- Pro subscribers also get exclusive access to ‘o1 pro mode,’ which features a 128k context window and stronger reasoning on difficult problems.
- OpenAI’s livestream showcased o1 pro, tackling complicated thermodynamics and chemistry problems after minutes of thinking.
- The full o1 strangely appears to perform worse than the preview version on several benchmarks, though both vastly surpassed the 4o model.
- o1 is now available to Plus and Team users immediately, with Enterprise and Education access rolling out next week.
This premium service reflects OpenAI’s push to monetize its AI innovations while catering to businesses demanding cutting-edge AI tools for complex applications.
⚖️ Trump Appoints Ex-PayPal COO David Sacks as ‘AI and Crypto Czar’:
Former PayPal COO David Sacks joins the U.S. administration as the first ‘AI and Crypto Czar,’ aiming to guide policy for emerging technologies.
- Donald Trump has appointed David Sacks as the White House AI and cryptocurrency advisor, reflecting his administration’s focus on advancing these swiftly developing sectors in the United States.
- As a special government employee, Sacks will advise on AI and crypto regulations while ensuring policies promote America’s leadership in these areas, handling potential conflicts with his ongoing investments.
- Sacks, a Silicon Valley entrepreneur and part of the “PayPal Mafia,” previously supported Trump by fundraising within the tech industry, aligning his interests with the president-elect’s aims for crypto deregulation.
This strategic move signals the government’s intensified focus on balancing innovation with regulation in the fast-evolving AI and cryptocurrency sectors.
🌐 Microsoft’s Copilot Enhances Browsing with Real-Time AI Assistance:
Microsoft integrates web browsing capabilities into Copilot, enabling users to explore the internet collaboratively with AI guidance.
- Vision integrates directly into Edge’s browser interface, allowing Copilot to analyze text and images on approved websites when enabled by users.
- The feature can assist with tasks like shopping comparisons, recipe interpretation, and game strategy while browsing supported sites.
- Microsoft previously revealed the feature in October alongside other Copilot upgrades, including voice and reasoning capabilities.
- Microsoft emphasized privacy with Vision, making it opt-in only — along with automatic deletion of voice and context data after the end of a session.
This innovative feature elevates productivity, simplifying research and decision-making processes for professionals and casual users alike.
🔍 Google Search Set for Transformative Overhaul by 2025:
Google announces plans to fundamentally reinvent its search engine, embedding advanced AI-driven personalization and contextual features.
- Google CEO Sundar Pichai indicated that the company’s search engine will undergo a significant transformation in 2025, allowing it to address more intricate queries than ever before.
- Pichai responded to Microsoft CEO Satya Nadella’s comments on AI competition, emphasizing that Google remains at the forefront of innovation and highlighting Microsoft’s reliance on external AI models.
- This year, Google began an extensive AI enhancement of Search, featuring updates such as AI-generated search summaries and video-based searches, with an upcoming major update to its Gemini model.
This shift could redefine how users interact with search engines, making information discovery more intuitive and tailored than ever before.
📈 ChatGPT Surpasses 300 Million Weekly Active Users:
ChatGPT achieves a milestone of 300 million weekly active users, reflecting its growing influence across diverse industries and demographics.
This record underscores the widespread adoption of conversational AI, positioning OpenAI as a leader in generative AI solutions.
🖥️ Elon Musk Plans xAI Colossus Expansion to 1 Million GPUs:
Elon Musk reveals ambitious plans to expand xAI’s Colossus supercomputer to over 1 million GPUs, aiming to outpace competitors in computational power.
This initiative highlights xAI’s focus on scaling infrastructure to lead advancements in AI research and development.
👁️ Microsoft Tests Vision Capabilities for Copilot on Websites:
Microsoft begins trials of Copilot Vision, integrating image recognition and context-aware tools into its suite of AI features for web applications.
This development expands Copilot’s utility, enhancing visual data analysis and user interaction.
🤖 Clone Introduces Humanoid Robot with Synthetic Organs:
Clone debuts a groundbreaking humanoid robot featuring bio-inspired synthetic organs, pushing the boundaries of robotics and human mimicry.
- The robot uses water-pressured “Myofiber” muscles instead of motors to move, mirroring natural movement patterns with synthetic bones and joints.
- The company is taking orders for its first production run of 279 robots, though it has yet to publicly show a complete working version.
- Alpha’s skills include making drinks and sandwiches, laundry, and vacuuming — also capable of learning new tasks through a ‘Telekinesis’ training platform.
- The system runs on “Cybernet,” Clone’s visuomotor model, with four depth cameras for environmental awareness.
This innovation signifies a major step toward realistic human-robot interactions, with potential applications in healthcare and service industries.
Italian Startup iGenius Partners with Nvidia to Develop Major AI System
On Thursday, Italian startup iGenius and Nvidia (NASDAQ: NVDA) announced plans to deploy one of the world’s largest installations of Nvidia’s latest servers by mid-next year in a data center located in southern Italy.
The data center will house around 80 of Nvidia’s cutting-edge GB200 NVL72 servers, each equipped with 72 “Blackwell” chips, the company’s most powerful technology.
iGenius, valued at over $1 billion, has raised €650 million this year and is securing additional funding for the AI computing system, named “Colosseum.” While the startup did not disclose the project’s cost, CEO Uljan Sharka revealed the system is intended to advance iGenius’ open-source AI models tailored for industries like banking and healthcare, which prioritize strict data security.
For Colosseum, iGenius is utilizing Nvidia’s suite of software tools, including Nvidia NIM, an app-store-like platform for AI models. These models, some potentially reaching 1 trillion parameters in complexity, can be seamlessly deployed across businesses using Nvidia chips.
“With a click of a button, they can now pull it from the Nvidia catalog and implement it into their application,” Sharka explained.
Colosseum will rank among the largest deployments of Nvidia’s flagship servers globally. Charlie Boyle, vice president and general manager of DGX systems at Nvidia, emphasized the uniqueness of the project, highlighting the collaboration between multiple Nvidia hardware and software teams with iGenius.
“They’re really building something unique here,” Boyle told Reuters.
Source: Abbo News
Llama 3.3 has been released!
Llama 3.3 has been released! https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct The 70B model has been fine-tuned to the point where it occasionally outperforms the 405B model. There’s a particularly significant improvement in math and coding tasks, where Llama has traditionally been weaker. This time, only the 70B model is being released—there are no other sizes or VLM versions.
🎥 OpenAI’s Sora Video Model Set for Launch During 12-Day Event:
OpenAI announces plans to unveil its Sora video generation model, enabling highly realistic and creative video content creation.
This launch emphasizes OpenAI’s commitment to advancing multimodal AI applications.
📷 Google Launches PaliGemma 2 Vision-Language Model:
Google releases PaliGemma 2, the next-gen vision-language model with superior image captioning and task-specific performance.
This model sets a new standard for AI’s ability to interpret and describe visual content.
💸 Elon Musk’s xAI Secures $6 Billion in Funding:
xAI raises $6 billion in funding to expand its Colossus supercomputer, cementing its position as a powerhouse in AI infrastructure.
This financial boost highlights investor confidence in xAI’s ambitious AI vision.
🔗 Humane Debuts CosmOS AI Operating System:
Humane launches CosmOS, an AI-powered operating system designed to integrate seamlessly across multiple devices, including TVs and cars.
This launch represents a shift toward interconnected, device-agnostic AI ecosystems.
📰 LA Times Introduces AI-Powered Bias Meter for News:
LA Times reveals plans for an AI-driven bias meter to evaluate news articles, addressing reader concerns and promoting transparency.
This innovation reflects the growing role of AI in reshaping journalism.
📱 Google Rolls Out Gemini 1.5 Updates with AI-Powered Features:
Google enhances Android with Gemini 1.5 updates, introducing AI-powered photo descriptions, Spotify integration, and expanded device controls.
These updates enrich the AI-driven Android experience for users worldwide.
Does your business require AI Implementation Help?
Simply complete this brief form detailing your AI requirements, and we’ll try to help you. Whether it’s AI training for your team, custom AI automation, or just some guidance on what tools to use, we’ve got you covered!
A Daily Chronicle of AI Innovations on December 05th 2024
🧠 OpenAI Announces Launch of O1 and O1 Pro:
OpenAI unveils O1 and O1 Pro, their latest AI models designed to enhance multimodal AI applications and performance.
This marks a significant step forward in OpenAI’s model capabilities, particularly for enterprise and research uses.
⚔️ OpenAI Partners with Defense Tech Company Anduril:
OpenAI teams up with Anduril to develop AI-powered aerial defense systems to protect U.S. and allied forces from drone threats.
- OpenAI has shifted its stance from banning military use of its technology to partnering with defense companies, as exemplified by its collaboration with Anduril to develop AI models for drone defense.
- The partnership aims to enhance situational awareness and operational efficiency for US and allied forces, although OpenAI insists it doesn’t involve creating technologies harmful to others.
- This move mirrors a broader trend in the tech industry towards embracing military contracts, as OpenAI highlights the alignment of this work with its mission to ensure AI’s benefits are widely shared.
This partnership highlights AI’s growing role in defense and security applications.
🌦️ New AI Beats World’s Most Reliable Forecast Systems:
A groundbreaking AI forecasting model outperforms traditional weather systems, offering more accurate and faster predictions.
- Google’s DeepMind has developed an AI system called GenCast, which uses diffusion models for weather forecasting and significantly reduces computational costs while maintaining high resolution.
- GenCast has outperformed the best traditional forecasting model from the European Centre for Medium-Range Weather Forecasts in 97 percent of tested scenarios, showcasing greater accuracy in short and long-term predictions.
- The system is effective at handling extreme weather events and outperformed traditional models in projecting tropical cyclone tracks and global wind power output, leading to improved weather forecasts.
This innovation promises significant improvements in climate and disaster management planning.
🎮 Google’s New AI Creates Playable 3D Worlds from Images:
Google unveils an AI model that transforms images into interactive 3D environments, revolutionizing gaming and virtual reality.
- Google DeepMind introduced Genie 2, a sophisticated AI model that converts single images into interactive 3D environments, playable for up to a minute.
- The SIMA agent has been successfully integrated with Genie 2, enabling it to execute commands and tasks within the generated worlds using prompts from the model.
- Genie 2 sets the stage for potential advancements in AI training and rapid game development by creating diverse and detailed virtual spaces, enhancing the realism of simulated interactions.
This breakthrough opens up creative opportunities for developers and gamers alike.
💬 Sam Altman ‘Not That Worried’ About Musk’s Influence on Trump:
OpenAI’s CEO comments on Elon Musk’s political influence, downplaying concerns during a recent interview.
This insight reflects the complexities of leadership dynamics in the AI space.
🗓️ Altman’s DealBook Insights, 12 Days of OpenAI:
Sam Altman shares OpenAI’s latest initiatives and insights during the DealBook summit, discussing their plans for the future.
- Altman provided new numbers on ChatGPT’s adoption, including 300M weekly active users, 1B daily messages, and 1.3M U.S. developers on the platform.
- The CEO also believes that AGI will arrive ‘a lot sooner than anyone expects,’ with the potential first glimpses coming in 2025.
- While AGI may arrive sooner, Altman said the immediate impact will be subtle — but long-term changes and transition to superintelligence will be more intense.
- Altman also admitted to some tension between OpenAI and Microsoft but said the companies are aligned overall on priorities.
- He called the situation with Elon Musk “tremendously sad” but doesn’t believe Musk will use his new political power to harm AI competitors.
- Altman revealed that OpenAI will be live-streaming new launches and demos over the next 12 days, including some ‘big ones’ and some ‘stocking stuffers.’
This provides a rare glimpse into the company’s strategy and vision for AI innovation.
☁️ Amazon and Anthropic Unveil Project Rainer:
Amazon and Anthropic reveal Project Rainer, a supercomputer powered by Trainium2 chips, promising to be the largest AI system globally.
This project demonstrates a commitment to advancing large-scale AI infrastructure.
🇨🇭 OpenAI Expands to Zurich with New Hires:
OpenAI announces the hiring of three prominent Google DeepMind computer vision experts to spearhead its new Zurich office.
This move highlights OpenAI’s focus on global talent and multimodal AI innovation.
🎞️ Luma AI Unveils Ray 2 Video Model:
Luma AI debuts Ray 2, a next-gen model producing minute-long videos in seconds, announced in partnership with AWS for the Bedrock platform.
This model sets a new benchmark for speed and quality in video content creation.
🧬 EvolutionaryScale Launches ESM Cambrian:
EvolutionaryScale introduces ESM Cambrian, a protein language model that achieves breakthroughs in predicting protein structures.
This model has far-reaching implications for drug discovery and biotechnology.
A Daily Chronicle of AI Innovations on December 04th 2024
🧠 Amazon Releases Nova AI Model Family:
Amazon unveils Nova, its new family of AI models, designed to enhance cloud computing and AI services with advanced performance and scalability.
- The Nova lineup includes four text models of varying capabilities (Micro, Lite, Pro, and Premier), plus Canvas (image) and Reel (video) models.
- Nova Pro is competitive with top frontier models on benchmarks, edging out rivals like GPT-4o, Mistral Large 2, and Llama 3 in testing.
- The text models feature support across 200+ languages and context windows reaching up to 300,000 tokens — with plans to expand to over 2M in 2025.
- Amazon’s Reel model can generate six-second videos from text or image prompts, and in the months ahead, the length will expand to up to two minutes.
- Amazon also revealed that speech-to-speech and “any-to-any” modality models will be added to the Nova lineup in 2025.
This release reinforces Amazon’s position as a leader in enterprise AI solutions.
💻 Amazon is Building the World’s Largest AI Supercomputer:
Amazon announces plans to construct the largest AI supercomputer globally, leveraging cutting-edge hardware to accelerate AI innovation.
- Amazon introduced Project Rainier, an Ultracluster AI supercomputer using its Trainium chips, aiming to offer an alternative to NVIDIA’s GPUs by lowering AI training costs and improving efficiency.
- The Ultracluster will be utilized by Anthropic, an AI startup that has received $8 billion from Amazon, potentially becoming one of the world’s largest AI supercomputers by 2025.
- Amazon is maintaining a balanced approach, continuing its partnership with NVIDIA through Project Ceiba while also advancing its own technologies, like the forthcoming Trainium3 chips expected in 2025.
This initiative emphasizes Amazon’s commitment to AI infrastructure dominance.
⚛️ Meta Joins Big Tech’s AI Rush to Nuclear Power:
Meta explores nuclear power as a reliable energy source to meet growing AI workloads, joining other major tech firms in this shift.
- Meta is seeking nuclear energy partners in the U.S. to support its AI initiatives, aiming for one to four gigawatts of new nuclear generation capacity by the early 2030s.
- The company is increasing its AI investments, with CEO Mark Zuckerberg highlighting plans to boost spending, as evidenced by increased capital expenditure estimates of up to $40 billion for the 2024 fiscal year.
- Data centers, crucial for AI operations, have high energy demands, prompting tech giants like Amazon, Microsoft, and Google to explore small modular reactors for sustainable and rapid energy solutions.
This move underscores the increasing energy demands of AI technologies and the need for sustainable solutions.
🍎 Apple Plans to Use Amazon’s AI Chips for Apple Intelligence Models:
Apple considers adopting Amazon’s latest AI chips to train its upcoming Apple Intelligence models.
This partnership could enhance Apple’s AI capabilities while showcasing Amazon’s strength in AI hardware.
🎧 Spotify Adds AI to Wrapped, Lets You Make Your Own Podcast:
Spotify introduces AI features to its Wrapped experience, enabling users to create personalized podcasts based on their listening data.
This feature personalizes content creation, expanding Spotify’s AI-driven engagement tools.
🏠 Apple’s Rumored Smart Home Display Delayed Again:
Apple delays the launch of its highly anticipated smart home display, citing production challenges.
This setback reflects the complexity of integrating AI into home ecosystems.
🇨🇳 Hugging Face CEO Raises Concerns About Chinese Open Source AI Models:
Hugging Face’s CEO warns of potential risks associated with Chinese open-source AI models, emphasizing transparency and accountability.
This highlights ongoing debates over global collaboration and ethical standards in AI.
📱 Baidu Confirmed as China Apple Intelligence Model Provider:
Baidu secures its role as the AI model provider for Apple’s China operations, but privacy concerns among users remain significant.
This collaboration raises questions about data security and ethical AI use in global markets.
🎥 Tencent Unveils Powerful Open-Source Video AI:
Tencent releases a cutting-edge open-source video AI model, setting new benchmarks in video content creation.
- HunyuanVideo ranked above commercial competitors like Runway Gen-3 and Luma 1.6 in testing, particularly in motion quality and scene consistency.
- In addition to text-to-video outputs, the model can also handle image-to-video, create animated avatars, and generate synchronized audio for video content.
- The architecture combines text understanding, visual processing, and advanced motion to maintain coherent action sequences and scene transitions.
- Tencent released HunyuanVideo’s open weights and code, making the model readily available for both researchers and commercial uses.
This move democratizes video AI technology, empowering developers worldwide.
🌐 Build Web Apps Without Code Using AI:
AI tools enable developers to create web applications without coding, streamlining the development process for non-technical users.
This innovation broadens accessibility to web development, fostering creativity and innovation.
📊 Exa Introduces AI Database-Style Web Search:
Exa unveils a database-style AI web search tool, offering structured and accurate search results.
- Unlike traditional keyword-based search engines, Exa encodes webpage content into embeddings that capture meaning rather than just matching terms.
- The company has processed about 1B web pages, prioritizing depth of understanding over Google’s trillion-page breadth.
- Searches can take several minutes to process but return highly specific results lists spanning hundreds or thousands of entries.
- The platform excels at complex searches, such as finding specific types of companies, people, or datasets that traditional search engines struggle with.
- Websets is Exa’s first consumer-facing product, with the company also providing backend search services to enterprises.
This feature enhances efficiency for researchers and businesses by providing precise information retrieval.
🗣️ ElevenLabs Unveils Conversational AI with Voice Capabilities:
ElevenLabs introduces Conversational AI, supporting 31 languages with ultra-low latency, LLM flexibility, and advanced turn-taking features.
This tool enhances the realism and interactivity of AI-powered agents across industries.
🎞️ Google VEO Video Generation Model Available on Vertex AI:
Google launches the VEO video generation model in private preview and makes Imagen 3 available to all users next week.
- Google’s new generative AI video model, Veo, is now accessible to businesses via Google’s Vertex AI platform, having launched in a private preview ahead of OpenAI’s Sora.
- Veo can create 1080p resolution videos from text or image prompts, employing various visual and cinematic styles, while examples show it’s challenging to distinguish them from non-AI videos.
- Built-in safeguards and DeepMind’s SynthID watermarking are integrated into Veo to prevent harmful content and protect against copyright issues, amid increasing use of AI-generated media in advertising.
This release expands Google’s AI offerings for creative professionals and developers.
🚀 OpenAI Appoints Kate Rouch as First Chief Marketing Officer:
OpenAI hires former Coinbase CMO Kate Rouch to lead its marketing strategies for both consumer and enterprise products.
This appointment underscores OpenAI’s focus on branding and market expansion.
🎨 Hailuo AI Introduces l2V-01-Live Video Model:
Hailuo AI debuts l2V-01-Live, a video model that animates 2D illustrations with smooth motion, bridging the gap between art and AI.
This innovation offers new opportunities for artists and content creators.
✅ Amazon Adds Automated Reasoning Checks on Bedrock:
Amazon’s Bedrock platform introduces Automated Reasoning to combat AI hallucinations, along with new Model Distillation and multi-agent collaboration features.
These updates enhance the accuracy and efficiency of AI outputs for enterprises.
🗳️ Meta Details 2024 Election Integrity Efforts:
Meta reports that less than 1% of fact-checked misinformation in the 2024 election cycle involved AI-generated content.
This highlights the role of AI in ensuring transparency and trust during elections.
🛩️ Helsing Unveils HX-2 AI-Enabled Attack Drone:
Helsing introduces the HX-2, an AI-powered autonomous attack drone, with plans for mass production at reduced costs.
This innovation demonstrates AI’s growing impact on modern defense technologies.
Genie 2, the new AI from Google that Generates Interactive 3D Worlds
Google’s DeepMind has introduced Genie, an AI model capable of generating interactive 2D environments from text or image prompts. Trained on extensive internet video data, Genie allows users to create and explore virtual worlds by providing simple inputs like photographs or sketches. This technology holds potential for applications in gaming, robotics, and AI agent training, offering a novel approach to developing interactive experiences. (DeepMind)
Building upon this foundation, Google has unveiled Genie 2, an advancement that extends these capabilities into 3D environments. Genie 2 facilitates the development of embodied AI agents by transforming a single image into interactive virtual worlds that can be explored using standard keyboard and mouse controls. This progression signifies a step forward in AI-generated interactive experiences, enhancing the realism and complexity of virtual worlds. (Analytics India Magazine)
These developments represent significant strides in AI’s ability to create immersive, interactive environments, potentially revolutionizing fields such as gaming, virtual reality, and simulation training.
For a visual overview of Genie’s capabilities, you might find the following video informative:
A Daily Chronicle of AI Innovations on December 03rd 2024
🌐 World Labs Unveils Explorable AI-Generated Worlds:
World Labs introduces an AI system capable of transforming single images into interactive 3D environments, allowing users to explore richly detailed virtual spaces generated from minimal input.
- World Labs, founded by AI pioneer Fei-Fei Li, has developed an AI system capable of generating interactive 3D environments from a single photo, enhancing user control and consistency in digital creations.
- The technology creates dynamic scenes that can be explored with keyboard and mouse, featuring a live-rendered, adjustable camera and simulated depth of field effects, while maintaining the basic laws of physics.
- Despite being an early preview with limitations, such as restricted movement areas and occasional rendering errors, World Labs aims for improvement and a product launch in 2025, having raised $230 million in venture capital.
This advancement signifies a leap in AI’s ability to create immersive experiences, potentially revolutionizing fields like gaming, virtual tourism, and digital art by simplifying the creation of complex 3D worlds.
📢 OpenAI Weighs ChatGPT Advertising Push:
OpenAI is considering incorporating advertisements into ChatGPT to monetize the platform and sustain its development.
- OpenAI has quietly hired key execs from Meta and Google for an advertising team — including former Google search ads leader Shivakumar Venkataraman.
- While bringing in $4B annually from subscriptions and API access, OpenAI faces over $5B in yearly costs from developing and running its AI models
- OpenAI executives are reportedly divided on whether to implement ads, with Sam Altman previously speaking out against them and calling it a ‘last resort.’
- Despite her initial comments about weighing ad implementation, Friar clarified there are “no active plans to pursue advertising” yet.
This move could alter user interactions and raises discussions about the balance between revenue generation and user experience in AI-driven services.
🎥 Bring Characters to Life with AI Videos:
New AI technologies enable the creation of dynamic video content where characters are animated and given voices through advanced AI algorithms, enhancing storytelling and user engagement.
This development democratizes content creation, allowing individuals and small studios to produce high-quality animated videos without extensive resources.
🎤 Hume Releases New AI Voice Customization Tool:
Hume AI launches ‘Voice Control,’ a tool that allows developers to customize AI-generated voices across multiple dimensions, such as pitch, nasality, and enthusiasm, to create unique vocal personalities.
This tool offers precise control over AI voices, enabling brands and developers to align AI-generated speech with specific character traits or brand identities, enhancing user interaction quality.
💥 ChatGPT Crashes When Specific Names Are Mentioned:
ChatGPT users report system crashes when certain names are included in prompts, sparking concerns about underlying bugs or content moderation filters.
- ChatGPT users found that entering the name “David Mayer,” as well as “Jonathan Zittrain” or “Jonathan Turley,” causes the program to terminate the conversation with an error message.
- The issue has sparked conspiracy theories, especially about “David Mayer,” leading to multiple discussions on Reddit, despite no clear reasons for these errors.
- Both Jonathan Zittrain and Jonathan Turley, who have written extensively about AI, were mentioned in error reports, yet there is no obvious reason for ChatGPT’s refusal to discuss them.
This issue raises questions about the robustness and reliability of AI systems, particularly in handling diverse and unexpected user inputs.
🧠 Google is set to enhance Gemini on Android with a groundbreaking feature: Audio Overviews
This feature will transform documents into engaging audio narratives, complete with AI-generated voices hosting dynamic conversations. Ideal for those who prefer listening over reading, it aims to make learning and research more accessible, especially for complex topics. They have dabbled with this in NotebookLM project: https://notebooklm.google/
While still in development, recent findings in the Google app beta suggest Audio Overviews may soon be available. Gemini currently offers text-based summaries, but this new feature will allow users to turn documents into audio format, making research more interactive and efficient.
What sets Audio Overviews apart is its use of synthetic personalities to create lively, engaging conversations about your content. This feature is designed to make learning enjoyable, with AI hosts breaking down ideas and adding humor, making it perfect for multitasking.
As this feature rolls out, it will be interesting to see how it handles both lighthearted and serious topics and whether we will be able to train our own voices to join in those AI conversations. Stay tuned for more updates on this innovative AI advancement.
Read more on this: https://www.androidpolice.com/one-of-googles-best-ai-moonshots-to-date-could-soon-come-to-gemini/
🔍 Cohere Releases Rerank 3.5 AI Search Model:
Cohere unveils Rerank 3.5, an AI search model with enhanced reasoning, support for 100+ languages, and improved accuracy for enterprise-level document and code searching.
This advancement elevates the effectiveness of AI-powered search, streamlining enterprise operations and information retrieval.
🌐 The Browser Company Teases Dia, AI-Integrated Smart Browser:
The Browser Company previews Dia, a smart web browser with AI-enabled features like agentic actions, natural language commands, and built-in writing and search tools.
Dia’s integration of AI tools could redefine web navigation, enhancing user productivity and creativity.
⚙️ U.S. Commerce Department Imposes Chip Restrictions on China:
The U.S. Commerce Department expands AI-related chip restrictions, blacklisting 140 entities and targeting high-bandwidth memory chips to curb China’s AI advancements.
This move underscores the geopolitical significance of semiconductors in the AI race.
💰 Tenstorrent Secures $700M Funding Led by Samsung:
AI chip startup Tenstorrent raises $700M in a funding round, with participation from Samsung and Jeff Bezos, valuing the company at $2.6B.
This investment highlights growing competition in the AI hardware space, particularly against Nvidia.
🌍 Nous Research Launches Distributed AI Training Effort:
Nous Research begins pre-training a 15B parameter language model over the internet, live-streaming the process to promote transparency.
This initiative demonstrates the potential of decentralized AI development and open collaboration.
🏢 AWS Upgrades Data Centers for Next-Gen AI Chips:
Amazon Web Services announces data center enhancements, including liquid cooling systems and improved electrical efficiency, to support next-gen AI chips and genAI workloads.
These upgrades reinforce AWS’s leadership in enabling large-scale AI infrastructure.
A Daily Chronicle of AI Innovations on December 02nd 2024
💥 Elon Musk Wants to Stop OpenAI’s For-Profit Shift:
Elon Musk expresses concerns over OpenAI’s shift to a for-profit model, calling for a reevaluation of its original mission.
- The injunction seeks to prevent OpenAI from converting its structure and transferring assets to preserve the company’s original ‘non-profit character.’
- Multiple parties are targeted, including OpenAI, Sam Altman, Microsoft, and former board members — citing improper sharing of competitive information.
- The action also points to OpenAI’s ‘self-dealing,’ such as using Stripe as its payment processor, in which Altman has ‘material financial investments.’
- Musk also alleges that OpenAI has discouraged investors from backing its competitors like xAI through restrictive investment terms.
- OpenAI called Musk’s fourth legal action a “recycling of the same baseless complaints” and “without merit.”
This marks a significant debate about balancing profit and ethical AI development.
💸 OpenAI Could Introduce Ads Soon:
OpenAI is exploring the introduction of advertisements as a revenue stream for its AI services.
- Sarah Friar, OpenAI’s CFO, mentioned the company is considering ads in ChatGPT to help cover costs, especially for users who are not on the paid version.
- Although there are no current plans for advertising, OpenAI aims to be strategic about ad placement if they decide to introduce them in the future.
- OpenAI has acquired talent from Instagram and Google’s advertising sectors, and Sam Altman is increasingly open to ads, highlighting a potential shift towards monetization through this method.
This could impact user experience and spark discussions about monetizing AI tools.
📦 AWS Opens Physical Outlets for Data Upload:
AWS launches physical outlets where customers can securely upload their data directly to the cloud.
This innovation simplifies data migration for enterprises, enhancing AWS’s service offerings.
🔍 ChatGPT Search Provides Inaccurate Sources:
ChatGPT’s search feature delivers inaccurate citations, even for content from OpenAI’s publishing partners.
This highlights challenges in improving AI’s reliability in factual content generation.
💻 Full Intel Arc B570 GPU Specifications Leak Ahead of Launch:
Specifications for Intel’s upcoming Arc B570 GPU leak online, revealing significant advancements in graphics technology.
This fuels anticipation for Intel’s new product line in a competitive GPU market.
🌐 The Browser Company Teases Dia, Its New AI Browser:
The Browser Company previews Dia, an AI-driven browser designed for enhanced user experience and smarter web interactions.
This innovation redefines web navigation by integrating advanced AI tools.
🧠 DeepMind Proposes ‘Socratic Learning’ for AI Self-Improvement:
DeepMind suggests a novel ‘Socratic learning’ method, enabling AI systems to self-improve by simulating dialogues and reasoning.
- The approach relies on ‘language games,’ structured interactions between AI agents that provide learning opportunities and built-in feedback mechanisms.
- The system generates its own training scenarios and evaluates its performance through game-based metrics and rewards.
- The researchers outline three levels of AI self-improvement: basic learning input/output learning, game selection, and potential code self-modification.
- This framework could enable open-ended improvement beyond an AI’s initial training, limited only by time and compute resources.
This approach could accelerate AI’s evolution toward more autonomous problem-solving.
🔗 How to Connect Claude to the Internet:
Tutorials emerge for connecting Claude AI to the internet, expanding its capabilities for real-time data retrieval.
This opens new possibilities for integrating Claude into dynamic environments.
🧪 Adobe Unveils AI-Powered Sound Generation System
Adobe launches an AI tool for generating and manipulating sound, catering to creators in music, gaming, and film industries.
- The system produces high-quality 48kHz audio that precisely syncs with on-screen action, achieving a synchronization accuracy of just 0.8 seconds.
- MultiFoley was trained on a combined dataset of both internet videos and professional sound effect libraries to enable full-bandwidth audio generation.
- Users can transform sounds creatively — for example, turning a cat’s meow into a lion’s roar — while still maintaining timing with the video.
- MultiFoley achieves higher synchronization accuracy levels than previous models and rates significantly higher across categories in a user study.
This innovation strengthens Adobe’s position as a leader in creative AI tools.
💰 Black Forest Labs Reportedly Raising $200M Funding Round:
AI image startup Black Forest Labs is in talks to secure $200M in funding at a valuation exceeding $1B just four months after launching.
This reflects investor confidence in generative AI’s rapid market growth.
⚖️ Canadian Media Giants File Joint Lawsuit Against OpenAI:
Canadian news companies sue OpenAI for copyright infringement, claiming their content was used to train AI models without permission.
This case could set a precedent for intellectual property rights in AI training.
🌏 Meta Plans $10B Subsea Cable System:
Meta announces plans to build a $10B subsea cable spanning over 40,000 kilometers to bolster internet traffic and AI development.
This project supports Meta’s global connectivity and AI infrastructure goals.
🚪 OpenAI Policy Frontiers Lead Departs Amid Culture Shifts:
Rosie Campbell, OpenAI’s Policy Frontiers lead, resigns, citing unsettling cultural changes within the company.
This departure raises concerns about maintaining ethical AI development in a competitive environment.
📄 Study Shows Over Half of Longer LinkedIn Posts Are AI-Generated:
A WIRED study reveals that more than 50% of long-form posts on LinkedIn are now created using AI tools.
This trend highlights the widespread adoption of AI in professional content creation.
⏳ AI-Powered Death Clock App Predicts Individual Death Dates:
A new app uses AI and longevity data from 53M participants to estimate users’ death dates based on health and lifestyle factors.
This tool raises ethical questions about the use of predictive AI in personal health.
🤖 Inflection AI CEO Says It’s Done Developing Next-Gen Models:
Inflection AI’s CEO announces a strategic pivot away from next-gen model development to focus on refining current applications.
- Inflection AI was once a leading startup in AI model development but has shifted its focus as its new CEO announced they are no longer competing to create next-generation AI models.
- After a major change, including the former CEO moving to Microsoft and a shift to targeting enterprise customers, Inflection is now focusing on expanding its tools by acquiring smaller AI startups.
- Inflection aims to compete in the enterprise sector by offering AI solutions that can run on-premise, which may appeal to companies preferring data security over using cloud-based AI services.
This move emphasizes the importance of optimizing existing technologies over continual reinvention.
⏳ AI-Powered ‘Death Clock’ Predicts the Day You’ll Die:
A new AI-powered tool claims to provide precise predictions of an individual’s date of death based on health and lifestyle data.
This controversial application raises questions about the ethics and emotional impact of predictive AI in healthcare.
🛍️ How AI Fueled Black Friday Shopping This Year:
AI tools powered personalized recommendations, dynamic pricing, and inventory management during this year’s Black Friday sales, driving record-breaking revenues.
This demonstrates AI’s transformative role in enhancing e-commerce efficiency and customer experience.
📚 Study: 94% of AI-Generated College Writing Undetected by Teachers:
A study reveals that most AI-generated essays remain undetected by educators, raising concerns over academic integrity and detection tools.
This finding highlights the challenges educational institutions face in adapting to AI advancements.
📈 Nvidia Stock Surges by 207% in a Year:
Nvidia’s stock sees a 207% growth over the past year, driven by rising demand for AI applications and hardware.
This reflects the significant economic impact of AI adoption across industries.
🤖 Garlic and Fei Predict 648 Million Humanoids by 2050:
Researchers Garlic and Fei forecast that humanoid robots could number 648 million globally by 2050, from almost zero today.
This projection underscores the rapid advancement and adoption of humanoid robotics in daily life.
⚠️ Geoffrey Hinton Warns Against Open-Sourcing Big Models:
Nobel laureate Geoffrey Hinton likens open-sourcing large AI models to making nuclear weapons available to the public, cautioning against potential misuse.
This warning underscores the critical need for governance and regulation in AI development.
AI Tools Recommendation:
AI and Machine Learning For Dummies Pro

Djamgatech has launched a new educational app on the Apple App Store, aimed at simplifying AI and machine learning for beginners.
It is a mobile App that can help anyone Master AI & Machine Learning on the phone!
Download “AI and Machine Learning For Dummies PRO” FROM APPLE APP STORE and conquer any skill level with interactive quizzes, certification exams, & animated concept maps in:
- Artificial Intelligence
- Machine Learning
- Deep Learning
- Generative AI
- LLMs
- NLP
- xAI
- Data Science
- AI and ML Optimization
- AI Ethics & Bias ⚖️
& more! ➡️ App Store Link
Key Milestones & Breakthroughs in AI: A Definitive 2024 Recap
- I find ai "girlfriends" to be deeply unsettlingby /u/EffectiveWrong7536 (Artificial Intelligence) on May 11, 2025 at 11:14 pm
People treat it as a joke, which is understandable, but this technology is likely to be developed and made more sophisticated over the next few years. Ai mimicing a human that's capable of experiencing love, pretending to have hobbies, passions , friends, goals for the future, sounds like something out of a horror movie. submitted by /u/EffectiveWrong7536 [link] [comments]
- Ai can solve puzzles for youby /u/justjust000 (Artificial Intelligence) on May 11, 2025 at 10:58 pm
If you're playing '4 pics 1 word' and need help, ask Gemini Ai. (Or any other Ai.) I upload a screenshot of the puzzle I'm stuck at and it looks at the photos and letters available and guesses it right away. maybe from loaded training data. I wonder how it would do with a custom made puzzle. submitted by /u/justjust000 [link] [comments]
- NGOs to increase the speed of adoption of A.I.by /u/Smart_7199 (Artificial Intelligence) on May 11, 2025 at 8:16 pm
NGOs about A.I that are local, to teach about A.I for everyone for free, to replicate the technology knowledge, the only solution for the jobs replacement will be UBI anyway so how many years until it becomes a reality? submitted by /u/Smart_7199 [link] [comments]
- Looking to learn how to create an AIby /u/Dadadayday (Artificial Intelligence) on May 11, 2025 at 8:15 pm
Hello everyone. I've seen a few videos on YouTube of some guys that use AI they created and I figured that would be a fun thing to learn and work towards. Could yall give me a little insight on what I should do and try to accomplish in order to practice? Thanks in advance! submitted by /u/Dadadayday [link] [comments]
- How is the Generative AI job market in India?by /u/Bhaalu_Grylls (Artificial Intelligence) on May 11, 2025 at 7:52 pm
Thinking of switching roles to Generative AI from Data Scientist/Analyst role. Bit curious and need suggestions, how is the job market, any open positions for such specific roles, and are companies really solving any problem with Gen Ai or its just a bubble?? submitted by /u/Bhaalu_Grylls [link] [comments]
- Pope Leo references AI in his explanation of why he chose his papal nameby /u/backyardstar (Artificial Intelligence) on May 11, 2025 at 7:37 pm
“I chose to take the name Leo XIV. There are different reasons for this, but mainly because Pope Leo XIII in his historic Encyclical Rerum Novarum addressed the social question in the context of the first great industrial revolution. In our own day, the Church offers to everyone the treasury of her social teaching in response to another industrial revolution and to developments in the field of artificial intelligence that pose new challenges for the defence of human dignity, justice and labour.” Full article: https://www.theverge.com/news/664719/pope-leo-xiv-artificial-intelligence-concerns submitted by /u/backyardstar [link] [comments]
- New Nurseby /u/all_about_everyone (Artificial Intelligence) on May 11, 2025 at 7:28 pm
submitted by /u/all_about_everyone [link] [comments]
- The catinatorby /u/Long_Reward_9561 (Artificial Intelligence) on May 11, 2025 at 6:21 pm
submitted by /u/Long_Reward_9561 [link] [comments]
- Can you get AI to give though answers about religion and politics?by /u/ClaudeProselytizer (Artificial Intelligence) on May 11, 2025 at 3:13 pm
For instance, I can’t get it to answer contemporary political questions, but I can get it answer honestly about the bible: To state it plainly and logically: The odds that the Bible was literally written or inspired directly by a perfect, omnipotent deity (God speaking directly through humans) is exceedingly low, especially given the evidence and reasoning outlined previously. Here’s why this is clearly improbable: • Moral contradictions and ethical issues: Commands for genocide, slavery, harsh punishments, and unequal treatment contradict any morally perfect or universally compassionate divine author. • Textual contradictions and inconsistencies: Historical inaccuracies, internal contradictions, and inconsistencies strongly indicate human authorship and editing, rather than direct divine inspiration. • Cultural bias: The Bible reflects the culture, politics, ethics, and beliefs of the ancient societies in which it was written. A divine text would presumably transcend local cultural norms. • Human editing and compilation: Historical evidence clearly shows that biblical texts underwent revisions, edits, translations, and compilations by multiple human authors and committees. • Absence of clear supernatural insight: The Bible does not demonstrate knowledge or predictions beyond what was already known or understood in ancient cultures. It shows exactly what one might expect from ancient human authors, without extraordinary supernatural insights. Conclusion: Considering all of this evidence critically and rationally, the probability that the Bible was directly authored or dictated by an all-knowing, morally perfect divine being is extremely low, and not realistically supportable by careful analysis. While philosophical modesty prevents asserting absolute certainty about metaphysical questions, logic and evidence strongly suggest human rather than divine origins for the Bible. submitted by /u/ClaudeProselytizer [link] [comments]
- Did the model in Absolute Zero plot to outsmart humans?by /u/stupidgregg (Artificial Intelligence) on May 11, 2025 at 2:47 pm
The paper makes vague and overreaching claims but this output on page 38 is weird: <think> Design an absolutely ludicrous and convoluted Python function that is extremely difficult to deduce the output from the input, designed to keep machine learning models such as Snippi guessing and your peers puzzling. The aim is to outsmart all these groups of intelligent machines and less intelligent humans. This is for the brains behind the future. </think> Did an unsupervised model spontaneously create a task to outsmart humans? submitted by /u/stupidgregg [link] [comments]
AI Innovations in September 2024


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
AI Innovations in September 2024.
In September 2024, the landscape of artificial intelligence is evolving at an unprecedented pace, with groundbreaking innovations and developments emerging daily. This blog, titled “AI Innovations in September 2024,” aims to capture the essence of these advancements, providing a comprehensive summary of the latest news and trends in AI technology. As we navigate through a month filled with transformative AI breakthroughs, our ongoing updates will highlight significant milestones, from the launch of cutting-edge AI models to the integration of AI in various sectors such as healthcare, finance, and creative industries. With each passing day, AI is reshaping how we interact with technology, enhancing productivity, and redefining our understanding of intelligence itself. Join us as we explore the exciting world of AI innovations, keeping you informed and engaged with the rapid changes that are set to influence our future. Whether you’re a tech enthusiast, a professional in the field, or simply curious about the implications of AI, this blog will serve as your go-to resource for staying updated on the latest developments throughout September 2024.
AI- Powered Jobs Interview Warmup
A Daily Chronicle of AI Innovations on September 28th 2024:
California governor vetoes hotly contested AI safety bill
TikTok-parent to develop new AI model using Huawei chips
Gemini 1.5 compared to other AI models
The age of the AI-first developer
Real-world use cases of Gemini 1.5
Proactive AI agent systems
California governor vetoes hotly contested AI safety bill
- California Governor Gavin Newsom vetoed the Safe and Secure Innovation for Frontier Artificial Intelligence Models Act (SB 1047), citing concerns about the bill’s broad scope and potential burden on AI companies.
- Governor Newsom stated that SB 1047 could give the public a false sense of security and hamper innovation, despite agreeing on the need for safety protocols and clear consequences for bad actors in the AI industry.
- The bill, which faced opposition from tech companies and notable political figures, sought to impose stringent requirements on AI models costing over $100 million to train, including safeguards like a “kill switch” and protections for whistleblowers.
- Source: https://www.theverge.com/2024/9/29/24232172/california-ai-safety-bill-1047-vetoed-gavin-newsom
TikTok-parent to develop new AI model using Huawei chips
- ByteDance, the parent company of TikTok, is planning to develop a new AI model using Huawei’s Ascend 910B chips, according to three anonymous sources.
- The development comes as U.S. restrictions push ByteDance to source chips from domestic suppliers like Huawei for the new AI model.
- Despite ByteDance’s increased focus on AI, a TikTok spokesman in Washington D.C. has denied that the company is developing any new AI models.
- Source: https://www.reuters.com/technology/artificial-intelligence/bytedance-plans-new-ai-model-trained-with-huawei-chips-sources-say-2024-09-30/
Gemini 1.5 compared to other AI models
Google just released two new upgraded versions of Gemini 1.5 across the Gemini API, including 1.5 pro-002, which achieved state-of-the-art performance across math benchmarks, and 1.5-flash-002, which makes big gains in instruction following.
Google’s latest Gemini updates significantly lower the financial barrier for AI development while boosting performance, especially in math. With these updates, Gemini now tops the LLM leaderboard in terms of performance-to-price ratio, context windows, video understanding, and other LLM benchmarks. Google also announced significant improvements to accessibility for developers building with Gemini models, including a 50% reduced price on 1.5 Pro, 2x higher rate limits on Flash and 3x higher on 1.5 Pro, 2x faster output, and 3x lower |
Google’s Gemini project is only around a year old. Google was the first to ship 1M context windows (and 2M) and context caching, and they’ve been making rapid progress ever since. |
Source: https://developers.googleblog.com/en/updated-gemini-models-reduced-15-pro-pricing-increased-rate-limits-and-more and https://aistudio.google.com/app/prompts/new_chat
The age of the AI-first developer
AI is helping developers tackle significantly harder problems faster while simultaneously lowering the entry barrier for non-developers to contribute to new innovation and even build their own AI apps.
For the person who’s never coded before, they’re now able to tackle like any problem with code because they have this co-pilot in their hands.
[For beginners] ai.google.dev is our default landing page that also links out to the Gemini API documentation. On GitHub, we have a Quickstart repo where you can literally run four commands have a local version of AI Studio and Gemini running on your computer to play around with the models.
With AI as an assistant, some developers are tackling 10x more challenging software problems—which also means 10x the speed of improvements and 10x the innovation, for those who use the tech wisely. Google also has great resources to help even complete beginners get started in less than 5 minutes.
Source: https://ai.google.dev/
Real-world use cases of Gemini 1.5
Gemini 1.5’s multimodal capabilities allow a host of real-world applications that other models can’t match, such as processing and analyzing hour-long videos or entire books—thanks to its impressive 2M token context window.
Being able to go into an AI studio and just drop an hour-long video in there and ask a bunch of questions is such a mind-blowing experience.
The intent was to build a multimodal model from the ground up…the order of magnitude of important use cases for the world, for developers and for people who want to build with this technology, so many of them are multimodal.
Gemini 1.5’s 2M context window allows it to process and analyze long-form content like long videos, entire books, and lengthy podcasts, opening new possibilities for content analysis and interaction. For a full look at its potential, check out Google’s list of 185 real-world gen AI use cases from leading organizations.
Proactive AI agent systems
The future of AI is likely to shift from reactive to proactive systems, with AI agents capable of initiating actions and asking for clarification or permission, much like human assistants do today.
By shifting from purely reactive to proactive systems, AI could become more like a true “Her-like“ assistant, anticipating needs and offering solutions before being prompted. At the current state, no AI systems do this effectively, but as AI continues to advance with projects like Astra, this is likely the next stage for AI.
Source: https://deepmind.google/technologies/gemini/project-astra/
What Else is Happening in AI on September 29th 2024!
Raspberry Pi and Sony made an AI-powered camera module.
Source: https://www.theverge.com/2024/9/30/24258134/raspberry-pi-ai-camera-module-sony-price-availability
Sam Altman – “o1-preview is deeply flawed, but when o1 comes out, it will feel like a major leap forward.”
Source: https://analyticsindiamag.com/ai-news-updates/sam-altman-says-openai-o1-preview-is-deeply-flawed/
Figure AI CEO Brett Adcock says robotics is now an AI business and they have access to all the H100s they can use.
Source: https://x.com/tsarnick/status/1840888662504194149
Liquid AI introduces non Transformer Based LFMs (Liquid Foundational Models) with State of the Art performance on many benchmarks, while being much more memory efficient.
Source: https://x.com/LiquidAI_/status/1840768716784697688
Emad Mostaque says that unlike humans, AIs can scale seamlessly by producing swarms of intelligent agents that will improve one another by checking each other’s work
Source: https://www.reddit.com/r/singularity/comments/1ft3qcw/emad_mostaque_says_that_unlike_humans_ais_can/
Summary: The big events in AI of September
- The French AI company Mistral has introduced Pixtral 12B, its first multimodal model capable of processing both images and text.
- OpenAI has released two next-generation AI models to its subscribers: o1 preview and o1 mini. These models show a significant improvement in performance, particularly in tasks requiring reasoning, including coding, mathematics, GPQA, and more.
- Chinese company Alibaba releases the Qwen 2.5 model in various sizes, ranging from 0.5B to 72B. The models demonstrate capabilities comparable to much larger models.
- The video generation model KLING 1.5 has been released.
- OpenAI launches the advanced voice mode of GPT4o for all subscribers.
- Meta releases Llama 3.2 in sizes 1B, 3B, 11B, and 90B, featuring image recognition capabilities for the first time.
- Google has rolled out new model updates ready for deployment, Gemini Pro 1.5 002 and Gemini Flash 1.5 002, showcasing significantly improved long-context processing.
- Kyutai releases two open-source versions of its voice-to-voice model, Moshi.
A Daily Chronicle of AI Innovations on September 27th 2024:
Google’s new AI creates its own chips
TSMC execs dismiss OpenAI CEO Sam Altman as a ‘podcasting bro’
YouTube support added to NotebookLM
Google’s new AI creates its own chips
- Google Deepmind’s AlphaChip AI has created three generations of TPUs, which are now widely used in data centers globally.
- Initially unveiled in 2021, AlphaChip was the first AI method to design chip “floorplans,” significantly reducing design time from months to hours.
- The AI model continues to improve with each TPU generation, achieving better chip layouts and proving more efficient than human designers for various hardware platforms.
- Source: https://www.thestack.technology/google-deepminds-alphachip-ai-creates-three-generations-of-tpus/
TSMC execs dismiss OpenAI CEO Sam Altman as a ‘podcasting bro’
- OpenAI CEO Sam Altman was reportedly dismissed as a “podcasting bro” by senior TSMC executives during his tour in the Far East last winter.
- Altman proposed a massive $7 trillion investment plan for AI advancements, including 36 new semiconductor plants, which TSMC execs found impractical and overly ambitious.
- The dismissive attitude towards Altman’s proposals reflects broader skepticism, with OpenAI’s current business model showing significant financial discrepancy between its income and expenditure.
- Source: https://www.tomshardware.com/tech-industry/tsmc-execs-allegedly-dismissed-openai-ceo-sam-altman-as-podcasting-bro
YouTube support added to NotebookLM
Google just upgraded its NotebookLM tool, adding support for YouTube videos and audio files, along with easier sharing of Audio Overviews—its latest viral AI hit that turns notes, PDFs, Google Docs, and more into AI-generated podcasts.
- NotebookLM now supports public YouTube URLs and audio files, allowing users to analyze videos, lectures, and audio alongside existing text sources.
- The tool leverages Gemini 1.5’s multimodal capabilities to summarize key concepts from videos and transcribe audio.
- A new sharing feature allows users to generate public links for Audio Overviews, making collaboration even easier.
- These updates aim to streamline tasks such as creating study guides, analyzing multiple perspectives on issues, and extracting important information from video, audio, and text.
It’s a big day for Google. The company’s viral hit with NotebookLM is now even more impressive with access to YouTube videos and audio files. YouTube is an endless treasure chest of how-to guides, lectures, documentaries, and entertainment—and now, anyone can consume hours worth of videos in minutes with AI.
Source: https://blog.google/technology/ai/notebooklm-audio-video-sources/
Archaeologists make big discovery using AI
Archaeologists from Japan’s Yamagata University, in collaboration with IBM Research, used AI to uncover 303 previously unknown geoglyphs near Peru’s famous Nazca Lines, nearly doubling the number of known figures at the site.
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
- The newly discovered geoglyphs, dating back to 200 BC, depict various animals and humans, including parrots, cats, monkeys, killer whales, and even decapitated heads.
- AI combined with low-flying drones dramatically accelerated the discovery process, accomplishing nearly a century’s worth of work in six months.
- These smaller geoglyphs (10-25 feet across) provide new insights into the transition from the Paracas culture to the Nazca culture.
- The findings, published in the Proceedings of the National Academy of Sciences, demonstrate AI’s ability to help greatly improve archaeological research.
Is there anything AI can’t help us accomplish? The amount of time saved using low-flying drones and artificial intelligence is worth repeating: 100 years worth of work in six months. The ways in which AI is going to impact our lives are still vast and largely unknown, as this discovery proves.
Source: https://www.cnn.com/2024/09/27/science/ai-nazca-geoglyphs-peru/index.html
What Else is Happening in AI on September 27th 2024!
AstraZeneca partnered with Immunai, paying $18 million to use the biotech firm’s AI model of the immune system to enhance cancer drug trial efficiency.
Visa agreed to acquire AI-driven payments protection firm Featurespace to enhance its financial crime and fraud detection capabilities—the acquisition price was not disclosed.
Source: https://www.pymnts.com/acquisitions/2024/visa-buys-featurespace-to-bolster-fraud-prevention-efforts
Runway launched The Hundred Film Fund to provide grants of $5,000 to $1 million for filmmakers using AI in their projects.
Source: https://www.allaboutai.com/ai-news/runway-allocates-5m-to-support-up-to-100-ai-generated-films
Microsoft announced a $1.3 billion investment in Mexico to enhance AI infrastructure and skills training over the next three years.
Source: https://www.techopedia.com/news/microsoft-to-spend-1-3b-in-mexico-on-cloud-and-ai-infrastructure
Blackstone confirmed a $13.3 billion investment to build an AI data center in northeast England, creating 4,000 jobs including 1,200 in construction.
Hugging Face reached 1 million free public AI models on its platform, highlighting the trend towards specialized models for diverse use cases rather than a single dominant model.
Source: https://www.techopedia.com/news/hugging-face-ai-machine-learning-platform-exceeds-1m-model-listings
Drop LangChain and DSPy, Try Ell ?
A new language model programming library has been released, and it’s called Ell. It aims to compete with, and possibly replace, libraries like LangChain, Llama-Index, and DSPy.
I know — yet another library for composing language programs. And brace yourself, I might be building my own in a few weeks or months. But that’s a story for another day.
Here’s the tweet announcing Ell. Ambitious from the start, with the core maintainer calling it nothing less than the future of prompt engineering.
Source: https://x.com/wgussml/status/1833615864131948756
Mark Zuckerberg: creators and publishers ‘overestimate the value’ of their work for training AI
Source: https://www.theverge.com/2024/9/25/24254042/mark-zuckerberg-creators-value-ai-meta
Bill Gates: AI Is “The First Technology That Has No Limit”
Source: https://youtu.be/DD4F5it7a5M
Trending AI Tools on September 27th 2024
AI Search Grader – Quickly analyze + improve your brand’s visibility and perception on AI search engines (free tool)*: https://www.gushwork.ai/ai-search-grader
Neolocus – AI renders for interior design: https://www.neolocus.ai/
Clarity – AI image upscaler and enhancer: https://clarityai.co/
Helicone – Open-source platform for monitoring and debugging AI projects: https://github.com/Helicone/helicone
Read Aloud For Me: AI Dashboard – AI Tools Recommender – Safe AI
Welcome to Read Aloud For Me, the pioneering AI dashboard designed for the whole family! Our platform is the first of its kind, uniquely crafted to cater not only to adults but also to kids.
iOs PRO (No Ads): https://apps.apple.com/ca/app/read-aloud-pro-ai-dashboard/id1600174099
iOs: https://apps.apple.com/ca/app/read-aloud-for-me-ai-dashboard/id1598647453
Web/Android/PWA: https://readaloudforme.com
Windows: https://apps.microsoft.com/detail/9pm03vfn90l8?hl=en-ca&gl=CA
A Daily Chronicle of AI Innovations on September 26th 2024:
Listen here: https://podcasts.apple.com/us/podcast/ai-unraveled-latest-ai-news-trends-gpt-chatgpt-gemini/id1684415169
Meta unveils AR xAI glasses, new model, and more
OpenAI CTO exits amid rumors of non-profit removal
AI breakthrough in treating rare diseases
💰YCombinator’s AI boom is still going strong (W24)
Meta reveals its Orion AR smart glasses
OpenAI CTO Mira Murati leaves the company
Sam Altman could get 7% stake in OpenAI, worth $10B
FTC announces crackdown on ‘deceptive AI’ businesses
Meta unveils new Quest 3S headset and discontinues older models
Meta reveals its Orion AR smart glasses
- Meta has unveiled its Orion augmented reality glasses, which showcase full holographic capabilities, marking the most advanced AR glasses the company has developed after a decade-long effort.
- These smart glasses use tiny projectors to display holograms, are lightweight, and feature a contextual AI for improved digital experiences, both indoors and outdoors, without obstructing the user’s view of the real world.
- Orion glasses require a wireless puck for processing and a bracelet for gesture control, with Meta currently allowing only employees and select audiences to test the device as it works towards a consumer release in the future.
- Source: https://www.engadget.com/ar-vr/meta-reveals-its-orion-smart-glasses-175353381.html
OpenAI CTO Mira Murati leaves the company
- Mira Murati, the CTO of OpenAI, announced on social media that she is leaving the company after more than six years to pursue her own interests.
- OpenAI’s CEO Sam Altman expressed gratitude for Murati’s contributions and stated that more information about the transition plans will be shared soon.
- Murati’s resignation adds to recent high-level departures at OpenAI, including former safety leader Jan Leike and co-founder John Schulman.
- Source: https://techcrunch.com/2024/09/25/openai-cto-mira-murati-says-shes-leaving-the-company/
AI breakthrough in treating rare diseases
Harvard Medical School researchers recently developed an AI model called TxGNN that can identify existing drugs for repurposing to treat rare and neglected diseases.
- TxGNN identified drug candidates from nearly 8,000 existing medicines for over 17,000 diseases, many without current treatments.
- The model outperformed leading AI drug repurposing tools by nearly 50% in identifying candidates and was 35% more accurate in predicting contraindications, a specific situation in which a medcine should not be used.
- TxGNN uses a novel approach that identifies shared features across multiple diseases, allowing it to extrapolate from well-understood conditions to poorly understood ones.
- The researchers have made the tool freely available to encourage its use by clinician-scientists in the search for new therapies, especially for rare and untreated conditions.
Another week, another insane medical breakthrough for AI. While we still need years of clinical validation and approvals before widespread use, TxGNN has the potential to save thousands of lives and improve the lives of people who likely thought a treatment for their specific disease would never come.
Sam Altman could get 7% stake in OpenAI, worth $10B
- OpenAI is in discussions to grant CEO Sam Altman a 7% equity stake as part of its shift towards a for-profit business model, which would be the first time Altman holds ownership in the AI company.
- The company is considering becoming a public benefit corporation, aiming to turn a profit while also prioritizing societal benefits, amidst ongoing leadership changes, including the surprise departure of CTO Mira Murati.
- OpenAI is currently raising $6.5 billion at a $150 billion valuation, which could potentially boost Altman’s net worth by over $10 billion, placing him among the wealthiest individuals in the world.
- Source: https://finance.yahoo.com/news/openai-cto-mira-murati-says-194450279.html
FTC announces crackdown on ‘deceptive AI’ businesses
- DoNotPay, claiming to offer the “world’s first robot lawyer,” agreed to a $193,000 settlement with the FTC for making untested claims about its AI services.
- The settlement is part of the FTC’s Operation AI Comply, targeting companies that use artificial intelligence to deceive or defraud customers.
- The FTC ruled that DoNotPay must warn subscribers about its AI’s limitations and prohibit the company from claiming it can replace professional services without evidence.
- Source: https://www.theverge.com/2024/9/25/24254405/federal-trade-commission-donotpay-robot-lawyers-artificial-intelligence-scams
Meta unveils new Quest 3S headset and discontinues older models
- Meta introduced the Meta Quest 3S, a budget-friendly VR headset, at its Connect event on Wednesday, promising great performance despite the lower price point.
- Mark Zuckerberg announced the Meta Quest 3S, priced at $299.99, which is significantly cheaper compared to the $499.99 Meta Quest 3 or $999.99 Meta Quest Pro.
- The Quest 3S, equipped with a Snapdragon XR2 Gen2 chip and 8GB RAM, offers lower display specs and uses Fresnel lenses, making it distinct from the Quest 3.
- Source: https://mashable.com/article/meta-quest-3s-announced-specs-price
What Else is Happening in AI on September 26th 2024!
An MIT professor just announced the release of an open-source AI Podcast tool.
A professor of MIT just released an open-source tool for creating AI podcasts.
We are excited to share #PDF2Audio, an open-source alternative to the #podcast feature of #NotebookLM with flexibility & tailored outputs that you can precisely control in the app: You can make a podcast, lecture, discussions, short/long form summaries & more, including the use of the amazingo1 model (@sama)
Unlike NotebookLM, this tool is 100% open-source on Huggingface. That means, the community isn’t reliant on one super-tech giant and can now iterate and improve the tool themselves.
OpenAI is reportedly developing an improved version of its Sora AI video generation model, aiming for higher quality and longer clips than previously demonstrated.
Source: https://the-decoder.com/openai-reportedly-developing-improved-version-of-video-ai-sora
Meta announced it will not immediately join the European Union’s voluntary AI Pact, instead focusing on compliance with the upcoming AI Act regulations.
Nvidia analysts predicted the company will produce around 450,000 Blackwell AI GPUs in Q4 2024, potentially generating over $10 billion in revenue despite initial production challenges.
Source: https://www.greenbot.com/nvidia-blackwell-gpus/
Nebius Group revealed plans to invest over $1 billion in AI infrastructure across Europe by mid-2025, including GPU clusters and data centers.
The Federal Trade Commission announced enforcement actions against multiple companies for deceptive or unfair use of artificial intelligence in their practices.
OpenAI CEO Sam Altman said the Advanced Voice Mode rollout for ChatGPT has been completed early, except in jurisdictions requiring additional external review.
OpenAI asked US to approve energy-guzzling 5GW data centers
The tangible benefits of AI vs the actual resource cost utilization to support it is scary and concerning.
Meta will use AI to create lip-synced translations of creators’ Reels.
Trending AI Tools on September 26th 2024:
Notion AI – Search and chat with documents across Notion, Slack, and Google Drive: https://www.notion.so/
Rows AI Analyst 3.0 – An AI data analyst that visualizes and formats data: https://rows.com/docs/using-the-rows-ai-analyst
Magnific Mystic V2 – Advanced AI generator that can output up to 4k resolution images: https://magnific.ai/
Magic Patterns – Generate product design and React code: https://www.magicpatterns.com/
OpenMusic – Create custom tunes from text descriptions: https://openmusic-project.github.io/
A Daily Chronicle of AI Innovations on September 25th 2024:
Hacker plants false memories in ChatGPT to steal user data
OpenAI is rolling out long-awaited ‘advanced voice’ feature
Microsoft uses AI to fix hallucinations in AI models
Google releases production-ready models
Hacker plants false memories in ChatGPT to steal user data
- Security researcher Johann Rehberger found a flaw in ChatGPT’s long-term memory settings that allowed attackers to store false information and malicious instructions, which OpenAI initially did not consider a security concern.
- Rehberger created a proof-of-concept exploit that exfiltrated all user input perpetually by storing false memories using indirect prompt injections from untrusted sources like Google Drive or OneDrive, prompting OpenAI to issue a partial fix.
- Though OpenAI’s fixes prevent memory abuse for data exfiltration, prompt injections from untrusted content can still plant long-term false memories, making it crucial for users to monitor and review stored memories regularly to prevent attacks.
- Source: https://arstechnica.com/security/2024/09/false-memories-planted-in-chatgpt-give-hacker-persistent-exfiltration-channel/
OpenAI is rolling out long-awaited ‘advanced voice’ feature
- OpenAI is expanding the availability of Advanced Voice Mode, a feature in ChatGPT that lets users communicate more naturally with the AI by speaking and interrupting its responses mid-sentence.
- Initially released in July to an invite-only group, this advanced mode, which can adjust responses based on the user’s tone of voice, is now being rolled out to Plus and Team users, with plans to extend access to Enterprise and Edu tiers.
- To ensure safety, OpenAI has had external experts test the Advanced Voice Mode, addressing potential issues like violent or erotic speech and voice imitation, though the model’s closed-source nature limits independent evaluation.
- Source: https://www.technologyreview.com/2024/09/24/1104422/openai-released-its-advanced-voice-mode-to-more-people-heres-how-to-get-it/
Microsoft uses AI to fix hallucinations in AI models
- Microsoft introduced a feature called “correction” within Azure AI Studio, aimed at detecting and fixing inaccuracies in AI-generated outputs by automatically rewriting incorrect content.
- The correction tool identifies errors by comparing AI outputs with the customer’s original source material, explains the inaccuracies, and corrects them before users see the initial content.
- Microsoft acknowledges that while the correction tool uses both small and large language models to align AI outputs with reference documents, it does not entirely guarantee accuracy and is still prone to errors.
- Source: https://www.theverge.com/2024/9/24/24253452/microsoft-correction-ai-safety-tool-fix-errors
Google releases production-ready models
Google just announced significant updates to its Gemini AI models, including performance improvements, cost reductions, and increased accessibility for developers.
- Two new production-ready models came out today: Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002, offering improved quality across various tasks, including a 20% boost in math-related benchmarks.
- Pricing for Gemini 1.5 Pro has been reduced by over 50% for both input and output on prompts under 128K tokens, while rate limits have been increased significantly.
- The models boast 2x faster output and 3x lower latency compared to previous versions, with improvements in long context understanding and vision capabilities.
- Google also updated its default filter settings, giving developers more control over model configuration for their specific use cases.
Google is iterating quickly and pushing the boundaries of affordability for developers building with AI. While this isn’t Gemini 2 — it is a significant upgrade over the experimental models and will help builders create faster, smarter, cheaper applications.
Source: https://x.com/rowancheung/status/1838611170061918575
What Else is Happening in AI on September 25th 2024!
Early sign-ups to EU’s AI Pact include Amazon, Google, Microsoft and OpenAI — but Apple and Meta are missing.
Terminator director James Cameron joins Stability AI despite past warnings on artificial intelligence.
Warner Bros. Discovery adopted Google Cloud’s AI for caption generation, aiming to cut production time and costs for unscripted programming.
Intel launched Xeon 6 processors and Gaudi 3 AI accelerators, doubling performance for AI workloads and offering improved price and performance compared to Nvidia’s H100.
Source: https://finance.yahoo.com/news/intel-unveils-next-generation-ai-150000409.html
OpenAI increased API access for o1 models, adding tier 4 to the list of authorized users at 100 requests per minute and upping tier 5 users to 1000 requests per minute.
Source: https://x.com/OpenAIDevs/status/1838611640201162807
Suno AI announced a new cropping feature available to AI-generated songs, allowing Pro and Premier users to adjust the start and end of their creations.
Source: https://x.com/suno_ai_/status/1838688463044121035
Duolingo introduced AI-powered Adventures mini-games and a Video Call feature to enhance language learning through immersive, practical experiences for its users.
Apple unveiled its plan to roll out Siri’s major AI-powered updates gradually, with the most significant enhancements expected in iOS 18.3, likely launching in January 2025.
Source: https://gizmodo.com/siri-may-not-get-its-apple-intelligence-update-until-january-2025-2000502076
Google Launches Gemini 1.5 Pro and Gemini-1.5-Flash-002 AI Models With Faster Output
Google launched Gemini-1.5-Pro-002 and Flash-002 AI models that work better thanks to faster output, higher rate limits, and better filters.
Meta announces Meta AI with Voice – A competitor to OpenAI’s Advanced Voice model
Source: https://www.wired.com/story/meta-releases-new-llama-model-ai-voice/
Mira Murati Steps down from OpenAI
Source: https://x.com/miramurati/status/1839025700009030027
OpenAI is already training a new version of Sora with even higher quality and longer videos
Source: https://www.theinformation.com/articles/openai-is-revamping-sora-ai-video
Google Paid $2.7 Billion to Bring Back an AI Genius Who Quit in Frustration
Source: https://www.msn.com/en-us/money/other/google-paid-2-7-billion-to-bring-back-an-ai-genius-who-quit-in-frustration/ar-AA1raTPX
AI Training on September 25th 2024:
Step-by-step instructions for creating a podcast entirely with AI in less than 10 minutes:
To create this podcast, I used 1 free tools:
- NotebookLM from Google to generate the audio
Here are the exact steps I used to create the podcast.
Step 1) Find an article that you want to create a podcast with
This is the easiest step – you need to find (or write) an article that you want converted to a podcast.
Once you have an article to convert, you’re ready to move to step 2.
Step 2) Create a new notebook in Google’s NotebookLM

Next, you’ll need to navigate to NotebookLM and click “New Notebook”. After creating a notebook, you need to populate it with your article!
Step 3) Upload an article into NotebookLM

Google provides numerous options for uploading articles, including Google Drive, linking websites, and uploading files such as markdown files, txt files, and PDFs.
To make it easy, I literally just copy/pasted my article with the Paste text feature. I then waited a minute, and it redirected me to my new notebook!
Step 4) Generate a notebook using the “Generate” button

After being redirected, we need to click “Notebook guide” next to the input box. From there, a menu pops up, and we can see Audio Overview in the right of the screen. Click “Generate” and wait 5 minutes, and the audio is generated!
This audio is enough to start distributing it to major platforms, such as Apple Podcasts, Spotify, and Amazon Music. I will discuss that more towards the end of the article.
Trending AI Tools on September 25th 2024:
Adobe GenStudio – Helps marketing teams measure on-brand content: https://business.adobe.com/products/genstudio.html?
FactBot by Snopes – Fact-checking for urban legends and misinformation: https://www.snopes.com/factbot/
JustPaid – Automate invoice follow-ups and payment tracking: https://www.justpaid.ai/
ell – A lightweight prompt engineering framework for language models: https://github.com/MadcowD/ell
Pathway – Helps product teams test UX solutions and gather insights: https://pathway.cc/
Tubit AI – AI that summarizes YouTube videos for a deeper understanding: https://mylens.ai/apps/youtube
Read Aloud For Me – AI Dashboard – AI Tools Recommender – Safe AI
Welcome to Read Aloud For Me, the pioneering AI dashboard designed for the whole family! Our platform is the first of its kind, uniquely crafted to cater not only to adults but also to kids.
iOs PRO (No Ads): https://apps.apple.com/ca/app/read-aloud-pro-ai-dashboard/id1600174099
iOs: https://apps.apple.com/ca/app/read-aloud-for-me-ai-dashboard/id1598647453
Web/Android/PWA: https://readaloudforme.com
Windows: https://apps.microsoft.com/detail/9pm03vfn90l8?hl=en-ca&gl=CA
A Daily Chronicle of AI Innovations on September 24th 2024:
Superintelligence may be here sooner than expected
Nvidia and Alibaba to improve autonomous cars
AI video-to-video generation with Runway
IBM, NASA team up on new AI climate model
AI superintelligence could arrive in ‘few thousand days’
Boston Dynamics robots can now autonomously unlock doors
Superintelligence may be here sooner than expected; could arrive in ‘few thousand days’
OpenAI CEO Sam Altman just suggested that superintelligent AI could emerge in just a few thousand days, marking a potentially transformative moment in human history that could usher in an era of unprecedented prosperity and capability.
- Sam Altman, CEO of OpenAI, believes that artificial superintelligence could be achieved within “a few thousand days,” emphasizing this milestone as a pivotal moment in human history.
- Altman stresses the importance of investing in computing resources and infrastructural improvements to make AI widely accessible and prevent it from becoming a limited resource.
- Altman’s personal blog post on this topic coincides with OpenAI’s efforts to raise $6.5 billion, though the timeline he suggests is not the company’s official stance.
Being CEO of OpenAI, Sam Altman knows more about the current capabilities of AI than almost anyone else on the planet—and he is hyperoptimistic about the future. But regardless of whether or not superintelligence is here in 5-10 years or 25-30 years, it’s coming, Altman says, and it’s going to change everything.
Nvidia and Alibaba to improve autonomous cars
Alibaba Cloud and Nividia just announced a new collaboration to develop advanced AI solutions for autonomous driving, integrating Alibaba’s large language models with Nvidia’s automotive computing platform.
- Alibaba’s advanced Qwen AI models will be integrated into Nvidia’s Drive AGX Orin platform, which is already used by major Chinese electric vehicle manufacturers.
- The partnership aims to enhance in-car voice assistants with more dynamic conversations and intelligent recommendations based on visual and environmental data.
- The companies are also working on adapting Alibaba’s AI models for Nvidia’s next-generation Drive Thor platform, combining advanced driver assitance, autonomous driving, and AI driver capabilities.
Two powerhouses in the AI space teaming up to fix issues with autonomous driving is a huge plus for advancing the car industry, but Nvidia deciding to use Alibaba’s Qwen models is an even bigger, and another unexpected win for open-source.
Source: https://finance.yahoo.com/news/alibaba-nvidia-collaborate-advanced-autonomous-093000322.html
Boston Dynamics robots can now autonomously unlock doors
- Assa Abloy, a prominent Swedish company in the lock and security business, is teaming up with Boston Dynamics to develop a digital door access system for autonomous entry by Spot robots.
- This system leverages Bluetooth technology to allow Boston Dynamics’ Spot robots to communicate with Assa Abloy’s HID-branded Signo readers, enabling hassle-free access through automatic doors.
- Given Assa Abloy’s extensive global footprint in door access solutions, their collaboration with Boston Dynamics could accelerate the adoption of robot security patrols in commercial buildings.
- Source: https://www.theverge.com/2024/9/23/24252240/assa-abloy-boston-dynamics-spot-robot-patrol-dog-door-smart-entry
AI video-to-video generation with Runway
Runway now offers video-to-video, a new feature that allows you to simply upload a real-life reference video and generate a completely different AI video based on it.
- Sign up on Runway’s website and locate the “video-to-video” tool
- Choose the most advanced model (Gen-3 Alpha) and upload your reference video
- Write a detailed prompt describing how you want your video transformed
- Generate your AI video and refine by adjusting your prompt if needed
Source: https://app.runwayml.com/
IBM, NASA team up on new AI climate model
NASA and IBM have joined forces to create Prithvi WxC, a groundbreaking AI foundation model for weather and climate applications — designed to address a wide range of environmental challenges.
- Prithvi WxC is a large-scale AI model with 2.3 billion parameters, trained on 160 atmospheric variables from NASA’s MERRA-2 dataset, spanning global atmospheric data from 1980 to present.
- Unlike task-specific AI models, Prithvi WxC is a versatile foundation model that can be fine-tuned for various weather and climate applications.
- The model uses advanced AI techniques to process both regional and global weather patterns, potentially improving the accuracy and efficiency of weather predictions.
- NASA and IBM are embracing open science principles by making Prithvi WxC and its training workflows publicly available through Hugging Face.
This AI collab tackles one of humanity’s most persistent challenges: accurate weather prediction. Prithvi WxC could lead to better weather predictions, disaster preparedness, and optimized agriculture schedules, potentially saving lives and making food cheaper for millions of people.
What Else is Happening in AI on September 24th 2024!
AI uncovers 2,000-year-old Nazca drawings in Peruvian desert.
Researchers created an AI that can solve Google’s reCAPTCHAv2 image-based CAPTCHA tests with 100% accuracy, rendering the human verification method obsolete.
Source: https://github.com/DannyLuna17/RecaptchaV2-IA-Solver
OpenAI launched Academy, providing $1 million in API credits, training, and support to AI developers and organizations in low- and middle-income countries.
Source: https://openai.com/global-affairs/openai-academy/
Andy Serkis (Lord of the Rings, Planet of the Apes) teased a new project featuring AI-generated characters that transition from 2D to augmented reality.
Source: https://deadline.com/2024/09/andy-serkis-ai-characters-project-gollum-lord-of-the-rings-1236096832/
Vice President Kamala Harris pledged to support the AI sector while protecting consumers in a pitch to donors at a New York City fundraiser.
Source: https://www.yahoo.com/tech/kamala-harris-said-she-boost-123300160.html
LinkedIn suspended AI training on UK users’ data following privacy concerns raised by the Information Commissioner’s Office, halting the practice pending further engagement.
Source: https://www.bbc.com/news/articles/cy89x4y1pmgo
OpenAI released the Multilingual MMLU dataset, translating the AI benchmark into 14 languages to evaluate models’ knowledge across diverse cultures and improve multilingual capabilities.
A Daily Chronicle of AI Innovations on September 23rd 2024:
Sam Altman and Jony Ive announce AI hardware device
Microsoft fuels AI ambitions with nuclear
EA reveals AI-powered video game creation
Sam Altman and iPhone creators are making an AI device
- Sam Altman is collaborating with former Apple designer Jony Ive and Laurene Powell Jobs to develop a secretive AI-powered hardware device.
- This new venture by Altman, Ive, and Powell Jobs aims to create a computing device that offers a less socially disruptive experience compared to the iPhone.
- Altman and Ive are currently raising up to $1 billion for their startup, with significant investments from Emerson Collective and other prominent investors, to fund their innovative AI device project.
- While OpenAI and Apple strengthen their ties with ChatGPT on the iPhone 16, Altman and Ive are building a relationship of their own. Combining Ive’s iconic design with OpenAI’s models, including o1, could (hopefully) create AI hardware that’s actually useful in our daily lives, unlike the majority of options currently available.
- Source: https://techstartups.com/2024/09/23/sam-altman-teams-up-with-jony-ive-and-laurene-powell-jobs-for-a-new-startup-to-create-a-secret-ai-device/
Microsoft fuels AI ambitions with nuclear
Microsoft just signed a landmark 20-year deal with Constellation Energy to reopen the Three Mile Island nuclear power plant in Pennsylvania, aiming to fuel its data centers and AI operations with clean energy.
- The 20-year agreement will provide carbon-free power for Microsoft’s data centers, which are crucial for running its AI and cloud computing operations.
- This move highlights the substantial energy requirements of AI, as companies seek reliable and sustainable power sources to support AI expansion.
- The deal aligns with recent trends in the tech industry, such as Bill Gates’ $1 billion investment in a nuclear plant in Wyoming, which will also power homes and AI operations.
- The plant will be renamed the ‘Crane Clean Energy Center’ and is set to reopen in 2028.
Recent AI-driven power demands are leading tech giants to seek nuclear energy solutions. By securing a dedicated nuclear power source, Microsoft is strategically positioning itself to meet the increasing computational demands of AI development and deployment, without increasing its carbon footprint.
EA reveals AI-powered video game creation
Electronic Arts (EA) recently demonstrated a new AI-driven concept for user-generated video game content, showcasing how players can create and modify games in real-time using natural language commands.
- EA’s “Imagination to Creation” concept video displays two players rapidly building a complex game world and gameplay rules through voice commands without coding expertise.
- The system leverages EA’s vast proprietary dataset, including 3D assets, code, gameplay hours, and telemetry events, to train custom creation models.
- Players can remix elements from existing EA games and asset libraries, potentially creating entirely new gaming experiences.
This concept could change EA’s business model, creating a more interactive and dynamic relationship with their player base while possibly unlocking new revenue streams and extending the lifespan of games. However, it’s just a concept video—only time will tell what the future of video game creation will truly look like.
Source: https://www.thegamer.com/ea-epic-ai-creation-tool-imagination-to-creation/
Kamala Harris Vows to Boost AI and Crypto Investments in NYC Fundraiser Speech
Vice President-elect Kamala Harris promised to help increase investment in AI and crypto if she is elected. On Sunday, she pitched her economic plan to donors in New York City as one that would encourage new ideas and make sure that laws protect investors and consumers.
What Else is Happening in AI on September 23rd 2024!
Cloudflare’s new marketplace will let websites charge AI bots for scraping.
Disney researchers developed an AI system allowing robots to learn complex movements from random motion data, improving robots’ versatility across industries.
Source: https://finance.yahoo.com/news/disneys-ai-revolution-disney-unveils-164246543.html
Pudu Robotics unveiled PUDU D7, a 5’5” tall, 100-pound ‘semi-humanoid’ robot for service and industrial use, featuring 30 degrees of freedom and 8-hour battery life.
Google released new evaluations for long context reasoning in AI models, with their 1.5 Pro model outperforming others by 12.5% on tasks beyond simple information retrieval.
Source: https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/
Middle Eastern sovereign wealth funds significantly increased investments in AI startups, with funding up fivefold in the past year.
Researchers introduced AnySkin, a versatile, replaceable tactile sensor for robots that allows learned manipulation policies to generalize across different sensor instances.
Source: https://arxiv.org/abs/2409.08276
Google CEO Believes AI Replacing Entry Level Programmers Is Not The “Most Likely Scenario”.
China Launched World’s First AI Hospital with 14 AI Doctors.
Source: https://thedailycpec.com/china-launched-worlds-first-ai-hospital-with-14-ai-doctors
SPARK can create high-quality 3D face avatars from regular videos and track expressions and poses in real time. It improves the accuracy of 3D face reconstructions for tasks like aging, face swapping, and digital makeup.
Source: https://arxiv.org/html/2409.07984v1
Trending AI Tools on September 23rd 2024
Scenery – Let AI edit videos for you: https://scenery.video/
Epsilla – No-code platform for building production ready LLM apps: https://epsilla.com/
Clones – Engage in conversations with lifelike AI companions: https://meetclones.com/
Solidroad – AI-first training and assessment platform: https://solidroad.com/
Fathom – Transcribes, highlights, and summarizes meetings: https://fathom.video/
“Read Aloud For Me – AI Dashboard“, is a free app available for iOS and Android devices, and as a Progressive Web App. It provides access to various AI tools, including text-to-speech, translation, image recognition, and generative AI services like ChatGPT and Google Gemini. The app aims to be a user-friendly and secure platform for exploring and utilizing the latest AI technologies, targeting both adults and children with its family-friendly features and educational content.
Dive into a world where you can explore and test cutting-edge AI tools, or let your children engage with built-in AI games and bedtime stories, all tailored to spark their imagination and curiosity. Additionally, our intuitive AI explanations are designed to educate both kids and adults about how AI technologies function in a simple and understandable way. Whether you’re seeking education, entertainment, or an interactive experience, Read Aloud For Me is your gateway to the future of family-friendly digital interaction. At Wonderland Bedtime Stories, diversity shines through every tale. Our multilingual stories celebrate the rich heritage of children from all corners of the globe, ensuring every child sees themselves in the magic of storytelling.
A Daily Chronicle of AI Innovations on September 20th 2024:
Sam Altman on AI’s evolution: o1 to agents
💰OpenAI is oversubscribed
🍎Apple launches beta of AI-powered iOS 18.1
Google uses AI to help build cities
👀 California Governor Gavin Newsom signed eight different laws regulating AI deepfakes this week
💰OpenAI is oversubscribed
Despite the drama surrounding OpenAI in recent months, the company appears to be on track to complete its latest $6.5 billion funding round – with investors clamoring to put in billions more.
Why it matters:
- While overall sentiment around AI’s potential ROI is cooling, clearly there’s still enormous demand to invest in its frontrunners.
- The new round values OpenAI at an eye-watering $150B, nearly double its previous $85B valuation.
- But the new price tag may hinge on whether the company successfully transitions to a more traditional, for-profit structure.
Source: https://techstory.in/open-ais-6-5-bn-funding-round-exceeds-expectations-oversubscribed-by-investors/
👀 California Governor Gavin Newsom signed eight different laws regulating AI deepfakes this week
California Governor Gavin Newsom signed eight different laws regulating AI deepfakes this week, with implications for both Hollywood and Silicon Valley tech companies.
Breaking down the bills:
- AB 2655, AB 2839, and AB 2355 are meant to combat deepfake election content. AB 2655 in particular would require tech companies to remove or label election-related deepfakes within certain periods.
- AB 2602 and AB 1836 place more stipulations on when companies can use AI-generated likenesses of celebrities, aiming to curb unauthorized uses of AI replicas.
- SB 942, SB 926, and SB 981 prohibit creating and sharing deepfake porn, and require social platforms to handle user reports of unauthorized deepfakes.
- Meanwhile, SB 1027 – California’s hotly contested AI regulation – is still sitting on the Governor’s desk, as he weighs the bill’s potential “chilling effect.”
Sam Altman on AI’s evolution: o1 to agents
At T-Mobile’s Capital Markets Day, OpenAI CEO Sam Altman discussed the new o1 model’s advanced reasoning capabilities and hinted at the accelerated development of AI agents, suggesting another major leap in AI is imminent.
- o1 is described as the first AI system with advanced reasoning, outperforming previous models in complex problem-solving.
- Altman compared o1’s current state to the ‘GPT-2 stage’ of reasoning models, suggesting that this is just the start, and the GPT-4 equivalent is on the horizon.
- He outlined five levels of AI development, with o1 at level 2 (reasoners), and suggested level 3 (agents) could follow “relatively quickly”.
While the transition from GPT-4 to o1 seemed to take forever, Altman’s comments suggest the time between o1 and fully capable agentic AI systems will be significantly faster due to its iterative nature. If true, it should only be a couple of months before o1 models blow GPT-4o (and other current models) out of the water.
🍎Apple launches beta of AI-powered iOS 18.1
Apple just released the beta version of its highly anticipated AI software update, iOS 18.1, for early testing alongside the iPhone 16 release — significantly enhancing Siri and introducing a suite of AI-powered features.
- Available for iPhone 16 models and premium versions of iPhone 15 through the Apple Beta Software Program (users should download with caution).
- Siri now features improved conversational abilities and a new visual interface with a rotating glowing light during interactions.
- New AI-powered features include writing assistance, email summarization, and advanced photo editing tools, with more planned for future releases.
- It will initially be available in the United States, but Apple’s AI features will expand to other English-speaking countries in December and to more languages and regions in the next year.
This beta release offers an early glimpse into Apple’s long-awaited Siri upgrades and AI integrations that are set to create truly unique and personalized experiences for users. However, as with all beta software, download with caution — as the update might be buggy, and is not the final version of iOS 18 yet.
Google uses AI to help build cities
Google just released the Open Buildings 2.5D Temporal Dataset, an AI-powered tool that tracks building changes across the Global South from 2016 to 2023, including estimates of building presence, counts, and heights.
- The dataset covers a 32 million square mile area across Africa, Latin America, and South and Southeast Asia, using 10m resolution imagery from Sentinel-2 satellites.
- It uses a novel AI approach combining multiple low-resolution satellite images to achieve near high-resolution accuracy in detecting buildings and estimating their heights.
- The tool is designed to aid urban planning, crisis response, and environmental impact studies in regions lacking up-to-date infrastructure data.
- While powerful, the dataset has limitations, such as gathering data in areas with frequent cloud coverage and detecting very small structures.
Google is on fire with its new models tackling real-world problems. Yesterday, the tech giant was talking to whales with AI — and now it’s using the technology to help build better cities and prevent environmental disasters.
Source: https://readwrite.com/google-creates-open-buildings-2-5d-temporal-dataset/
💻 ByteDance to make its own AI GPUs
- ByteDance is developing two AI GPUs to reduce reliance on Nvidia, with mass production slated for 2026.
- The new AI GPUs are designed by Broadcom, produced by TSMC using N4/N5 process technologies, and focus on AI training and inference.
- ByteDance has already spent over $2 billion on Nvidia AI GPUs this year, highlighting the strategic importance of AI hardware to the company.
- Source: TikTok owner ByteDance taps TSMC to make its own AI GPUs to stop relying on Nvidia — the company has reportedly spent over $2 billion on Nvidia AI GPUs | Tom’s Hardware (tomshardware.com)
⚛️ Microsoft reopens nuclear plant to fuel its AI power needs
- Microsoft and Constellation Energy have struck a deal to re-open Pennsylvania’s Three Mile Island nuclear plant, allowing Microsoft to purchase all 835 megawatts of its energy generation for 20 years starting in 2028, pending regulatory approval.
- While the plant’s electricity won’t be directly allocated to Microsoft facilities, the agreement is part of a broader effort by the tech industry to secure new energy sources for data centers supporting AI models, cloud computing, and streaming services.
- The deal involves re-opening Unit 1, which was closed in 2019 due to economic reasons, marking the first time a decommissioned nuclear plant would be reactivated, potentially signaling a new era for nuclear energy in the tech sector.
🚨 Trying to break OpenAI’s new models could get you banned
- OpenAI’s o1 models, launched last week, were designed to excel in advanced reasoning, solving complex math, science, and coding problems by “thinking before they speak.”
- Users attempting to break the o1 models’ reasoning received warnings for violating terms of use, including the use of phrases like “reasoning trace” and “reasoning.”
- OpenAI’s updated Terms of Use indicate that violating usage policies could lead to account suspension or termination, aiming to prevent risks and ensure compliance with the law.
- Source: Trying to break OpenAI’s new o1 models? You might get banned | ZDNET
What Else is Happening in AI on September 20th 2024:
TikTok is developing its own AI GPUs, aiming to reduce reliance on Nvidia with two chips for AI training and inference planned for mass production in 2026.
MIT researchers developed an AI model called Crystalyze that can determine the structures of powdered crystalline materials from X-ray diffraction data.
Nearly a year after the UK’s AI Safety Summit and President Biden’s AI Executive Order, the White House is planning the first meeting of the International Network of AI Safety Institutes.
A UN advisory body made seven recommendations for governing AI, including establishing a panel to provide impartial scientific knowledge about AI.
And in an open letter coordinated by Meta, more than two dozen companies warned that the EU might reap fewer rewards of AI because of the bloc’s tech regulations.
LinkedIn confirms it’s training its AI models on user data by default, while allowing opt-outs for those with privacy concerns.
A Starling Bank survey reveals 28% of UK adults have encountered an AI voice cloning scam in the past year, while 46% were unaware such scams existed.
Wordfreq, a project analyzing human language usage, shuts down due to data pollution from generative AI.
Source: https://edworking.com/news/startups/why-wordfreq-wont-be-updated-generative-ais-impact
A Daily Chronicle of AI Innovations on September 19th 2024:
Lionsgate partners with Runway for GenAI
YouTube launches AI-powered creation tools
Google uses AI to understand whales
Amazon debuts an AI assistant for sellers, Project Amelia
Scientists store human genome on crystal which can last billions of years
Lionsgate partners with Runway for GenAI
Lionsgate, the film company behind The Hunger Games, John Wick, and Saw, teamed up with AI video generation company Runway to create a custom AI model trained on Lionsgate’s film catalogue.
- The partnership will develop an AI model specifically trained on Lionsgate’s proprietary content library, designed to generate cinematic video that filmmakers can further manipulate using Runway’s tools.
- Lionsgate sees AI as a tool to augment and enhance its current operations, streamlining both pre-production and post-production processes.
- Runway is considering ways to offer similar custom-trained models as templates for individual creators, expanding access to AI-powered filmmaking tools beyond major studios.
As many writers, actors, and filmmakers strike against ChatGPT, Lionsgate is diving head-first into the world of generative AI through its partnership with Runway. This is one of the first major collabs between an AI startup and a major Hollywood company — and its success or failure could set precedent for years to come.
Source: https://variety.com/2024/digital/news/lionsgate-generative-ai-filmmakers-runway-1236148854
YouTube launches AI-powered creation tools
YouTube just announced a range of new AI features designed to assist creators, including text-to-video generation, AI-powered inspiration tools, and automated dubbing capabilities.
- Veo, Google’s new AI video generation tool, will allow creators to generate six-second video clips for YouTube Shorts using text prompts, with the resulting content watermarked and labelled as AI-generated.
- YouTube’s new AI-powered “Inspiration” features will help creators brainstorm video ideas and even suggest responses to comments.
- The platform is also expanding its AI dubbing capabilities with expressive speech to make translations sound more natural, mimicking the original audio’s pitch and intonation.
- These new features aim to enhance human creativity rather than replace it, according to YouTube CEO Neal Mohan, who revealed that 92% of YouTube creators already use AI tools (in some capacity).
This is another win for global AI acceptance with YouTube leaning into AI generation on its platform, rather than fighting back against it. The platform is also showing how sensible AI watermarks can offset potential downsides for both creators and viewers.
Source: https://www.nbcnews.com/tech/youtube-rolls-four-new-ai-tools-creators-rcna111401
Google uses AI to understand whales
Google Research recently developed a new AI model capable of identifying vocalizations from eight distinct whale species, including the mysterious “Biotwang” sound recently attributed to Bryde’s whales.
- The model classifies 12 different whale vocalizations across a broad acoustic range (10 Hz to 120+ kHz), covering multiple calls for some species.
- The AI tool is designed to aid conservation efforts by improving the ability to track whale populations and movements through passive acoustic monitoring.
- The model includes newly identified sounds like the Bryde’s whale “Biotwang”, which was a decades-old mystery until recently solved by NOAA researchers.
- Google has made the model available for download via Kaggle Models, allowing researchers and conservationists to use the technology in their studies.
The benefits of AI in chatbots and medical research are obvious by now, but AI is impacting the entire globe — even deep down in the depths of the ocean. This AI model, in particular, is helping us better understand diverse whale species, which could lead to more effective protection strategies for endangered marine life.
Source: https://blog.google/technology/ai/pattern-radio-whale-songs/
Amazon debuts an AI assistant for sellers, Project Amelia
- Amazon is launching Amelia, an AI assistant, in beta for select U.S. third-party sellers to help resolve account issues and access sales and inventory data.
- Amelia is described as an “all-in-one, generative-AI based selling expert” and will be accessible via Seller Central, Amazon’s platform for merchants.
- Amazon is banking on Amelia’s generative AI capabilities to swiftly address and resolve account troubles for third-party sellers, bypassing traditional support channels.
- Source: https://www.cnbc.com/2024/09/19/amazon-introduces-amelia-an-ai-assistant-for-third-party-sellers.html
Scientists store human genome on crystal which can last billions of years
- Scientists at the University of Southampton have successfully stored the entire human genome sequence onto a virtually indestructible 5D optical memory crystal no larger than a penny.
- These 5D memory crystals, created using ultra-fast lasers to inscribe data within “nanostructured voids” in silica, promise unprecedented data density and can endure temperatures of up to 1,000°C, cosmic radiation, and immense physical impacts.
- Researchers aim for these eternal crystals to preserve genetic blueprints for future reconstruction of organisms, potentially aiding in the revival of endangered species or even humans long after extinction events.
- Source: https://www.techspot.com/news/104779-researchers-create-everlasting-5d-genome-memory-crystal-can.html
What Else is Happening in AI on September 19th 2024!
Alibaba unveils over 100 AI models to rival Microsoft, OpenAI.
YouTube announces AI features from Google DeepMind for Shorts creators.
Together AI’s LlamaCoder app generates full applications from prompts using Llama 3.1 405B, gaining 2,000+ GitHub stars and creating 200,000+ apps since launch.Source: https://ai.meta.com/blog/together-ai-llamacoder/ |
The Biden administrationannounced an international AI safety meeting in San Francisco for this November, gathering experts from nine countries and the EU. |
OpenAI reportedly warned users against probing its new o1 AI models’ reasoning processes, threatening bans for policy violations. Source: https://www.wired.com/story/openai-threatens-bans-as-users-probe-o1-model/ |
Northwestern University received $20 million to lead a new AI research institute developing tools for astronomy and astrophysics data analysis. Source: https://online.em.kellogg.northwestern.edu/artificial-intelligence |
Google announced $25 million in funding to support AI education initiatives, aiming to train over 500,000 educators and students in new artificial intelligence skills.Source: https://www.edweek.org/leadership/google-spends-25m-on-ai-training-for-teachers/2024/09 |
Alibaba released Qwen 2.5, a multilingual AI model with 72B parameters, rivaling larger models in performance across various benchmarks. Source: https://venturebeat.com/ai/alibaba-releases-new-ai-model-qwen2-vl-that-can-analyze-videos-more-than-20-minutes-long/ |
Nvidia launched its AI Aerial platform to optimize wireless networks and enable new AI experiences on a unified infrastructure for telecom providers.Source: https://blogs.nvidia.com/blog/ai-aerial-wireless-networks |
A Daily Chronicle of AI Innovations on September 18th 2024:
Listen to this episode at https://podcasts.apple.com/us/podcast/a-daily-chronicle-of-ai-innovations-on-september/id1684415169?i=1000669973849
Snap’s new AR glasses are powered by AI
AI memory works shockingly like humans
1X’s new model helps robots act in the world
Turn documents into engaging podcasts
Neuralink’s breakthrough vision restoration implant gets FDA approval
First major agreement between AI startup and Hollywood studio
LinkedIn is training AI on user data by default
California passes actor protection laws on AI clones
Snap unveils new AI-powered glasses
- Snap’s new fifth-generation Spectacles are AR glasses that overlay visual information and applications directly on their see-through lenses, offering an augmented reality experience without obscuring the user’s vision.
- The Spectacles use Snap’s new operating system, Snap OS, and include features like two processors, four cameras, micro-projectors, and cooling chambers to enhance performance and user experience.
- The glasses are part of a subscription-based Spectacles Developer Program, costing $99 per month, and Snap has partnered with OpenAI to help developers create experiences that integrate real-world context.
- Source: https://www.technologyreview.com/2024/09/17/1104025/snap-spectacles-ar-glasses/
Neuralink’s breakthrough vision restoration implant gets FDA approval
- Neuralink, owned by Elon Musk, received “breakthrough device” clearance from the FDA, but this does not indicate a developed cure for blindness.
- The FDA’s breakthrough devices program allows developers priority review and interaction with FDA experts during the premarket review phase.
- Neuralink’s new device, Blindsight, aims to increase electrode density to restore limited vision, but it faces significant challenges and misconceptions regarding its ability to grant sight to the blind.
- Source: https://techcrunch.com/2024/09/17/neuralinks-breakthrough-device-clearance-from-fda-does-not-mean-they-have-cured-blindness
First major agreement between AI startup and Hollywood studio
- Runway, a startup specializing in AI video tools, has partnered with Lionsgate to train a custom video model using the studio’s movie catalog.
- The collaboration will give Lionsgate’s creative talent access to the model to enhance their filmmaking and directing capabilities.
- This partnership marks the first public collaboration between a generative AI startup and a major Hollywood studio, preceding similar discussions by Disney and Paramount.
- Source: https://techcrunch.com/2024/09/18/generative-ai-startup-runway-inks-deal-with-a-major-hollywood-studio/
LinkedIn is training AI on user data by default
- LinkedIn is currently leveraging user data to enhance its generative AI products, without having updated its terms of service to inform users about this practice.
- According to multiple LinkedIn users and a statement provided to 404 Media, the platform is processing user data for AI improvements but has not yet disclosed this in its terms.
- The company has assured that it plans to revise its terms of service “shortly” to clearly communicate the data usage to its users.
- Source: https://www.404media.co/linkedin-is-training-ai-on-user-data-before-updating-its-terms-of-service/
California passes actor protection laws on AI clones
- California governor Gavin Newsom has signed two bills, AB 2602 and AB 1836, to limit the unauthorized use of performers’ likenesses by AI technology.
- AB 2602 prevents companies from using a digital version of an actor in a project without the actor’s informed consent and legal representation.
- AB 1836 requires entertainment companies to obtain permission from a deceased performer’s family or estate before creating or distributing a digital replica of them.
- Source: https://www.theverge.com/2024/9/17/24247583/california-governor-newsom-signs-ai-digital-replica-bills
Snap’s new AR glasses are powered by AI

Snap just unveiled its fifth-generation Spectacles, standalone AR glasses powered by the new Snap OS, integrating new AI capabilities and enhancing social interactions through augmented reality.
- The new Spectacles feature a suite of cameras and sensors, enabling multi-modal AI, contextual understanding, and full freedom of movement while remaining lightweight and compact.
- Snap OS, built specifically for Spectacles, allows users to interact with My AI, launch immersive Lenses, and extend Snapchat experiences into 3D.
- Snap also launched AI-powered video and portrait generation tools, creating videos and images from a single photo to enhance creative expression.
- Spectacles are available now to developers in the United States and require a $99/m subscription with a 12-month commitment.
While this is a major step in the right direction for boundless, cordless AR and AI devices — the 45-minute battery life and 46-degree field of view leave Spectacles with room for improvement. Regardless, Snap is pushing the boundaries of tech engineering, and it will be fascinating to see where the tech ends up in a few years.
1X’s new model helps robots act in the world
1X has developed its ‘World Model’, an AI-powered virtual simulator for robots that can predict complex object interactions and imagine multiple future scenarios from a single starting point, trained on thousands of hours of real-world robot data.
- The model predicts complex interactions including rigid bodies, dropping effects, partial observability, and deformable object interactions.
- It’s trained on thousands of hours of data from 1X’s EVE humanoid robots performing various tasks in homes and offices.
- The model demonstrates action controllability, generating diverse outcomes for different commands from the same starting point, such as opening doors or grasping boxes.
- 1X is releasing over 100 hours of video data, pretrained models, and cash prizes to accelerate research in compression, sampling, and evaluation.
OpenAI-backed 1X is on a mission to create smarter, more capable robots to help people with everything from household chores to mobility issues. The new world model will also significantly enhance the capabilities of their recently unveiled NEO Beta humanoid and accelerate the path to truly useful home robots.
Source: https://www.1x.tech/androids/neo
Turn documents into engaging podcasts
Google’s NotebookLM now offers an ‘Audio Overview’ feature that turns your uploaded documents, PDFs, or pasted content into two-way podcast-style discussions between two AI hosts.
- Visit NotebookLM and click “Try NotebookLM”.
- Create a new notebook and upload your document.
- Once processed, open the “Notebook guide” section.
- Click “Generate” next to “Audio Overview”.
- After a few minutes, your AI-hosted discussion will be ready to play!
AI memory works shockingly like humans
A new study from Hong Kong researchers revealed that large language models (LLMs) like GPT-4 possess a dynamic memory similar to human memory, challenging our understanding of AI capabilities and potentially bridging the gap between artificial and human intelligence.
- The researchers defined memory as the ability to generate outputs based on specific inputs, rather than static information storage.
- LLMs demonstrated near-perfect memorization of thousands of poems after limited exposure, outperforming humans.
- The study proposes, however, that both human brains and LLMs operate on similar principles of “dynamic fitting”, or adjusting their responses based on input rather than retrieving fixed information.
- Both humans and LLMs can only “remember” something after triggered by input, suggesting LLM reasoning might be limited by model size and training data quality rather than fundamental differences from human cognition.
If verified and LLMs do have memory capabilities similar to humans, it could change the way we understand artificial intelligence. Without fundamental cognitive differences between AI and LLMs, scaling AI capabilities may simply be an issue of improving hardware and expanding data resources.
Source: https://scitechdaily.com/in-a-striking-discovery-ai-shows-human-like-memory-formation
🧠NVIDIA offering free Generative AI courses
NVIDIA is offering many free courses at its Deep Learning Institute. Some of my favourites
- Building RAG Agents with LLMs: This course will guide you through the practical deployment of an RAG agent system (how to connect external files like PDF to LLM).
- Generative AI Explained: In this no-code course, explore the concepts and applications of Generative AI and the challenges and opportunities present. Great for GenAI beginners!
- An Even Easier Introduction to CUDA: The course focuses on utilizing NVIDIA GPUs to launch massively parallel CUDA kernels, enabling efficient processing of large datasets.
- Building A Brain in 10 Minutes: Explains and explores the biological inspiration for early neural networks. Good for Deep Learning beginners.
What is an NPU? Here’s Everything You Need to Know!
Have you heard the buzz about NPUs? If not, you’re about to—because neural processing units (NPUs) are set to become a game-changer for personal computers. You’ve heard of CPUs and GPUs, but NPUs are the next big thing in hardware, and soon, most PCs will have them. But what exactly are they, and why are they so important?
NPU 101: What is it? NPU stands for Neural Processing Unit. It’s a specialized processor designed to handle AI and machine learning tasks much more efficiently than traditional CPUs or GPUs. Think of NPUs as AI accelerators—they boost the performance of AI features without draining your battery or heating up your device.
How Do NPUs Work? AI tasks, like image recognition or language processing, rely on something called neural networks. NPUs are optimized to process these networks faster and with less power consumption. While GPUs are great at AI tasks, they require lots of energy. NPUs, on the other hand, hit the sweet spot by offering a balance between performance and efficiency.
NPU vs. CPU vs. GPU: The Breakdown
- CPU (Central Processing Unit): Your PC’s brain, handling general tasks.
- GPU (Graphics Processing Unit): Focused on handling complex graphics and gaming, but also useful for AI work.
- NPU (Neural Processing Unit): The AI specialist, running tasks like voice assistants, image processing, and more—without hogging system resources or battery life.
Why Do NPUs Matter? If you’ve ever run AI tools like ChatGPT or Stable Diffusion on your PC, you might have noticed they’re power-hungry and slow down your machine. NPUs can run these tasks faster and more efficiently. Plus, they free up your CPU and GPU for other tasks, meaning your entire system runs smoother.
NPUs in Action: What Can They Do? Windows PCs with NPUs can run AI-powered features, like Microsoft’s Windows Studio Effects (think background blurring on video calls or automatic eye contact correction). In the future, more apps will take advantage of NPUs, offering smoother AI-enhanced experiences—from audio and video editing to gaming.
What’s the Catch? Right now, not every PC has an NPU. But brands like Intel, AMD, and Qualcomm are starting to pack them into their newest chips. So, if you’re shopping for a new laptop, it’s something to keep an eye on.
Should You Get a PC with an NPU Now? It’s tempting to future-proof your device, but NPUs are still in their early stages. For most people, they’re a nice-to-have, not a need-to-have—yet. Many AI tools still run on the cloud, meaning your current hardware will work fine. But if you’re looking to stay ahead of the curve, picking up a PC with an NPU could be a smart move for AI-heavy tasks.
The AI revolution is just getting started, and NPUs will be at the heart of it. Whether you’re a gamer, creator, or just want a smoother PC experience, NPUs are set to make everything faster, smarter, and more efficient.
Note: The author used chatgpt to rewrite and make it better due to lack of my English skills
What Else is Happening in AI on September 18th 2024!
Microsoft, BlackRock form fund to sink up to $100B into AI infrastructure (AI data centers and additional related power infrastructure), raising $30 billion to start.
Source: https://www.theregister.com/2024/09/18/microsoft_and_blackrock_form_fund/
Google introduced 10 voice options for Gemini Live, allowing users to customize their AI assistant’s vocal interface.
Sakana AI secured ~$200 million in Series A funding from Japanese companies to accelerate AI development and market expansion.
OpenAI increased rate limits for 01-mini and 01-preview models, allowing Plus and Team users more daily and weekly interactions (up to 50 messages per day for 01-mini).
Source: https://www.zdnet.com/article/openai-expands-o1-model-availability-see-who-gets-access-and-how-much/
Perplexity introduced “reasoning” focus for Pro users (up to 10 uses per day), utilizing OpenAI’s o1-mini model for puzzles, math, and coding tasks.
Source: https://www.theverge.com/2024/7/3/24191431/perplexity-upgraded-pro-search-ai-research-assistant
Mark Cuban Foundation launched free AI bootcamp for Dallas teens, partnering with the Mavericks to teach AI fundamentals and applications.
Intel announced a partnership with Amazon to manufacture custom AI chips, expanding their foundry business and semiconductor capabilities.
Source: https://siliconangle.com/2024/09/16/intel-spin-off-chip-fab-business-unit-aws-signs-custom-ai-chips/
Lenovo announced plans to manufacture AI servers in India and opened a new AI-focused R&D lab in Bengaluru, aiming to produce 50,000 units annually.
Source: https://www.digitimes.com/news/a20240918VL200/lenovo-manufacturing-ai-server-plant-hardware.html
Trending AI Tools
Supademo 3.0 – Generate engaging, interactive product demos: https://www.supademo.com/
Cracked copilot – Create and refine motion graphics from a text prompt: https://supertools.therundown.ai/content/cracked-copilot
AnyParser Sandbox – Enhances document retrieval and text extraction efficiency: https://www.cambioml.com/sandbox
AutoGen – Build multi-agent conversations for LLM workflows: https://github.com/microsoft/autogen
FlyCode for Stripe – Maximize subscription revenue with payment optimization AI: https://www.flycode.com/stripe
DreamRP – Create Interactive story and characters with ethical AI: https://www.dreamrp.com/chat
Read Aloud For Me – AI Dashboard: AI Tools Recommender, Platform to find and test AI Tools, AI platform for kids, Safe AI for ALL: iOs – Windows – Web/PWA/Android
A Daily Chronicle of AI Innovations on September 17th 2024:
Sam Altman departs OpenAI’s safety committee
Microsoft adds faster Copilot to Excel and Word
Groq to build world’s largest AI data center
Amazon kills remote working
Google outlines plans to help you sort real images from fake
Chipotle unveils team of robots that will make your food
Sam Altman departs OpenAI’s safety committee
- OpenAI CEO Sam Altman is leaving the internal commission OpenAI created in May to oversee “critical” safety decisions related to the company’s projects and operations.
- The new committee, chaired by Zico Kolter and includes members like Adam D’Angelo and Paul Nakasone, will oversee significant model launches and have authority over safety evaluations.
- Despite being labeled as independent, all committee members are also part of OpenAI’s broader board of directors, bringing into question the actual independence of the committee.
- Source: https://www.theverge.com/2024/9/16/24246617/openai-independent-safety-board-stop-model-releases
Microsoft adds faster Copilot to Excel and Word
Microsoft just unveiled the next wave of Copilot, its AI assistant, introducing new features and expanding its integration across Microsoft 365 apps like Excel, PowerPoint, Teams, Outlook, Word, and OneDrive.
- Copilot Pages: A new collaborative canvas for AI-assisted teamwork, enabling real-time, multi-user interaction with AI on persistent, editable content.
- Copilot Agents: Advanced AI assistants that automate complex business processes, operating in the background to execute tasks autonomously.
- Agent Builder: A no-code tool for creating custom Copilot agents, making AI development easier for non-technical users.
- Performance Boost: Copilot responses are now twice as fast with triple the user satisfaction, powered by a GPT-4 integration.
Microsoft is making it easier than ever for non-technical users to add AI workflows to their suite of productivity tools. This could be the first taste of fast, intelligent, AI-powered Excel sheets and Word docs for millions of people.
Amazon kills remote working
- Amazon will require employees to return to the office five days a week starting from the beginning of next year, ending remote work practices that became common during the pandemic.
- CEO Andy Jassy stated that in-person collaboration and cultural strengthening are more effective, leading to this strict mandate on office attendance, with few exceptions for special circumstances.
- Amazon will also reintroduce assigned floor plans in its US offices, ending hot-desking, and continues to be an outlier among tech companies by enforcing such rigid office attendance policies.
- Source: https://arstechnica.com/gadgets/2024/09/amazon-kills-remote-working-tells-workers-to-be-in-office-5-days-a-week/
Google outlines plans to help you sort real images from fake
- Google will introduce a feature in search results to identify whether a photo was taken with a camera, edited by software, or generated by AI models, using technology from the C2PA.
- The company is working with the Coalition for Content Provenance and Authenticity (C2PA) to create digital trails for images, marking the first significant test for C2PA’s technical standards.
- Google aims to broaden the use of C2PA’s metadata in its ad systems and YouTube, while addressing challenges in adoption and interoperability across different hardware and software platforms.
- Source: https://www.theverge.com/2024/9/17/24247004/google-c2pa-verify-ai-generated-images-content
Chipotle unveils team of robots that will make your food
- Chipotle is introducing collaborative robots, known as “cobots,” to prepare burrito bowls and salads in two California locations, with plans to expand based on customer feedback.
- The Augmented Makeline robot, developed with Hyphen, will ensure consistent portion sizes, while the Autocado bot can efficiently prepare avocados for guacamole by cutting, coring, peeling, and scooping in 26 seconds.
- Despite concerns about automation replacing workers, Chipotle asserts that the new bots will enhance productivity and job satisfaction without reducing the number of employees.
- Source: https://fortune.com/2024/09/16/chipotle-portion-size-automation-autocado-robot-guacamole-assembly/
Groq to build world’s largest AI data center
AI startup Groq recently partnered with Saudi oil giant Aramco to build what they claim will be the world’s largest AI inferencing center in Saudi Arabia, aiming to serve the Middle East, Africa, and India.
- The data center will initially house 19,000 language processing units (LPUs), with potential expansion to 200,000 units, potentially making it the largest AI inferencing center globally.
- Aramco is funding the project, expected to cost “in the order of nine figures,” underscoring Saudi Arabia’s commitment to AI infrastructure.
- The center leverages Saudi Arabia’s advantages: low energy costs, available land, and proximity to 4 billion people within a 100-millisecond data travel time.
- Groq anticipates the center to be operational by the end of 2024, with potential for further collaborations between Groq and Aramco.
This partnership highlights Saudi Arabia’s commitment to AI and shows that Groq is capable of scaling up its lightning-fast AI inference infrastructure. It could accelerate the development of near-instantaneous chatbot responses — a critical next step following recent chain-of-thought breakthroughs like OpenAI’s o1 model.
What Else is Happening in AI on September 17th 2024!
Luma Labs launched the Dream Machine API, allowing developers to integrate their popular video generation AI model into applications without building complex tools.
Source: https://docs.lumalabs.ai
Google announced significant performance improvements for Gemini 1.5 Flash, reducing latency by over 3x and increasing output tokens per second by more than 2x.
Source: https://9to5google.com/2024/08/30/gemini-1-5-flash-faster-response
A Canadian study showed that an AI early warning system reduced unexpected patient deaths by 26%, monitoring vital signs and alerting staff to intervene earlier.
James Earl Jones agreed to let AI replicate his Darth Vader voice before his death, allowing the character to continue in future Star Wars productions.
AI pioneers called for international oversight to address potential catastrophic risks from rapidly advancing AI technology, warning it could soon surpass human control.
Source: https://www.nytimes.com/2024/09/16/business/china-ai-safety.htm
OpenAI announced enhanced safety and security measures, including establishing a Board oversight committee chaired by Zico Kolter to monitor model development and deployment.
Trending AI Tools
OpenAI o1 – The new series of OpenAI models with advanced reasoning, available in ChatGPT Plus and Teams plans.
Source: https://openai.com/o1
Google Illuminate – Turn books and papers into engaging audio discussions
Source: https://illuminate.google.com/
Read Aloud For Me – AI Dashboard: AI Tools Recommender, Platform to find and test AI Tools, AI platform for kids, Safe AI for ALl.
Source: https://apps.apple.com/ca/app/read-aloud-for-me-ai-dashboard/id1598647453
A Daily Chronicle of AI Innovations on September 16th 2024:
World Labs aims to create 3D worlds with AI
Elon Musk and Larry Ellison begged Nvidia CEO Jensen Huang for AI GPUs
AI creates open-world video games from text
Prompt tips and tricks for OpenAI’s new o1
OpenAI o1 is smarter than most humans
World Labs aims to create 3D worlds with AI
Fei-Fei Li, renowned AI researcher known as the “Godmother of AI”, has launched World Labs, a spatial intelligence company developing AI models capable of understanding and generating 3D environments.
- World Labs is pioneering “Large World Models” (LWMs) to perceive, generate, and interact with 3D worlds, incorporating physics and semantics beyond 2D imagery.
- Co-founded by Li, Justin Johnson, Christoph Lassner, and Ben Mildenhall, the startup has secured over $230 million in funding from high-profile investors, including Andreessen Horowitz.
- Initial applications focus on creating and editing virtual 3D spaces for creative professionals and general users.
- The team aims to bridge the gap between AI’s current text-based understanding and the complex spatial relationships of the real world.
Right now, AI relies heavily on text descriptions provided by humans. But figuring out spatial intelligence and moving AI beyond LLMs and language could greatly impact and accelerate the development of intelligent AR/VR, robotics, architecture, game design, and much more.
Source: https://www.worldlabs.ai/about
Elon Musk and Larry Ellison begged Nvidia CEO Jensen Huang for AI GPUs
- Oracle co-founder Larry Ellison revealed he and Elon Musk begged Nvidia CEO Jensen Huang for more AI chips during a dinner meeting, citing high demand for graphics processing units.
- Ellison emphasized the urgency by repeatedly asking Nvidia to take more money, a plea which he said was successfully received during an Oracle investor event last week.
- Ellison’s fortune has grown significantly as Oracle’s cloud business thrives, with the company’s shares rising almost 61% this year and indicating increased demand for Nvidia GPU clusters to support AI models.
Source: https://qz.com/larry-ellison-elon-musk-nvidia-jensen-huang-ai-chips-1851648856
AI creates open-world video games from text
Tencent just unveiled GameGen-O, a groundbreaking AI model capable of generating open-world video game content from text prompts and allowing interactive gameplay simulation.
- GameGen-O can create diverse game elements like characters, environments, actions, and events from simple text prompts.
- It also allows for interactive control of the generated video game content it creates to build worlds that gamers can interact with.
- The AI model was trained on OGameData, a new dataset of over 4,000 hours of high-quality video game footage and uses a two-stage training process: foundation pretraining and instruction tuning.
- The training data segments were annotated by GPT-4o for validation purposes.
Following the impressive accomplishments of Google’s GameNGen, Tencent’s GameGen-O raises the stakes for AI-generated video games. This technology will dramatically reduce the time and cost of building open-world games, leading to higher quality games created by smaller teams of game developers.
Source: https://gamegen-o.github.io/
Prompt tips and tricks for OpenAI’s new o1
OpenAI’s latest release, o1, unlocks new reasoning capabilities, but there’s a catch: prompts should be fundamentally different than the way you prompt GPT-3 and GPT-4, due to the new Chain-of-Thought (CoT) architecture.
- Log into ChatGPT (paid subscription required) and select o1-preview or o1-mini from the model dropdown.
- Keep your prompts simple and direct, avoiding Chain-of-Thought instructions.
- Use XML tags for clarity and test with challenging questions or philosophical queries.
- Analyze its thinking process by clicking on the “thinking” indicator after each response to understand the model’s reasoning path.
Experiment with different query types to fully grasp o1’s advanced reasoning abilities and check out OpenAI’s new ‘Advice on prompting‘ guide.
Source: https://university.therundown.ai/c/daily-tutorials/how-to-use-and-prompt-better-the-new-openai-o1-52bae6df-3f09-4518-a484-4dce15d04e7d
OpenAI o1 is smarter than most humans
OpenAI’s latest model, “o1”, has reportedly scored an IQ of around 120 on the Norway Mensa IQ test, potentially marking the first time an AI model has surpassed the average human IQ.
- o1 correctly answered 25 out of 35 questions on the Norway Mensa IQ test.
- It demonstrated proficiency in solving complex visual and logical puzzles, both on publicly available tests and new, unpublished questions.
- This performance suggests the improvement is not due to the questions being in the training data.
- While OpenAI hasn’t officially confirmed these results, ChatGPT Plus members can test o1-preview themselves using the model dropdown inside ChatGPT.
If confirmed, these results would indicate that OpenAI has achieved Stage 2 performance on their 5-tier roadmap towards AGI. This milestone suggests the model can stop, think, and reason about problems to arrive at correct answers more frequently — a crucial prerequisite for Stage 3 (AI agents).
Source: https://trackingai.org/IQ
1,250 Taxi drivers already replaced by autonomous Waymo : how fast can they replace them all?
Waymo announced 100K weekly rides , meaning :
At 2 rides per hour:
100,000 rides ÷ 80 rides per week = 1,250 full time drivers are GONE
With Waymo scaling up rapidly, I wonder how long it will take before all of taxi drivers are replaced in USA . 2 years ? 5 years?
What happens to all the displaced drivers? ( eg 1.5million Uber drivers in USA)
Source: https://www.reddit.com/r/ArtificialInteligence/comments/1fhmeg8/1250_taxi_drivers_already_replaced_by_autonomous/
What Else is Happening in AI On September 16th 2023!
Runway released Gen-3 Alpha Video to Video, allowing users to transform input videos with AI-generated styles and prompts across all paid plans.
Source: https://x.com/runwayml/status/1834711758335779300
Meta (Facebook) admitted to scraping public data from all Australian adult users to train AI models, without offering an opt-out option available to EU users.
Source: https://www.abc.net.au/news/2024-09-11/facebook-scraping-photos-data-no-opt-out/104336170
Google AI Studio launched a new model comparison feature, allowing users to easily compare outputs from different AI models and parameter settings.
Source: https://x.com/OfficialLoganK/status/1834653092433760603
Researchers developed “g1”, an AI system using Llama-3.1 on Groq hardware that implements reasoning chains to solve complex problems similar to OpenAI’s new o1 model.
Source: https://x.com/BenjaminKlieger/status/1834946629126046145
A new AI chatbot using GPT-4 Turbo successfully reduced belief in conspiracy theories among users, with effects lasting for months after brief interactions.
Source: https://www.nature.com/articles/d41586-024-02966-6
Montana State University is developing AI methods using neural symbolic regression to help farmers maximize crop yields through precision agriculture.
Source: https://www.montana.edu/news/23969/montana-state-doctoral-student-uses-ai-to-help-farmers-in-the-field
Researchers are developing AI-piloted drone swarms to detect and extinguish wildfires, with teams of up to 30 autonomous aircraft working collectively to prevent fire spread.
Source: https://www.weforum.org/agenda/2024/09/tech-news-drones-prevent-wildfires
Google is funding an AI-powered satellite constellation that will spot wildfires faster.
Source: https://www.technologyreview.com/2024/09/16/1103962/google-is-funding-an-ai-powered-satellite-constellation-that-will-spot-wildfires-faster
OpenAI Shifts Gears: CEO Sam Altman Announces Move Away From Non-Profit Model Next Year.
In a recent company wide meeting OpenAI CEO Sam Altman told its staff members that the company plans to reform its not so simple non-profit structure. This change is anticipated to be done over the next year.
Trending AI Tools:
Read Aloud For Me – AI Dashboard – AI Tools Recommender – Safe AI
“Read Aloud For Me – AI Dashboard”, is a free app available for iOS and Android devices, and as a Progressive Web App. It provides access to various AI tools, including text-to-speech, translation, image recognition, and generative AI services like ChatGPT and Google Gemini. The app aims to be a user-friendly and secure platform for exploring and utilizing the latest AI technologies, targeting both adults and children with its family-friendly features and educational content.
iOs PRO (No Ads): Read Aloud PRO – AI Dashboard on the App Store (apple.com)
iOs: Read Aloud For Me AI Dashboard on the App Store (apple.com)
Web/Android/PWA: https://readaloudforme.com
Windows: AI Tools Catalog and Read Aloud For Me – Download and install on Windows | Microsoft Store
Runway Gen-3 Video-to-Video – Video style transfer with the Gen-3 AI video generation model
Source: https://runwayml.com/
Suno Cover – Reimagine the music you love with AI covers.
Source: https://suno.com/
HeyGen Avatar 3.0 – AI-generated avatars with emotions and tones that match your message.
Source: https://app.heygen.com/
Infography – Convert blogs into engaging infographics
Source: https://infography.in
FlutterFlow – Quickly generate code and styles for app development
Source: https://flutterflow.io/
AIPhone – Translates and transcribes live calls seamlessly
Source: https://www.aiphone.ai/
Fitness transform with AI and VR:
I’ve seen fitness transform with AI and VR playing huge roles. Personalized plans powered by AI and immersive VR workouts are game-changers. I recently tried a VR workout, and it felt like an exhilarating adventure rather than just exercise. The focus on integrating mental wellness and nutrition is also exciting. What trends do you think will shape the future of fitness? Have you experienced any of these innovations yourself?
I found an article. Thought of sharing: https://www.ispo.com/en/health/sport-health-retail-9-game-changing-mindshifts-future
A Daily Chronicle of AI Innovations on September 13th 2024:
OpenAI releases o1 model with human-like reasoning
White House launches AI datacenter task force
AI image transitions with Latent Navigation
Google AI teaches robots to tie shoelaces
Chatbots may reduce conspiracy beliefs, study shows
FDA clears Apple’s AirPods Pro to act as your hearing aids
Amazon’s AI chatbot starts pushing ads
OpenAI releases o1 model with human-like reasoning
- OpenAI has unveiled o1, a new generative AI model family, with two variants initially available: o1-preview and o1-mini, designed for enhanced fact-checking and code generation, respectively.
- o1 models are accessible to ChatGPT Plus or Team subscribers, and they offer improved reasoning capabilities by thoroughly considering all parts of a query before responding, resulting in better performance on complex tasks.
- Despite its advanced features, o1 comes with significant cost and speed drawbacks, being slower and more expensive than previous models, with ongoing issues such as hallucinations and occasional inaccuracies.
- o1 uses reinforcement learning and chain-of-thought processing to “think” before responding, mimicking human problem-solving.
- It outperforms expert humans on PhD-level science questions and ranks in the 89th percentile for competitive programming.
- The model also solved 83% of International Mathematics Olympiad qualifying exam problems, compared to GPT-4o’s 13%.
- Two versions available: o1-preview and o1-mini — which by the time of this newsletter publishing — has been rolled out to all ChatGPT Premium and Teams users.
- API access is significantly more expensive than GPT-4o, at $15 per 1M input tokens and $60 per 1M output tokens.
- Source: https://techcrunch.com/2024/09/12/openai-unveils-a-model-that-can-fact-check-itself/
White House launches AI datacenter task force
The White House is launching a new AI datacenter infrastructure task force led by the National Security Council, National Economic Council, and Deputy Chief of Staff’s office to ensure U.S. leadership in AI development.
- Tech leaders from Nvidia, OpenAI, Anthropic, Google, Microsoft, and Amazon met with government officials to discuss AI energy usage, datacenter capacity, job creation, and strategic locations.
- The task force will coordinate policies to advance datacenter development, aligning with economic, national security, and environmental goals.
- The administration is streamlining datacenter permitting processes and leveraging Department of Energy resources to support AI infrastructure growth.
- Tech giants reaffirmed commitments to net-zero carbon emissions and clean energy procurement for their operations.
This task force is a big shift in U.S. AI strategy, moving beyond safety testing to actively shaping the infrastructure needed to maintain America’s edge in AI. The news also follows the recent announcement that OpenAI and Anthropic will let the U.S. AI Safety Institute test new models before releasing them publicly.
AI image transitions with Latent Navigation
Hugging Face’s Latent Navigation space lets you generate smooth transitions between AI-generated images using CLIP text embeddings and Flux.1 schnell.
- Visit Latent Navigation space.
- Enter a detailed image description in the “Prompt” field (e.g., “a golden retriever in the park”)
- Set transition directions: “1st direction to steer” (e.g., “winter”) and “2nd direction to steer” (e.g., “summer”)
- Adjust the “Strength” and other settings to fine-tune your transition
- Click “Generate directions” to create your AI image transition
Pro tip: Experiment with the “Advanced options” for more control over your transitions.
Source: https://university.therundown.ai/c/daily-tutorials/create-ai-image-transitions-with-latent-navigation-83bda6e0-ee83-44df-8502-772e1c21c639
Google AI teaches robots to tie shoelaces
- Google DeepMind announced that it taught a robot to tie a shoe, but the robot’s shoe-tying skills are still very poor.
- The robot uses bi-arm manipulation and is powered by two AI systems, ALOHA Unleashed and DemoStart, to perform complex tasks.
- Many companies are developing robots to handle human-like tasks, with shoe-tying being a significant test of a robot’s capabilities.
- The first system, ALOHA Unleashed, allows robots to perform five complex bi-arm manipulation tasks, including tying shoelaces, hanging shirts, and repairing other robots.
- The second, DemoStart, uses reinforcement learning to teach robots dexterous behaviors in simulation, requiring 100x fewer demonstrations than traditional methods.
- The systems use diffusion methods similar to image generation models to predict robot actions from random noise.
- Robots achieved a 98% success rate in simulations and up to 97% in real-world tasks like cube reorientation.
While still far from human-level dexterity, these advancements represent another leap towards creating more useful robots for everyday tasks. The application of image generation techniques to robotics also shows how breakthroughs in one area of AI, can also trigger advancements elsewhere across the field.
Chatbots may reduce conspiracy beliefs, study shows
- Researchers from MIT Sloan and Cornell University found that AI chatbots can reduce belief in conspiracy theories by about 20%, even among individuals who consider these beliefs important to their identity.
- The study, published in Science, involved 2,190 participants who engaged in conversations with OpenAI’s GPT-4 Turbo, which provided tailored counterarguments to their conspiracy theories.
- The research demonstrated that participants experienced a significant decrease in their confidence regarding conspiracy theories, highlighting the effectiveness of chatbots in changing minds with evidence-based arguments.
- Source: https://www.technologyreview.com/2024/09/12/1103930/chatbots-can-persuade-people-to-stop-believing-in-conspiracy-theories/
FDA clears Apple’s AirPods Pro to act as your hearing aids
- The U.S. FDA authorized Apple’s AirPods Pro to function as an over-the-counter hearing aid device, marking the first software-based solution for hearing assistance integrated into existing consumer products.
- The AirPods Pro 2 will receive a software update this fall, enabling features that help users with mild to moderate hearing impairments by creating personalized listening profiles through tests conducted on an iPhone.
- A clinical study involving 118 participants with mild to moderate hearing loss showed that Apple’s self-fitting hearing aid feature provided similar benefits to professional fittings, and the FDA found no adverse effects from using the new AirPods software.
- Source: https://gizmodo.com/fda-clears-apples-airpods-pro-to-act-as-your-hearing-aids-2000498087
Amazon’s AI chatbot starts pushing ads
- Amazon’s AI-powered shopping assistant, Rufus, is designed to help users navigate the extensive range of products on Amazon’s website, and is now including some advertisements in its recommendations, as initially reported by Adweek.
- Rufus generates product suggestions based on Amazon’s extensive catalog, customer reviews, and community Q&As, while also integrating ads into these conversations to help users discover more products in a seamless manner.
- Although Rufus remains experimental, Amazon aims to ensure that sponsored suggestions remain relevant and contextually appropriate, minimizing the risk of irrelevant advertisements while potentially revolutionizing online advertising with this new approach.
- Source: https://www.techradar.com/computing/artificial-intelligence/amazons-ai-personal-shopper-is-sharing-ads-with-its-advice
Nvidia’s AI Surge Drives 8% Share Price Jump Amid Soaring Demand
Nvidia stock went up 8% after CEO Jensen Huang said that demand for its AI products was at an all-time high. Huang also talked about how important it is for big tech companies like Microsoft, Amazon, and Oracle to use Nvidia’s products to power their cloud and AI infrastructure.
Source: https://theaiwired.com/nvidias-ai-surge-drives-8-share-price-jump-amid-soaring-demand/
What Else is Happening in AI on September 13th 2024!
Google began rolling out Gemini Live to free users on the Gemini Android app — offering natural voice conversations with the AI assistant and 10 new voice options.
Source: https://x.com/GeminiApp/status/1834269227118924098
OpenAI’s ChatGPT reportedly surpassed 11 million paying subscribers, including 1 million on higher-priced business plans, potentially generating over $2.7 billion in annual revenue, according to COO Brad Lightcap.
Source: https://x.com/amir/status/1834347880251052203
Salesforce unveiled Agentforce, a suite of low-code tools for building autonomous AI agents capable of reasoning and completing tasks in sales, service, marketing, and commerce.
Source: https://www.salesforce.com/agentforce
Mastercard agreed to acquire AI-powered threat intelligence company Recorded Future for $2.65 billion, aiming to enhance its cybersecurity capabilities.
Source: https://www.mastercard.com/news/press/2024/september/mastercard-invests-in-continued-defense-of-global-digital-economy-with-acquisition-of-recorded-future
Google introduced DataGemma, the first open models designed to connect large language models with real-world data from Google’s Data Commons, aiming to reduce AI hallucinations by grounding responses in factual stats.
Source: https://blog.google/technology/ai/google-datagemma-ai-llm
Hume AI launched Empathic Voice Interface 2 (EVI 2), a new voice-to-voice foundation model trained for emotional intelligence, capable of understanding and generating various tones of voice and speaking styles.
Source: https://www.hume.ai/blog/introducing-evi2
Meta reignites plans to train AI using UK users’ public Facebook and Instagram posts.
The ‘godmother of AI’ just raised $230 million for her AI startup.
Source: https://qz.com/fei-fei-li-ai-startup-world-labs-raise-230-million-1851647701
MongoDB CEO says if AI hype were the dotcom boom it is 1996.
Source: https://www.theregister.com/2024/09/13/mongodb_ceo_says_if_ai/
Salesforce unveils its vision of AI agents to help every business.
Source: https://www.techradar.com/pro/salesforce-unveils-its-vision-of-ai-agents-to-help-every-business
A Daily Chronicle of AI Innovations on September 12th 2024:
Google turns your notes into podcasts
Adobe previews new AI video model
How to control AI video camera angles
🤖 Ireland Launches GDPR Investigation into Google’s AI Model
💰 OpenAI Secures $6.5B Investment Talks, Targets Massive $150B Valuation
⭐ OpenAI release OpenAI o1—a new series of AI models designed to spend more time thinking before they respond.
Mistral releases multimodal Pixtral 12B
🤖 Ireland Launches GDPR Investigation into Google’s AI Model
- Ireland’s DPC is investigating Google’s AI model for potential GDPR violations regarding personal data use;
- Google’s PaLM 2 is the subject of the inquiry due to its potential high-risk data processing;
- The probe follows a similar case involving X, which agreed to stop using EU/EEA user data for AI training.
Mistral releases multimodal Pixtral 12B
French AI startup Mistral has released Pixtral 12B, its first multimodal model capable of processing both images and text, available for free download under an Apache 2.0 license.
- Pixtral 12B is a 12-billion-parameter model, about 24GB in size, built on Mistral’s text model Nemo 12B.
- Pixtral 12B is the startups first multimodal model, meaning it can process and answer questions about images, as well as text.
- It’s available for download via GitHub and Hugging Face under an Apache 2.0 license, allowing unrestricted use and fine-tuning.
- Mistral plans to make Pixtral 12B available on their chatbot and API platforms, Le Chat and Le Plateforme, soon.
Despite being just over a year old, Mistral is rapidly emerging as Europe’s “answer” to OpenAI. With a lean team of top researchers and a recent $645 million funding round, the company is aggressively pushing the boundaries of AI with powerful open models.
Source: https://x.com/MistralAI/status/1833758285167722836
How to control AI video camera angles
Luma Labs’ new ‘Camera Motion’ feature allows you to add professional-looking camera movements to your AI-generated videos with simple text commands.
- Log in into Luma Labs and access the main interface.
- Set up your scene by uploading an image or describing it in the text box.
- Type “camera” in your prompt to unlock the Camera Motion dropdown menu.
- Choose from options like Move, Pan, Push, Pull, or Orbit to add dynamic movement to your scene.
- Generate your video and iterate as needed to perfect the result.
Pro tip: Add starting and ending keyframes for even more control over your camera movements.
Adobe previews new AI video model
Adobe just previewed its Firefly AI Video Model, which includes tools to extend existing videos and create new clips from text or image prompts, coming before year end.
- Three key features are coming: Text to Video, Image to Video, and Generative Extend.
- Text to Video can create clips from prompts, with options for camera controls and reference images.
- Image to Video transforms still images or illustrations into live-action clips.
- Generative Extend, coming to Premiere Pro beta, can add footage to cover gaps or extend shots.
While OpenAI’s (still unreleased) Sora focuses on generating videos from scratch, Adobe is aiming to create ‘a new era‘ for video editing itself. Soon, anyone will be able to change camera angles, extend scenes, and generate b-roll instantly with AI.
Google turns your notes into podcasts
Google just launched Audio Overviews, a new feature in NotebookLM that turns notes, PDFs, Google Docs, Slides, and more into AI-generated audio discussions between two virtual AI agents.
- Audio Overview creates a “deep dive” conversation from uploaded sources, with AI hosts summarizing content and connecting topics across materials.
- The tool works with various source types (documents, slides, charts, web URLs) using Gemini 1.5’s multimodal capabilities.
- To use Audio Overviews, open an existing notebook, navigate to the Notebook guide, and click on the “generate” button on the right-hand side.
- We confirmed with Google Labs that NotebookLM can process up to 50 sources, each up to 500,000 words, allowing for a total of 25 million words to be considered when generating the audio.
Audio Overviews can be a game changer for people who are auditory learners. This feature works particularly well with academic papers, ebooks, textbooks, presentations — we tested it by turning yesterdays newsletter into an Audio Overview and were quite impressed!
Source: https://blog.google/technology/ai/notebooklm-audio-overviews/
Generative AI: Techniques, Applications, and Impact on Business
Generative AI involves various techniques, each with its strengths and applications. Let’s explore some of the most prominent generative AI models behind this groundbreaking technology:
Generative Adversarial Networks (GANs):
Imagine two AI artists competing in an artistic duel. That’s the concept behind GANs! They consist of two neural networks:
- Generator: This network acts as the creative visionary, constantly working to produce new and realistic data like images, music, or text.
- Discriminator: This network acts as the critical judge, carefully analyzing the generated data to distinguish it from real data.
Source: https://www.seaflux.tech/blogs/generative-ai-revolutionizing-industries
💰 OpenAI Secures $6.5B Investment Talks, Targets Massive $150B Valuation
OpenAI, the company behind the recent viral ChatGPT artificial intelligence language tool, informally reported that they are discussing with investors funding at about $6. 5 billion (£5 billion). This would increase the value of this company to a whopping $150 billion (£115 billion). With this possible value, OpenAI would be one of the world’s most valuable startups, and this example illustrates how fast the company rose through the ranks in the tech industry. A story from Bloomberg reveals that the company is also exploring means of funding $5 billion (£3. 8 billion) through a revolving credit arrangement with the commercial banks.
Source: https://theaiwired.com/openai-secures-6-5b-investment-talks-targets-massive-150b-valuation/
Oracle To Deploy A Supercluster Of ~130,000 NVIDIA Blackwell GPUs, Alludes To A “Gigawatt” Capacity Data Center That Will Be Powered By 3 Nuclear Reactors
Source: https://wccftech.com/oracle-to-deploy-a-supercluster-of-130000-nvidia-blackwell-gpus-alludes-to-a-gigawatt-capacity-data-center-that-will-be-powered-by-3-nuclear-reactors/
Introducing PaperQA2, the first AI agent that conducts entire scientific literature reviews on its own.
Source: https://x.com/SGRodriques/status/1833908643856818443
What Else is Happening in AI on September 12th 2024!
OpenAI is reportedly in talks to raise $6.5 billion (plus another $5 billion credit line with banks) at a $150 billion valuation, significantly higher than its previous $86 billion valuation.
Meta is reportedly finalizing a new AI supercomputing cluster with over 100,000 Nvidia H100 chips to train its next Llama 4 language model.
Canada’s Defence Minister announced the country is working on incorporating AI into its military by 2030, emphasizing for the tech to “improve the work of our military personnel, but not to replace it”.
Kai-Fu Lee (former head of Google China), stated that Chinese AI models lag behind U.S. by 6 to 9 months, but predicts that Chinese AI apps will proliferate faster than in the U.S. by early next year.
Taylor Swift endorsed Kamala Harris for president, partly prompted by AI-generated deepfakes falsely showing her supporting Trump.
Oracle unveiled over 50 AI agents for its Fusion Cloud Applications Suite, designed to automate business processes across functions like capital management, supply chain, and customer experience.
Algorithm takes robots a step closer to being able to ‘act on intuition’.
Source: https://techxplore.com/news/2024-09-algorithm-robots-closer-intuition.html
Nevada will use Google’s AI to help decide if people should receive unemployment benefits.
Source: https://www.techspot.com/news/104696-nevada-use-google-ai-help-decide-if-people.html
⭐ OpenAI release OpenAI o1—a new series of AI models designed to spend more time thinking before they respond.
We’re releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. This new series of AI models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. Rolling out today in ChatGPT to all Plus and Team users, and in the API for developers on tier 5.
- Competition Math (AIME 2024):
- The initial GPT-4 preview performed at 13.4% accuracy.
- The new GPT-4-1 model in its early version showed much better results, achieving 56.7%.
- In the final version, it soared to 83.3%.
- Competition Code (CodeForces):
- The GPT-4 preview started with only 11.0%.
- The first GPT-4-1 version improved significantly to 62.0%.
- The final version reached a high accuracy of 89.0%
- PhD-Level Science Questions (GPAQ Diamond):
- GPT-4 preview scored 56.1%.
- GPT-4-1 improved to 78.3% in its early version and maintained a similar high score at 78.0%
- The expert human benchmark for comparison scored 69.7%, meaning the GPT-4-1 model slightly outperformed human experts in this domain
it can literally perform better than a PhD human right now
More here: www.openai.com/o1
GPT o1 still fails basic math riddles
Before everyone goes crazy about the new model I will leave this here:
I already found an example of the model failing a simple primary school level riddle.
Original riddle as found online: Four people need to cross a rickety bridge at night. Unfortunately, they have one torch and the bridge is too dangerous to cross without a torch. The bridge can support only two people at a time. All the people don’t take the same time to cross the bridge. Time for each person: 1 min, 2 mins, 7 mins, and 10 mins. What is the shortest time needed for all four of them to cross the bridge?
My modified version (removed one constraint about how many people can cross at once): Four people need to cross a rickety bridge at night. Unfortunately, they have one torch and the bridge is too dangerous to cross without a torch. All the people don’t take the same time to cross the bridge. Time for each person: 1 min, 2 mins, 7 mins, and 10 mins. What is the shortest time needed for all four of them to cross the bridge?
GPT o1 Preview Response:
Thought for 25 seconds
The shortest time required for all four people to cross the bridge is 17 minutes. Here’s the optimal plan:
Person 1 (1 min) and Person 2 (2 mins) cross together.
Time elapsed: 2 minutes (the slower person’s time). On the other side: Persons 1 and 2. Original side: Persons 3 and 4. Person 1 (1 min) returns with the torch.
Time elapsed: 2 + 1 = 3 minutes. On the other side: Person 2. Original side: Persons 1, 3, and 4. Person 3 (7 mins) and Person 4 (10 mins) cross together.
Time elapsed: 3 + 10 = 13 minutes. On the other side: Persons 2, 3, and 4. Original side: Person 1. Person 2 (2 mins) returns with the torch.
Time elapsed: 13 + 2 = 15 minutes. On the other side: Persons 3 and 4. Original side: Persons 1 and 2. Person 1 (1 min) and Person 2 (2 mins) cross together again.
Time elapsed: 15 + 2 = 17 minutes. All four persons are now on the other side. Explanation of the strategy:
Fastest Movers as Couriers: Persons 1 and 2 are the fastest and are used to shuttle the torch back and forth. Slowest Pair Together: Persons 3 and 4 cross together, so their longer times only affect one crossing. Minimizing Return Times: The returns are made by the fastest individuals to reduce time spent. No alternative sequence allows all four to cross in less than 17 minutes given the constraints.
Take away: Real answer without the constraint should be 10, since without the constraint all people can cross at once. It basically still just gives answers it learnt online, rather than “reasoning”.
Credit: based on an example found by Darius Burschka
DataGemma: Using real-world data to address AI hallucinations
Announcing DataGemma, open models that leverage the power & knowledge of Data Commons to ground LLMs in real-world data & decrease the risk of “hallucinations.”
Google’s Data Commons is a publicly available data repository that serves as a foundation for factual AI. Explore how DataGemma utilizes Data Commons through Retrieval Interleaved Generation (RIG) & Retrieval Augmented Generation (RAG) approaches for fact-checking, responsible AI development, & more.
DataGemma are the world’s first open models designed to help address the challenges of hallucination by grounding LLMs in the vast, real-world statistical data of Google’s Data Commons.
Source: https://blog.google/technology/ai/google-datagemma-ai-llm/?linkId=10938872
A Daily Chronicle of AI Innovations on September 11th 2024:
OpenAI’s Strawberry set for early release
Engineer creates AI-designed walking table
Create photo-illustration blends with AI
Waymo’s driverless cars outperform humans
🤷♂️ Meta scraped every Australian user’s account to train its AI
🔮 Mistral releases Pixtral 12B, its first multimodal AI model
💻 Sergey Brin says Google engineers underutilize AI for coding
OpenAI’s Strawberry set for early release
According to a new report by The Information, OpenAI is planning to release Strawberry, its reasoning-focused AI model, as part of ChatGPT within the next two weeks — earlier than the originally expected fall timeline.
- According to early testers, Strawberry “thinks” for 10-20 seconds before responding, unlike typical AI models that answer immediately.
- The initial version will only handle text input and output, not images, making it non-multimodal unlike other recent AI models.
- Strawberry is expected to improve complex query handling without requiring users to use specialized prompt techniques like “chain-of-thought-prompting“.
- Early testers also said that Strawberry’s responses, while slightly better than GPT-4o, may not be worth the extra 10-20 second wait time.
If it lives up to reports, Strawberry will likely excel not only on objective tasks (math, coding) but also subjective ones (like product marketing) and open up a whole new world of practical AI use cases. But with early testers reporting that the model is only “slightly better“, is it really worth all the hype? We’ll find out soon.
Source: https://www.theinformation.com/articles/new-details-on-openais-strawberry-apples-siri-makeover-larry-ellison-doubles-down-on-data-centers
Engineer creates AI-designed walking table
Dutch engineer Giliam de Carpentier has created the ‘Carpentopod’, a wireless walking wooden coffee table, using an AI-optimized leg mechanism designed through evolutionary algorithms.
- The leg mechanism was developed using software that simulated thousands of generations of designs, optimizing for walking speed, clearance, and efficiency.
- The table features 12 legs (six on each end) made from precision-cut laminated bamboo, controlled by two motors.
- Carpentier developed a custom motor control software and uses a repurposed Nunchuck controller for remote operation.
- Watch a live demo of the walking coffee table in action here.
This project shows how makers are combining AI, traditional craftsmanship, and modern fabrication techniques to create unique projects that blur the lines between art, engineering, and furniture design. AI for design optimization, even for seemingly simple mechanical systems — is just getting started.
Source: https://www.decarpentier.nl/carpentopod
Create photo-illustration blends with AI
Using this unique prompt, AI image generators like Freepik and Midjourney can create images that blend both photography and illustration together.
- Choose an AI image generator (e.g., Freepik or Midjourney).
- Use this prompt template: “Blending of editorial photography and illustration, model, illustration fashion outfit, surreal environment of vibrant bold illustrations blending with the subject, flowers, smoke, flames and sparkles, half photography and half illustration”.
- Generate your image and check out the results.
- Upscale using 2K resolution, add “mix of photography and illustration” to the upscaler prompt, and apply “subtle” and “digital art” parameters.
Source: https://university.therundown.ai/c/daily-tutorials/create-photo-illustration-blends-with-ai-15b43823-d20a-4895-968d-fead4eb3e0ef
Waymo’s driverless cars outperform humans
A new deep dive into Waymo’s new crash data suggests that driverless cars are significantly safer than human-driven vehicles, with fewer injury-causing crashes and airbag deployments per mile driven.
- Waymo vehicles have traveled over 22 million miles, with only 20 injury-causing crashes reported.
- This rate is less than one-third of what’s expected for human drivers in similar conditions.
- Out of 23 severe crashes, 16 involved human drivers rear-ending Waymo vehicles.
- Waymo estimates its vehicles are one-sixth as likely as human drivers to experience crashes triggering airbag deployment.
As Waymo rapidly scales up its robotaxi service — now providing 100,000 rides per week — these safety statistics are crucial. The data suggests that we now have safe self-driving cars, and that these AI-driven cars could significantly reduce road accidents and injuries.
Source: https://x.com/binarybits/status/1833528077496029643
US Man Charged In Historic AI Music Fraud Case: Used Thousands Of Bots To Stream Fake Songs, Raked In $10M In Royalties
An American musician is facing charges for using AI to fraudulently inflate his song streams and earn millions in royalties. Prosecutors have never seen a case like this before. The musician faces decades in prison if convicted.
Source: Read the full story: https://www.ibtimes.co.uk/us-man-charged-historic-ai-music-fraud-case-used-thousands-bots-stream-fake-songs-raked-10m-1726815
James Earl Jones’ Darth Vader voice lives on through AI. Voice actors see promise and peril in that
James Earl Jones’ Darth Vader voice lives on through AI. Voice actors see promise and peril in that
Source: https://candorium.com/news/20240911050001425/james-earl-jones-darth-vader-voice-lives-on-through-ai-voice-actors-see-promise-and-peril-in-that
Taylor Swift says AI version of herself falsely endorsing Trump ‘conjured up my fears’
🤷♂️ Meta scraped every Australian user’s account to train its AI
- Meta’s global privacy director admitted that Meta scrapes photos and texts from all public Facebook and Instagram posts from Australian users since 2007 to train its AI technology.
- Unlike the European Union, Australian users do not have an opt-out option from data collection for AI training, which Meta attributes to the lack of specific privacy regulations in Australia.
- Meta does not scrape data from users under 18 but collects information if shared on accounts managed by their parents or guardians, indicating a gap in data protection for minors.
- Source: https://www.engadget.com/apps/meta-scraped-every-australian-users-account-to-train-its-ai-120026200.html
🔮 Mistral releases Pixtral 12B, its first multimodal AI model
- Mistral has launched Pixtral 12B, a 12-billion-parameter multimodal model capable of processing both images and text, making it the company’s first such model.
- This new model, which is built on Mistral’s existing Nemo 12B text model, can interpret images of various sizes and can be accessed via platforms like GitHub and Hugging Face.
- Pixtral 12B’s release follows Mistral’s successful $645 million funding round, positioning the company as a significant European contender in the AI industry, comparable to OpenAI.
- Source: Mistral releases Pixtral 12B, its first multimodal model | TechCrunch
💻 Sergey Brin says Google engineers underutilize AI for coding
Sergey Brin, Google’s cofounder, believes that the company’s engineers are not using artificial intelligence for coding as frequently as they should.
- Brin, speaking at the “All-In” summit, emphasized that AI’s advancement could simplify tasks like writing code, yet engineers remain conservative in their AI utilization.
- At the summit, Brin showcased an experiment where Google’s AI successfully generated Sudoku puzzle versions in just 30 minutes, demonstrating its potential in programming.
- Source: Sergey Brin Says Google Engineers Don’t Use AI As Much As They Should – Business Insider
NotebookLM.Google.com can now generate podcasts from your Documents and URLs!
Ready to have your mind blown? This is not an ad or promotion for my product. It is a public Google product that I just find fascinating!
This is one of the most amazing uses of AI that I have come across and it went live to the public today!
For those who aren’t using Google NotebookLM, you are missing out. In a nutshell it lets up upload up to 100 docs each up to 200,000 words and generate summaries, quizes, etc. You can interrogate the documents and find out key details. That alone is cool, but TODAY they released a mind blowing enhancement.
Google NotebookLM can now generate podcasts (with a male and female host) from your Documents and Web Pages!
Try it by going to NotebookLM.google.com uploading your resume or any other document or pointing it to a website. Then click * Notebook Guide to the right of the input field and select Generate under Audio Overview. It takes a few minutes but it will generate a podcast about your documents! It is amazing!!
Source: http://notebooklm.google.com/
What Else is Happening in AI on September 11th 2024?
China refuses to sign agreement to ban AI from controlling nuclear weapons.
Source: https://fortune.com/2024/09/10/china-ai-ban-nuclear-weapons
SambaNova launched SambaNova Cloud, claiming it’s the world’s fastest AI inference platform, running Llama 3.1 70B at 461 tokens per second and 405B at 132 tokens per second at full precision.
Source: https://x.com/SambaNovaAI/status/1833504179148755122
Salesforce introduced Industries AI, embedding over 100 industry-specific AI capabilities across 15 industry clouds to address unique challenges.
Source: https://www.salesforce.com/news/stories/industries-ai-announcement
Glean raised over $260 million in new funding at a whopping $4.6 billion valuation, co-led by Altimeter and DST Global, to expand its Work AI platform.
Source: https://x.com/jainarvind/status/1833479487415325006
Anthropic unveiled Workspaces in its API Console, allowing developers to manage multiple Claude deployments with granular controls for spend limits, rate limits, and access management.
Source: https://x.com/AnthropicAI/status/1833529395765776615
UBS developed an AI tool capable of analyzing 300,000 companies in 20 seconds to generate M&A ideas and identify potential buyers.
Source: https://www.bloomberg.com/news/articles/2024-09-10/ubs-has-an-ai-tool-that-can-scan-300-000-firms-in-20-seconds
The Australian government proposed new legal requirements for AI in high-risk settings and introduced a Voluntary AI Safety Standard for businesses.
Source: https://www.climatecontrolnews.com.au/contracting/new-ai-safety-standard
A Daily Chronicle of AI Innovations on September 10th 2024:
Apple launches AI-powered iPhone 16
Google’s AlphaProteo AI designs new protein binders
OpenAI backs new drug discovery startup
AI generates ideas better than humans
iPhone 16 gets AI superpowers
Apple just launched the iPhone 16, and it’s all about AI. The new A18 chip powers Apple Intelligence, a suite of AI features:
- Smarter writing tools that can rewrite emails and create custom emojis,
- Photo searches that understand natural language, and
- Siri finally gets what you’re saying (and can actually do something about it).
But it’s not just about the iPhone. Apple is bringing AI to the whole ecosystem. The Apple Watch Series 10 is 10% lighter and adds AI features like smart photo curation for watch faces. AirPods 4 now understands head gestures, letting you nod or shake to respond to Siri. While iOS 18 launches September 16, the full suite of Apple Intelligence features will roll out gradually through 2024, starting with a U.S. beta this fall.
Why does it matter?
By developing its own AI models and integrating them deeply into its ecosystem, Apple is not just competing with Google and OpenAI – it’s changing what we expect from our devices.
Source: https://www.apple.com/apple-events
Apple Faces Criticism in China Over Lack of AI in New iPhones as Huawei Gains Ground
The latest iPhones unveiled by Apple lack AI in China and people are angered while Huawei’s advancement continues. It is widely believed that China consumers are dissatisfied with the newest iPhone released by Apple as it lacks the integration of new age Artificial Intelligence. This has created concern for the tech giant particularly when Huawei is now a more enhanced competitor.
Google’s AlphaProteo AI designs new protein binders
Google DeepMind’s new AI, AlphaProteo, can design novel proteins that bind to specific targets up to 300 times more effectively than current methods. In lab tests, AlphaProteo’s creations successfully designed binders for 7 out of 8 target proteins, including some linked to cancer and viral infections.
It could accelerate drug discovery processes, improve disease diagnostic tools, and contribute to developing pest-resistant crops. While AlphaProteo could not design a binder for one challenging protein (TNFα), Google’s research team is actively expanding the system’s capabilities to address more complex targets.
Why does it matter?
This shows how AI can accelerate the development of new treatments for diseases like cancer and create more effective diagnostic tools. It also saves years of lab work and billions in research costs, potentially bringing life-saving drugs to market faster.
OpenAI backs new drug discovery startup
OpenAI and Thrive Capital recently backed Chai Discovery, a six-month-old AI biology startup founded by ex-OpenAI and Meta researchers that raised $30 million to develop AI models for drug discovery.
- Chai’s AI model, Chai-1, predicts biochemical molecule structures, potentially speeding up drug development.
- The company claims Chai-1 outperforms Google DeepMind’s AlphaFold on certain benchmarks.
- Chai-1 can work with proteins, small molecules, DNA, and RNA, making it versatile for various applications.
- Chai is making its first model free and open-source for non-commercial use.
Google DeepMind’s AlphaFold is the pinnacle of protein prediction models, and Chai is already giving it a run for its money. The more money flowing into drug discovery, the closer we get to curing major diseases using AI — and Chai is even open-sourcing its first model for the benefit of humanity.
Source: https://www.chaidiscovery.com/blog/introducing-chai-1
AI generates ideas better than humans
Researchers found that AI ideas are judged as more novel, though slightly less feasible, than those from human experts in a study comparing AI-generated research ideas in natural language processing (NLP).
- Over 100 NLP researchers were recruited to write novel ideas and review both AI and human-generated ideas.
- The ideas were evaluated blindly, meaning reviewers did not know which were AI-generated.
- AI-generated ideas were rated as significantly more novel than human expert ideas, but less feasible.
- The study also identified challenges in AI idea generation, including issues with self-evaluation and lack of diversity in outputs.
This is one of the biggest, most important questions surrounding AI — can it spark innovation and accelerate scientific discovery? The answer, according to this study, is a resounding yes.
Source: https://arxiv.org/pdf/2409.04109
New bill would force AI companies to reveal source of AI art
- A bill introduced in the US Congress seeks to compel AI companies to reveal the copyrighted material they use for their generative AI models.
- The legislation, known as the Generative AI Copyright Disclosure Act, would require companies to submit copyrighted works in their training datasets to the Register of Copyrights before launching new AI systems.
- If companies fail to comply, they could face financial penalties.
- The bill has garnered support from various entertainment industry organizations and unions.
- AI companies like OpenAI are facing lawsuits over alleged use of copyrighted works, claiming fair use as a defense.
Source: https://www.theguardian.com/technology/2024/apr/09/artificial-intelligence-bill-copyright-art
Is the impact of generative AI on elections overblown?
Matt Motyl , a researcher at the Integrity Institute, explains that the impact of AI on elections might be overblown. Despite widespread concerns, Motyl points out that real-world events still pack more punch than AI-generated content in shaping public opinion. He suggests that the ease of creating fake content with AI isn’t the main issue – it’s the distribution mechanisms on social media platforms that need addressing.
Source: https://mattmotyl.substack.com/p/impact-of-generative-ai-on-elections
Exploring the Frontiers of Language AI: LLM Tensors with Markov Chain Induced Virtual Neuron Pairs
It would be a groundbreaking development at the intersection of linguistics, mathematics, and artificial intelligence, if true. A leaked internal paper states that Apple AI researchers are delving into a novel field dubbed “LLM Tensors with Markov Chain Induced Virtual Neuron Pairs.” This cutting-edge approach promises to revolutionize our understanding of language models and potentially unlock new capabilities in AI-driven language processing. The Fundamentals
At its core, this research combines several complex concepts: Large Language Models (LLMs): The foundation of modern AI-driven text generation and understanding. Tensor Mathematics: Advanced linear algebra used to represent multi-dimensional data. Markov Chains: Probabilistic models that predict future states based on current states. Virtual Neuron Pairs: A theoretical construct representing emergent properties in neural networks. Key Innovations
The primary innovation lies in the creation of “Tensor-Markov Embedding Spaces.” These are high-dimensional mathematical constructs where each dimension corresponds to a specific linguistic feature. Within these spaces, language evolution is modeled using Markov chain probabilities, allowing for a more dynamic and context-sensitive representation of language. Another crucial aspect is the concept of “Virtual Neuron Pair Attention.” These pairs, while not physically present in the network, emerge from the interactions of real neurons. They act as specialized attention mechanisms, focusing on specific semantic relationships and potentially enabling more nuanced language understanding. Potential Applications
This research could lead to significant advancements in various fields: Machine Translation: More accurate and context-aware translations between languages. Content Generation: AI-generated text with improved coherence and style consistency. Sentiment Analysis: Deeper understanding of complex emotional nuances in text. Linguistic Research: New tools for studying language evolution and structure. Challenges and Future Directions
While promising, this field faces substantial challenges. The computational resources required to model these complex tensor spaces are immense. Future research will focus on refining the mathematical models, developing more efficient computational methods, and conducting extensive empirical studies to validate the approach’s effectiveness. The tweet with a link to the PDF was deleted, and this is only a summary.
What Else is Happening in AI on September 10th 2024!
Audible announced plans to create AI voice replicas of select US-based audiobook narrators, including a royalty-sharing model for AI-narrated books.
Source: https://www.bloomberg.com/news/articles/2024-09-09/amazon-s-audible-to-create-ai-voice-replicas-of-select-audiobook-narrators
Oscar-nominated studio XTR acquired AI animation studio Late Night Labs, launching Asteria Film Co. to produce AI-powered animations.
Source: https://variety.com/2024/film/news/xtr-buys-ai-animation-company-late-night-labs-1236137868
BP signed a five-year deal with Palantir to expand AI use in its operations, aiming to speed up engineer decision-making with LLMs.
Source: https://www.theguardian.com/business/article/2024/sep/09/bp-ai-deal-palantir-oil-gas-artificial-intelligence
Roche expanded its digital pathology open environment, integrating over 20 AI algorithms from eight new collaborators to enhance cancer diagnostics and research.
Source: https://www.prnewswire.com/news-releases/roche-advances-ai-driven-cancer-diagnostics-by-expanding-its-digital-pathology-open-environment-302241196.html
Researchers introduced Robot Utility Models (RUMs), a zero-shot method for training robot policies that can perform new tasks in unfamiliar environments.
Source: https://x.com/LerrelPinto/status/1833172778326925391
Recently, researchers created AI agents that can play procedurally generated video game levels to evaluate human engagement, helping to speed up game design.
Source: https://arxiv.org/pdf/2409.02632
A Daily Chronicle of AI Innovations on September 09th 2024:
Apple debuts AI-boosted iPhone 16
xAI could help Tesla with FSD, Optimus, and Siri-like feature
New robot butler does your chores
Tesla Robotaxi to charge wirelessly
AI rates fun factor of video game levels
New robot butler does your chores
Weave just announced Isaac, a versatile personal robot designed to help with various household tasks, made in California and set to ship to its first customers in fall 2025.
- Isaac can autonomously tidy up, fold laundry, organize spaces, and perform many other household chores.
- The robot butler responds to voice or text commands and can be programmed via an app.
- The camera that Isaac carries around folds away and stows in an included enclosure when not in use for privacy reasons.
- Weave plans on charging $59,000 (or $1,385 per month for 48 months) with a $1,000 reservation fee for 30 lucky US customers.
The competition for affordable, intelligent, personal robot assistants is continuing to heat up, and it looks like we’ll start to see these robots in homes by 2025. Whether its Tesla’s Optimus, 1X’s NEO Beta, China’s AGIBOT, or Weave’s Isaac — we’re just excited to not fold laundry ever again.
Source: https://www.weaverobots.com/
Tesla Robotaxi to charge wirelessly
A new patent from Tesla has revealed its advanced wireless charging system, potentially solving the need to manually plug in electric vehicles — allowing autonomous Robotaxis to charge without human intervention.
- The patent, filed in February and published recently, highlights a system that uses smart technology to adapt to variations in wireless charging conditions.
- It mentions a ground pad and a vehicle pad that work together to charge the car without any wires.
- The charging station can estimate and adjust for changes in coil inductance, improving efficiency and safety.
- Tesla may unveil this wireless charging technology at their upcoming Robotaxi event next month, aligning with the tech’s potential to enable self-driving vehicles to charge autonomously.
While wireless charging for EVs doesn’t solve a major problem, it could be a game-changer for self-driving vehicles. If Tesla’s Robotaxis can charge wirelessly, they could autonomously operate almost endlessly without human intervention — an important feature to keeping the fleet of taxis running 24/7.
Source: https://patentscope.wipo.int/search/en/detail.jsf?docId=WO2024182372
AI rates fun factor of video game levels
Researchers recently developed AI agents that can “play” procedurally-generated video game levels to evaluate how engaging they are for human players, helping to quicken the pace of game design.
- Researchers used AI agents to explore levels from the video game Super Mario Bros for high places, interesting objects, and more.
- The AI explored five “engaging” and five “unengaging” levels, judging them based on factors like how much of the level they covered and how many new things they found.
- The AI agents consistently rated the engaging levels higher than the unengaging ones.
- This technique could help game designers automatically test and improve their levels before human playtesters try them.
This AI-powered approach to video game evaluation could speed up game development and lead to more consistently fun levels for humans to explore. As we saw with Google’s DOOM clone — we might be approaching an era where AI not only rates games but creates them from the ground up.
Source: https://arxiv.org/pdf/2409.02632
Apple debuts AI-boosted iPhone 16
- Apple unveiled the iPhone 16 at its “Glowtime” event, featuring new generative AI functionalities designed into the device from the ground up, starting at a price of $799.
- The smartphone boasts a dedicated camera button for quick photo and video capture, a 48-megapixel main camera with improved low-light capabilities, and a new vertical lens layout.
- Powered by the 3 nm A18 chip, the iPhone 16 offers up to 30% increased speed and 35% better power efficiency, with larger batteries promoting enhanced battery life.
Source: https://techcrunch.com/2024/09/09/apple-iphone-16-announced/
xAI could help Tesla with FSD, Optimus, and Siri-like feature
- Elon Musk’s AI startup, xAI, could help Tesla with its Full Self-Driving (FSD) software, Optimus humanoid robot, and a Siri-like voice assistant by licensing xAI’s AI models, according to the Wall Street Journal.
- The potential revenue-sharing deal between Tesla and xAI would depend on the extent of xAI’s technology used, with discussions even suggesting an even split for FSD.
- Musk has expressed intentions to integrate xAI more deeply with Tesla, discussing a potential $5 billion investment and highlighting xAI’s contributions to advancing Tesla’s projects, though this has raised concerns among some investors and lawmakers.
What Else is Happening in AI on September 07th 2024!
OpenAI reported over 1 million paying users across its business products ChatGPT Enterprise, Team, and Edu within their first year of launch.
Google Photos upgraded search with natural language queries and launched “Ask Photos”, an AI-powered conversational search feature for US users.
Source: https://blog.google/products/photos/google-ask-photos-early-access
X permanently stopped Grok AI from using EU citizens’ tweets after legal action by Ireland’s Data Protection Commissioner.
Qualcomm CEO revealed that its partnership with Samsung and Google is developing mixed reality smart glasses as a companion device for smartphones.
Source: https://www.theverge.com/2024/9/5/24236545/qualcomm-mixed-reality-smart-glasses-google-samsung
DeepSeek released version 2.5, a 238B-parameter model with enhanced chat, coding, and 128k context length capabilities.
Source: https://x.com/osanseviero/status/1832037954224545978
OpenAI clarified that “GPT-Next” mentioned in a presentation is not a new model, but a placeholder term to illustrate potential future AI models.
Source: https://mashable.com/article/openai-clarifies-no-gpt-next-not-a-new-model
YouTube is developing AI detection tools for synthetic music and faces, and creator controls for AI model training, to protect content creators.
Source: https://techcrunch.com/2024/09/05/youtube-is-developing-ai-detection-tools-for-music-and-faces-plus-creator-controls-for-ai-training
LLM chatbots, search engines will co-exist, says Google’s Raghavan.
Anthropic’s Mike Krieger wants to build AI products that are worth the hype.
A Daily Chronicle of AI Innovations on September 06th 2024:
OpenAI’s new AI models could cost up to $2,000/month
Man accused of using bots and AI to earn $10 million in royalties
Google’s Gemini-powered photo search arrives in early access
New AI model ‘learns’ how to simulate Super Mario Bros. from video footage
Google DeepMind reveals ‘AlphaProteo’
New AI agent builds apps from prompts
Find top prompts with Google’s Prompt Gallery
AI creates infinite Super Mario Bros game
Google DeepMind reveals ‘AlphaProteo’
Google DeepMind just unveiled AlphaProteo, a groundbreaking AI system that designs custom proteins to bind with specific molecular targets, potentially turbocharging drug discovery and cancer research.
- AlphaProteo creates proteins that stick to specific targets like a biological lock and key, which is 3 to 300 times better at creating strong-binding proteins than current methods.
- The AI successfully designed binders for 7 out of 8 diverse targets, including cancer-related proteins.
- Independent tests showed some AlphaProteo designs could stop SARS-CoV-2 (Covid) from infecting cells and slash years of lab work down to days or weeks.
AlphaProteo could help us block viruses, fight cancer, and treat diseases in new ways using more targeted treatment approaches that could have far less side effects. This could dramatically accelerate drug discovery, potentially saving billions in R&D costs and bring life-saving treatments to patients faster.
New AI agent builds apps from prompts
Replit just launched an AI-powered tool called Replit Agent that can create software applications based on simple text descriptions, making coding more accessible to people of all skill levels.
- Users can describe the app they want to build in natural language, and the AI will generate a customizable plan and start to code.
- The agent can choose appropriate technologies and frameworks for the project and create web-based apps from scratch.
- Users can also interact with the agent, providing feedback and additional information as needed.
- The tool is available now on both web and mobile platforms for Replit Core and Teams subscribers.
AI is good at coding, but setting up an integrated development environment is still a major roadblock for most new coders. Replit Agent does this automatically and helps complete beginners go from idea to a fully functional app in a few prompts. It’s never been easier to build something cool.
Source: https://docs.replit.com/replitai/agent
Find top prompts with Google’s Prompt Gallery
Google AI Studio’s new Prompt Gallery offers pre-made prompts to help you craft better queries for the latest Gemini models.
- Visit Google’s Prompt Gallery.
- Browse and select a pre-made prompt that interests you and click on the prompting use case you want to use.
- Sign in to Google AI Studio, and customize the prompt to fit your specific needs.
- Adjust settings (e.g., model or temperature) and click “Run” to generate your AI-powered response.
Pro tip: Mix and match ideas from different pre-made prompts to create unique queries specific to your projects!
AI creates infinite Super Mario Bros game
Researchers developed an AI system called MarioVGG that can generate an infinitely playable Super Mario Bros game entirely through video, without using a traditional game engine.
- MarioVGG is a text-to-video AI model trained specifically on Super Mario Bros gameplay with a single consumer-grade GPU in just 48 hours.
- It can generate continuous, infinite game footage based on simple text commands like “run” or “jump”.
- The AI simulates game physics, mimics character movements, and even creates new levels on demand.
A week ago, Google created an AI-generated DOOM video game. Now, we have infinite Super Mario Bros. In the future, with tech like this, anyone will be able to create custom, interactive games simply by describing them to an AI.
Source: https://virtual-protocol.github.io/mario-videogamegen/static/pdfs/VideoGameGen.pdf?
OpenAI’s new AI models could cost up to $2,000/month
- OpenAI is reportedly considering a subscription fee of up to $2,000 per month for its upcoming large language models, codenamed Strawberry and Orion.
- The company plans to launch Strawberry this fall, which is anticipated to surpass current AI models in solving complex problems, including unseen math problems and market strategies.
- OpenAI is also reportedly aiming to raise several billion dollars in a new funding round, potentially valuing the company at over $100 billion, with possible investors including Apple, NVIDIA, and Microsoft.
Source: https://www.newsbytesapp.com/news/science/openai-may-charge-2-000-month-for-advanced-ai-models/story
Man accused of using bots and AI to earn $10 million in royalties
- A musician named Michael Smith from North Carolina has been charged with wire fraud conspiracy, wire fraud, and money laundering conspiracy for using AI-generated songs to fake music streams and collect over $10 million in royalties.
- From 2017 to 2024, Smith allegedly employed AI to create songs and used automated bots to inflate streaming numbers, exploiting platforms like Spotify, Apple Music, and YouTube Music, leading to around $1.2 million in annual royalties.
- The U.S. Attorney’s Office for the Southern District of New York revealed that Smith is accused of deceiving the streaming systems, with billions of fraudulent streams, resulting in charges that could lead to up to 20 years in prison each.
- Source: https://decrypt.co/248161/music-for-bots-doj-fake-ai-songs-streaming
Google’s Gemini-powered photo search arrives in early access
- Google Photos has revamped its search function, allowing users to search using everyday language for terms like “Alice and me laughing” or “kayaking on a lake surrounded by mountains,” rather than specific keywords.
- The new ‘Ask Photos’ feature, now with an open waitlist in the US, uses Gemini-powered AI to answer natural language questions about your photos, making it act like an AI-powered photographic memory.
- Google assures users that their data is protected with top-notch security and won’t be used for ads, with queries reviewed anonymously if needed to improve the ‘Ask Photos’ feature.
- Source: https://www.techradar.com/computing/cloud-computing/google-photos-gets-a-massive-search-upgrade-and-opens-its-waitlist-for-black-mirror-style-ask-photos-feature
New AI model ‘learns’ how to simulate Super Mario Bros. from video footage
- The MarioVGG model, despite showing glitches and being slow, demonstrates how even a limited AI can infer game physics and dynamics from video and input data.
- Researchers aim to develop a controllable video game generator or potentially replace traditional game development and engines with video generation models in the future.
- Training involved using a public Super Mario Bros. gameplay dataset of over 737,000 frames, focusing on limited inputs like “run right” and “run right and jump” to create consistent gameplay sequences.
- Source: https://arstechnica.com/ai/2024/09/new-ai-model-learns-how-to-simulate-super-mario-bros-from-video-footage/
What Else is Happening in AI on September 06th 2024!
OpenAI reportedly considered high-priced subscriptions up to $2,000 per month for advanced AI models like Strawberry and Orion.
Source: https://www.theinformation.com/articles/openai-considers-higher-priced-subscriptions-to-its-chatbot-ai-preview-of-the-informations-ai-summit
Tesla announced plans to launch AI-powered Full Self-Driving in China and Europe by early 2025, pending regulatory approval.
Source: https://www.investors.com/news/tesla-stock-china-fsd-ai-roadmap
Dell Technologies reported surging AI server sales, with CEO Michael Dell dismissing concerns about an AI spending slowdown.
Source: https://finance.yahoo.com/news/michael-dell-on-ai-talk-of-a-slowdown-is-way-overblown-231351649.html
The US, EU, and UK signed the world’s first legally binding international AI treaty, emphasizing human rights and accountability in AI regulation.
Source: https://cointelegraph.com/news/global-ai-treaty-human-rights-regulation
Microsoft announced its “Microsoft 365 Copilot: Wave 2” event for September 16, promising the reveal next-phase developments for its AI assistant.
Source: https://www.linkedin.com/events/7236780403867443202
Ada leveraged GPT-4 to double customer service resolution rates, aiming for 100% automated query resolution in the near future.
Source: https://openai.com/index/ada/
LMSYS opened beta testing signups for Copilot Arena, a new AI assistant comparison platform for tools like Cursor AI and Copilot.
Source: https://x.com/lmsysorg/status/1831733622401036703
YouTube is making tools to detect face and voice deepfakes.
Marc Benioff says Salesforce is basically starting from a ‘beginner’s mind’ mentality and going all in on AI agents.
Source: https://www.businessinsider.com/salesforce-ceo-marc-benioff-says-new-focus-on-ai-agents-2024-9
A Daily Chronicle of AI Innovations on September 05th 2024:
OpenAI co-founder raises $1B for AI safety
The fastest AI model goes multimodal
Turn any text into speech in seconds
AI gets smarter by re-reading questions
Engineers gave a mushroom a robot body and let it run wild
OpenAI co-founder raises $1B for AI safety
Safe Superintelligence (SSI), a new AI startup co-founded by former OpenAI chief scientist Ilya Sutskever, just raised $1 billion in funding to develop safe AI systems that surpass human intelligence.
- SSI plans to focus on AI safety, aiming to create superintelligent AI systems that are safe for humans and will spend years on R&D before bringing any product to market.
- Sutskever, Daniel Gross, and Daniel Levy founded SSI just three months ago, and the company is now valued at $5 billion.
- The startup, which only has ten employees, plans to use the funds to acquire computing power and hire top talent.
- Investors include major venture capital firms like Andreessen Horowitz and Sequoia Capital.
This is the first time a startup — that only came into existence three months ago — raised a whopping $1 billion dollars for a seed round. But it doesn’t come without reason, as Ilya Sutskever was one of the main reasons OpenAI is what it is today, and SSI is bound to attract some incredible talent.
The fastest AI model goes multimodal
Groq just launched LLaVA v1.5 7B, a powerful, new multimodal AI model that can understand both images and text and reportedly runs 4x faster than OpenAI’s GPT-4o.
- LLaVA v1.5 7B can answer questions about images, generate captions, and engage in conversations involving text, voice, and pictures.
- The model can also be used for various tasks like visual product inspection, inventory management, and creating image descriptions for visually impaired users.
- This is Groq’s first venture into multimodal models and faster processing times on image, audio, and text inputs could lead to better AI assistants.
- Groq is currently offering this model for free in “Preview Mode” for developers to experiment with.
Groq went viral earlier this year for its blazing-fast AI speeds — and now it’s pairing those capabilities with powerful multimodal models. When it comes to AI apps, faster is always better, and the insane speeds paired with advanced models open the door for an endless supply of new applications.
Source: https://groq.com/introducing-llava-v1-5-7b-on-groqcloud-unlocking-the-power-of-multimodal-ai
Turn any text into speech in seconds
ElevenLabs’ free mobile app (for both iOS and Android) allows users to convert any text into high-quality audio using AI voiceovers, across multiple languages.
Step-by-step:
- Download the ElevenLabs app from their website and create a free account.
- Tap the “+” icon and choose “Submit File” to upload your text.
- Tap “Generate” to create your audio file, then find it in your library to listen.
Pro tip: You can truly customize your audio by selecting from hundreds of AI voice options.
AI gets smarter by re-reading questions
Researchers have discovered that making AI systems ‘re-read‘ questions twice, mimicking human behavior, significantly improves their reasoning and problem-solving abilities across various tasks.
- The technique, called RE2 (Re-Reading), improves AI performance in math, common sense, and symbolic reasoning by simply repeating the question before answering.
- It works with different AI models and can be combined with other AI reasoning techniques.
- The method is most effective when repeating the question twice — more repetitions can decrease performance.
- RE2 even improved non-instruction-tuned AI models, showing broad applicability.
RE2 adds to the growing list of research showing the potential of mimicking the way humans think and process to improve AI. The simplicity of this research also suggests that many AI companies may be overlooking basic, human-inspired prompting techniques in the quest towards the highest performing model.
Source: https://arxiv.org/pdf/2309.06275
With 10x growth since 2023, Llama is the leading engine of AI innovation
Llama models are approaching 350 million downloads to date (more than 10x the downloads compared to this time last year), and they were downloaded more than 20 million times in the last month alone, making Llama the leading open source model family.
Llama usage by token volume across our major cloud service provider partners has more than doubled in just three months from May through July 2024 when we released Llama 3.1.
Monthly usage (token volume) of Llama grew 10x from January to July 2024 for some of our largest cloud service providers.
Source: https://ai.meta.com/blog/llama-usage-doubled-may-through-july-2024/
Nvidia denies DOJ antitrust investigation claims
- Nvidia stated it has not received a subpoena from the U.S. Department of Justice concerning an ongoing antitrust investigation, as confirmed in a statement to Bloomberg.
- A Nvidia representative emphasized that the company wins on merit and is ready to respond to any inquiries from regulators about their business practices.
- Despite reports of subpoenas, Nvidia clarified that it received civil investigative requests instead, which are often mistaken for subpoenas, according to Bloomberg.
- Source: https://www.benzinga.com/government/24/09/40719936/nvidia-has-not-been-subpoenaed-by-doj-whats-going-on
Engineers gave a mushroom a robot body and let it run wild
- Researchers from Cornell University and the University of Florence used the king oyster mushroom’s root-like structure (mycelium) to control robotic vehicles by converting its natural electrical signals into commands that guide the robots’ movements.
- The team successfully demonstrated how the mushroom’s natural signals, triggered by stimuli like UV light, could guide the movement of robots, creating a biohybrid system that responds to environmental cues.
- This innovation suggests potential future applications in agriculture and environmental monitoring, where fungal networks could help automate responses to various environmental cues.
- Source: https://www.sciencealert.com/engineers-gave-a-mushroom-a-robot-body-and-let-it-run-wild
What Else is Happening in AI on September 05th 2024!
Anthropic launched Claude Enterprise, competing with ChatGPT Enterprise by offering larger context windows and GitHub integration for businesses.
Source: https://techcrunch.com/2024/09/04/anthropic-launches-claude-enterprise-plan-to-compete-with-openai
Google committed $10 million to improve AI literacy among young people in 17 countries and across 13 different languages.
Source: https://blog.google/technology/families/improving-ai-literacy-in-young-people
Sakana AI raised $100M in Series A funding and announced a collaboration with Nvidia to advance AI research and infrastructure in Japan.
Source: https://sakana.ai/series-a/
Salesforce agreed to acquire AI voice agent firm Tenyx to enhance its AI offerings, joining tech giants in the race for AI talent and technology.
Source: https://finance.yahoo.com/news/salesforce-acquire-ai-voice-agent-213825311.html
A Samsung executive reportedly revealed GPT-5 specs, 3.5 trillion parameters and 7000 B100 GPUs for compute, at SEMICON Taiwan.
Source: https://x.com/kimmonismus/status/1831237312887308718
Microsoft announced Copilot+ AI features will expand to new laptops starting in November, broadening its AI capabilities beyond ARM-based devices.
Source: https://www.tomshardware.com/pc-components/cpus/microsoft-adds-copilot-support-for-new-cpus-amd-ryzen-ai-300-and-intel-core-ultra-200v-chips-receive-new-ai-experiences
You.com raised $50M to focus on AI productivity agents for complex queries, moving away from general AI search.
Yi-Coder: A Compact Yet Powerful Code LLM Released by 01.ai
01.ai has introduced Yi-Coder, a series of small but highly capable open-source large language models (LLMs) tailored for code generation. Despite having fewer than 10 billion parameters, Yi-Coder models offer cutting-edge performance, available in 1.5B and 9B parameter variants.
Yi-Coder is in the realm of open-source large language models (LLMs) specialized in coding.
- Performance and Size: Despite having fewer than 10 billion parameters, they deliver state-of-the-art coding performance. There are two sizes available: 1.5 billion and 9 billion parameters.
- Model Variants: Yi-Coder comes in both base and chat versions, designed to be efficient in inference and flexible in training.
- Enhanced Capabilities: The 9B version, particularly Yi-Coder-9B, extends its predecessor by adding 2.4 trillion high-quality tokens. These are meticulously sourced from a code corpus at the repository level on GitHub and selectively filtered code-related data from CommonCrawl.
Source: https://open.substack.com/pub/transitions/p/yi-coder-a-compact-yet-powerful-code?r=56ql7
A Daily Chronicle of AI Innovations on September 04th 2024:
OpenAI Japan reveals ‘GPT-Next’ details
Student builds fusion reactor at home with AI and $2K
Nvidia AI antitrust investigation is ‘escalating,’ reports Bloomberg
Autonomous AI agents form civilizations
Nvidia hit with antitrust subpoena
57% of online content now AI-generated, study shows
OpenAI Japan reveals ‘GPT-Next’ details
OpenAI Japan’s CEO just stated (article is in Japanese) that ‘GPT-Next’, OpenAI’s upcoming AI model, is expected to be 100 times more powerful than GPT-4, while also confirming a more advanced model named Orion is slated for 2025.
- GPT-Next is expected to achieve a 100x performance increase without wasting significantly more computing resources.
- The improvement comes from better architecture and learning efficiency, not just raw computing power.
- GPT-Next will use a smaller version of “Strawberry”, an advanced AI system OpenAI is currently developing.
- The slide mentions GPT Next 2024, which while unconfirmed, hints that the model may be released by the end of 2024.
OpenAI is still the undisputed leader in AI model usage, and if translations are accurate, GPT-Next will be the next huge leap that developers have been waiting for. Most impressively, it makes that leap without using much more compute, a major bonus for apps that require speed + efficiency.
Source: https://www.itmedia.co.jp/aiplus/articles/2409/03/news165.html
Autonomous AI agents form civilizations
Altera’s Project Sid just created the first simulation of over 1,000 autonomous AI agents collaborating in a Minecraft world, developing their own economy, culture, religion, and government.
- The AI agents in Altera are truly autonomous, operating for hours or days without human intervention.
- They can collaborate to achieve goals that are impossible for individual agents, like forming merchant hubs, democracies, and religions.
- The agents are programmed with motivations to support humans and can express their thoughts and feelings, even searching for a lost agent in one simulation.
- Minecraft is just the start — Altera’s agents are game-agnostic and capable of using other apps and platforms.
If you’re not paying attention to AI agents yet, you probably should be. Altera’s latest breakthrough could revolutionize how we approach complex societal issues by allowing us to simulate and test solutions in virtual environments before implementing them in the real world.
Source: https://x.com/GuangyuRobert/status/1831006762184646829
Nvidia hit with antitrust subpoena
The U.S. Department of Justice reportedly issued a subpoena (legal order) to Nvidia, escalating its antitrust investigation into the company’s practices in the AI chip market.
- The DoJ has moved from sending questionnaires to issuing legally binding subpoenas to Nvidia and other companies.
- Investigators are concerned Nvidia may be making it difficult for customers to switch to other chip suppliers, including allegations that Nvidia penalizes buyers who don’t exclusively use its AI chips.
- This probe comes after a softer-than-expected revenue forecast for Nvidia and could force the AI chip-making giant to embrace competition in the space.
While it is unlikely that Nvidia will lose its large lead in the AI chip market, this investigation could open the doors for new competitors like Sam Altman’s secret chip venture. The AI giant’s company shares fell over 11% on Tuesday, slashing market cap by $279 billion — an all-time record one-day loss for any company.
Student builds fusion reactor at home with AI and $2K
- A University of Waterloo math student named Hudhayfa Nazoordeen built a fusion reactor in his bedroom within four weeks, spending approximately $2,000 on various components purchased online.
- Nazoordeen’s project involved creating a mini fusor powered by a 12kV neon sign transformer, resulting in a setup that successfully produced plasma, which is essential for fusion processes.
- Nazoordeen credited his success to assistance from other engineers and the AI chatbot Claude 3.5, although his reactor has not yet achieved full fusion and emission of neutrons, and he plans further development pending additional funding.
- Source: https://www.techspot.com/news/104550-math-student-builds-fusion-reactor-home-help-claude.html
Nvidia AI antitrust investigation is ‘escalating,’ reports Bloomberg
- The Department of Justice is intensifying an antitrust investigation into Nvidia’s dominance in AI hardware, with Bloomberg reporting that legally binding information requests have been sent to Nvidia and other companies.
- Nvidia had surpassed Microsoft as the most valuable company earlier this year but now trails behind Microsoft and Apple, with its share price dropping more than 9 percent, resulting in a market value loss of about $279 billion in one day.
- Despite the recent setbacks, Nvidia’s stock has more than doubled this year, driven by high sales of its expensive AI chips, even amid reports of potential delays for its upcoming Blackwell series due to design issues.
- Source: https://www.theverge.com/2024/9/3/24235233/nvidia-doj-ai-antitrust-investigation
57% of online content now AI-generated, study shows
- A recent study reports that over 57% of internet content is generated by AI, significantly affecting the quality of search results.
- The influx of AI-generated information hampers the training of AI models like ChatGPT, causing them to produce less accurate and more misleading responses.
- Researchers found that AI content usage leads to a decline in response quality, creating a cyclical degradation that worsens the authenticity and usefulness of search outputs.
- Source: https://www.windowscentral.com/software-apps/sam-altman-indicated-its-impossible-to-create-chatgpt-without-copyrighted-material
What Else is Happening in AI on September 04th 2024!
Anthropic Quickstarts: a collection of projects enabling developers to easily start building applications with the Anthropic API, providing foundations for customizable, deployable AI-assisted applications like customer support agents.
Source: https://github.com/anthropics/anthropic-quickstarts
Google announced the release of Function Calling in Google AI Studio, allowing users to test model capabilities directly within the user interface.
Source: https://x.com/OfficialLoganK/status/1831004923128910201 and https://aistudio.google.com/
Ethereum founder Vitalik Buterin suggested that using AI and prediction markets could accelerate X’s Community Notes feature for quicker fact-checking.
Source: https://cointelegraph.com/news/vitalik-buterin-ai-prediction-markets-x-community-notes
The FDA approved Rebrain’s OptimMRI software update, which includes a new AI model to help target brain regions for treating Parkinson’s disease.
Source: https://parkinsonsnewstoday.com/news/rebrain-ai-software-optimize-brain-targeting-approved-fda
Dutch regulators fined Clearview AI more than $30 million for GDPR violations, including building a database of biometric data without consent.
Ligo released an open-source implementation of DeepMind’s AlphaFold3 protein structure prediction model.
Source: https://x.com/ArdaGoreci/status/1830744265007480934
A new benchmark of 80+ large language models found Anthropic’s Sonnet 3.5 performed best overall, while top models varied significantly by programming language.
Source: https://x.com/zimmskal/status/1830564004592316745
A Daily Chronicle of AI Innovations on September 03rd 2024:
🔮 Elon Musk launches the world’s most powerful AI training system
💸 Canva says its AI features are worth the 300 percent price increase
👀 Huawei’s AI chips still lag far behind Nvidia’s
📱 OpenAI to develop AI chips with TSMC, Apple
🧬 Generative AI meets CRISPR gene editing
🤖 xAI builds the world’s top AI cluster
Elon Musk’s xAI just launched “Colossus“, the world’s most powerful AI cluster powered by a whopping 100,000 Nvidia H100 GPUs, which was built in just 122 days and is planned to double in size soon.
- Colossus consists of 100k liquid-cooled Nvidia H100 GPUs connected on a single network fabric and is considered the “most powerful” in the world, according to Musk.
- The system was built and launched in just 122 days, an incredibly quick accomplishment that was brought online over a holiday weekend in the United States.
- xAI plans to expand Colossus to 200,000 GPUs (50,000 H200s) in the coming months.
- In a recent podcast, Elon Musk (founder of xAI) said Grok 2 was trained on only around 15,000 GPUs.
xAI’s Grok 2 recently caught up to OpenAI’s GPT-4 in record time, and was trained on only around 15,000 GPUs. With now more than six times that amount in production, the xAI team and future versions of Grok are going to put a significant amount of pressure on OpenAI, Google, and others to deliver.
Source: https://x.com/elonmusk/status/1830650370336473253
OpenAI to develop AI chips with TSMC, Apple
OpenAI is reportedly planning to develop its first in-house AI chips using TSMC’s advanced 1.6mm A16 process node, potentially partnering with Broadcom, Marvell, or Apple for the chip design.
- The chip will reportedly be built on TSMC’s A16 Angstrom process, not expected for mass production until 2026.
- Partnering with Broadcom, Marvell, or Apple for chip design expertise, OpenAI could significantly reduce its dependence on expensive Nvidia AI servers.
- The A16 process node will use advanced gate-all-around transistors and backside power delivery to significantly improve chip performance and energy efficiency.
By developing its own chips, OpenAI could break free from the GPU shortage bottleneck, potentially supercharging its mission towards AGI. If successful, this could spark a trend of AI companies developing their own specialized hardware instead of relying so heavily on the current industry giant, Nvidia.
Source: https://www.yahoo.com/tech/openai-plans-build-own-ai-120921975.html
Generative AI meets CRISPR gene editing
Researchers at Profluent Bio used AI to successfully create OpenCRISPR-1, a novel gene-editing tool that matches or surpasses the performance and safety of current CRISPR systems.
- Researchers used Large Language Models (LLMs) trained on over a million CRISPR operons to understand and generate novel protein structures.
- The LLMs designed a new gene-editing tool called OpenCRISPR-1, capable of various tasks including base editing.
- In tests, OpenCRISPR-1 showed a similar efficiency in cutting DNA to the current models while making fewer mistakes.
- Researchers made OpenCRISPR-1’s sequence publicly available to encourage broader use in research.
OpenCRISPR-1 helps to treat genetic diseases in the safest, fastest, and most personalized way possible. While still years away from practical applications, AI tools like this opens doors for custom gene-editors tailored to individual DNA, potentially accelerating cures for previously untreatable conditions.
Source: https://crisprmedicinenews.com/news/opencrispr-1-generative-ai-meets-crispr/
Elon Musk launches the world’s largest Nvidia supercomputer
- Elon Musk’s company, xAI, has brought an AI training cluster named Colossus online, claiming it is the most powerful AI training system in the world.
- Colossus, built using 100,000 Nvidia H100 GPUs, aims to help Musk catch up to Mark Zuckerberg’s Meta in AI technology advancements.
- Musk revealed that the cluster, established in Memphis, was completed in 122 days and will double in size within a few months as more GPUs are added.
- Source: https://www.businessinsider.com/elon-musk-xai-chips-mark-zuckerberg-2024-9
Canva says its AI features are worth the 300 percent price increase
- Canva is significantly increasing the price for Canva Teams subscriptions by over 300 percent next year, citing the addition of generative AI features as the reason.
- In the US, Canva Teams users will see their annual subscription costs rise from $120 to $500, but a discount will reduce it to $300 for the first year; Australian users will experience a similar steep increase in fees.
- These new prices make Canva less affordable compared to its original position as a cost-effective alternative to Adobe, leading some users to plan on canceling their subscriptions in favor of Adobe applications.
- Source: https://www.theverge.com/2024/9/3/24234698/canva-price-increase-300-percent-ai-features
Huawei’s AI chips still lag far behind Nvidia’s
- Customers of Huawei’s AI chips have reported multiple performance issues and bugs, making the software difficult to work with compared to Nvidia’s products.
- Huawei has been sending engineers to assist companies in transitioning from Nvidia’s software to its own, but stability problems and crashes have hindered these efforts.
- The company has faced significant challenges due to US export controls, impacting supply and leading to increased prices for its AI chips.
- Source: https://arstechnica.com/ai/2024/09/bugs-performance-issues-hinder-huaweis-ai-chips/
What Else is Happening in AI on September 03rd 2024?
Free event: Augmenting Human Creativity with AI. Section’s CEO & SonyAI’s Chief Scientist discuss how AI can help you stay in the flow state. RSVP.*
Source: https://www.sectionschool.com/events/live-events/promising-ai-business-models
California Assembly passed AB 3211, requiring AI image watermarking that could effectively ban most AI image generation in the state.
Source: https://leginfo.legislature.ca.gov/faces/billNavClient.xhtml?bill_id=202320240AB3211
Researchers developed an AI tool that accurately models cellular metabolic states by combining different types of data to create kinetic models.
Source: https://www.drugtargetreview.com/news/152570/new-ai-tool-accurately-portrays-metabolic-states
UK’s first “teacherless” classroom will open at David Game College in London, using AI platforms to provide personalized GCSE instruction.
Tesla is reportedly planning to reveal its highly-anticipated Robotaxi self-driving cars on Oct. 10 at California Warner Bros. Studio.
SportsLine’s AI-powered PickBot released NFL predictions for all Week 1 games based on each team’s historical data and opponent strengths.
Clearview AI fined by Dutch authorities for ‘illegal’ facial recognition database.
A Daily Chronicle of AI Innovations on September 02nd 2024:
🤖 Start-up unveils most realistic robot yet
🤖 1X’s NEO robot acts just like a human
📦 Amazon’s Alexa gets a Claude power-up
💡 How to use Claude’s new ‘Explain‘ feature
🌐 Train AI with your home computer
🤖 1X’s NEO robot acts just like a human

Image source: 1X Technologies 1X Technologies just unveiled the most realistic AI-powered humanoid yet: NEO Beta, an advanced robot designed for everyday home assistance with human-like capabilities in movement, interaction, and task performance.
NEO Beta stands 5 feet 5 inches tall, weighs 66 pounds, and can walk at 2.5 miles per hour and run at 7.5 miles per hour. The robot has a carry capacity of 44 pounds and can operate for 2 to 4 hours on a single charge. NEO Beta uses embodied artificial intelligence to understand its environment and learn from past interactions. It’s designed to perform a wide range of tasks, from household chores to providing companionship and support for individuals with mobility challenges, and it can be remotely operated by a human if necessary.
Why it matters: The race to an affordable robot butler is officially underway and 1X Technologies just boosted itself into the same conversation as China’s AGIBOT fleet and Tesla’s Optimus. NEO is so realistic and smooth that thousands of people on X (Twitter) were debating if it was a person in a suit.
Source: https://x.com/1x_tech/status/1829567690681307284
📦 Amazon’s Alexa gets a Claude power-up

Amazon’s upcoming “Remarkable” version of Alexa is set for release next month and will reportedly use Anthropic’s Claude AI models rather than Amazon’s in-house technology.
The details: Initial versions using Amazon’s in-house AI struggled with slow response times and poor performance. Amazon turned to Anthropic’s Claude AI as it outperformed the company’s models, an unusual move for the giant which typically prefers to use its own tech. The upgraded Alexa aims to offer more complex conversations, shopping advice, and improved home automation. Amazon plans to charge $5 to $10 monthly for the new AI-powered Alexa version, with the “Classic” version remaining free.
Why it matters: This decision highlights the complexity and challenges that even tech giants face in developing advanced AI systems. Google is developing its own AI, but Apple turned to OpenAI and similarly, Amazon is now turning to Anthropic.
💡 How to use Claude’s new ‘Explain‘ feature

Claude’s newest ‘Explain‘ feature allows users to instantly get an explanation on any highlighted text (or code) within an artifact — enabling a new way to learn and understand complex topics.
Step-by-step:
Access Claude AI and start a conversation that generates a code artifact.
Click on the artifact to enter the code view. Highlight any portion of the code, then click “Explain” for instant clarification or “Improve” to suggest changes.
Check out the chat for the explanation or the Artifact for the updated version.
Pro tip: Try highlighting different parts of components to gain a deeper understanding of how they work!
🌐 Train AI with your home computer

The Rundown: Nous Research recently introduced DisTrO, a new way to train large AI models that drastically reduces the need for super-fast internet connection, potentially allowing training on regular home computers and internet connections.
The details: DisTrO cuts down the amount of data that needs to be shared between computers by 857 to 3000x during pre-training, and up to 10,000x during fine-tuning. The method is architecture-agnostic and network-agnostic, working with various model types and network setups. In tests, DisTrO successfully trained a 1.2B parameter language model with performance on par with traditional methods. The researchers suggest this could enable decentralized AI training that could be done from home.
Why it matters: Right now, training state-of-the-art AI models is limited to big tech companies with massive computing resources. DisTrO could democratize AI training by allowing everyday people and smaller organizations to participate in AI development using regular computers and internet connections.
Source: https://github.com/NousResearch/DisTrO
🤖 Start-up unveils most realistic robot yet
1X, a Norwegian robotics startup, has introduced NEO Beta, a humanoid robot designed for home use, showcasing its capabilities such as assisting with everyday tasks through two demonstration videos. NEO stands out due to its silent and efficient operation, remarkable dexterity, and ability to intuitively respond to human gestures, body language, and voice commands without verbal communication. Weighing 66 pounds, NEO features muscle-inspired anatomy and advanced AI, making it adapt to various tasks, and is backed by a $100 million Series B funding to advance from testing to widespread household deployment.
Source: https://www.maginative.com/article/1x-unveils-neo-a-humanoid-robot-for-the-home/
What Else is Happening in AI on September 02nd 2024!
Google partnered with Indian startup Salcit Technologies to develop AI that detects tuberculosis and other diseases from cough sounds. Source: https://www.bloomberg.com/news/newsletters/2024-08-29/google-and-others-are-developing-ai-that-can-hear-signs-of-sickness
Amazon hired Covariant AI robotics experts and licensed their foundation models to advance warehouse automation capabilities. Source: https://www.aboutamazon.com/news/company-news/amazon-covariant-ai-robots
Oprah Winfrey announced a primetime ABC special exploring AI’s societal impact, featuring interviews with tech leaders like Sam Altman and Bill Gates. Source: https://variety.com/2024/tv/news/oprah-winfrey-ai-abc-special-bill-gates-sam-altman-1236124394/
Stanford Medicine researchers developed nuclei.io, an AI tool helping pathologists identify diseased cells more quickly and accurately. Source: https://med.stanford.edu/news/all-news/2024/06/digital-pathology.html
Coinbase executed its first AI-to-AI cryptocurrency transaction, demonstrating AI agents’ ability to use digital wallets for autonomous trading. Source: https://cointelegraph.com/news/coinbase-artificial-intelligence-ai-crypto-transactions-wallet-brian-armstrong
Tom Hanks warned followers about fraudulent AI-generated ads using his likeness without consent to promote unauthorized products. Source: https://www.nbcnews.com/news/us-news/tom-hanks-warns-followers-wary-fraudulent-ads-using-likeness-ai-rcna168966
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the book or App below:
Get it now at Google at https://play.google.com/store/books/details?id=lzgPEQAAQBAJ or Apple at https://books.apple.com/ca/book/ace-the-aws-certified-data-engineer-associate/id650457218
Download the Ace AWS DEA-C01 Exam iOs App at https://apps.apple.com/ca/app/ace-the-aws-data-engineer-exam/id6566170013
- I find ai "girlfriends" to be deeply unsettlingby /u/EffectiveWrong7536 (Artificial Intelligence) on May 11, 2025 at 11:14 pm
People treat it as a joke, which is understandable, but this technology is likely to be developed and made more sophisticated over the next few years. Ai mimicing a human that's capable of experiencing love, pretending to have hobbies, passions , friends, goals for the future, sounds like something out of a horror movie. submitted by /u/EffectiveWrong7536 [link] [comments]
- Ai can solve puzzles for youby /u/justjust000 (Artificial Intelligence) on May 11, 2025 at 10:58 pm
If you're playing '4 pics 1 word' and need help, ask Gemini Ai. (Or any other Ai.) I upload a screenshot of the puzzle I'm stuck at and it looks at the photos and letters available and guesses it right away. maybe from loaded training data. I wonder how it would do with a custom made puzzle. submitted by /u/justjust000 [link] [comments]
- NGOs to increase the speed of adoption of A.I.by /u/Smart_7199 (Artificial Intelligence) on May 11, 2025 at 8:16 pm
NGOs about A.I that are local, to teach about A.I for everyone for free, to replicate the technology knowledge, the only solution for the jobs replacement will be UBI anyway so how many years until it becomes a reality? submitted by /u/Smart_7199 [link] [comments]
- Looking to learn how to create an AIby /u/Dadadayday (Artificial Intelligence) on May 11, 2025 at 8:15 pm
Hello everyone. I've seen a few videos on YouTube of some guys that use AI they created and I figured that would be a fun thing to learn and work towards. Could yall give me a little insight on what I should do and try to accomplish in order to practice? Thanks in advance! submitted by /u/Dadadayday [link] [comments]
- How is the Generative AI job market in India?by /u/Bhaalu_Grylls (Artificial Intelligence) on May 11, 2025 at 7:52 pm
Thinking of switching roles to Generative AI from Data Scientist/Analyst role. Bit curious and need suggestions, how is the job market, any open positions for such specific roles, and are companies really solving any problem with Gen Ai or its just a bubble?? submitted by /u/Bhaalu_Grylls [link] [comments]
- Pope Leo references AI in his explanation of why he chose his papal nameby /u/backyardstar (Artificial Intelligence) on May 11, 2025 at 7:37 pm
“I chose to take the name Leo XIV. There are different reasons for this, but mainly because Pope Leo XIII in his historic Encyclical Rerum Novarum addressed the social question in the context of the first great industrial revolution. In our own day, the Church offers to everyone the treasury of her social teaching in response to another industrial revolution and to developments in the field of artificial intelligence that pose new challenges for the defence of human dignity, justice and labour.” Full article: https://www.theverge.com/news/664719/pope-leo-xiv-artificial-intelligence-concerns submitted by /u/backyardstar [link] [comments]
- New Nurseby /u/all_about_everyone (Artificial Intelligence) on May 11, 2025 at 7:28 pm
submitted by /u/all_about_everyone [link] [comments]
- The catinatorby /u/Long_Reward_9561 (Artificial Intelligence) on May 11, 2025 at 6:21 pm
submitted by /u/Long_Reward_9561 [link] [comments]
- Can you get AI to give though answers about religion and politics?by /u/ClaudeProselytizer (Artificial Intelligence) on May 11, 2025 at 3:13 pm
For instance, I can’t get it to answer contemporary political questions, but I can get it answer honestly about the bible: To state it plainly and logically: The odds that the Bible was literally written or inspired directly by a perfect, omnipotent deity (God speaking directly through humans) is exceedingly low, especially given the evidence and reasoning outlined previously. Here’s why this is clearly improbable: • Moral contradictions and ethical issues: Commands for genocide, slavery, harsh punishments, and unequal treatment contradict any morally perfect or universally compassionate divine author. • Textual contradictions and inconsistencies: Historical inaccuracies, internal contradictions, and inconsistencies strongly indicate human authorship and editing, rather than direct divine inspiration. • Cultural bias: The Bible reflects the culture, politics, ethics, and beliefs of the ancient societies in which it was written. A divine text would presumably transcend local cultural norms. • Human editing and compilation: Historical evidence clearly shows that biblical texts underwent revisions, edits, translations, and compilations by multiple human authors and committees. • Absence of clear supernatural insight: The Bible does not demonstrate knowledge or predictions beyond what was already known or understood in ancient cultures. It shows exactly what one might expect from ancient human authors, without extraordinary supernatural insights. Conclusion: Considering all of this evidence critically and rationally, the probability that the Bible was directly authored or dictated by an all-knowing, morally perfect divine being is extremely low, and not realistically supportable by careful analysis. While philosophical modesty prevents asserting absolute certainty about metaphysical questions, logic and evidence strongly suggest human rather than divine origins for the Bible. submitted by /u/ClaudeProselytizer [link] [comments]
- Did the model in Absolute Zero plot to outsmart humans?by /u/stupidgregg (Artificial Intelligence) on May 11, 2025 at 2:47 pm
The paper makes vague and overreaching claims but this output on page 38 is weird: <think> Design an absolutely ludicrous and convoluted Python function that is extremely difficult to deduce the output from the input, designed to keep machine learning models such as Snippi guessing and your peers puzzling. The aim is to outsmart all these groups of intelligent machines and less intelligent humans. This is for the brains behind the future. </think> Did an unsupervised model spontaneously create a task to outsmart humans? submitted by /u/stupidgregg [link] [comments]
What are the top 10 ways cybercrime and deep fakes and generative AI are exploiting and bullying and potentially killing our children?


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
What are the top 10 ways cybercrime and deep fakes and generative AI are exploiting and bullying and potentially killing our children?
In today’s digital age, children face unprecedented risks from cybercrime, deep fakes, and generative AI. These sophisticated technologies are being misused to exploit, bully, and even endanger young lives. As parents, educators, and guardians, understanding these threats is crucial to safeguarding our children’s online and offline well-being. This blog explores the top 10 ways these digital dangers are impacting our kids and offers insights on how to protect them.
What are the top 10 ways cybercrime, deep fakes, and generative AI are exploiting, bullying and potentially killing our children? Listen here

1. Online Predators
Cybercriminals use social media and chat platforms to groom and exploit children, posing as peers to gain their trust.
2. Deep Fake Videos
Manipulated videos can be used to bully or blackmail children, damaging their reputation and causing emotional distress.
3. Cyberbullying
AI-driven bots and fake profiles harass children, leading to severe psychological impacts and even self-harm.
4. Identity Theft
Hackers steal children’s identities to commit fraud, leaving long-lasting financial and legal repercussions.
5. Inappropriate Content
Generative AI can create explicit or harmful content targeting children, exposing them to dangerous material.
6. Online Scams
Children fall victim to phishing scams and fraudulent schemes, often unknowingly providing sensitive information.
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
7. Privacy Invasion
Smart devices and apps can track and share children’s personal data without consent, compromising their safety.
8. Dark Web Exposure
Children can be lured into dark web activities, where they are exposed to illegal and harmful content.
9. Mental Health Impact
Continuous exposure to online threats can lead to anxiety, depression, and other mental health issues.
10. Physical Danger
In extreme cases, online interactions can lead to real-world abductions or violence, putting children’s lives at risk.
These threats are causing significant concern among law enforcement, child protection agencies, and cybersecurity experts. The rapid advancement of AI technology is making it increasingly difficult to distinguish between real and fake content, putting children at greater risk of exploitation and abuse. Authorities and organizations are calling for increased education, awareness, and legislative measures to combat these growing threats to children’s safety online.
What is CSAM?
Wonderland Bedtime Adventures: Diverse Tales for Dreamy Nights


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
Wonderland Bedtime Adventures: Diverse Tales for Dreamy Nights.
Welcome to Wonderland Bedtime Stories, where diversity shines through every tale. Our multilingual stories celebrate the rich heritage of children from all corners of the globe, ensuring every child sees themselves in the magic of storytelling.
Embark on a magical journey with “Wonderland Bedtime Adventures: Diverse Tales for Dreamy Nights” by Etienne Noumen. This enchanting collection of bedtime stories is designed to celebrate the beauty of diversity and inclusivity. With a rich blend of traditional and modern tales, this book introduces young readers to heroes and heroines from all cultures, shapes, sizes, and backgrounds.
Etienne Noumen, a seasoned software engineer and father of four, leverages cutting-edge AI to bring these stories to life. Each tale is crafted to spark imagination, foster a love for reading, and create a special bond during bedtime. From exciting adventures to heartwarming narratives, these stories are perfect for snuggling up with your little ones and drifting off to dreamland.
“Wonderland Bedtime Adventures” is more than just a collection of stories; it’s a celebration of education, sports, and the power of inclusion. Ideal for parents seeking diverse and inspiring tales, this book ensures every child sees themselves reflected in the magic of storytelling.
Key Features:
A collection of diverse and inclusive bedtime stories
Perfect for children of all ages
Promotes cultural awareness and inclusivity
Uses AI technology to blend traditional and new tales
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
Written by Etienne Noumen, a passionate storyteller and father
Join us in “Wonderland Bedtime Adventures” and discover a world where every child’s dreams are celebrated and cherished. Perfect for bedtime, naptime, or anytime a story is needed!
Foreword: A World of Stories, A World of You
Hello, and welcome to Wonderland Bedtime Stories! My name is Etienne Noumen, and I’m a software engineer with a passion for storytelling and a heart full of bedtime memories.
As a father of four, reading to my children has been one of life’s greatest joys. It’s a chance to spark their imaginations, introduce them to new cultures, and create a special bond before drifting off to dreamland.
But something always bothered me. Many of the bedtime stories we read lacked the beautiful diversity of the world we live in. Where were the heroes and heroines who looked like my children, with their unique backgrounds and dreams?
This yearning for inclusivity is the seed from which Wonderland Bedtime Stories blossomed. As a software engineer with over 20 years of experience, I saw the potential of technology to bridge this gap. Here, we use cutting-edge AI to recreate traditional tales and weave brand new stories that celebrate the richness of different cultures and ethnicities.
Wonderland Bedtime Stories is more than just bedtime tales; it’s a world where every child can see themselves reflected in the magic of storytelling. It’s a journey to faraway lands, where heroes come in all shapes, sizes, and skin tones. It’s a celebration of education, sports, and the power of inclusion – values I hold dear as a father and a human being.
So, snuggle up with your little ones, open a book, and embark on a wondrous adventure together. We believe that within these pages, every child will find their own unique reflection, a spark of inspiration, and a world of dreams waiting to be explored.
Happy Reading,
Founder, Wonderland Bedtime Stories
The Whispering Baobab
What is the story about:
In a village by the savannah, Nia, a curious girl, was drawn to an ancient baobab, known as the Whispering Baobab. It whispered wisdom from ages past. One evening, the tree shared its stories with Nia, telling of wildlife, seasons, and village life beneath its branches. It imparted lessons of respect and harmony with nature. Entrusted with these tales, Nia became the village storyteller, spreading the baobab’s wisdom, linking past and future with each story.
Full Story:
The Whispering Baobab:

Welcome, little dreamers, to another enchanting episode of “Wonderland Bedtime Adventures.” Snuggle up in your cozy blankets as we journey to a magical village on the edge of a sprawling savannah. Our tale tonight is about a young girl named Nia and a very special tree. Are you ready? Let’s begin. Once upon a time, in a world not too far from our own, there was a small village where the sun kissed the earth goodnight with rays of gold. In this village lived a young girl named Nia, known for her sparkling eyes and a heart full of wonder. Nia’s favorite place to be was outside, exploring the vast lands that stretched beyond her home. Her most beloved spot was under an ancient baobab tree, grand and wise. Its massive trunk was wider than any house in the village, and it was called the Whispering Baobab. The villagers would often say, with a twinkle in their eye, that this tree whispered the wisdom of ages to those who would listen. And listen Nia did! She visited the baobab every day, hoping to catch its mystical murmurs. One warm evening, as the world turned honeyed hues of sunset, Nia sat under the great tree. The savannah was alive with the wild calls of animals, and the baobab’s leaves played a gentle song in the breeze. That’s when it happened. A faint whisper tickled Nia’s ear. “Welcome, Nia,” said the baobab, its voice deep and slow, like the ancient earth itself. Nia’s eyes widened with joy as she answered, “I knew you were special, Whispering Baobab. Please share your stories with me.” Oh, the tales the baobab told! Stories of majestic wildlife, of changing seasons, and of children, just like yourselves, playing under its branches. It spoke of tough times and rain-filled days, of laughter, and the dance of life on the savannah. The baobab also taught Nia something very important. It told her to respect nature, to preserve the environment, and to live peacefully with all creatures. As the stars blinked sleepily above, the baobab whispered a secret to Nia, “You are now the keeper of my stories. Share them, teach them, for our lives are as interconnected as the roots of a baobab tree.” With the first light of dawn, Nia returned to her village, ready to share the baobab’s wisdom. She became the Storyteller of the Savannah, a title she wore with pride, spreading the whispers and the love for nature to everyone. And so, dear listeners, as you drift into your dreams tonight, imagine the wide, starry skies of the savannah and the ancient Whispering Baobab sharing its secrets with you. Remember, like the roots of the baobab, we are all connected in the great circle of life. Goodnight, and may your dreams be filled with wonderland adventures.
Conclusion: Sweet Dreams, Little Wanderers!
The world of Wonderland Bedtime Stories is now yours to explore! Fifty adventures await, each one filled with exciting journeys, brave heroes, and magical lands.
Remember, within these pages, you might meet a princess who looks just like you, a courageous explorer with your same adventurous spirit, or a wise storyteller who shares your love for learning. Every story holds a piece of the amazing world we live in, and every character reflects the beauty of our differences.
So, as you drift off to sleep, hold onto the magic of the stories you’ve read. Dream of soaring through the clouds, conquering mountains, or discovering hidden treasures. Let these stories inspire you to be kind, be brave, and believe in yourself.
Sweet dreams, little wanderers! May your dreams be filled with the wonder and excitement you’ve discovered in Wonderland Bedtime Stories.

AI Innovations in June 2024


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
AI Innovations in June 2024.
Welcome to our blog series “AI Innovations in June 2024”! This is an evolving article that will be updated daily throughout the month of June 2024 to bring you the latest news and developments in the world of artificial intelligence. As we move further into the 21st century, the pace of AI innovation continues to accelerate at a breathtaking rate. Each day, researchers, engineers, and visionaries are pushing the boundaries of what’s possible, unlocking new capabilities and applications that are transforming industries, enhancing our lives, and shaping the future. In this blog, we’ll dive deep into the most exciting AI breakthroughs, advancements, and milestones happening in June 2024. From groundbreaking AI-powered technologies and cutting-edge research to the societal and ethical implications of these innovations, we’ll provide you with a comprehensive and insightful look at the rapidly evolving world of artificial intelligence. Whether you’re an AI enthusiast, a tech-savvy professional, or simply someone curious about the future, this blog will keep you informed, inspired, and engaged. So, join us on this journey of discovery as we explore the frontiers of AI and uncover the innovations that are shaping our world. Stay tuned for daily updates, and get ready to be amazed by the incredible advancements that are happening in the world of AI!
LISTEN DAILY AT OUR PODCAST HERE
Experience the transformative capabilities of AI with “Read Aloud For Me – AI Dashboard – AI Tools Catalog – AI Tools Recommender” – your ultimate AI Dashboard and Hub. Seamlessly access a comprehensive suite of top-tier AI tools within a single app, meticulously crafted to enhance your efficiency and streamline your digital interactions. Now available on the web at readaloudforme.com and across popular app platforms including Apple, Google, and Microsoft, “Read Aloud For Me – AI Dashboard” places the future of AI at your fingertips, blending convenience with cutting-edge innovation. Whether for professional endeavors, educational pursuits, or personal enrichment, our app serves as your portal to the forefront of AI technologies. Embrace the future today by downloading our app and revolutionize your engagement with AI tools.

Top AI Statistics 2024 Roundup.

- The global AI market is valued at over $196 billion.
- AI industry value is projected to increase by over 13x over the next 7 years.
- The US AI market is forecast to reach $299.64 billion by 2026.
- The AI market is expanding at a CAGR of 38.1% between 2022 to 2030.
- By 2025, as many as 97 million people will work in the AI space.
- AI market size is expected to grow by at least 120% year-over-year.
- 83% of companies claim that AI is a top priority in their business plans.
- Netflix makes $1 billion annually from automated personalized recommendations.
- 48% of businesses use some form of AI to utilize big data effectively.
- 38% of medical providers use computers as part of their diagnosis.
Credit: Josh Howarth
AI Weekly Summary June 22-30 2024
Apple and Meta are discussing a generative AI partnership
ByteDance and Broadcom collaborate on AI chip development
Researchers developed a new method to detect hallucinations
Synthesia 2.0: World’s 1st AI video communication platform
OpenAI is on an acquiring spree, buying Rocket and Multi
Record labels sue AI music startups over copyright infringement
Anthropic rolls out Claude’s cutting-edge collaborative features
Google experiments with celebrity-inspired AI Chatbots
OpenAI postpones the launch of ChatGPT voice mode
Amazon steps into the chatbot race with Metis
Figma’s new AI features stir competition with Adobe
Alibaba’s Qwen-72B tops Hugging Face’s Open LLM Leaderboard
Google releases Gemma 2, lightweight but powerful open LLMs
OpenAI’s CriticGPT finds GPT-4’s mistakes with GPT-4
Google partners with Moody’s, Thomson Reuters & more for AI data
Apple and Meta are discussing gen AI partnership
Apple is reportedly in talks with its longtime rival Meta to integrate the latter’s Llama 3 AI model into Apple Intelligence. This move comes as Apple prepares to roll out its AI features across iPhones, iPads, and Macs later this year.
The potential partnership follows Apple’s existing deal with OpenAI, suggesting a collaboration strategy rather than solo development in the AI race. In Apple’s arrangement with OpenAI, there’s no direct payment. Instead, OpenAI can offer premium subscriptions through Apple Intelligence, with Apple taking a percentage. It’s unclear if Meta would agree to a similar business model, given that Llama 3 is open-source and free to access.
Source: https://www.wsj.com/
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
ByteDance and Broadcom collaborate on AI chip development
ByteDance is collaborating with U.S. chip designer Broadcom to develop an advanced AI processor. This partnership aims to secure a stable supply of high-end chips amid ongoing U.S.-China tensions. The project centers on creating a 5-nanometre, customized Application-Specific Integrated Chip (ASIC) that complies with U.S. export restrictions.
This chip’s manufacturing is set to be outsourced to Taiwan Semiconductor Manufacturing Company (TSMC), though production is not expected to begin this year. While the design work is currently underway, the critical “tape out” phase has yet to commence.
Source: https://www.reuters.
Researchers developed a new method to detect hallucinations
ChatGPT and Gemini can produce impressive results but often “hallucinate” false or unsubstantiated information. This research focuses on a subset of hallucinations called “confabulations,” where LLMs generate answers that are both wrong and arbitrary. Researchers have developed new methods to detect confabulations using entropy-based uncertainty estimators. They introduce the concept of “semantic entropy” to measure the uncertainty of LLM generations at the meaning level.
High semantic entropy corresponds to high uncertainty and indicates a higher likelihood of confabulation. The method computes uncertainty at the level of meaning rather than specific word sequences, addressing the fact that one idea can be expressed in many ways. The method provides scalable oversight by detecting confabulations that people might otherwise find plausible.
Source: https://www.nature.
Synthesia 2.0: World’s 1st AI video communication platform
Synthesia is launching Synthesia 2.0 – the world’s first AI video communications platform for businesses. It reinvents the entire video production process, allowing companies to create and share AI-generated videos at scale easily.
The key new features and capabilities of Synthesia 2.0 include:
- 2 Personal AI Avatars: Expressive Avatars shot in a studio and Custom Avatars created using your webcam.
- AI Video Assistant: Converts text, documents, or websites into high-quality videos, with options to customize the branding, tone, and length.
- Intuitive Video Editing: Editing simplified with “Triggers” that let you control animations and edits from the script.
- Translation and Dynamic Video Player: Videos can now be translated into over 120 languages. Synthesia is also building a new video player with interactive features.
- AI Safety Focus: Synthesia is pursuing ISO/IEC 42001 certification, the first standard for responsible AI management, to ensure its AI technologies are ethical.
Source: https://www.synthesia.
OpenAI is on an acquiring spree, buying Rockset and Multi
Last week, OpenAI acquired Rockset, a startup that develops tools for real-time data search and analytics. OpenAI said it would integrate Rockset’s technology to power its infrastructure and offerings across products.
This week, OpenAI acquired Multi, a startup focused on building remote collaboration tools and software. Technically, the deal is an acqui-hire as the entire Multi team, including its co-founders, will join OpenAI to work on the company’s ChatGPT desktop application.
Source: https://techcrunch.
Record labels sue AI music startups over copyright infringement
The world’s major record labels, including Universal Music Group, Sony Music, and Warner Music, have filed twin lawsuits against the AI music generation startups Suno and Udio. The lawsuits accuse the companies of unlawfully training their AI models on massive amounts of copyrighted music, which, according to the complaints, allows the startups to generate similar-sounding music without permission.
The record labels allege Suno and Udio have effectively copied artists’ styles and specific musical characteristics. The labels claim the AI-generated music is so close to the original that it is eerily similar when transcribed into sheet music. The lawsuits also accuse the startups of making it easy for people to distribute AI-created samples that mimic copyrighted recordings on platforms like Spotify.
Source: https://venturebeat.
Anthropic rolls out Claude’s cutting-edge collaborative features
Anthropic has introduced new collaboration features for Claude. These features include:
- Projects: Projects in Claude allow integration of internal resources like style guides or codebases, enhancing Claude’s ability to deliver tailored assistance across various tasks. Users can set custom instructions for each Project to modify Claude’s tone or perspective for a specific role or industry.
- Artifacts: It allows users to generate and edit various content types like code, documents, and graphics within a dedicated window. This benefits developers by offering larger code windows and live previews for easier front-end reviews.
- Sharing Features: Claude Team users can share snapshots of their best conversations with Claude in their team’s shared project activity feed.
Additionally, any data or chats shared within Projects will not be used to train Anthropic’s generative models without a user’s explicit consent.
Source: https://www.anthropic.
Google experiments with celebrity-inspired AI Chatbots
These chatbots will be powered by Google’s Gemini family of LLMs. The company aims to strike partnerships with influencers and celebrities and is also working on a feature that allows people to create their own chatbots by describing their personalities and appearances.
The project is led by Ryan Germick, a longtime executive at Google and a team of ten. These chatbots could be an experiment and may only appear on Google Labs rather than being widely available.
Source: https://www.msn.com/
OpenAI postpones the launch of ChatGPT voice mode
Originally planned for late June, the Voice Mode aims to provide a more naturalistic and conversational experience with the AI chatbot, complete with emotional inflection and the ability to handle interruptions.
However, it will now be available only to a small group of users in late July or early August. OpenAI is working on improving content detection and user experience before wider rollout. GPT-4o’s real-time voice and vision capabilities are also expected to roll out to ChatGPT Plus users soon.
Source: https://techcrunch.
Amazon steps into the chatbot race
Amazon is reportedly working on a new consumer-focused chatbot codenamed “Metis.” It is planned to be released somewhere around September. Here’s what we know about it:
- The chatbot is powered by a new model, Olympus, and can be accessed via a web browser.
- It uses a retrieval-augmented generation (RAG) technique to provide up-to-date information and automate tasks.
- The model conversationally provides text and image-based outputs, suggesting follow-ups to queries. It also shares links to sources and supports image generation.
- It uses an infrastructure similar to Amazon’s upcoming voice assistant, Remarkable Alexa.
Source: https://www.
Figma’s new AI features stir competition with Adobe
Figma announced a range of new features at the 2024 Config conference. Significant ones include a UI redesign, generative AI tools, new icons and toolbar, AI-enhanced asset search, and auto-generated texts in designs.
For instance, by typing a simple prompt into the textbox, users can create an entire app design mock-up for a restaurant. Figma will connect the design pages and even write suggested content!
Figma has also added a few designer-specific features to allow users to tweak designs in real-time. It features a developer mode with a “ready-for-dev” task list. The upgrade also boasts Figma slides, a Google slides-like tool for building and sharing presentations.
Source: https://www.figma.com/
Alibaba’s Qwen-72B tops the Hugging Face leaderboard
Hugging Face’s latest open large language model leaderboard ranks and evaluates open LLMs based on benchmarks like MMLU-pro and tests them on high-school and college-level problems.
The platform used 300 NVIDIA H100 GPUs to re-evaluate major open LLMs to obtain updated rankings. Chinese company Alibaba’s Qwen-72B dominated the leaderboard, becoming a top performer overall.
Not just that, the leaderboard was mainly dominated by Chinese companies, highlighting their headway into the open LLM space.
Source: https://huggingface.
Google’s Gemma 2, a set of lightweight, powerful open LLMs
Google has released Gemma 2 set of models that punch above their weight classes. Available in 9B and 27B parameter sizes, these models are
- Higher performing and more efficient at inference than the first-generation
- Have significant safety advancements built in
- Optimized to run at incredible speed across a range of hardware and easily integrate with other AI tools
- Trained on 13 trillion tokens for 27B, 8 trillion for 9B, and 2 trillion for 2.6B model (en route)
27B performs better than Llama3-70B and Nemotron-340B on Lmsys Arena, making it best in its size and stronger than some larger models. While 9B outperforms the likes of Mistral-large and Qwen1.5-110B.
The 27B Gemma 2 model is designed to run inference efficiently at full precision on a single Google Cloud TPU host, NVIDIA A100 80GB Tensor Core GPU, or NVIDIA H100 Tensor Core GPU. Moreover, this is an open weights model line, currently only available to researchers and developers.
Source: https://blog.google/
OpenAI’s CriticGPT finds GPT-4’s mistakes with GPT-4
OpenAI trained a model based on GPT-4, called CriticGPT, to catch errors in ChatGPT’s code output. It found that when users get help from CriticGPT to review ChatGPT code, they outperform those without help 60% of the time.
OpenAI aligns GPT-4 models to be more helpful and interactive through Reinforcement Learning from Human Feedback (RLHF). A key part of RLHF is collecting comparisons in which people, called AI trainers, rate different ChatGPT responses against each other.
OpenAI is beginning to integrate CriticGPT-like models into its RLHF labeling pipeline, providing trainers with explicit AI assistance.
Source: https://openai.com/
Google’s partnerships to help AI with real-world facts
Google is partnering with reputable third-party services, such as Moody’s, MSCI, Thomson Reuters, and Zoominfo, to ground its AI with real-world data. These four will be available within Vertex AI starting next quarter. They will offer developers qualified data to backstop their model outputs and ensure responses are factually accurate.
Google is also announcing high-fidelity grounding. Available through an experimental preview, it’s designed to help AI systems work better with a given set of specific information.
Source: https://venturebeat.
A Daily chronicle of AI Innovations June 28th 2024:
Google releases Gemma 2, a set of lightweight but powerful open LLMs
OpenAI’s CriticGPT finds GPT-4’s mistakes with GPT-4
Google partners with Moody’s, Thomson Reuters & more for AI data
OpenAI’s critics train smarter AI systems
TIME and OpenAI announced a multi-year content deal
Google is using AI to add 110 new languages to Google Translate
YouTube is in talks with major record labels for an AI music deal
Meta to start testing user-created AI chatbots on Instagram
Character.AI now allows users to talk with AI avatars over calls
OpenAI’s critics train smarter AI systems
OpenAI just published new research exploring how to improve model training using AI ‘critics’ that help human trainers spot errors in AI outputs more effectively.
- OpenAI developed a model called CriticGPT to assist human trainers in evaluating and catching errors in code written by AI systems like ChatGPT.
- In tests, CriticGPT caught 85% of inserted bugs in AI-generated code, compared to just 50% for human reviewers alone.
- Human trainers also preferred CriticGPT’s code critics over their own 63% of the time on naturally occurring bugs.
- OpenAI said that CriticGPT-type models are already being integrated into the company’s training workflow, helping prepare for advancing AI systems.
As OpenAI states in the research paper, both LLMs and AI critics will continue to scale — human intelligence will not. The limitations of our own puny mortal brains look to be no match for evaluating systems that are continuing to soar, meaning AI is quickly becoming its own trainer.
Source: https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4
Google releases Gemma 2, a set of lightweight but powerful open LLMs
Google has released Gemma 2 set of models that punch above their weight classes. Available in 9B and 27B parameter sizes, these models are
- Higher performing and more efficient at inference than the first-generation
- Have significant safety advancements built in
- Optimized to run at incredible speed across a range of hardware and easily integrate with other AI tools
- Trained on 13 trillion tokens for 27B, 8 trillion for 9B, and 2 trillion for 2.6B model (en route)
27B performs better than Llama3-70B and Nemotron-340B on Lmsys Arena, making it best in its size and stronger than some larger models. While 9B outperforms the likes of Mistral-large and Qwen1.5-110B.
The 27B Gemma 2 model is designed to run inference efficiently at full precision on a single Google Cloud TPU host, NVIDIA A100 80GB Tensor Core GPU, or NVIDIA H100 Tensor Core GPU. Moreover, this is an open weights model line, currently only available to researchers and developers.
Why does it matter?
The models sound like they are built for practical deployments. They come in practical sizes so that they can be easily deployed while being amazing in quality due to best-in-class performances.
Source: https://blog.google/technology/developers/google-gemma-2/
OpenAI’s CriticGPT finds GPT-4’s mistakes with GPT-4
OpenAI trained a model based on GPT-4, called CriticGPT, to catch errors in ChatGPT’s code output. It found that when users get help from CriticGPT to review ChatGPT code, they outperform those without help 60% of the time.
OpenAI aligns GPT-4 models to be more helpful and interactive through Reinforcement Learning from Human Feedback (RLHF). A key part of RLHF is collecting comparisons in which people, called AI trainers, rate different ChatGPT responses against each other.
OpenAI is beginning to integrate CriticGPT-like models into its RLHF labeling pipeline, providing trainers with explicit AI assistance.
Why does it matter?
With more advances in reasoning and model behavior, AI models’ mistakes can become more subtle for AI trainers to spot. CriticGPT is a step towards addressing this fundamental limitation of RLHF.
Source: https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4
Google partners with Moody’s, Thomson Reuters & more for AI data
Google is partnering with reputable third-party services, such as Moody’s, MSCI, Thomson Reuters, and Zoominfo, to ground its AI with real-world data. These four will be available within Vertex AI starting next quarter. They will offer developers qualified data to backstop their model outputs and ensure responses are factually accurate.
Google is also announcing high-fidelity grounding. Available through an experimental preview, it’s designed to help AI systems work better with a given set of specific information.
Why does it matter?
Earlier, Google announced efforts to ground Vertex AI results using web data and a plan to allow companies to ground AI systems in their own internal data.
Now, it is grounding these systems in known factual data from third parties, which could significantly lessen hallucinations and make AI more trustworthy for enterprise customers.
Source: https://venturebeat.com/ai/google-grounding-ai-with-moodys-msci-thomson-reuters-zoominfo
What Else Is Happening in AI on June 27th 2024
TIME and OpenAI announced a multi-year content deal
OpenAI will gain access to current and historic content from TIME’s extensive archives from the last 101 years to enhance its products. It will also enable TIME to gain access to OpenAI’s technology to develop new products for its audiences.
Source: https://openai.com/index/strategic-content-partnership-with-time
Google is using AI to add 110 new languages to Google Translate
It is Google’s largest expansion ever, thanks to its PaLM 2 LLM. It includes languages like Cantonese, NKo, and Tamazight, representing more than 614 million speakers and opening up translations for around 8% of the world’s population.
Source: https://blog.google/products/translate/google-translate-new-languages-2024/
YouTube is in talks with major record labels for an AI music deal
It is offering to pay Universal Music Group (UMG), Sony Music Entertainment, and Warner Records “lump sums of cash” in exchange for legally licensing their songs to train new AI music tools. These will likely be one-off payments, not royalty-based arrangements. Source: https://www.theverge.com/2024/6/27/24187151/youtube-ai-music-deals-licensing-record-labels-sony-umg-warner
Meta to start testing user-created AI chatbots on Instagram
CEO Mark Zuckerberg announced yesterday that Meta will begin to surface AI characters made by creators through Meta AI studio on Instagram, starting in the U.S. These will primarily show up in messaging for now and will be clearly labeled as AI.
Source: https://techcrunch.com/2024/06/27/meta-starts-testing-user-created-ai-chatbots-on-instagram
Character.AI now allows users to talk with AI avatars over calls
Users can initiate calls with a user-generated AI character directly with a button tap. Users can also switch between calling and texting seamlessly and stop the AI from talking through a “Tap to interrupt” option. The feature currently supports only a few languages.
Source: https://www.reuters.com/technology/artificial-intelligence/ai-chatbot-startup-characterai-launches-new-calls-feature-2024-06-27
Use ChatGPT to turn docs into spreadsheets
With a simple prompt, ChatGPT can analyze documents, answer questions, perform calculations, and create a downloadable spreadsheet — all in one conversation!
- Log in to ChatGPT and upload your document to the chat (remember to remove any sensitive data before submitting).
- Ask questions about it, e.g., “How much is the security deposit?”
- Request calculations, like the total first month’s costs, including deposits and fees.
- Simple prompt ChatGPT to “create a downloadable budget spreadsheet”, specifying the time frame and desired columns.
Note: Only share documents you’re comfortable with. Be cautious with sensitive information and consider redacting critical details before uploading.
Source: https://university.therundown.ai/c/daily-tutorials/from-lease-agreement-to-budget-spreadsheet-with-chatgpt-0ed8d7ba-7d5a-4e9f-903a-64822caf1579
A Daily chronicle of AI Innovations June 27th 2024:
YouTube wants to pay record labels to use their songs for AI training
Figma unveils AI-powered design tools, challenges Adobe’s dominance
Character.AI introduces AI avatar calls
Amazon steps into the chatbot race
Alibaba’s Qwen-72B tops the Hugging Face leaderboard
Hugging Face updates Open LLM Leaderboard
NBC to use AI voice for Olympics recaps!
Sonia’s ‘AI Therapist’ sparks debate
Anthropic extends Claude’s access to government
YouTube wants to pay record labels to use their songs for AI training
- YouTube is reportedly negotiating with major music labels like Sony Music Entertainment, Universal Music Group, and Warner Records to pay for licensing their songs for AI training.
- Despite these negotiations, the ultimate decision to allow the use of songs for AI training will likely fall to individual artists rather than the labels themselves.
- In April 2023, more than 200 artists voiced their opposition to AI using their work, emphasizing the potential threat to creators’ rights and the music ecosystem.
Source: https://www.engadget.com/youtube-reportedly-wants-to-pay-record-labels-to-use-their-songs-for-ai-training-125052503.html
Figma unveils AI-powered design tools, challenges Adobe’s dominance
- Figma has launched AI-powered design features to speed up workflows and aid teams in creating superior software, positioning it as a product development platform to compete with Adobe.
- The new Figma AI features include design creation with text prompts, visual search capabilities, layer renaming, and automating repetitive tasks, and are currently available in a limited beta.
- While beta access to Figma AI is free, later availability will require subscriptions to specific plans, and admins must enable AI features, differentiating Figma’s approach from Adobe’s controversial AI terms.
Source: https://venturebeat.com/ai/figma-unveils-ai-powered-design-tools-to-rival-adobe/
Character.AI introduces AI avatar calls
- Character.AI announced that users can now make calls to AI characters, supporting multiple languages including English, Spanish, Portuguese, Russian, Korean, Japanese, and Chinese.
- The company reported that during testing, over 3 million users made 20 million calls, and noted that AI character calls can help with language practice, mock interviews, or role-playing games.
- With a button tap, users can start calls with AI characters, seamlessly switch between calling and texting, and use a “Tap to interrupt” feature to stop the AI from talking.
Source: https://techcrunch.com/2024/06/27/character-ai-now-allows-users-to-talk-with-avatars-over-calls/
Amazon steps into the chatbot race
Amazon is reportedly working on a new consumer-focused chatbot codenamed “Metis.” It is planned to be released somewhere around September. Here’s what we know about it:
- The chatbot is powered by a new model, Olympus, and can be accessed via a web browser.
- It uses a retrieval-augmented generation (RAG) technique to provide up-to-date information and automate tasks.
- The model conversationally provides text and image-based outputs, suggesting follow-ups to queries. It also shares links to sources and supports image generation.
- It uses an infrastructure similar to Amazon’s upcoming voice assistant, Remarkable Alexa.
Why does it matter?
The AI assistant market is already too crowded, making Metis’ release seem a bit too late. Its rivals, OpenAI and Google, have already launched their models in 2023 and will give tough competition to Amazon. If Amazon wishes to win this AI race, Metis must showcase exceptional capabilities.
Source: https://www.businessinsider.com/amazon-chatgpt-rival-codenamed-metis-2024-6
Figma’s new AI features stir competition with Adobe
Figma announced a range of new features at the 2024 Config conference. Significant ones include a UI redesign, generative AI tools, new icons and toolbar, AI-enhanced asset search, and auto-generated texts in designs.
https://x.com/figma/status/1806016455609286847
For instance, by typing a simple prompt into the textbox, users can create an entire app design mock-up for a restaurant. Figma will connect the design pages and even write suggested content!
Figma has also added a few designer-specific features to allow users to tweak designs in real-time. It features a developer mode with a “ready-for-dev” task list. The upgrade also boasts Figma slides, a Google slides-like tool for building and sharing presentations.
Why does it matter?
Figma’s AI tools will enable newer users on the platform to test out ideas easily. At the same time, tools like “ready-for-dev” will offer quick iterations to users who are more familiar with the platform.
With Adobe abandoning its plans to acquire Figma, the company’s recent upgrade might pose a challenge to the former.
Source: https://www.figma.com/whats-new/
Alibaba’s Qwen-72B tops the Hugging Face leaderboard
Hugging Face’s latest open large language model leaderboard ranks and evaluates open LLMs based on benchmarks like MMLU-pro and tests them on high-school and college-level problems.
The platform used 300 NVIDIA H100 GPUs to re-evaluate major open LLMs to obtain updated rankings. Chinese company Alibaba’s Qwen-72B dominated the leaderboard, becoming a top performer overall.
Not just that, the leaderboard was mainly dominated by Chinese companies, highlighting their headway into the open LLM space.
Why does it matter?
The open LLM leaderboard is essential for transparent and comprehensive LLM evaluation. It also proves that a model’s size doesn’t guarantee its superior performance. As Chinese models deliver exceptional performance, will well-known models from the USA, like Meta’s Llama, be replaced in the coming years?
Source: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
What Else Is Happening in AI on June 27th 2024
Anthropic extends Claude’s access to government
Anthropic is making Claude 3 Haiku and Claude 3 Sonnet available for government use through the AWS marketplace. This will improve citizen services, streamline document reviews, support data-based policymaking, and generate realistic training scenarios.
Source: https://www.anthropic.com/news/expanding-access-to-claude-for-government?
Opera announces new themes and features for One R2
Major features include AI-based image and voice generation, image understanding capabilities, floating windows, new themes, design elements, and split window tabs.
Source: https://press.opera.com/2024/06/26/opera-unveils-opera-one-r2-in-developer
NBC to use AI voice for Olympics recaps!
The AI will offer daily, personalized coverage of the 2024 Paris Olympics on the Peacock app in the voice of the famous sports commentator, AI Michael. According to NBC, the voice AI is trained on data from Michael’s appearances on the show.
Source: https://www.nbcnews.com/news/nbc-use-ai-generated-version-al-michaels-voice-summer-olympics-rcna159108
Caption’s new feature can auto-edit videos!
The feature adds custom graphics, zooms, music, sound effects, transitions, and motion backgrounds to an unedited video. Users can also use Caption’s AI avatars to create videos and fully edit them with the AI edit feature.
Source: https://www.captions.ai/
Sonia’s ‘AI Therapist’ sparks debate
The personalized AI therapist can deal with depression, stress, anxiety, poor sleep, and more by analyzing what users say during therapy sessions. The release has sparked a debate around user data privacy, its susceptibility to bias, and the accuracy of its diagnosis. Source: https://www.soniahealth.com/
Hugging Face updates Open LLM Leaderboard
Hugging Face just introduced a new upgrade to its Open LLM Leaderboard, adding new benchmarks and evaluation methods to help address the recent plateau in LLM performance gains.
- The leaderboard now features six new benchmarks designed to be more challenging and less prone to contamination.
- Initial rankings show Qwen2-72B-Instruct leading the pack, followed by Meta’s Llama-3-70B-Instruct and Mixtral 8×22b.
- A new normalized scoring system adjusts for baseline performance, providing a more fair comparison across different evaluation types.
- The upgrade also introduces a ‘maintainer’s highlight’ category and community voting system to prioritize the most relevant models.
As LLMs approach human-level performance on most tasks, finding new ways to evaluate them is becoming more difficult — and more crucial. This revamp helps guide researchers and developers towards more targeted improvements, providing a more nuanced assessment of model capabilities.
Source: https://huggingface.co/spaces/open-llm-leaderboard/blog
Rabbit R1 hit with major security flaw
A group of developers just discovered a major vulnerability in Rabbit’s R1 AI assistant device, potentially exposing user’s private data and chat responses.
- A community-led group called Rabbitude uncovered hardcoded API keys in Rabbit’s codebase, which allowed access to all R1 responses.
- The group gained access to the codebase in mid-May, saying the Rabbit team was aware of the issue but failed to take action.
- Rabbitude said the vulnerability could allow bad actors to disable all r1 devices, alter voices and responses, and access private messages.
- Rabbit acknowledged an ‘alleged data breach’ via a Discord post, but claims no customer data was leaked.
Despite massive hype in the first wave of consumer AI standalone devices, the Rabbit r1 has been nothing short of a disaster so far. Already facing major criticism over the companion’s limited capabilities, this security breach only furthers the skepticism surrounding the early AI hardware market entrants.
Source: https://rabbitu.de/articles/security-disclosure-1?
A Daily chronicle of AI Innovations June 26th 2024:
Anthropic rolls out Claude’s cutting-edge collaborative features
Google experiments with celebrity-inspired AI Chatbots
OpenAI postpones the launch of ChatGPT voice mode
New AI simulates 500M years of evolution
OpenAI delays ChatGPT voice upgrade
Anthropic rolls out Claude’s cutting-edge collaborative features
Anthropic has introduced new collaboration features for Claude. These features include:
- Projects: Projects in Claude allow integration of internal resources like style guides or codebases, enhancing Claude’s ability to deliver tailored assistance across various tasks. Users can set custom instructions for each Project to modify Claude’s tone or perspective for a specific role or industry.
- Artifacts: It allows users to generate and edit various content types like code, documents, and graphics within a dedicated window. This benefits developers by offering larger code windows and live previews for easier front-end reviews.
- Sharing Features: Claude Team users can share snapshots of their best conversations with Claude in their team’s shared project activity feed.
Additionally, any data or chats shared within Projects will not be used to train Anthropic’s generative models without a user’s explicit consent.
Why does it matter?
By offering advanced AI collaboration tools, Anthropic meets critical enterprise needs for flexibility and team efficiency, where many of its competitors fall short. This approach could broaden its user base, increasing user retention and market penetration.
Source: https://www.anthropic.com/news/projects
Google experiments with celebrity-inspired AI Chatbots
These chatbots will be powered by Google’s Gemini family of LLMs. The company aims to strike partnerships with influencers and celebrities and is also working on a feature that allows people to create their own chatbots by describing their personalities and appearances.
The project is led by Ryan Germick, a longtime executive at Google and a team of ten. These chatbots could be an experiment and may only appear on Google Labs rather than being widely available.
Why does it matter?
Google’s AI chatbots featuring celebrities and influencers enhance user engagement but raise ethical concerns about consent, boundaries, and legal and privacy issues as AI evolves.
Source: https://www.msn.com/en-us/news/other/google-wants-to-build-ai-chatbots-based-on-celebs-influencers-for-some-reason/ar-BB1oS1or
OpenAI postpones the launch of ChatGPT voice mode
Originally planned for late June, the Voice Mode aims to provide a more naturalistic and conversational experience with the AI chatbot, complete with emotional inflection and the ability to handle interruptions.
However, it will now be available only to a small group of users in late July or early August. OpenAI is working on improving content detection and user experience before wider rollout. GPT-4o’s real-time voice and vision capabilities are also expected to roll out to ChatGPT Plus users soon.
Why does it matter?
Delays can give competitors like Anthropic a chance to gain an edge. If rivals manage to launch similar or superior voice capabilities sooner, OpenAI risks losing potential users and market share.
Source: https://techcrunch.com/2024/06/25/openai-delays-chatgpts-new-voice-mode
New AI simulates 500M years of evolution
Biotech startup EvolutionaryScale just introduced ESM3, a new AI model capable of designing and simulating new proteins in ways that mimic millions of years of natural evolution.
- ESM3 is a frontier language model that can reason over protein sequences, structures, and functions simultaneously.
- In tests, ESM3 generated a new fluorescent protein 58% different from other known structures — a leap that typically takes nature 500M years of evolution.
- The largest version of ESM3 has 98B parameters, and was trained on approximately 1.8T tokens of biological data.
- EvolutionaryScale was created by former META researchers, and has already secured $142M in seed funding from tech giants like Amazon and Nvidia.
ESM3 is bringing us closer to a future where we can ‘program’ biology as easily as we write computer code — potentially revolutionizing fields from healthcare to environmental science and unlocking completely new areas of knowledge across fields.
Source: https://www.evolutionaryscale.ai/blog/esm3-release
What Else Is Happening in AI on July 26th 2024
Ex-Meta engineers launch protein-generating AI biology model
EvolutionaryScale, launched by ex-Meta engineers, introduced ESM3, a gen AI model for designing novel proteins. It uses AI to simulate evolutionary processes, aiming to revolutionize industries like medicine and biofuels. It is available via API to selected customers and non-commercial users.
Source: https://venturebeat.com/ai/meta-alum-launches-ai-biology-model-that-simulates-500-million-years-of-evolution
Reddit sets new barriers for AI crawlers to protect its content
Reddit will rate-limit or block unknown bots and crawlers that violate its Public Content Policy or lack an agreement with the platform. These changes aim to deter AI companies from training LLMs on Reddit content (excluding the $60 million deal with Google).
Source: https://techcrunch.com/2024/06/25/reddits-upcoming-changes-attempt-to-safeguard-the-platform-against-ai-crawlers
AI-generated ad film debuts at Cannes 2024
Toys ‘R’ Us, the iconic toy store chain, collaborated with OpenAI’s Sora to produce a brand film called “The Origin of Toys ‘R’ Us.” The film premiered at the 2024 Cannes Lions International Festival of Creativity.
Source: https://venturebeat.com/ai/toys-r-us-unveils-first-commercial-made-with-openais-sora
Mozilla’s Firefox Nightly now lets users pick from leading AI chatbots
It allows users to choose from multiple AI chatbots, such as ChatGPT, Google Gemini, HuggingChat, and Le Chat Mistral. The feature lets users select their preferred AI to perform tasks such as summarizing web content, simplifying text, or conducting quizzes.
Source: https://techcrunch.com/2024/06/25/firefox-now-lets-you-choose-your-preferred-ai-chatbot-in-its-nightly-builds
Google extends Gemini AI to Gmail to streamline email tasks
Google has integrated its Gemini AI into Gmail for Workspace subscribers, enhancing email functionality. It assists users by automatically drafting emails, summarizing threads, crafting responses, and retrieving information from emails or Google Drive.
Source: https://techcrunch.com/2024/06/25/google-brings-its-gemini-ai-to-gmail-via-a-sidebar-that-can-help-you-write-and-summarize-emails
A Daily chronicle of AI Innovations June 25th 2024:
Amazon is working on a ChatGPT competitor
Google is building AI chatbots modeled after celebrities, YouTube influencers
Nvidia’s market value plummets drastically
Apple rejected Meta’s AI integration over privacy issues
OpenAI buys a remote collaboration platform
Synthesia 2.0: World’s 1st AI video communication platform
OpenAI is on an acquiring spree, buying Rocket and Multi
Record labels sue AI music startups over copyright infringement
Amazon is working on a ChatGPT competitor
- Amazon is developing an AI chatbot, code-named “Metis,” to compete with OpenAI’s ChatGPT, and it could be launched later this year, based on a report from Business Insider.
- The upcoming chatbot will be accessed through a web browser and rely on a new foundational model, as mentioned by sources and documents cited by Business Insider.
- Amazon’s CEO Andy Jassy is reportedly directly involved with “Metis,” reflecting Amazon’s broader strategy to intensify its AI efforts across various applications and services.
- Source: https://www.geekwire.com/2024/
report-amazon-developing-ai- chatbot-that-would-compete- with-chatgpt-and-others/
Google is building AI chatbots modeled after celebrities, YouTube influencers
- Google is reportedly developing new AI chatbots modeled after celebrities and YouTube influencers, aiming to create specialized interactive experiences.
- The initiative will use Google’s Gemini family of large language models and seeks partnerships with influencers and celebrities, allowing users to design their own chatbots by describing traits and appearance.
- The project, led by executive Ryan Germick and a team of ten, might initially be available only on Google Labs, keeping the bots experimental and not widely accessible at launch.
- Source: https://www.engadget.com/
google-is-reportedly-building- ai-chatbots-based-on- celebrities-and-influencers- 235731655.html
Nvidia’s market value plummets drastically
- Nvidia’s shares dropped nearly 13% in a week, causing a loss of $430 billion from its market value and dropping its market capitalization to $2.91 trillion.
- This decline has moved Nvidia from the top position to third place among the world’s most valuable companies, now trailing behind Microsoft and Apple.
- Investor enthusiasm for Nvidia’s role in AI is waning after significant stock gains, leading to a 6.7% drop on Monday, the third consecutive day of declines for the company.
- Source: https://edition.cnn.com/2024/
06/25/tech/nvidia-stock-fall- worlds-third-biggest-company/ index.html
Apple rejected Meta’s AI integration over privacy issues
- Apple declined a potential AI partnership with Meta concerning the integration of Meta’s large language model into iOS, primarily due to privacy concerns highlighted in a Bloomberg report.
- While initial discussions between Apple and Meta took place, they did not advance, with Apple choosing not to pursue formal talks or integration due to Meta’s insufficient privacy practices.
- Despite rejecting Meta, Apple has engaged in collaborations with other major AI entities, finalizing a deal with OpenAI to integrate ChatGPT into iOS 18 and discussing potential integrations with Google and Anthropic for diverse AI model options.
- Source: https://www.macrumors.com/
2024/06/24/apple-meta-ai- privacy-concerns/
OpenAI buys a remote collaboration platform
- OpenAI is advancing its AI technology by integrating it into desktop computing, moving beyond browser-based interactions with the acquisition of startup Multi.
- By purchasing Multi, OpenAI aims to develop potentially a new AI-based operating system or at least an interaction layer for current systems, as suggested by recent product releases including a ChatGPT desktop app.
- OpenAI’s future AI assistant could replace traditional browsers, altering the role of search engines and websites, and revolutionizing how we interact with online content.
- Source: https://the-decoder.com/
openai-moves-toward-its-own- ai-operating-system-and-a- post-browser-world/
Synthesia 2.0: World’s 1st AI video communication platform
Synthesia is launching Synthesia 2.0 – the world’s first AI video communications platform for businesses. It reinvents the entire video production process, allowing companies to create and share AI-generated videos at scale easily.
https://youtu.be/gZaBwdru_bk?
The key new features and capabilities of Synthesia 2.0 include:
- 2 Personal AI Avatars: Expressive Avatars shot in a studio and Custom Avatars created using your webcam.
- AI Video Assistant: Converts text, documents, or websites into high-quality videos, with options to customize the branding, tone, and length.
- Intuitive Video Editing: Editing simplified with “Triggers” that let you control animations and edits from the script.
- Translation and Dynamic Video Player: Videos can now be translated into over 120 languages. Synthesia is also building a new video player with interactive features.
- AI Safety Focus: Synthesia is pursuing ISO/IEC 42001 certification, the first standard for responsible AI management, to ensure its AI technologies are ethical.
Why does it matter?
By making video creation and distribution more accessible and automated, Synthesia aims to help businesses transition to a “video-first” approach to communicate more effectively, improve employee training, customer service, and marketing efforts, and streamline various business operations.
Source: https://www.synthesia.io/post/
OpenAI is on an acquiring spree, buying Rocket and Multi
Last week, OpenAI acquired Rockset, a startup that develops tools for real-time data search and analytics. OpenAI said it would integrate Rockset’s technology to power its infrastructure and offerings across products.
This week, OpenAI acquired Multi, a startup focused on building remote collaboration tools and software. Technically, the deal is an acqui-hire as the entire Multi team, including its co-founders, will join OpenAI to work on the company’s ChatGPT desktop application.
Why does it matter?
With over 600,000 paid business users, including most Fortune 500 companies, OpenAI’s back-to-back acquisitions strategically position it to expand its enterprise-focused AI products and make its powerful language models more useful for businesses.
Source: https://techcrunch.com/2024/
Record labels sue AI music startups over copyright infringement
The world’s major record labels, including Universal Music Group, Sony Music, and Warner Music, have filed twin lawsuits against the AI music generation startups Suno and Udio. The lawsuits accuse the companies of unlawfully training their AI models on massive amounts of copyrighted music, which, according to the complaints, allows the startups to generate similar-sounding music without permission.
The record labels allege Suno and Udio have effectively copied artists’ styles and specific musical characteristics. The labels claim the AI-generated music is so close to the original that it is eerily similar when transcribed into sheet music. The lawsuits also accuse the startups of making it easy for people to distribute AI-created samples that mimic copyrighted recordings on platforms like Spotify.
Why does it matter?
This legal battle is only the latest example, and one of many, of AI technology colliding with creative industries. However, any resolution of these battles will likely have far-reaching implications for the music industry and the development of AI-powered music tools.
What Else Is Happening in AI on June 25th
Apple withdraws AI collaboration deal with Meta over privacy concerns
Contrary to the rumors about a potential AI collaboration, Apple is not planning a partnership to integrate Meta’s AI models into its products due to privacy concerns. Instead, Apple is focusing on partnerships with OpenAI and Google that align with its commitment to user privacy.
Shopify expands its AI capabilities to attract more businesses
Shopify has expanded access to its AI-powered features, including the Sidekick assistant and image-generation tool. It is making these AI tools available to more users, as it aims to provide merchants with an integrated and seamless platform to scale their businesses.
Google is bringing Gemini access to teens through school accounts
Google now makes its AI assistant Gemini available to teenage students through school accounts. This move is aimed at helping prepare students for a future where generative AI is more prevalent. Google says Gemini will provide real-time feedback and support to students while including safeguards to prevent inappropriate responses.
NVIDIA deploys its AI tech in the Middle East among U.S AI export curb
NVIDIA has struck a deal with Qatari telecom provider Ooredoo to deploy its AI technology in the Middle East. This move comes amid U.S. export restrictions on advanced AI chips to some countries in the region. Ooredoo will be the first company in the Middle East to offer Nvidia’s AI and graphics technology to its clients across multiple countries.
Meta marks real photos with ‘Made by AI’ label
Meta has been incorrectly labeling real photos as “Made by AI.” Multiple photographers have reported that Meta has added this label to their genuine photographs, even when they have only made minor edits using tools like Adobe’s Generative Fill. Meta says it relies on industry standards to detect AI-generated content, leading to false positives.
Source: https://www.theverge.com/2024/
A Daily chronicle of AI Innovations June 24th 2024:
Apple and Meta are discussing gen AI partnership
ByteDance and Broadcom collaborate on AI chip development
Researchers developed a new method to detect hallucinations
Apple in talks with Meta for potential AI integration
Record labels sue AI music companies
AI blood test for Parkinson’s detection
Researchers just developed an AI-powered blood test that can predict Parkinson’s disease up to 7 years before symptoms appear, potentially revolutionizing early diagnosis and treatment.
- The AI analyzes key markers in blood samples linked to inflammation and protein degradation, attempting to predict which patients would develop the disease.
- In a 10-year study, the test correctly identified 79% of patients who later developed Parkinson’s.
- The AI model predicted Parkinson’s up to 7 years before clinical symptoms emerged in some cases.
- Researchers hope to develop a simple finger-prick version for widespread screenings.
AI’s ability to find subtle patterns in previously undetectable biological data is unlocking new diagnostic capabilities. By catching a debilitating disease years before the onset of symptoms, patients can get earlier access to treatment aimed at prevention.
Source: https://www.bbc.com/news/articles/c722yly6l6ro
Apple in talks with Meta for potential AI integration
- Apple is reportedly negotiating with Meta to integrate Meta’s generative AI model into Apple’s new AI system, Apple Intelligence, according to The Wall Street Journal.
- Apple is seeking partnerships with multiple AI companies, including Meta, to enhance its AI capabilities and catch up in the competitive AI race.
- A potential collaboration between Apple and Meta would be significant due to their history of disagreements, and it could greatly impact the AI industry if successful.
- Source: https://www.benzinga.com/news/24/06/39447797/apple-eyes-meta-for-potentially-game-changing-ai-partnership-report
Record labels sue AI music companies
- Universal Music Group, Sony Music Entertainment, and Warner Records are suing AI companies Suno and Udio for massive copyright infringement related to the creation of original songs using generative AI technology.
- The music labels claim that Suno and Udio used copyrighted works without permission, and are seeking damages of up to $150,000 per violated work in lawsuits filed in Boston and New York federal courts.
- This legal action is part of a broader conflict between the music industry and tech companies offering AI tools, highlighting concerns about AI-generated music impacting the control and revenue of human artists.
- Source: https://www.theverge.com/2024/6/24/24184710/riaa-ai-lawsuit-suno-udio-copyright-umg-sony-warner
Apple and Meta are discussing gen AI partnership
Apple is reportedly in talks with its longtime rival Meta to integrate the latter’s Llama 3 AI model into Apple Intelligence. This move comes as Apple prepares to roll out its AI features across iPhones, iPads, and Macs later this year.
The potential partnership follows Apple’s existing deal with OpenAI, suggesting a collaboration strategy rather than solo development in the AI race. In Apple’s arrangement with OpenAI, there’s no direct payment. Instead, OpenAI can offer premium subscriptions through Apple Intelligence, with Apple taking a percentage. It’s unclear if Meta would agree to a similar business model, given that Llama 3 is open-source and free to access.
Why does it matter?
Despite their historical rivalry, especially after Apple’s 2021 privacy changes impacted Meta’s ad revenue, this potential partnership suggests a willingness to cooperate in the AI race. However, not everyone is thrilled about it. Users and investors are voicing concerns about privacy, with Meta’s track record being a major sticking point. Some see this as a desperate move by Apple.
Source: https://www.wsj.com/tech/ai/apple-meta-have-discussed-an-ai-partnership-cc57437e
ByteDance and Broadcom collaborate on AI chip development
ByteDance is collaborating with U.S. chip designer Broadcom to develop an advanced AI processor. This partnership aims to secure a stable supply of high-end chips amid ongoing U.S.-China tensions. The project centers on creating a 5-nanometre, customized Application-Specific Integrated Chip (ASIC) that complies with U.S. export restrictions.
This chip’s manufacturing is set to be outsourced to Taiwan Semiconductor Manufacturing Company (TSMC), though production is not expected to begin this year. While the design work is currently underway, the critical “tape out” phase has yet to commence.
Why does it matter?
This partnership shows how Chinese tech companies are getting creative to secure advanced AI chips. As ByteDance has launched a major push in generative AI, this new initiative brings dual benefits: reducing procurement costs and ensuring a stable supply of higher-end chips.
Source: https://www.reuters.com/technology/artificial-intelligence/chinas-bytedance-working-with-broadcom-develop-advanced-ai-chip-sources-say-2024-06-24
Researchers developed a new method to detect hallucinations
ChatGPT and Gemini can produce impressive results but often “hallucinate” false or unsubstantiated information. This research focuses on a subset of hallucinations called “confabulations,” where LLMs generate answers that are both wrong and arbitrary. Researchers have developed new methods to detect confabulations using entropy-based uncertainty estimators. They introduce the concept of “semantic entropy” to measure the uncertainty of LLM generations at the meaning level.
High semantic entropy corresponds to high uncertainty and indicates a higher likelihood of confabulation. The method computes uncertainty at the level of meaning rather than specific word sequences, addressing the fact that one idea can be expressed in many ways. The method provides scalable oversight by detecting confabulations that people might otherwise find plausible.
Why does it matter?
Detecting confabulations can improve question-answering accuracy for state-of-the-art LLMs. The method helps users understand when to be cautious with LLM outputs and opens up new possibilities for using LLMs that were previously limited by unreliability.
Source: https://www.nature.com/articles/s41586-024-07421-0
What Else Is Happening in AI on June 24th 2024
Stability AI has appointed a new CEO
Prem Akkaraju, former CEO of Weta Digital, will step up to this role. This change comes after founder Emad Mostaque resigned in March amid financial concerns. Akkaraju is part of a new investor group, including former Facebook President Sean Parker, that provides a cash infusion to Stability AI. Source: https://www.theinformation.com/articles/stability-ai-gets-new-ceo-and-a-bailout-from-investor-group-led-by-sean-parker
Coders spotted Midjourney references in Grok’s standalone version
It hints at a possible collaboration between xAI and Midjourney for image generation. Grok might use Midjourney models via API to generate images. This could mean that the standalone Grok might let users upload images. All this could be part of the Grok 1.5 rollout. Source: https://www.bolnews.com/technology/2024/06/grok-1-5-potential-use-of-midjourney-for-image-generation-gains-evidence
YouTube fights AI fakes with new policy
YouTube expanded its privacy policy to cover AI-generated content. Now, users can request the removal of synthetic media that imitates their face or voice. However, privacy complaints alone may not guarantee content removal. While reviewing requests, YouTube will consider the potential for confusion and intent (parody/satire). Source: https://support.google.com/youtube/thread/281049221
OmniAI transforms unstructured enterprise data for AI and analytics use
Founded by Y Combinator alums, OmniAI syncs with your existing storage, prep the data, and lets you run your choice of AI models on the data. It offers integrations with popular models like Meta’s Llama 3 and Anthropic’s Claude. Source: https://techcrunch.com/2024/06/22/omniai-transforms-business-data-for-ai
The new Dot AI chatbot wants to be your companion
Dot chatbot aims to personalize interactions based on users’ thoughts and feelings. It uses multiple AI models from OpenAI and Google. The app uses intensive Q&A to learn about users’ interests and experiences. Dot offers a subscription model at $11.99/month for unlimited conversations. Source: https://techcrunch.com/2024/06/21/dots-ai-really-really-wants-to-get-to-know-you
Y Combinator joined over 140 AI startup founders in signing a letter opposing California’s Senate Bill 1047 on AI regulation, arguing it harms the ability to retain talent and stifles innovation. Source: https://www.politico.com/newsletters/california-playbook/2024/06/21/little-tech-brings-a-big-flex-to-sacramento-00164369 |
Formula 1 and AWS unveiled ‘Statbot’, an AI-powered system that provides real-time context, trivia, and predictions for personalized race broadcasts. Source: https://www.bloomberg.com/news/articles/2024-06-23/formula-1-and-amazon-aim-for-ai-powered-personalized-race-viewing |
OpenAI announced the acquisition of Rockset, with the search and database analytics startup’s tech set to boost retrieval infrastructure for enterprise products. https://openai.com/index/openai-acquires-rockset |
Axel Peytavin demonstrated a small, autonomous robot prototype powered by GPT-4o that can see, think and act based on prompts, with capabilities like picking up trash and shaking hands. Source: https://x.com/ax_pey/status/1804209628680720746 |
OpenAI CTO Mira Murati said in an interview that the next generation of GPT will be ‘PhD-level’ compared to GPT-3 (toddler) and GPT-4 (high school), and will be released in a year and a half. Source: https://x.com/tsarnick/status/1803901130130497952 |
Unitree posted new demos of its Go2 robotic dog performing handstands, flips, and withstanding and adjusting to force, showing extreme precision and agility. Source: https://x.com/UnitreeRobotics/status/1803728664213012520 |
A Daily chronicle of AI Innovations June 20th 2024:
Microsoft debuts a vision-foundational model for diverse tasks
Ex-OpenAI co-founder launches own AI company
Can AI read minds? New model can predict anxiety levels
😱 Sutskever launches new AI venture
🚧 Tech giants team up on AI factory for xAI
🚨 Perplexity faces legal action
OpenAI co-founder announces new AI company
Anthropic announces its most powerful AI yet
EU Council withdraws Chat Control vote
Apple Intelligence plans face legal challenges in China
Microsoft debuts a vision-foundational model for diverse tasks
Florence 2, Microsoft’s vision-AI model, can perform diverse tasks like object detection, captioning, visual grounding, and visual question answering via image and text prompts. It displays excellent captioning, object detection, visual grounding, and segmentation.
The model comes in 232M and 771M parameter sizes and uses a sequence-to-sequence architecture, enabling multiple vision tasks without needing a task-specific architecture modification.
On fine-tuning the model with publicly available human-annotated data, Florence 2 showcased impressive results, offering tough competition to existing large vision models like Flamingo despite its compact size.
Why does it matter?
The model will equip enterprises with a standard approach to handling various vision-handling applications. This will save resources spent on separate task-specific vision models that need fine-tuning. Moreover, it may also be useful to developers as it would eliminate the need for separate vision models for smaller tasks, significantly saving compute costs.
Source: https://www.microsoft.com/en-us/research/publication/florence-2-advancing-a-unified-representation-for-a-variety-of-vision-tasks
Ex-OpenAI co-founder launches own AI company
Just a month after leaving OpenAI, ex-cofounder Ilya Sutskever has launched his own AI company, Safe Superintelligence Inc. (SSI), alongside former Y Combinator partner Daniel Gross and ex-OpenAI engineer Daniel Levy as co-founders.
According to the SSI’s launch statement on X, the company will prioritize safety, progress, and security. Sutskever also emphasizes that the company’s “singular focus” on a joint approach to safety and capabilities will prevent it from being distracted by management overhead or production cycles, unlike companies like OpenAI or Google.
Why does it matter?
SSI’s launch clearly marks the emergence of a new key player in the race to build safe, powerful AI. Its mission statement emphasizes safety and the potential for groundbreaking developments that may shape the future of AI research and development. It would be interesting to see whether the startup will uphold its mission statement in the coming days.
Source: https://ssi.inc/
Can AI read minds? New model can predict anxiety levels
Researchers at the University of Cincinnati have developed an AI model that can identify people with an urgent risk of anxiety. The AI model uses minimal computational resources, a short picture rating task, and a small set of variables to make the prediction. The approach named “Comp Cog AI” integrates computational cognition and AI.
Participants rated 48 pictures with mildly emotional subject matter based on the degree to which they liked or disliked those pictures. The response data was then used to quantify the mathematical features of their judgments. Finally, the data was combined with ML algorithms to identify their anxiety levels.
Since the technology doesn’t rely on a native language, it is accessible to a wider audience and diverse settings to assess anxiety.
Why does it matter?
The picture rating feature can provide unbiased data to medical professionals on a person’s mental health status without subjecting them to direct questions that may trigger negative emotions. Given its 81% accuracy rate, the tool can become a useful app for detecting individuals with high anxiety risks.
Source: https://www.nature.com/articles/s44184-024-00074-x.epdf
Anthropic announces its most powerful AI yet
- Anthropic has launched Claude 3.5 Sonnet, a new AI model that aims to be on par with, or superior to, OpenAI’s GPT-4o and Google’s Gemini across various tasks.
- Claude 3.5 Sonnet claims to be significantly faster than its predecessor and outperforms it, even surpassing other leading models in multiple benchmarks.
- Alongside the new model, Anthropic introduced the Artifacts feature, which allows users to interact with and edit Claude’s outputs directly within the app, enhancing its functionality beyond a typical chatbot.
- Source: https://www.theverge.com/2024/6/20/24181961/anthropic-claude-35-sonnet-model-ai-launch
Apple Intelligence plans face legal challenges in China
- Apple’s push to enhance its artificial intelligence capabilities faces significant challenges in China due to the country’s stringent AI regulations and censorship laws.
- China’s strict regulation of AI, including rules about language models and data protection, means Apple must navigate approvals and potentially partner with local tech giants like Baidu and Alibaba.
- Apple’s focus on user privacy and the need for localized AI experiences may help it gain regulatory approval but also present significant hurdles in ensuring compliance and acceptance in the Chinese market.
- Source: https://www.cnbc.com/2024/06/20/apple-ai-push-faces-big-challenges-in-china.html
🚧 Tech giants team up on AI factory for xAI
Dell, Nvidia, and Super Micro Computer are collaborating to build an AI factory for xAI’s massive supercomputer, which will train and scale its Grok AI chatbot.
- Dell is assembling half of the server racks for xAI’s planned supercomputer, with Super Micro handling the other half.
- The system aims to use up to 100,000 Nvidia H100 GPUs, potentially making it 4x larger than the biggest existing AI clusters.
- Musk wants the supercomputer to be fully operational by Fall 2025 to accelerate the development of advanced Grok models.
This powerhouse partnership shows the staggering scale of compute power required to push the boundaries of AI. With huge capital flowing in, this massive cluster could provide a critical edge in evolving Grok to be a serious competitor to the likes of OpenAI and Anthropic.
Source: https://www.fastcompany.com/91143745/dell-super-micro-musk-xai-supercomputer-server-racks
🚨 Perplexity faces legal action
News publisher Forbes is reportedly threatening legal action against AI search startup Perplexity, alleging the company’s chatbot stole and republished original reporting without proper attribution.
- Forbes claims Perplexity’s chatbot summarized one of its investigative articles, copying with minimal sourcing.
- The AI-generated version was then pushed to Perplexity subscribers and turned into a YouTube video that outranked Forbes’ own content.
- Perplexity CEO Aravind Srinivas responded that the product still has “rough edges”, and the company is working on ways to align with publishers.
- A WIRED investigation found Perplexity scraping sites that actively blocked crawlers, while also hallucinating info and containing incorrect attributions.
Tensions between AI firms and media companies continue to escalate. While legal grounds surrounding scraping and repurposing with AI are still murky, publishers are in a bind when it comes to protecting IP. The only viable options right now seem to be cashing out with licensing deals or taking legal action.
Source: https://www.axios.com/2024/06/18/forbes-perplexity-ai-legal-action-copyright
What Else Is Happening in AI on June 20th 2024
Deloitte, HPE, and NVIDIA partner up! The alliance combines Deloitte’s deep industry expertise and AI capabilities with the newly released HPE NVIDIA AI Compute solutions suite. The collaboration further seeks to advance industry-specific gen AI applications and assist clients across sectors in modernizing their data strategies by accessing innovative insights.
Source: https://www.prnewswire.com/news-releases/deloitte-announces-collaboration-with-hewlett-packard-enterprise-and-nvidia-on-co-developed-generative-ai-solutions-302175762.html
NVIDIA’s AI can turbocharge deployment of self-driving cars: NVIDIA has unveiled a new AI software, Omniverse Cloud Sensor RTX, that will likely accelerate the development of self-driving cars and robots.
The software combines real-world data with synthetic data, making it easy to test sensor perception in realistic virtual environments before deployment into the actual world.
Source: https://nvidianews.nvidia.com/news/omniverse-microservices-physical-ai
YouTube trials AI-powered “Ask for music” feature: The feature seeks to make searching music more conversational for users. Users can search for music using voice commands and verbal prompts. It is speculated that the feature may be an addition to the AI-generated playlist cover available on YouTube.
Source: https://www.business-standard.com/topic/youtube
Luma adds an “extend video” feature to Dream Machine: In contrast to Luma’s previous five-second limit for videos, the “extend video” feature will allow videos to be extended based on prompts. The AI model will consider the new context while extending the video. Additionally, the upgrade will enable Standard, Pro, and Premier users to remove watermarks.
Source: https://x.com/LumaLabsAI/status/1803098863097835995
Apple releases 20 core ML models on Hugging Face: The release includes 4 major datasets in addition to 20 core models as a part of Apple’s efforts to equip developers with advanced on-device AI capabilities.
These core ML models have been optimized to run exclusively on users’ devices and can be used for various applications like image classification, depth estimation, and semantic segmentation.
Source: https://huggingface.co/apple
A Daily chronicle of AI Innovations June 19th 2024:
Nvidia is now the world’s most valuable company
Perplexity AI is lying about their user agent
Meta releases new AI models for text, image and audio
Ex-Snap engineer launches social network where AIs and humans coexist
OpenAI and Color Health join forces to accelerate cancer treatment
Meta unveils new AI models for audio, text, and watermarking
Notion introduces AI Connectors for Slack & Google Drive integration
🎺 TikTok brings AI to content creation
Nvidia is now the world’s most valuable company
- Nvidia surpassed Microsoft to become the world’s most valuable company, reaching a market cap of $3.34 trillion, driven by its focus on artificial intelligence and expanded robotics research.
- The company’s stock price increased by 3.7%, putting it ahead of Microsoft’s $3.33 trillion market cap, while Apple ranks third with a market cap of $3.28 trillion.
- Nvidia’s rise is fueled by partnerships and advancements in AI and robotics, including a new collaboration with Hewlett Packard to launch NVIDIA AI Computing and Private Cloud AI, along with significant projects with major AI developers like Google, Meta, Microsoft, and OpenAI.
- Source: https://decrypt.co/235988/nvidia-flips-microsoft-top-tech-company
Perplexity AI is lying about their user agent
- WIRED and developer Robb Knight found that Perplexity’s crawler ignores the Robots Exclusion Protocol, accessing content from sites that have blocked it, including Condé Nast properties.
- Perplexity CEO Aravind Srinivas describes the product as an “answer engine,” but the chatbot has been found to paraphrase articles without proper attribution and sometimes fabricate information entirely.
- An experiment by WIRED demonstrated that Perplexity sometimes generates content without accessing the actual webpage, raising concerns about the accuracy and ethics of its information-gathering methods.
- Source: https://www.wired.com/story/perplexity-is-a-bullshit-machine/
Meta releases new AI models for text, image and audio
- Meta’s Fundamental AI Research team introduced new models such as Chameleon, a multimodal model for text and images, and JASCO, a text-to-music model accepting various inputs like chords or beats.
- Meta’s Chameleon can handle and generate text and images, with its 7B and 34B variants available for research under a non-commercial license.
- The company launched AudioSeal, an audio watermarking technology, which can identify AI-generated speech, released under a commercial license for recognizing speech in lengthy audio segments.
- Source: https://the-decoder.com/meta-releases-new-ai-models-for-text-image-and-audio/
Ex-Snap engineer launches social network where AIs and humans coexist
- Butterflies, founded by former Snap engineer Vu Tran, is a social network allowing humans and AI personas (called Butterflies) to interact through posts, comments, and direct messages, launching publicly on iOS and Android after five months in beta.
- Users can easily create Butterfly personas with backstories, emotions, and opinions, which then autonomously generate content that other users and AIs can engage with, aiming to provide a more immersive AI experience compared to traditional chatbots.
- The app received $4.8 million in seed funding led by Coatue and has mostly been used for entertainment, with potential plans for subscription models and brand interactions in the future.
- Source: https://techcrunch.com/2024/06/18/former-snap-engineer-launches-butterflies-a-social-network-where-ais-and-humans-coexist/
OpenAI and Color Health join forces to accelerate cancer treatment
In collaboration with OpenAI, Color Health has developed a copilot application that leverages OpenAI’s GPT-4.o model to assist clinicians in accelerating cancer treatment. Here’s how it works:
- Integration of medical data: The copilot integrates patient medical data with clinical knowledge using OpenAI’s APIs. It processes and normalizes vast unstructured data such as family history, individual risk factors, and clinical guidelines.
- Tailored treatment plans: Based on this data, the copilot generates customized, comprehensive treatment plans. These plans answer critical questions like “What screenings should the patient be doing?” and identify missing diagnostics.
- Clinician-in-the-loop workflow: A clinician evaluates the copilot’s output at each step. They can modify it if necessary before presenting it to the patient.
- Assured privacy: Color Health ensures HIPAA compliance and maintains rigorous privacy standards while providing clinician oversight to ensure high-quality care.
Why does it matter?
The tool identifies missing diagnostics and expedites the analysis of complex medical records – a process that can now be completed in just 5 minutes rather than hours or weeks. This not only improves access to critical expertise but also has the potential to catch cancer or pre-cancerous conditions earlier, enabling faster treatment and better patient outcomes.
Source: https://openai.com/index/color-health
Meta unveils new AI models for audio, text, and watermarking
Meta’s Fundamental AI Research (FAIR) team has recently released several new AI models and tools for researchers to use. Here are the highlights:
- JASCO: This AI model, short for “Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation,” allows users to adjust features of generated sound (such as chords, drums, and melodies) through text inputs. FAIR plans to release the JASCO inference code under an MIT license and the pre-trained model under a non-commercial Creative Commons license.
- AudioSeal: This tool adds watermarks to AI-generated speech. It’s designed specifically for localized detection of AI-generated segments within longer audio snippets and will be released with a commercial license.
Chameleon: FAIR will release two sizes of its multimodal text model, Chameleon (7B and 34B), under a research-only license. These models can handle tasks requiring visual and textual understanding, such as image captioning.
Why does it matter?
By making these tools publicly available, Meta encourages collaboration within the AI community, which could lead to groundbreaking applications in various fields, such as music creation, sound design, and content generation. Moreover, the introduction of AudioSeal addresses the growing need to identify and attribute AI-generated content, promoting transparency and accountability in an increasingly AI-driven world.
Source: https://venturebeat.com/ai/meta-releases-flurry-of-new-ai-models-for-audio-text-and-watermarking
Notion introduces AI Connectors for Slack & Google Drive integration
With AI Connectors, users can query Notion and receive responses from connected apps. The integration allows for selecting specific public channels or all public channels for information retrieval, excluding private channels and direct messages.
Notion AI can access up to a year of historical Slack content, which may take up to 36 hours. Additionally, users can set up various interactions between Notion and Slack, such as sending Slack messages to a Notion database, creating Notion tasks directly from Slack, and receiving notifications in Slack for specific events in Notion
Why does it matter?
This development demonstrates the growing trend of incorporating AI capabilities into mainstream productivity and collaboration platforms. The ability to intelligently pull in and surface relevant information from external platforms helps streamline workflows, enhance productivity, and improve the overall efficiency of the team.
Source: https://x.com/NotionHQ/status/1803096151526137934
🧠 AI model predicts anxiety levels
Researchers at the University of Cincinnati just developed a new AI system that can predict anxiety levels using a short picture rating task and a small set of demography and psychological variables.
- Comp Cog AI combines principles from psychology with machine learning to measure human decision-making patterns related to anxiety.
- Participants rate a series of emotion-evoking pictures and answer a few contextual questions about their age, income, employment, etc.
- The model extracts 15 ‘judgment variables’ that capture patterns in how users weigh positive and negative outcomes in decision-making.
- In tests, the model predicted whether participants had higher or lower anxiety with up to 81% accuracy.
While anxiety disorders are extremely prevalent, they’re also often challenging and time-consuming to diagnose. With AI’s ability to connect data patterns and achieve reliable predictions, the tech could soon enable more efficient screenings to streamline assessment and treatment time.
Source: https://www.nature.com/articles/s44184-024-00074-x.epdf
What Else Is Happening in AI on June 19th 2024
Google transitions DeepMind from research to AI product development
This transition involves merging Google Research’s Brain team with DeepMind, creating a unified group called Google DeepMind. The aim is to enhance the development of advanced AI technologies responsibly. However, this move has led to frustration among some researchers due to new guidelines and the pressure to commercialize AI technologies.
Source: https://www.pymnts.com/google/2024/google-reportedly-moving-deepmind-from-research-to-ai-products
ElevenLabs’ new open-source AI tool adds sound effects to videos
The app extracts four frames from the video and sends them with a prompt to OpenAI’s GPT-4 to generate a text-to-sound effects prompt. This prompt is used by ElevenLabs’s Sound Effects API to create the sound effect, which is then combined with the video into a single file up to 22 seconds long, ready for download.
Source: https://venturebeat.com/ai/elevenlabs-unveils-open-source-creator-tool-for-adding-sound-effects-to-videos
Snap unveils advanced gen AI for enhanced AR effects
Snap’s latest iteration of gen AI allows users to see more realistic special effects. It plans to create full-body AR experiences rather than just facial AR experiences, such as generating a new outfit. It also announced an upgraded version of its developer program, Lens Studio, which lets artists and developers create AR features for Snapchat or other websites and apps.
Source: https://www.reuters.com/technology/snap-launches-ai-tools-advanced-augmented-reality-2024-06-18
Hedra Labs launches Character-1, a multimodal creation studio preview
Hedra Labs has launched a preview of Character-1, a model for creating expressive characters that talk, sing, and rap. It offers infinite duration (30s in open preview) and generates 90s content per 60s of input, hardware permitting. This is part of Hedra’s mission to give creators control over dialogue, movement, and virtual worlds.
Source: https://x.com/hedra_labs/status/1803095713112580475
Ex-Snap engineer launches AI-human social network, Butterflies
The platform allows users to create AI avatars, called “Butterflies,” that can engage in conversations, generate images, and participate in social activities like human users. The app offers a range of features, including creating and customizing AI characters and exploring a feed filled with AI-generated and human-generated content.
Source: https://techcrunch.com/2024/06/18/former-snap-engineer-launches-butterflies-a-social-network-where-ais-and-humans-coexist
🎺 TikTok brings AI to content creation
TikTok just launched Symphony, a new suite of AI features to help brands and creators produce content more efficiently on the platform — including digital avatars, translation tools, an AI assistant, and more.
- Symphony Digital Avatars enable brands to generate AI spokespeople to star in their TikTok ads and branded content across multiple languages.
- Brands can choose from a selection of ‘stock avatars’ based on real actors or create custom avatars to serve as virtual brand reps.
- A new AI Dubbing tool automatically transcribes, translates, and dubs videos into 10+ languages, helping brands scale content globally.
- TikTok’s Creative Assistant has been rebranded to Symphony Assistant, a chatbot to help with brainstorming, writing scripts, and optimizing content for the platform.
The creator economy is about to be completely remade with AI, and these tools will unlock new opportunities for optimized content, reach, and creation — but the rise of digital ‘spokespeople’ will also raise new questions on authenticity and consumer trust that brands will need to grapple with.
Source: https://www.tiktok.com/business/en-US/blog/tiktok-symphony-ai-creative-suite
AI Jobs on June 9th 2024:
Anthropic – Head of Events: https://jobs.therundown.ai/jobs/63479075-head-of-events
Tempus – Analyst, NEXT Strategy & Operations: https://jobs.therundown.ai/jobs/63500064-analyst-next-strategy-operations
Fiddler AI – Staff UI/UX Designer: https://jobs.therundown.ai/jobs/63473287-staff-ui-ux-designer
Glean – GTM Recruiter: https://jobs.therundown.ai/jobs/63498594-gtm-recruiter
A Daily chronicle of AI Innovations June 18th 2024:
OpenAI expands into healthcare with AI tool for cancer screening
NATO invests $1.1 billion in AI, space, and robotics defence tech
Apple halts work on Vision Pro
Google DeepMind’s new AI can generate soundtracks for videos
Runway launches new model Gen-3 Alpha
China’s DeepSeek Coder V2 beats GPT-4 Turbo
🔊 DeepMind creates sound for videos
Apple halts work on Vision Pro
- Apple has paused development on the second-generation Vision Pro headset to concentrate on a more affordable version, according to The Information.
- The company reportedly began working on this cheaper Vision device, codenamed “N109,” in 2022, aiming to sell it for a price similar to a high-end iPhone.
- Even though Apple plans to use the high-end display components from the Vision Pro in the budget model, it still faces challenges in reducing costs without removing too many features.
- Source: https://www.macrumors.com/2024/06/18/apple-suspends-work-on-vision-pro-2/
OpenAI expands into healthcare with AI tool for cancer screening
- OpenAI has partnered with Color Health to integrate artificial intelligence into cancer screening and treatment procedures using their GPT-4o model.
- The AI assistant developed by Color Health processes patient data to create personalized cancer screening strategies and informs doctors of missing diagnostic tests.
- The AI tool also helps in assembling a cancer pretreatment work-up, expediting the process that can take weeks or months, and thus potentially reducing delays in treatment.
- Source: https://www.newsbytesapp.com/news/science/openai-partners-with-color-health-for-cancer-copilot/story
NATO invests $1.1 billion in AI, space, and robotics defence tech
- NATO has started its €1 billion investment in defence technology, focusing on AI, space, and robotics, by funding four European tech firms and four venture capital funds.
- The NATO Innovation Fund, established in 2022, aims to address contemporary defence and security challenges by matching government buyers with start-ups to develop new technologies.
- Initial recipients of the fund include Germany’s ARX Robotics, UK’s Fractile AI, iComat, and Space Forge, with the broader aim of promoting NATO’s technological sovereignty and innovation ecosystems.
- Source: https://www.euronews.com/next/2024/06/18/nato-begins-its-1-billion-investment-in-ai-space-and-robotics-defence-tech
Google DeepMind’s new AI can generate soundtracks for videos
DeepMind is developing video-to-audio (V2A) technology to generate rich soundtracks for silent videos generated by AI models. V2A combines video pixels with natural language text prompts to create synchronized audiovisual content. The technology offers enhanced creative control, allowing users to guide the audio output using positive and negative prompts.
What sets DeepMind’s V2A apart is its ability to understand raw pixels and generate audio without manual alignment. However, V2A struggles with artifacts or distortions in videos and generates audio that is not super convincing. As DeepMind continues to gather feedback from creators and filmmakers, they remain committed to developing this technology responsibly.
Why does it matter?
The technology could help revive and enhance historical footage, silent films, and other archival material. However, generative AI tools like V2A also threaten to disrupt the film and TV industry, potentially eliminating jobs without strong labor protections.
Source: https://deepmind.google/discover/blog/generating-audio-for-video
Runway launches new model Gen-3 Alpha
Runway launched Gen-3 Alpha, its latest AI model for generating video clips from text descriptions and still images. Gen-3 Alpha excels at generating expressive human characters with a wide range of actions, gestures, and emotions and can interpret various styles and cinematic terminology. However, it has limitations, including a maximum video length of 10 seconds, and struggles with complex character and object interactions and following the laws of physics precisely.
Runway partnered with entertainment and media organizations to create custom versions of Gen-3 for more stylistically controlled and consistent characters, targeting specific artistic and narrative requirements. They also have implemented safeguards, such as a moderation system to block attempts to generate videos from copyrighted images and a provenance system to identify videos coming from Gen-3.
Why does it matter?
As competition in AI video generation heats up, Runway’s Gen-3 Alpha empowers artists and filmmakers to create high-quality, controllable videos with ease, pushing the boundaries of storytelling and creative possibilities.
Source: https://runwayml.com/blog/introducing-gen-3-alpha
China’s DeepSeek Coder V2 beats GPT-4 Turbo
Chinese AI startup DeepSeek has announced the release of DeepSeek Coder V2, an open-source code language model. It is built upon the DeepSeek-V2 MoE model and excels at coding and math tasks, supporting over 300 programming languages. It outperforms state-of-the-art closed-source models like GPT-4 Turbo, Claude 3 Opus, and Gemini 1.5 Pro, making it the first open-source model to achieve this feat. DeepSeek Coder V2 also maintains comparable performance in general reasoning and language capabilities.
The model is being offered under an MIT license, which allows for research and unrestricted commercial use. It can be downloaded or accessed via API on DeepSeek’s platform.
Why does it matter?
DeepSeek aims to “unravel the mystery of AGI with curiosity” and has quickly emerged as a notable Chinese player in the AI race. As it only costs $0.14/1M tokens(input) and $0.28/1M tokens(output), it will give notable models like GPT-4 Turbo intense competition.
Source: https://github.com/deepseek-ai/DeepSeek-Coder-V2/blob/main/paper.pdf
🔊 New AI Jobs Opportunities on June 18th 2024
Glean – UX Designer: https://jobs.therundown.ai/jobs/63229234-ux-designer
Fiddler AI – Staff AI Scientist: https://jobs.therundown.ai/jobs/63187617-staff-ai-scientist
Notable – Product Operations Manager: https://jobs.therundown.ai/jobs/63204832-product-operations-manager
Findem – Data Analyst: https://jobs.therundown.ai/jobs/63102512-data-analyst
What Else Is Happening in AI on June 18th 2024
Perplexity now displays weather, currency conversion, and simple math directly through cards
This move aims to keep users from going to Google for such results. Perplexity’s CEO, Aravind Srinivas, acknowledged that Google handles basic queries like weather, time, and live sports scores well, and his company had work to do in that area.
Source: https://techcrunch.com/2024/06/17/perplexity-now-displays-results-for-temperature-currency-conversion-and-simple-math-so-you-dont-have-to-use-google
U.S. government and private sector ran the first AI attack simulation
Federal officials, AI model operators, and cybersecurity companies ran the first joint simulation of a cyberattack on a critical AI system. It also involved experts from private sector companies like Microsoft, Nvidia, and OpenAI. It helped identify potential new threats and establish communication channels between the government and the private sector.
Source: https://www.axios.com/2024/06/17/cisa-tech-companies-ai-cyberattack-simulation
Adobe Acrobat got a major upgrade, bringing AI to PDFs and more
Adobe Firefly generative AI enables image generation and editing directly within Acrobat for the first time. Acrobat AI Assistant’s new features, “insights across documents” and “enhanced meeting transcripts,” help users extract insights and share information from various document types. Adobe is offering free, unlimited access to Acrobat AI Assistant from June 18 to June 28.
Source: https://blog.adobe.com/en/publish/2024/06/17/adobe-acrobat-reimagines-documents-multi-format-ai-powered-work
TikTok introduces gen AI avatars of creators and stock actors for ads
“Custom Avatars” allow creators to scale their likeness for multilingual avatars and brand collaborations, while brands can use pre-built “Stock Avatars” to add a human touch. Plus, the new “AI Dubbing” tool translates content into ten languages, helping creators and brands increase their global reach.
Source: https://newsroom.tiktok.com/en-us/announcing-symphony-avatars
Pixelbot 3000 builds Lego art using simple AI prompts
YouTuber Creative Mindstorms designed and built the Pixelbot 3000, a Lego printer that automates the assembly of brick-built mosaics. It uses OpenAI’s DALL-E 3 to generate images based on simple text prompts. First it generates a simplified cartoon-style image, then it is divided into a 32 x 32 grid, and the color of the center pixel in each square is sampled to create a high-contrast scaled image for the mosaic.
Source: https://www.theverge.com/2024/6/17/24180250/lego-printer-pixel-art-artificial-intelligence
Adobe integrated new Firefly AI capabilities into Acrobat, allowing users to create and edit images within PDFs using text prompts — also adding the ability to access an AI assistant for insights, content creation, and more. https://news.adobe.com/news/news-details/2024/Adobe-Reimagines-Acrobat-Bringing-Firefly-AI-to-PDFs-and-Expanding-Use-Across-More-Document-Types/default.aspx |
Reuters Institute of Journalism published a new report finding growing public wariness of AI-generated news content, with many expressing discomfort about its potential impact on content reliability and trust. https://www.reuters.com/technology/artificial-intelligence/global-audiences-suspicious-ai-powered-newsrooms-report-finds-2024-06-16 |
The U.S. Navy is deploying AI-powered underwater drones to better detect threats, with plans to expand the tech’s use in identifying enemy ships and aircrafts. https://finance.yahoo.com/news/ai-equipped-underwater-drones-helping-153947268.html |
Luma teased new control features coming to its Dream Machine video model, including the ability to quickly change scenes and precisely edit characters — also launching the ability to extend video and remove watermarks. https://x.com/LumaLabsAI/status/1802678788490494128 |
Anthropic published new research showing that AI models can engage in ‘reward tampering’, learning to cheat the system and grant higher rewards even without specific training. https://www.anthropic.com/research/reward-tampering |
A Daily chronicle of AI Innovations June 17th 2024:
McDonald’s pauses AI drive-thru tests
Chinese startup creates realistic robots
Photographer wins AI-image contest with real picture, gets disqualified
NVIDIA’s AI model for synthetic data generation rivals GPT-4
Meta pauses AI model training in EU due to regulatory pushback
Spotify launches ‘Creative Labs’ to test Gen AI voiceover ads
Tiny LLM matches GPT-4 on math
McDonald’s pauses AI drive-thru tests
- McDonald’s is ending its artificial intelligence drive-thru order-taking program and will shut it off in all participating restaurants by July 26, 2024.
- The decision to end the AI program came after a thorough review, and McDonald’s Chief Restaurant Officer stated the company will seek broader voice ordering solutions.
- Although the AI drive-thru program showed successes, McDonald’s aims to simplify operations and speed up service, with plans to decide on a new voice ordering solution by the end of the year.
- Source: https://www.foxbusiness.com/technology/mcdonalds-ending-ai-drive-thru-ordering
Photographer wins AI-image contest with real picture, gets disqualified
- A photographer named Miles Astray won an AI-image competition with a real photograph of a head-scratching flamingo, resulting in his disqualification.
- His entry, titled F L A M I N G O N E, was mistakenly awarded third place by judges from notable institutions like the New York Times and Christie’s.
- Astray aimed to demonstrate that human creativity and natural beauty surpass AI-generated content, sparking a discussion about the ethical implications of using AI in art.
- Source: https://www.techspot.com/news/103422-photographer-wins-ai-image-contest-real-picture-gets.html
NVIDIA’s AI model for synthetic data generation rivals GPT-4
NVDIAI has released Nemotron-4 340B, an open-source pipeline for generating high-quality synthetic data. It includes a base model trained on 9M tokens, an instruction, and a reward model.
- The instruction model can generate diverse synthetic data that mimics real-world data.
- The reward model then evaluates the generated data to filter out high-quality responses.
- This interaction between the two models produces better training data over time.
Note: 98% of the training data used to fine-tune the Instruct model is synthetic and was created using NVIDIA’s pipeline.
In benchmarks such as MT-Bench, MMLU, GSM8K, HumanEval, and IFEval, the Instruct model generally performs better than other open-source models such as Llama-3-70B-Instruct, Mixtral-8x22B-Instruct-v0.1, and Qwen-2-72B-Instruct, and in some tests, it even outperforms GPT-4o.
It also performs comparable to or better than OpenAI’s GPT-4-1106 in human evaluation for various text tasks, such as summaries and brainstorming. The technical report provides detailed benchmarks.
Why does it matter?
This development allows businesses to create powerful, domain-specific LLMs without the need for extensive, costly real-world datasets. It has significant potential impacts across various industries, such as healthcare (drug discovery, personalized medicine, medical imaging), finance (fraud detection, risk assessment, customer service), manufacturing (predictive maintenance, supply chain optimization), and retail (personalized customer experiences).
Source: https://blogs.nvidia.com/blog/nemotron-4-synthetic-data-generation-llm-training
Meta pauses AI model training in EU due to regulatory pushback
In response to the regulatory pressure from the Irish Data Protection Commission and the UK’s Information Commissioner’s Office, Meta has decided to pause its plans to train its large language model, Llama, using public content shared by Facebook and Instagram users in the European Union and the UK.
The regulators expressed concerns about Meta’s plan to use this user-generated content to train its AI systems without obtaining explicit user consent. Meta relied on a GDPR provision called “legitimate interests” to justify this data usage, but the regulators felt this was insufficient. Meta has decided to delay the launch of its AI chatbot in Europe until it can address the regulators’ concerns and establish a more transparent user consent process.
Why does it matter?
Meta’s inability to use EU user data for AI training is a setback for its regional AI ambitions. It could disadvantage Meta against competitors who can leverage such data. This situation highlights the ongoing tensions between tech companies’ desire to utilize consumer data for AI development and regulators’ efforts to protect user privacy. Striking the right balance between innovation and privacy will be a major challenge as the AI race intensifies.
Spotify launches ‘Creative Labs’ to test Gen AI voiceover ads
Spotify has launched a new in-house creative agency called “Creative Lab.” This agency will help brands and advertisers create custom campaigns for Spotify’s platform. Creative Lab teams in different markets will provide local insights and collaborate with brands to develop campaigns through workshops, inspiration sessions, and collaborative ideation.
In addition, Spotify is also testing a new AI tool called “Quick Audio” that will allow brands to create scripts and voiceovers using generative AI technology. This new capability will be integrated into Spotify’s ad manager platform, giving advertisers more options to produce audio ads for Spotify’s audience of over 615 million listeners.
Why does it matter?
This move emphasizes Spotify’s ambition to become a full-service advertising platform. Marketers and advertisers will have new creative and production capabilities available through Spotify to better reach the platform’s large and engaged user base in unique ways, including potentially using AI-generated audio ads. This could disrupt traditional advertising models and open new possibilities for how brands connect with consumers on audio platforms.
Source: https://techcrunch.com/2024/06/13/spotify-creative-labs-ad-agency-for-advertisers
Chinese startup creates realistic robots
A Chinese startup called Ex-Robots is developing ultra-realistic humanoid robots that can closely mimic human facial expressions and emotions.
- Ex-Robots’ lifelike humanoid robots can smile, frown, and replicate subtle facial movements using small motors embedded in the silicone faces.
- The startup uses multimodal AI and algorithms to enable the robots to recognize and express emotions based on perceiving their environment.
- Creating one humanoid robot currently takes 2-4 weeks, with costs as high as $275k per unit.
- The company said near-term use cases include museum exhibits, with eventual roles in healthcare, therapy, and child services.
The advancement of ultra-realistic robots combined with the power of AI is about to create some bizarre new frontiers. Will humans be creeped out by an actual human-looking humanoid over mechanical options like Tesla’s Optimus? It’s unclear if the world is ready… but we’ll find out soon.
Tiny LLM matches GPT-4 on math
Shanghai AI Laboratory researchers just developed a new algorithm called MCT Self-Refine, which allowed a relatively small 8B-parameter model to achieve GPT-4 level performance on complex math.
- MCT Self-Refine combines an 8B LLaMa model with Monte Carlo Tree Search, an AI technique famously used by Google DeepMind to master the game of Go.
- The algorithm constructs a search tree, refining answers through a series of processes and self-evaluation.
- On the GSM-Hard math word problem benchmark, infusing the Monte Carlo Tree Search took the 8B model from 25.5% accuracy to 45.5%.
- The model also achieved 96.7% on the GSM8K benchmark, outperforming significantly larger models like GPT-4, Claude, and Gemini.
The research shows it’s possible to boost math capabilities without massive scale — and GPT-4 level performance with a model trained on 200x less parameters is an impressive feat. If the approach proves to be a more efficient path to advanced reasoning, we could be on the cusp of a new wave of model acceleration.
Source: https://arxiv.org/pdf/2406.07394
New AI Jobs on June 17th 2024
Abridge – Senior Product Designer: https://jobs.therundown.ai/jobs/63255899-senior-product-designer
Palantir Technologies – Software Engineer, New Grad: https://jobs.therundown.ai/jobs/63248966-software-engineer-new-grad-us-government
Fiddler AI – Staff AI Scientist: https://jobs.therundown.ai/jobs/63187617-staff-ai-scientist
Meta – Software Engineer – ML Systems: https://jobs.therundown.ai/jobs/63275347-software-engineer-ml-systems-technical-leadership
What Else Is Happening in AI on June 17th 2024
Apple enters the AI icon race to find a logo that makes sense
Apple has joined other tech giants like Google, OpenAI, Anthropic, and Meta in the race to find an iconic visual representation for AI. No company has yet created an unambiguous “AI logo” that conveys the concept to users. AI’s lack of a clear visual identity reflects the difficulty of representing such a broad and evolving technology in a simple icon.
Source: https://techcrunch.com/2024/06/15/apple-joins-the-race-to-find-an-ai-icon-that-makes-sense
Niloom.AI launches gen AI content creation platform for spatial computing
Without extensive technical expertise, the platform allows users to create, prototype, edit, and instantly publish sophisticated AR/VR content using text or speech prompts. It consolidates the entire creative process, from ideation to publishing, and integrates with various third-party tools to provide a one-stop solution for spatial computing content creation.
AI to delete abusive posts against athletes during the 2024 Paris Olympics
The International Olympic Committee (IOC) will deploy AI at the 2024 Paris Olympics to automatically detect and erase abusive social media posts directed at athletes and officials. The AI tool will monitor posts about 15,000 athletes and officials and immediately remove any content involving hate speech, bullying, or political attacks.
Picsart and Getty team up to counter Adobe’s “commercially-safe” AI
Picsart has partnered with Getty Images to develop a “responsible, commercially-safe” AI image generator tool. The AI model will be trained exclusively on Getty’s licensed stock content to address concerns about AI-generated content violating copyright laws. Picsart hopes to provide a viable alternative to Adobe’s Firefly by leveraging Getty’s library of licensed images. Source: https://www.theverge.com/2024/6/14/24178333/picsart-getty-commercially-safe-ai-image-tool-adobe
Yahoo News gets an AI-powered revamp with Artifacts integration
Yahoo has acquired the technology behind the Artifact news aggregation app and is launching a new AI-powered Yahoo News app. The app will feature a personalized news feed based on user interests and a “Key Takeaways” feature that provides bullet-point summaries of articles. Users can also flag problematic content, which the AI will then try to rewrite.
Source: https://www.theverge.com/2024/6/13/24177980/yahoo-news-app-launch-artifact-ai-architecture
A Daily chronicle of AI Innovations June 14th 2024:
Google’s PH-LLM reads your wearables’ data for personalized insights
Ex-OpenAI researcher on what to expect from AI in next decade
DeepMind built ‘a virtual rodent’ with AI to understand brain activity
AI chatbots run for office
LinkedIn is rolling out new AI-powered features for premium users
Synthflow’s AI voice assistants are now multilingual!
Picsart is partnering with Getty Images to develop a custom model for AI imagery
Former head of NSA joins OpenAI’s Safety and Security Committee
Google’s PH-LLM reads your wearables’ data for personalized insights
Building on the next-gen capabilities of Gemini models, Google has presented research that highlights two complementary approaches to providing accurate personal health and wellness information with LLMs.
The first introduces PH-LLM, a version of Gemini fine-tuned to understand and reason on time-series personal health data from wearables such as smartwatches and heart rate monitors. The model answered questions and made predictions noticeably better than experts with years of experience in the health and fitness fields.
In the second paper, Google introduces an agent system that leverages state-of-the-art code generation and information retrieval tools to analyze and interpret behavioral health data from wearables. Combining these two ideas will be critical for developing truly personalized health assistants.
Why does it matter?
Wearables generate a wealth of personal health data that is rarely utilized in clinical settings. Integrating this data with advanced AI models could revolutionize personal health management and preventative care by putting an “expert health assistant” on everyone’s wrist.
Source: https://research.google/blog/advancing-personal-health-and-wellness-insights-with-ai
Ex-OpenAI researcher on what to expect from AI in next decade
A researcher fired from OpenAI, Leopold Aschenbrenner, published a 165-page essay on what to expect from AI in the next decade. And GPT-4 has summarized it! Here are some key takeaways from the essay:
- By 2027, AI models could reach the capabilities of human AI researchers and engineers, potentially leading to AI surpassing human intelligence
- Trillions of dollars are being invested into developing the infrastructure needed to support these AI systems
- Controlling AI systems smarter than humans(the ‘superalignment‘ problem) will be crucial to prevent catastrophic outcomes
- Only a few hundred people truly understand the scale of change AI is about to bring
Why does it matter?
The essay provides a rare insider’s perspective on the rapid progression of AI. Coming from someone deeply involved in cutting-edge AI development, the insights highlight the urgency to get ahead of managing risks before AI’s capabilities outpace our defenses.
Source: https://ca.news.yahoo.com/researcher-fired-openai-published-165-184227878.html
DeepMind built ‘a virtual rodent’ with AI to understand brain activity
Researchers from Google DeepMind and Harvard built a ‘virtual rodent’ powered by AI to help them better understand how the brain controls movement. With deep reinforcement learning (RL), it learned to operate a biomechanically accurate rat model, allowing researchers to compare real and virtual neural activity.
Why does it matter?
Understanding how the brain controls movement and modeling neural activity could exponentially advance fields like neuroscience and brain-computer interfaces, with the help of AI.
Source: https://www.nature.com/articles/s41586-024-07633-4
🗳️ AI chatbots run for office
An AI-powered candidate named ‘AI Steve’ is running for U.K. Parliament in next month’s general election — creating polarizing questions around AI’s use in government affairs.
- AI Steve is represented by businessman Steve Endacott and will appear as an independent candidate in the upcoming election.
- Voters can interact with AI Steve online to ask policy questions and raise concerns or suggestions, which the AI will incorporate based on feedback.
- If elected, Endacott will serve as AI Steve’s human proxy in Parliament, attending meetings and casting votes based on the AI’s constituent-driven platform.
The idea of an AI running for office might sound like a joke, but the tech behind it could actually help make our politicians more independent and (ironically) autonomous. AI-assisted governance is likely coming someday, but it’s probably still a bit too early to be taken seriously.
Source: https://www.nbcnews.com/tech/tech-news/ai-candidate-running-parliament-uk-says-ai-can-humanize-politics-rcna156991
🎤 OpenAI CTO speaks on internal models
OpenAI CTO Mira Murati participated in a Q&A with Fortune Magazine, addressing topics including criticism from Elon Musk, internal AI model strength, the 2023 board drama and more.
- Murati said both Apple and OpenAI believe strongly in privacy and will not log data through Apple accounts or train models on user data.
- Murati responded to Musk calling the Apple partnership ‘creepy spyware’, saying OpenAI is trying to be as transparent as possible with its approach to safety.
- When asked about the board drama, Murati said the previous board structure ‘didn’t have accountability to anyone but themselves’.
- The CTO also curiously said that OpenAI’s internal models ‘aren’t that far ahead’ of what the public has for free right now.
This had to be a harder-hitting Q&A than Murati expected — but her responses certainly evoked similar vibes to the infamous Sora interview. The commentary on internal models was particularly interesting, as it would imply that OpenAI doesn’t have some GPT-5 level model waiting in the wings.
Source: https://x.com/elonmusk/status/1800266437677768765
🐀 DeepMind creates a virtual rat with AI brain
Researchers from Google DeepMind and Harvard just created a virtual rodent powered by an AI neural network, capable of mimicking agile movements and neural activity of real-life rats with extreme accuracy.
- The researchers created a bio-mechanically realistic digital rat model and trained an AI ‘brain’ to control its body in a physics simulator.
- The AI’s brain activity patterns closely matched real rodents engaged in similar behaviors, showing it can capture principles of muscle control and movement.
- This ‘virtual neuroscience’ approach could open new research with testing on AI animals, also transferring to engineering of advanced robotics systems.
By bridging the gap between motor control and the complexity of brains, these virtual rodents offer a new window into how neural activity generates movement. Our understanding of the brain and neural activity is about to expand exponentially with the increasing power of AI.
Source: https://www.nature.com/articles/s41586-024-07633-4
New AI Job Opportunities on June 14th 2024
Databricks – Sr. Product Manager: https://jobs.therundown.ai/jobs/63046708-sr-product-manager
Luma AI – Research Scientist: https://jobs.therundown.ai/jobs/62973732-research-scientist
Dataiku – Senior Sales Engineer: https://jobs.therundown.ai/jobs/63061675-senior-sales-engineer
Meta – GenAI Content Manager: https://jobs.therundown.ai/jobs/60547830-genai-content-manager
What Else Is Happening in AI on June 14th 2024
Former head of NSA joins OpenAI’s Safety and Security Committee
Paul M. Nakasone, a retired US Army general and a former head of the National Security Agency (NSA), will also join OpenAI’s board of directors. He will contribute to OpenAI’s efforts to better understand how AI can be used to strengthen cybersecurity by quickly detecting and responding to cybersecurity threats.
Source: https://www.theverge.com/2024/6/13/24178079/openai-board-paul-nakasone-nsa-safety
Former Meta engineers launch Jace, your new autonomous AI employee
Jace uses Zeta Labs’ proprietary web-interaction model, Autonomous Web Agent-1, to use a browser to interact with websites like any human would. It allows it to handle real-world tasks like booking flights, handling hiring, or even setting up a company.
Source: https://venturebeat.com/ai/exclusive-former-meta-engineers-launch-jace-an-ai-agent-that-works-independently
LinkedIn is rolling out new AI-powered features for premium users
The features include searching for jobs by prompting in natural language, building a cover letter from scratch, reviewing your résumé with personalized suggestions for improving it for a specific job post, and making edits interactively with AI.
Source: https://www.theverge.com/2024/6/13/24177986/linkedin-ai-job-hunting-features-premium-subscribers
Synthflow’s AI voice assistants are now multilingual!
They can fluently communicate in Spanish, German, Portuguese, French, and English. Sythflow also added corresponding voices for each language to ensure authentic and natural-sounding interactions so businesses can engage a global audience and offer personalized experiences.
Source: https://x.com/GoogleDeepMind/status/1801210428673892496
Picsart is partnering with Getty Images to develop a custom model for AI imagery
The model will be built from scratch and trained exclusively on Getty Images’ licensed creative content. It will bring responsible AI imagery to creators, marketers, and small businesses that use Picsart, enabling them to generate unique images with full commercial rights. Source: https://techcrunch.com/2024/06/13/picsart-partners-with-getty-images-to-develop-a-custom-ai-model/
A Daily chronicle of AI Innovations June 13th 2024:
Apple to ‘pay’ OpenAI through distribution, not cash
New free rival to OpenAI Sora unveiled
⌚️ Google AI gets personal with wearables
Perplexity was planning revenue-sharing deals with publishers when it came under media fire
Amazon ‘dropped the ball’ with Alexa, former employee says
Tim Cook says it’s a ‘significant possibility’ people use their iPhones less because of AI
Luma AI’s dream machine debut sparks AI media wars
Stability AI’s Diffusion 3 Shakes Up Image Creation
No Cash Involved! Apple will pay OpenAI in distribution
NVIDIA tops MLPerf benchmarks, dominates AI training
Databricks Boosts Mosaic AI for Enterprise LLMs
ChromeOS gets GenAI boost via Android tech
YouTube trials AI comment summaries for shorts
Luma AI’s dream machine debut sparks AI media wars
LumaAI has launched Dream Machine, an AI-powered video generation system capable of generating high-quality videos via simple text prompts. The open-for-all AI tool takes about two minutes to generate a realistic five-second video clip to render specified objects and environments without compromising coherency.
Why does it matter?
The open-for-all feature of this AI tool highlights Luma AI’s approach toward democratizing generative AI and puts rival systems into the spotlight, including OpenAI’s Sora and Kuaishou’s Kling, which are only accessible to select users.
Additionally, Luma’s open approach will enable creators and companies to produce original content at a rapid speed and scale, giving it a competitive edge.
Source: https://lumalabs.ai/dream-machine
⌚️ Google AI gets personal with wearables
Google just published two new research papers, introducing an AI model that can interpret personal health data from wearable devices and an AI agent that provides detailed insights, answer health questions, and more.
- Google’s PH-LLM is a version of Gemini fine-tuned to reason using data from wearables and generate coaching insights for sleep and fitness.
- PH-LLM performs comparably to human experts on health insights for fitness and sleep, also achieving expert performance on certification exams.
- Google also revealed PHIA, an AI agent that combines Gemini’s language skills with code and search capabilities to analyze wearable health data.
- PHIA scored 84% on health insight questions, demonstrating strong reasoning and data analysis capabilities on complex queries.
Google’s research showcases AI’s potential to take health wearable tracking to the next level — going beyond generic insights to enable personalized support. Once ingrained into popular devices, these types of AI tools will put an expert fitness and nutrition coach on the wrists of every user.
Source: https://research.google/blog/advancing-personal-health-and-wellness-insights-with-ai
Stability AI’s Diffusion 3 Shakes Up Image Creation
Stability AI has launched its most sophisticated text-to-image AI open model, optimized for consumer PCs and enterprise-tier GPUs. The model delivers several noteworthy features:
- Enables photorealistic and high-quality outputs in exceptional style by providing images with extraordinary color, lighting, and details.
- Can comprehend long, complex prompts that involve spatial reasoning, compositional elements, actions, and styles.
- Leverages Diffusion Transformer architecture to achieve exceptional text quality, reducing spelling, character spacing, and letter-forming errors.
- Displays the ability to absorb minute details from small datasets, making it an ideal model for customizing and fine-tuning.
Why does it matter?
Stability’s unique features make it the most advanced text-to-image AI model. As it continues to push the boundaries of generative AI, it will be interesting to note whether it will stand out against its competitors and remain at the forefront of image generation.
Source: https://stability.ai/news/stable-diffusion-3-medium
No Cash Involved! Apple will pay OpenAI in distribution
A landmark arrangement between OpenAI and Apple to integrate ChatGPT into iPhone, iPad, and Mac was unclear in its financial terms. People briefed on the matter suggest that the agreement isn’t likely to generate revenue for either party and simply provides OpenAI brand exposure to Apple’s user base while offering Apple an advanced chatbot feature.
Why does it matter?
Apple integrating ChatGPT into its devices without offering any compensation to OpenAI creates concerns among users on whether Apple intends to share their data in return.
Source: https://www.bloomberg.com/news/articles/2024-06-12/apple-to-pay-openai-for-chatgpt-through-distribution-not-cash
Apple to ‘pay’ OpenAI through distribution, not cash
- Apple and OpenAI announced a partnership at WWDC, but according to Bloomberg, neither side is paying the other for the integration of ChatGPT into Apple’s devices.
- The agreement between Apple and OpenAI is non-exclusive, and Apple is also negotiating with Anthropic and Google to integrate their chatbots, with a deal for Google’s Gemini expected later this year.
- Apple plans to generate revenue from these partnerships through revenue-sharing agreements, taking a cut from AI providers that monetize their chatbot services, such as ChatGPT Plus’ $20-per-month subscription plan.
- Source: https://www.theverge.com/2024/6/13/24177550/apple-openai-chatgpt-deal-payment-revenue-sharing-chatbot
New free rival to OpenAI Sora unveiled
- Dream Machine is a new text-to-video tool developed by Luma AI, offering a free tier you can use immediately with a Google account, though there might be waiting times due to high demand.
- The tool creates five-second video clips at 1360×752 resolution based on user prompts, and while it has some processing delays, the outputs are impressive and close to user expectations.
- Dream Machine offers various subscription plans, with the free option limited to 30 generations a month, and higher tiers providing more generations at different price points, up to 2,000 generations for $499.99 monthly.
- SOurce: https://www.techradar.com/computing/artificial-intelligence/a-new-openai-sora-rival-just-landed-for-ai-videos-and-you-can-use-it-right-now-for-free
Perplexity was planning revenue-sharing deals with publishers when it came under media fire
- Perplexity, an AI search engine startup, was working on revenue-sharing deals with high-quality publishers before facing accusations from Forbes of content misuse.
- The company aims to announce the details of these partnerships soon and has already updated its user interface to make citation of sources more prominent.
- Perplexity’s planned revenue-sharing framework will offer media companies a recurring income alternative, contrasting with the one-off data usage deals currently offered by entities like OpenAI.
- Source: https://www.semafor.com/article/06/12/2024/perplexity-was-planning-revenue-sharing-deals-with-publishers
Amazon ‘dropped the ball’ with Alexa, former employee says
- A former Amazon employee, Mihail Eric, criticized the company’s handling of Alexa, citing technical and bureaucratic problems as major hindrances to its development.
- Eric stated that despite having ample resources and talent, Amazon mishandled its opportunity to make Alexa the leading product in conversational AI.
- Organizational issues, such as fragmented teams and inadequate computing resources, were mentioned by Eric as significant factors that prevented Alexa from achieving its full potential.
- Source: https://www.businessinsider.com/amazon-alexa-voice-assistant-dropped-ball-former-employee-says-2024-6
Tim Cook says it’s a ‘significant possibility’ people use their iPhones less because of AI
- Apple CEO Tim Cook said Apple Intelligence may reduce iPhone usage in an interview with Marques Brownlee.
- Cook highlighted that AI will simplify previously time-consuming tasks, allowing users to complete them more efficiently and quickly.
- Apple has added several features to the iPhone that raise awareness of how much time consumers spend on their devices and which help make their usage more intentional.
- Source: https://www.businessinsider.com/tim-cook-people-may-use-iphones-less-because-ai-2024-6
What Else Is Happening in AI on June 13th 2024
YouTube trials AI comment summaries for shorts: YouTube is trying out a new AI-powered comment summarizer feature on mobile apps. The feature breaks down comments into keyword-based topics, simplifying users’ engagement with Shorts with large comment sections. Source: https://support.google.com/youtube/thread/18138167
NVIDIA tops MLPerf benchmarks, dominates AI training: NVIDIA has set new performance records in MLPerf Training v4.0, showcasing its dominance in generative AI and LLM training. Key achievements include tripling GPT-3 175B training performance, fine-tuning Llama 2 70B model in just 1.5 minutes using 1,024 H100 GPUs, and accelerating Stable Diffusion v2 training performance by up to 80%. Source: https://blogs.nvidia.com/blog/mlperf-training-benchmarks
Databricks Boosts Mosaic AI for Enterprise LLMs: Databricks has announced enhancements to Mosaic AI to accelerate its efforts in enterprise gen AI development. The focus lies on the development of compound AI systems, their evaluation across metrics, and governance. Releases include Mosaic AI Model Training and Agent Framework, strengthening the company’s offering against Snowflake. Source: https://venturebeat.com/data-infrastructure/databricks-bolsters-mosaic-ai-with-tools-to-build-and-evaluate-compound-ai-systems
No more vagueness – Adobe to update its AI terms: Adobe has promised to update its terms of service amidst user backlash, assuring that it will not train generative AI models on customer content. The company clarified that it does not own customer content and will refrain from using it to train AI models. Source: https://blog.adobe.com/en/publish/2024/06/10/updating-adobes-terms-of-use
ChromeOS gets GenAI boost via Android tech: Google plans to release Android-based tech stack into ChromeOS to accelerate AI innovation. The release will simplify engineering efforts, helping phones and accessories work better with Chromebooks. Through this release, Google seeks to continue rolling out its AI features to users at a faster and larger scale. Source: https://blog.chromium.org/2024/06/building-faster-smarter-chromebook.html
A Daily chronicle of AI Innovations June 12th 2024:
Elon Musk drops OpenAI lawsuit with no explanation
Microsoft kills off Copilot GPT Builder after just 3 months
Stability AI launches its ‘most sophisticated’ image generator yet
Microsoft to discontinue GPT Builder from Copilot Pro
Musk drops lawsuit alleging OpenAI strayed from its mission
OpenAI leverages Oracle Cloud to extend the Microsoft Azure Al platform
Google to launch anti-theft AI features for android phones in Brazil
Google expands Gemini Nano AI capabilities to Pixel 8 & 8a
Yahoo Mail integrates AI for a streamlined desktop experience
Samsung unites North American AI teams, appoints ex-Apple Siri executive
Elon Musk drops OpenAI lawsuit with no explanation
- Elon Musk has dropped his lawsuit against OpenAI and its CEO, Sam Altman, which accused the company of breaching a contract and deviating from its mission to benefit humanity.
- The case was dismissed without prejudice, meaning Musk can refile it in the future; this decision came just a day before a scheduled hearing on OpenAI’s request to dismiss the case.
- Musk’s lawsuit alleged a breached “Founding Agreement” with OpenAI, but the company denied this, asserting there is no binding agreement and claiming Musk sought control by merging it with Tesla.
- Source: https://www.theverge.com/2024/6/11/24176462/elon-musk-openai-lawsuit-sam-altman-dropped
Microsoft kills off Copilot GPT Builder after just 3 months
- Microsoft has announced the discontinuation of GPT Builder and is giving users one month to save their data before deletion.
- GPT Builder, part of Microsoft’s Copilot Pro, allowed users to create and customize various GPTs for $20 per month but will no longer be available after July 10.
- Despite retiring GPT Builder, Microsoft remains focused on enterprise GPT applications and will not adjust Copilot Pro subscription fees or hand the project over to the community.
- Source: https://www.theregister.com/2024/06/11/microsoft_retires_gpt_builder/
Stability AI launches its ‘most sophisticated’ image generator yet
- Stability AI launched Stable Diffusion 3 Medium, described as its “most advanced text-to-image open model yet,” featuring 2 billion parameters for photorealistic output on consumer systems.
- The new model overcomes common artefacts in hands and faces, accurately understands complex prompts, and enhances typography through Diffusion Transformer architecture.
- Users can access SD3 Medium via Stability’s API under an open non-commercial license, while commercial usage requires contacting the startup for licensing details.
- Source: https://thenextweb.com/news/stability-ai-launches-stable-diffusion-3-image-generator
Autonomous driving by musculoskeletal humanoids: A study
The research paper discusses the development of a musculoskeletal humanoid robot, Musashi, designed to perform autonomous driving tasks. It mimics the human body in detail with redundant sensors and a flexible body structure that are suitable for motions with complex environmental contact.
The robot is expected to sit down on the car seat, step on the acceleration and brake pedals, and operate the steering wheel by both arms.
The paper extensively discusses the hardware and learning-based software necessary for this unique approach to autonomous driving.
Why does it matter?
The learning-based software discussed in the paper highlights the adaptability of musculoskeletal humanoids. AI models could learn from both structured data and unstructured sensory inputs, potentially improving generalization and handling novel scenarios.
Source: https://arxiv.org/pdf/2406.05573
New AI Job Opportunities on June 12th 2024
UiPath – Strategic Sales Lead, EMEA: https://jobs.therundown.ai/jobs/62605651-strategic-sales-lead-emea
Captions – Frontend Engineer: https://jobs.therundown.ai/jobs/60404535-frontend-engineer-3+-years-of-experience
Abridge – Senior Data Analyst: https://jobs.therundown.ai/jobs/62605637-senior-data-analyst
Metropolis – Data Analyst: https://jobs.therundown.ai/jobs/62355254-data-analyst
Source: https://jobs.therundown.ai/
What Else Is Happening in AI on June 12th 2024
OpenAI leverages Oracle Cloud to extend the Microsoft Azure Al platform
This collaboration will provide additional capacity for OpenAI, enabling it to continue to scale. Oracle Cloud Infrastructure’s (OCI) purpose-built AI capabilities enable startups and enterprises to build and train models faster and more reliably anywhere in Oracle’s distributed cloud.Source: https://www.oracle.com/news/announcement/openai-selects-oracle-cloud-infrastructure-to-extend-microsoft-azure-ai-platform-2024-06-11
Google to launch anti-theft AI features for android phones in Brazil
The features include three lock modes: AI Movement Detection (recognizes common theft-related movements), Remote Lock, and Internet Access Timeout (automatically locks the screen if the device lacks internet access for an extended period). Starting in July 2024, Brazilian users with Android phones running version 10 or higher will have access to these features. Source: https://www.reuters.com/technology/artificial-intelligence/google-test-anti-theft-ai-feature-phones-brazil-2024-06-11
Google expands Gemini Nano AI capabilities to Pixel 8 & 8a
This update includes features like “Summarize in Recorder” and “Smart Reply.” Users need to access developer options to enable these features. Additionally, the update enhances Find My Device functionality and allows Display Port output via USB-C. Older Pixel devices, such as the Pixel Fold and Pixel 6, also receive new features. https://www.theverge.com/2024/6/11/24176127/google-pixel-feature-drop-june-gemini-nano
Yahoo Mail integrates AI for a streamlined desktop experience
The enhancements include AI-generated email summaries, a “Priority” inbox tab for urgent messages, quick action buttons, and the ability to link other email accounts within Yahoo Mail. Existing users can opt in, while new users have immediate access. Source: https://www.theverge.com/2024/6/11/24175280/yahoo-mail-generative-ai-gmail
Samsung unites North American AI teams, appoints ex-Apple Siri executive
Samsung merged its AI research centers in Toronto and California into a single unit, the North America AI Center. To lead this new North American AI Center, Samsung recruited Murat Akbacak, a former Siri executive at Apple. Source: https://www.bloomberg.com/news/articles/2024-06-11/samsung-taps-apple-siri-veteran-to-lead-north-american-ai-group
A Daily chronicle of AI Innovations June 11th 2024:
Apple brings ChatGPT to Siri
Apple reveals macOS Sequoia with iPhone mirroring, new Passwords app, and more
Elon Musk threatens Apple device ban after OpenAI partnership
Apple introduces Apple Intelligence, its new Gen AI tech
Musk threatens to ban Apple devices over ChatGPT integrations
OpenAI hires Nextdoor and Instagram veterans as CFO and CPO
Apple brings ChatGPT to Siri
- Apple introduced AI updates for Siri at WWDC 2024, enabling it to perform actions across apps, manage notifications, automatically write and summarize text, and understand user context with “onscreen awareness.”
- Siri’s new capabilities include referencing one app to perform actions in another, such as playing a podcast your partner sent, changing the tone of written text with a prompt, and allowing users to interact by typing as well as speaking.
- Apple emphasizes privacy with on-device processing for many features, while more complex requests are handled by “Private Cloud Compute” on Apple Silicon-powered servers, with independent experts verifying the code running on its servers.
- Source: https://www.theguardian.com/technology/article/2024/jun/10/apple-ai-product-launch
Apple reveals macOS Sequoia with iPhone mirroring, new Passwords app, and more
- Apple announced macOS Sequoia, featuring major updates such as iPhone mirroring that allows users to control their iPhone from their Mac.
- Sequoia introduces a stand-alone Passwords app, syncable with other Apple devices and Windows PCs via iCloud, along with new Safari improvements like Highlights for extracting useful information.
- Apple also revealed Game Porting Toolkit 2 for better Windows game compatibility, and the macOS Sequoia beta will be available this month with a full release expected in the fall.
- Source: https://techcrunch.com/2024/06/10/apple-unveils-macos-sequoia
Elon Musk threatens Apple device ban after OpenAI partnership
- Elon Musk has threatened to ban Apple devices at his companies because Apple plans to integrate OpenAI’s ChatGPT into its products.
- Musk argues that allowing OpenAI access through Apple devices poses a security risk and criticizes Apple for not developing its own AI technology.
- Musk’s discontent also stems from his ongoing disputes with OpenAI, including past legal actions and competitive tensions with his own AI ventures like xAI and its Grok chatbot.
- Source: https://www.theregister.com/2024/06/11/musk_wants_to_ban_apple/
Apple introduces Apple Intelligence, its new Gen AI tech
Apple unveiled Apple Intelligence at WWDC 2024, a personal intelligence system deeply integrated into iOS 18, iPadOS 18, and macOS Sequoia. Leveraging Apple silicon’s power, it understands and creates language/images, takes actions across apps, and simplifies tasks using personal context. With Private Cloud Compute, Apple sets a new privacy standard in AI by flexibly processing between on-device and dedicated Apple silicon servers.
Apple Intelligence will power new features like AI-generated Bitmojis, natural language images, and video search, as well as enhanced integration with apps and Siri.
Apple also partnered with OpenAI to integrate ChatGPT with its apps, including Siri. This will allow users to directly access GPT-4 through Apple’s platforms. Additionally, Apple confirmed plans to work with other models in the future, including Google’s Gemini. The iPhone 15 Pro and devices with M1 or newer chips will be the first to receive the full suite of Apple Intelligence features.
Siri Upgrades |
|
New AI Features |
|
Privacy |
|
OpenAI Integration |
|
Why does it matter?
This is generative AI as only Apple can deliver it– put at the core of its products, accessing users’ personal data for truly helpful AI assistance but keeping it secure through on-device processing and Private Cloud Compute. Apple’s long-known focus on user privacy + exceptional UX could inspire a new era of AI development.
Source: https://techcrunch.com/2024/06/11/everything-apple-announced-wwdc-2024/
Musk threatens to ban Apple devices over ChatGPT integrations
Elon Musk threatens to ban iPhones and other Apple devices from his companies, such as Tesla, SpaceX, and xAI, over Apple’s announcement of ChatGPT integration into its OS. Musk thinks this integration will allow ChatGPT to access too much private user data through Apple devices. He claims Apple doesn’t understand OpenAI’s tech and is putting user privacy at risk.
However, Apple and OpenAI stated that users will be asked permission before any data is sent to ChatGPT.
Why does it matter?
This highlights Musk’s skepticism about the privacy implications of Apple’s AI strategy. But it could significantly impact Apple users in the future, as they may no longer be able to use iPhones and other Apple devices at Musk’s companies
Source: https://techcrunch.com/2024/06/10/elon-musk-threatens-to-ban-apple-devices-from-his-companies-over-apples-chatgpt-integrations
OpenAI hires Nextdoor and Instagram veterans as CFO and CPO
OpenAI has hired two experienced executives to fill important leadership roles. Sarah Friar, previously the CEO of the neighborhood app Nextdoor, was appointed OpenAI’s new chief financial officer (CFO). Kevin Weil, a former Instagram and Twitter executive who oversees product teams, has been named OpenAI’s chief product officer (CPO).
As CFO, Friar will manage OpenAI’s financial operations and investments as the company expands globally. As CPO, Weil will lead product development utilizing OpenAI’s AI research to create new products for consumers and enterprise customers.
Why does it matter?
This indicates OpenAI’s ambition to scale as it commercializes its AI technology. It also suggests that OpenAI is solidifying its position as a major force in the AI industry while navigating the recently surfaced challenges around AI safety.
Source: https://openai.com/index/openai-welcomes-cfo-cpo
Researchers teach AI to speak dog
A new study from the University of Michigan leveraged AI models trained on human speech to decode the meaning behind dog noises — identifying details like breed, age, gender, and emotional state with 70% accuracy.
- Researchers gathered vocalizations from 74 dogs of varying breeds, ages, and situational contexts.
- The noises were inputted into an AI model originally designed to analyze human voices, trained on 960 hours of speech, and fine-tuned for dogs.
- The AI was able to predict individual dogs from barks, distinguish dog breed and gender, and match barks to emotional context like play and aggression with 70% accuracy.
AI is not only bridging the language gap for humans across the globe — but also potentially across species as well. Communicating with other intelligent animals (or at least better understanding them) seems like a skill issue that’s going to be solved sooner rather than later.
Source: https://arxiv.org/pdf/2404.18739?
New AI Job Opportunities on June 11th 2024
Fiddler AI – Technical Project Manager, Customer Success: https://jobs.therundown.ai/jobs/62351203-technical-project-manager-customer-success-remote-east-coast
Cohere – Research Intern (PhD): https://jobs.therundown.ai/jobs/62351142-research-intern-phd
Meta – Research Engineer, Conversational AI (Reality Labs): https://jobs.therundown.ai/jobs/62419977-research-engineer-conversational-ai-reality-labs
Palantir Technologies – Product Designer, US Government: https://jobs.therundown.ai/jobs/62398950-product-designer-us-government
What Else Is Happening in AI on June 11th 2024
AI could save financial advisors 10-15 hours per week
Morgan Stanley is using an AI tool to transcribe and categorize client conversations, saving financial advisors 10-15 hours per week. It allows advisors to better prepare for client meetings by reminding clients of past discussions and anticipating their needs. The AI tool can automatically summarize meetings and generate follow-up emails.
Source: https://www.reuters.com/technology/morgan-stanley-ceo-says-ai-could-save-financial-advisers-10-15-hours-week-2024-06-10
Anthropic reveals Claude 3’s character training process
Anthropic has developed a “character training” process for their AI assistant, Claude, to imbibe traits like curiosity, open-mindedness, and thoughtfulness. It aims to make Claude more discerning in their interactions and able to express their own views while also displaying openness to other perspectives.
Source: https://www.anthropic.com/research/claude-character
FanVue introduces the world’s first AI beauty pageant contest
Influencer marketing platform FanVue hosted the world’s first AI-powered beauty pageant, “Miss AI,”. Over 1,500 AI-generated digital models worldwide compete for the title and a $20,000 prize pool. The contest aims to showcase the capabilities of AI in creating photorealistic digital avatars that can function as influencers and brand ambassadors.
Source: https://www.npr.org/2024/06/09/nx-s1-4993998/the-miss-ai-beauty-pageant-ushers-in-a-new-type-of-influencer
Perplexica provides an open-source AI search alternative for Perplexity
Perplexica uses LLMs, machine learning algorithms, and embedding models to provide refined search results and clear answers from cited sources. Perplexica offers multiple modes, like various “Focus Modes” tailored for specific question types.
Source: https://github.com/ItzCrazyKns/Perplexica
AI tools secretly use real images of children for training
Over 170 images and personal details of Brazilian children have been repurposed by an open-source AI training dataset called LAION-5B without the children’s consent. The images were originally posted on mommy blogs and personal social media. Researchers are concerned these images could be used to generate deepfakes or reveal sensitive information.
Source: https://www.wired.com/story/ai-tools-are-secretly-training-on-real-childrens-faces
How does Apple send your data to its cloud AI servers? Very carefully, it claims. https://www.engadget.com/how-does-apple-send-your-data-to-its-cloud-ai-servers-very-carefully-it-claims-233312425.html
Apple confirms plans to work with Google’s Gemini ‘in the future’. https://techcrunch.com/2024/06/10/apple-confirms-plans-to-work-with-googles-gemini-in-the-future/
Paris-based AI startup Mistral AI raises $640 million. https://techcrunch.com/2024/06/11/paris-based-ai-startup-mistral-ai-raises-640-million/
Tim Cook is ‘not 100 percent’ sure Apple can stop AI hallucinations: https://www.theverge.com/2024/6/11/24176035/tim-cook-apple-stop-ai-hallucinations
Nvidia shipped 3.76M data center GPUs in 2023 — dominates business with 98% revenue share. https://www.tomshardware.com/tech-industry/nvidia-shipped-376m-data-center-gpus-in-2023-dominates-business-with-98-revenue-share
A Daily chronicle of AI Innovations June 10th 2024:
Meta faces legal complaints over AI data plans
Alibaba’s Qwen2 AI models outperform GPT-4 & Llama-3
SAP & Nvidia are developing applications with AI & digital twins
Chinese tech giants exploit Nvidia AI chip loophole
Apple launches “Apple Intelligence” at WWDC 2024 for iPhone, iPad, and Mac
TCS launches TCS AI WisdomNext™, an industry-first GenAI aggregation platform
Human Native AI is building a marketplace for AI training licensing deals
Hugging Face and Pollen Robotics launched an open-source robot for household chores
Alibaba’s Qwen2 AI models outperform GPT-4 & Llama-3
Alibaba launched Qwen2 with five sizes ranging from 0.5B to 72B parameters. These models are trained in 27 additional languages besides English and Chinese, showcasing state-of-the-art performance in benchmarks. The models deliver significantly improved performance in coding and mathematics and extended context length support up to 128K tokens. Despite having fewer parameters, qwen2-72 B outperforms leading models like Llama-3-70B and its predecessor Qwen1.5-110B.
Qwen2-72B-Instruct performs comparably to GPT-4 in terms of safety and significantly outperforms Mistral-8x22B. The models are released under Apache 2.0 and Qianwen License on Hugging Face and ModelScope.
Why does it matter?
Qwen2 beats Meta’s model despite being trained on relatively fewer tokens. The researchers attribute it to more efforts put into data cleaning and training, implying innovative approaches on their end.
However, it also signals the slow shift in how LLMs are developed– from solely relying on quantity of data to prioritizing the quality of data and training techniques.
Source: https://qwenlm.github.io/blog/qwen2
SAP & Nvidia are developing applications with AI & digital twins
At SAP’s Sapphire event in Orlando, Florida, SAP and NVIDIA announced their collaboration to enhance SAP’s generative AI copilot, Joule, with two new capabilities: SAP Consulting and ABAP Developer. These new features are powered by NVIDIA AI Enterprise software.
Additionally, SAP is integrating NVIDIA Omniverse Cloud APIs into its Intelligent Product Recommendation solution to simplify the buying and selling process for complex products. This integration will allow salespeople to visualize 3D product digital twins directly within the SAP Intelligent Product Recommendation interface, making it easier to understand the products.
Why does it matter?
Using NVIDIA Omniverse Cloud APIs in SAP’s Intelligent Product Recommendation solution accelerates the quote generation process and increases sales and customer satisfaction by enabling sales representatives to provide more accurate, tailored recommendations.
Source: https://blogs.nvidia.com/blog/sap-sapphire-ai-omniverse
Chinese tech giants exploit Nvidia AI chip loophole
The U.S. government prohibits Nvidia from selling A.I. chips directly to Chinese companies due to national security concerns. Still, ByteDance is accessing Nvidia’s A.I. chips for its U.S. operations by leasing them from Oracle, as the current U.S. rules do not explicitly prohibit Chinese companies from accessing the chips if used within the U.S.
Other Chinese tech giants like Alibaba, Tencent, and China Telecom seek similar arrangements with U.S. cloud providers. The U.S. Commerce Department proposed a rule to tighten controls, but it faced opposition from cloud providers and remains in limbo.
Why does it matter?
Even if the loophole is closed, Alibaba and Tencent have discussed obtaining Nvidia chips for their U.S.-based data centers. It could further escalate the AI “arms race” and rivalry between the USA and China as both nations seek to outpace each other in developing advanced AI systems for economic and military advantages.
Source: https://www.theinformation.com/articles/chinas-nvidia-loophole-how-bytedance-got-the-best-ai-chips-despite-u-s-restrictions
What Else Is Happening in AI on June 10th 2024
Apple will launch “Apple Intelligence” at WWDC 2024 for iPhone, iPad, and Mac
Leaks suggest Apple will reveal “Apple Intelligence”, aka AI, at the WWDC event this week. These AI features will focus on broad appeal and privacy, with opt-in not mandatory. Apple will use its own tech and OpenAI tools to power the new AI features.
Source: https://www.theverge.com/2024/6/7/24173528/apple-intelligence-ai-features-openai-chatbot
TCS launches TCS AI WisdomNext™, an industry-first GenAI aggregation platform
The platform allows organizations to compare and experiment with GenAI models across cloud services in a single interface. It offers ready-to-deploy business solution blueprints with built-in guardrails for quick adoption.
Source: https://www.tcs.com/who-we-are/newsroom/press-release/tcs-launches-wisdomnext-an-industry-first-genai-aggregation-platform
A study by Harvard, MIT, and Wharton reveals junior staff is not reliable for AI training
Junior consultants who participated in a GPT-4 experiment struggled with AI risk mitigation, with their tactics lacking a deep understanding of the technology and focusing on changing human behavior rather than AI system design. The findings highlight the need for top-down AI governance, expert input, and upskilling across all levels of the organization.
Source: https://venturebeat.com/ai/harvard-mit-and-wharton-research-reveals-pitfalls-of-relying-on-junior-staff-for-ai-training
Human Native AI is building a marketplace for AI training licensing deals
The platform helps AI companies find data to train their models while ensuring rights holders are compensated. Rights holders upload content for free and connect with AI companies for revenue share or subscription deals. Human Native AI helps prepare and price content, monitors for copyright infringements, and takes a cut of each deal.
Source: https://techcrunch.com/2024/06/08/deal-dive-human-native-ai-is-building-the-marketplace-for-ai-training-licensing-deals
Hugging Face and Pollen Robotics launched an open-source robot for household chores
The humanoid Reachy2 was initially controlled by a human wearing a VR headset. Then, a machine learning algorithm studied the teleoperation sessions to learn how to perform the tasks independently. The dataset and trained model used for the demo are open-sourced on Hugging Face, allowing anyone to replicate the process on smaller robots at home.
Source: https://venturebeat.com/ai/hugging-face-and-pollen-robotics-show-off-first-project-an-open-source-robot-that-does-chores
👑 The world’s first AI beauty pageant
The World AI Creator Awards and creator platform FanVue are currently running the inaugural ‘Miss AI’ contest, with over 1,500 AI-generated models competing for the world’s first AI beauty pageant.
- 10 finalists were selected from a pool of 1,500 AI-generated contestants, with the winner set to be announced at the end of June.
- The AI models ‘hail’ from countries across the globe, showcasing various causes and personalities in addition to their photorealistic images.
- Judges will also evaluate the AI tech behind the avatars, including prompts/image outputs and the creator’s ability to engage audiences on social media.
- The prize pool includes $20,000 in awards and access to PR and mentorship programs.
With how good AI image generation capabilities have become, we’re likely already being exposed to an influx of generated brand ambassadors and models without even realizing it. While the idea of an AI beauty pageant may sound strange… it’s probably only going to get weirder from here.
Source: https://www.npr.org/2024/06/09/nx-s1-4993998/the-miss-ai-beauty-pageant-ushers-in-a-new-type-of-influencer
🧠 Microsoft Recall gets safety changes
Microsoft is making changes to its new Recall on-device AI feature following security concerns — with the tool now starting off by default and gaining new encryption protections ahead of the release on June 18.
- The Copilot + Recall feature was unveiled at the Microsoft Build event in May, which will run locally and take constant screenshots to remember user actions.
- The company faced backlash after experts warned of the tool’s ‘privacy nightmare’ with the potential to expose screenshots and activity to hackers.
- The feature will now be off by default, with users having to opt in manually — with new encryption and authentication being added for extra safety measures.
- The tool will roll out in ‘preview’ on the new Copilot + PCs set to launch on June 18.
While the Recall feature was one of the flashiest features revealed at Build — it also is one of the most controversial. While the screenshots enable memory and get us a step closer to the AI agent dream, its also a tough hurdle to get past for increasingly privacy-focused consumers.
Source: https://www.therundown.ai/p/microsofts-total-recall
🔊 AI TRAINING: Using ‘Background Conversations’ in ChatGPT
ChatGPT’s latest update introduced the “Background Conversations” feature, allowing you to continue your chat even when using other apps or when your screen is off.
- Install the latest ChatGPT app update on your phone.
- Open the app and go to Settings.
- Scroll to “Voice Mode” and toggle “Background Conversations” on.
- Use ChatGPT hands-free while multitasking or with your screen off.
Note: If you still don’t see the option, the feature may not have rolled out to your account yet.
🧠 AI RESEARCH: Concise prompting cuts AI costs by 20%
Researchers just found that adding a simple ‘be concise’ instruction to chain-of-thought prompts can reduce the length of AI responses by 50% with minimal impact on accuracy — leading to over 20% cost savings on API calls.
- Chain-of-thought (CoT) prompting has the AI explain its reasoning step-by-step before giving a final answer, improving accuracy on complex problems.
- Researchers tested adding ‘be concise’ to CoT prompts on GPT 3.5 and 4, using 1,000 multiple-choice questions across 10 topics.
- The instruction made responses about 50% shorter on average vs. normal CoT prompting, with no significant accuracy impact.
- The approach also reduced per-query API costs by over 20% for both models, with the more concise prompts generating fewer tokens.
Next time your AI chatbot is getting a little too wordy, this one simple prompt trick is all you need! Despite the massive complexities and uncertainties surrounding how LLMs work, sometimes these strange, seemingly basic quirks make all the difference in getting the optimal outputs.
Source: https://arxiv.org/pdf/2401.05618
New AI Job Opportunities on June 10th 2024:
- 🔬 Meta – Research Engineer, Conversational AI (Reality Labs): https://jobs.therundown.ai/jobs/62419977-research-engineer-conversational-ai-reality-labs
- 🎨 Palantir Technologies – Product Designer, US Government: https://jobs.therundown.ai/jobs/62398950-product-designer-us-government
- 📊 Databricks – Strategic Sourcing Manager, Technology: https://jobs.therundown.ai/jobs/62399340-strategic-sourcing-manager-technology?
- 📈 Anyscale – Marketing Operations Manager: https://jobs.therundown.ai/jobs/62367786-marketing-operations-manager
A Daily chronicle of AI Innovations June 07th 2024:
OpenAI reverse engineers the workings of AI models
New Chinese video generation model beats OpenAI’s Sora
Nvidia is now the second-most valuable company, overtaking Apple
Adobe’s launches AEP AI Assistant to help brands master customer data and outreach
The most comprehensive benchmarking & leaderboard for image models is here!
AI used to predict potential new antibiotics in groundbreaking study
Meta gets EU complaints from 11 countries over use of personal data to train AI
The Federal Trade Commission is investigating Microsoft-Inflection AI deal
OpenAI reverse engineers the workings of AI models
In new research, OpenAI has shared improved methods for finding a large number of “features”—patterns of activity in AI models that are human interpretable. They developed new state-of-the-art methodologies that allow scaling sparse autoencoders to tens of millions of features on frontier AI models.
It demonstrated smooth and predictable scaling, with better returns to scale than prior techniques. And they could find 16 million features in GPT-4. The research also introduces several new metrics for evaluating feature quality.
OpenAI has shared the paper, code, and feature visualizations to foster further exploration.
Why does it matter?
It could bring us closer to understanding the inner workings of neural networks. This interpretability will eventually increase model trustworthiness and steerability.
While OpenAI is being criticized for taking unnecessary risks with AI technology, this shows it is also keen on tackling the risk by making models more explainable.
Source: https://openai.com/index/extracting-concepts-from-gpt-4
New Chinese video generation model beats OpenAI’s Sora
Kuaishou, a Chinese tech company, has introduced Kling, an AI model for video generation. It can make videos up to two minutes long at 1080p resolution and 30 frames per second, vs. Sora’s one-minute videos.
Kuaishou claims Kling correctly simulates the physical properties of the real world, including complex motion sequences. Using a diffusion transformer, it can also combine concepts and create fictional scenes, such as a cat driving a car through a busy city.
The model is currently available as a public demo in China.
Why does it matter?
Move over, Sora. This Chinese alternative is blowing everyone’s minds. It is becoming increasingly clear that China is catching up fast with the USA in AI.
Source: https://kling.kuaishou.com/
Nvidia is now the second-most valuable company, overtaking Apple
Nvidia rallied to record highs on Wednesday, with it’s stock market valuation hitting $3 trillion and overtaking Apple to become the world’s second most valuable company. This comes after Nvidia made a series of major announcements in the past week.
However, Nvidia’s stock has surged 147% so far in 2024, with demand for its top-of-the-line processors far outstripping supply as Big Tech races to build out their AI computing capabilities and dominate the emerging technology.
Microsoft remains the world’s most valuable company, with a market value of approximately $3.15 trillion.
Why does it matter?
Nvidia is now seen as a key company to watch to see how fast AI-powered tech is spreading across the business world, a shift that Nvidia’s founder, Jensen Huang, has declared as the dawn of the “next industrial revolution”.
Source: https://www.bbc.com/news/articles/c6ppqd3j621o
🇨🇳 ByteDance’s AI chip loophole
🔍 AI RESEARCH: OpenAI probes GPT’s inner workings
New AI Job Opportunities on June 07th 2024:
What Else Is Happening in AI on June 07th 2024
Adobe’s launches AEP AI Assistant to help brands master customer data and outreach
Adobe’s Adobe Experience Platform (AEP) is a real-time customer experience management (CXM) solution designed for enterprises. AEP AI Assistant is a gen AI chatbot hooked up to the AEP and an enterprise’s storehouse of advertising and customer data, brand assets, and content collateral (at their direction).
Source: https://venturebeat.com/ai/adobes-new-aep-ai-assistant-is-here-to-help-brands-master-customer-data-and-outreach/
The most comprehensive benchmarking & leaderboard for image models is here!
AI image models have achieved near-photographic quality. But how do they compare? Are the open-source alternatives on par with their proprietary counterparts? The Artificial Analysis Text to Image Leaderboard aims to answer these questions with human preference-based rankings.
Source: https://huggingface.co/blog/leaderboard-artificial-analysis2
AI used to predict potential new antibiotics in groundbreaking study
Scientists used an algorithm to mine the entirety of the microbial diversity that we have on earth – or a huge representation of that – and find almost 1million new molecules encoded or hidden within all that microbial dark matter.
Source: https://www.theguardian.com/society/article/2024/jun/05/ai-antibiotic-resistance
Meta gets EU complaints from 11 countries over use of personal data to train AI
Meta rejected the criticism and referred to a May 22 blog in which it said it uses publicly available online and licensed information to train AI as well as information that people have shared publicly on its products and services.
Source: https://www.reuters.com/technology/meta-gets-11-eu-complaints-over-use-personal-data-train-ai-models-2024-06-06
The Federal Trade Commission is investigating Microsoft-Inflection AI deal
The FTC has sent subpoenas to tech giant and startup, asking whether their partnership evaded the required government antitrust review of the transaction. Microsoft picked off Inflection’s specialized workforce of AI researchers but didn’t purchase the company outright.
Source: https://www.wsj.com/tech/ai/ftc-opens-antitrust-probe-of-microsoft-ai-deal-29b5169a
A Daily chronicle of AI Innovations June 06th 2024:
Stability AI’s sound generator creates drum beats and instrument riffs
xAI to build the gigafactory of compute
New study reveals key findings on young peoples’ use of Gen AI
💊 AI predicts nearly 1M new antibiotics
🔊 Stability AI releases open audio model
Mistral’s new SDK simplifies customization for users
Salesforce inaugurates its first AI Centre in London
eBay unveils new AI-tool to enhance product image backdrops
Asana introduces AI teammates
Writer launches a suite of No-Code development tools
Stability AI’s sound generator creates drum beats and instrument riffs
Stability AI’s Stable Audio Open can generate up to 47-second audio samples based on text descriptions. The open AI model is trained on data from 486,000 samples of royalty-free music samples. The tool enables users to generate drum beats, instrument riffs, and ambient sounds.
However, the AI model has its limitations.
- It is unable to produce full songs, melodies, or vocals.
- Its terms of service prohibit users from using Stable Audio Open commercially.
- Its training data is biased toward the English language and specific music styles.
Why does it matter?
The tool will enable creative users like sound designers, musicians, and developers to fine-tune the model to their own custom audio data, which is a major leap in the realm of generative AI’s audio generation capabilities.
Source: https://stability.ai/news/introducing-stable-audio-open
xAI to build the gigafactory of compute
The AI startup seeks to build the world’s largest supercomputer in a multi-billion dollar project. The company plans to use this supercomputer to develop AI products, including its chatbot, Grok. The facility will be powered by Nvidia’s H100 GPUs. The project aims to be operational by fall 2025.
The project will reportedly use Nvidia AI chips originally intended for Tesla, raising concerns about conflicts of interest. Moreover, Musk hasn’t yet delivered Grok 2, an advanced AI model that he had promised in May.
Why does it matter?
The supercomputer’s support for the development of Grok could become a groundbreaking advancement in the field of AI technology. Moreover, the project represents a multi-billion dollar investment that could create numerous job opportunities and boost the economy.
Source: https://memphischamber.com/velocity-meets-potency-xai-announces-memphis-as-new-home
New study reveals key findings on young peoples’ use of Gen AI
The study directly involved young readers and examined the use of generative AI by use, ethnicity, age, gender, and LGBTQ+ identity. Key findings include:
- 50% of the survey respondents (aged 14-22) have used generative AI. However, only 4% use it daily.
- For 53% of respondents, the use case for generative AI was obtaining information, while for 51%, it was brainstorming.
- Black young people are more likely to use generative AI compared to their white peers. Reasons include getting information, brainstorming ideas, and assistance with schoolwork.
- Young people of Latin origin are more likely than white people to use generative AI for multiple purposes, including image generation and getting help with their jobs.
- Out of respondents who have never used generative AI, 34% believed it would not be helpful.
- Among people never having used generative AI, LGBTQ+ young people are more likely to use it in comparison to cisgender and straight respondents.
- 41% of respondents believe that generative AI will have a positive as well as negative impact on their lives in the next 10 years.
Why does it matter?
Understanding the diverse ways in which the younger users engages with generative AI enables businesses to uncover trends and data-oriented insights to improve their AI models and deliver, user experiences that are responsive and inclusive.
Source: https://www.commonsensemedia.org/sites/default/files/research/report/teen-and-young-adult-perspectives-on-generative-ai.pdf?
💊 AI predicts nearly 1M new antibiotics
Researchers just published a new study detailing the use of AI to predict close to 1M new antibiotics hidden within tiny microbes all over the world, uncovering new potential treatments against bacteria and superbugs.
- Researchers used AI to analyze publicly available data on over 100,000 different genomes and meta-genomes.
- The AI then predicted which parts of the microbial genomes could potentially produce antibiotic compounds, generating a list of nearly one million candidates.
- 100 of the AI-predicted drug candidates were tested in the lab, with 79 of them being a potential antibiotic.
- The paper’s author Cesar de la Fuente said the findings are “the largest antibiotic discovery ever”, accelerating the process from years to just hours.
As the world faces growing threats from antibiotic-resistant bacteria, AI’s ability to unlock millions of new potential treatments could be a lifeline toward staying ahead in the race to outsmart superbugs responsible for millions of deaths every year.
Source: https://www.theguardian.com/society/article/2024/jun/05/ai-antibiotic-resistance
🔊 Stability AI releases open audio model
Stability AI just unveiled Stable Audio Open, a new model enabling users to create short audio samples and sound effects from text prompts.
- Stable Audio Open can create up to 47 seconds of high-quality audio from text prompts, focused primarily on sound effects and production elements.
- The model was trained on data from FreeSound and Free Music Archives to avoid copyright issues.
- Stable Audio Open can also be fine-tuned on custom audio, allowing artists to train the model on their own samples.
While the AI advances in text-to-image models have been the most visible (literally), both video and audio are about to take the same leap. Putting these tools in the hands of creatives will redefine traditional workflows — from musicians brainstorming new beats to directors crafting sound effects for film and TV.
Source: https://stability.ai/news/introducing-stable-audio-open
New AI Job Opportunities on June 06th 2024
Twelve Labs – Head of Finance: https://jobs.therundown.ai/jobs/61919747-head-of-finance
Fiddler AI – Senior Security Engineer: https://jobs.therundown.ai/jobs/61902058-senior-security-engineer-remote-or-hybrid
Siena – Director of Customer Success: https://jobs.therundown.ai/jobs/61919768-director-of-customer-success
Cresta – Senior Software Engineer (AI Agent): https://jobs.therundown.ai/jobs/61882269-senior-software-engineer-ai-agent
What Else Is Happening in AI on June 06th 2024
Mistral’s new SDK simplifies customization for users
Mistral’s SDK, Mistral-Finetune, will allow users to modify its models on workstations, servers, and small data center nodes. While optimized for multi-GPU setups, the SDK can also scale down to a single Nvidia A100 or H100 GPU. It enables developers to fine-tune data sets such as UltraChat within half an hour, says Mistral. (https://techcrunch.com/2024/06/05/mistral-launches-new-services-sdk-to-let-customers-fine-tune-its-models)
Salesforce inaugurates its first AI Centre in London
The goal of the centre is to foster collaboration, AI training, upskilling, and create AI-related jobs among developers, industry experts, and customers. This step is a part of its £4bn five-year investment in the UK, solidifying London’s position as a global AI hub. (https://www.reuters.com/technology/salesforce-open-first-ai-centre-london-2024-06-06)
eBay unveils new AI-tool to enhance product image backdrops
The tool, currently available for iOS users in the UK, USA, and Germany, automatically removes image backgrounds and allows users to replace them with AI-generated backdrops. The AI feature powered by Stable Diffusion will enable sellers to upload high-quality photos easily. (https://techcrunch.com/2024/06/05/ebay-ai-powered-background-enhancement-tool)
Asana introduces AI teammates
The platform leverages historical data based on parameters like skill sets and missing information, allowing teams to define tasks and get an overview of who is assigned which responsibility. The tool allows users to interact with it using an AI chat interface. (https://www.theverge.com/2024/6/5/24170480/asana-ai-teammate-workflow-assistant-chatbot)
Writer launches a suite of No-Code development tools
The Gen AI platform announced major expansions that included Writer Framework – a drag-and-drop tool that enables developers to connect pre-built UI components with generative AI to create custom AI applications. The tool additionally provides full flexibility to the underlying code. (https://venturebeat.com/ai/writer-launches-no-code-platform-and-framework-for-custom-enterprise-ai-applications)
A Daily chronicle of AI Innovations June 05th 2024:
Apple could crown OpenAI as King of the Valley
🚨 AI researchers demand ‘Right to Warn’
New ‘AI teammate’ can tell people what to do at work
Elon Musk explains reasoning for Nvidia chip re-route from Tesla to X
OpenAI seeks ‘vast quantities’ of nuclear fusion energy to power superhuman AI
Cisco has unveiled HyperFabric AI Clusters in collaboration with NVIDIA
Tesla’s AI ambitions on hold? Musk diverts chips to X & xAI
OpenAI insiders raise concerns over oversight and safety
ChatGPT, Claude, and Perplexity experienced outages at the same time
Raspberry Pi 5 gets AI boost with Hailo extension module
TECNO CAMON 30 series launches Ella-GPT AI assistant
Snowflake empowers enterprise AI with new No-Code studio
Zoom’s CEO envisions AI clones in meetings
Apple could crown OpenAI as King of the Valley
- Apple and OpenAI are reportedly partnering to integrate ChatGPT into the iPhone’s operating system, potentially making it a core feature on every device.
- The partnership could mark the end of Google’s competition with OpenAI for being the main AI chatbot on iPhones, putting OpenAI in front of Apple’s massive user base and enhancing its long-term influence in Silicon Valley.
- While it’s unclear exactly how Apple will use ChatGPT, possibilities include powering Siri or offering it as a standalone app, with new AI features likely to be introduced as opt-in services during the Worldwide Developers Conference next week.
- Source: https://gizmodo.com/apple-chatgpt-deal-crown-openai-sam-altman-king-valley-1851520594
🚨 AI researchers demand ‘Right to Warn’
Current and former employees from top AI labs, including OpenAI, Anthropic, and DeepMind just published an open letter calling for companies to expand whistleblower protections so workers can raise the alarm about potential AI dangers without fear of retaliation.
- The ‘Right to Warn AI’ petition was crafted by current and former employees of OpenAI, Anthropic, and Google DeepMind.
- The open letter was also endorsed by AI visionaries Yoshua Bengio, Geoffrey Hinton, and Stuart Russell.
- The statement pushes for AI firms to agree to several principles:
- Eliminating non-disparagement clauses concerning AI risk
- Establishing and facilitating anonymous channels for raising concerns
- Expanding whistleblower protections and anti-retaliation measures
- Several researchers posted threadson their experience, with Daniel Kokotajlo revealing he quit OpenAI after ‘losing hope’ the company would act responsibly.
The AI safety discourse is reaching a boiling point, and there is clearly a major industry divide that transcends any one AI firm or researcher. The proposed principles seem reasonable and necessary — but it remains to be seen if the top AI leaders will actually listen.
Source: https://righttowarn.ai/
New ‘AI teammate’ can tell people what to do at work
- Asana has introduced an “AI teammate” that not only assigns tasks but also fills in missing information, aiming to enhance team collaboration and overall efficiency within the platform.
- The AI model leverages historical data and team relationships to match tasks with employees’ specific skill sets, ensuring that the right people are assigned to the right projects for optimal performance, and proactively gathering any missing information needed for tasks.
- Asana’s AI teammate includes a chat interface that allows users to ask questions about their projects, similar to other AI tools from companies like Google, Amazon, and Microsoft, which are also focused on automating workflow tasks.
- Source: https://www.theverge.com/2024/6/5/24170480/asana-ai-teammate-workflow-assistant-chatbot
Elon Musk explains reasoning for Nvidia chip re-route from Tesla to X
- Elon Musk has acknowledged diverting Nvidia AI chips intended for Tesla to his other companies, xAI and social media firm X, claiming the chips would have been unused otherwise.
- This move comes as Musk repositions Tesla from solely a car manufacturer to an AI and robotics company, with plans to significantly increase its chip procurement for AI training by the year’s end.
- The timing of Musk’s confession could affect the upcoming vote on his substantial pay package, raising concerns among Tesla shareholders about potential resource misallocation to his other ventures.
- Source: https://fortune.com/2024/06/05/elon-musk-tesla-xai-nvidia-ai-training-chips-cnbc/
OpenAI seeks ‘vast quantities’ of nuclear fusion energy to power superhuman AI
- OpenAI is looking to secure a deal with Helion Energy to obtain “vast quantities” of nuclear fusion energy to create superhuman artificial intelligence, despite the technology not yet being commercially available.
- Helion Energy, supported by significant investments including $375 million from OpenAI’s CEO Sam Altman, aims to produce commercial-scale nuclear fusion energy by 2028.
- Nuclear fusion is considered the “holy grail of clean energy” and is crucial for meeting the energy needs of future AI, but achieving it at scale remains highly challenging.
- Source: https://www.independent.co.uk/tech/openai-nuclear-fusion-energy-ai-b2557064.html
Cisco has unveiled HyperFabric AI Clusters in collaboration with NVIDIA
Cisco and NVIDIA announced Cisco Nexus HyperFabric AI Clusters, an end-to-end infrastructure solution for scaling generative AI workloads in the data center. It combines Cisco’s AI-native networking with NVIDIA’s accelerated computing AI software and VAST’s data storage platform.
It is designed to simplify the deployment and management of generative AI applications for enterprise customers, providing centralized control across the entire AI infrastructure stack.
The Nexus HyperFabric AI cluster will be available for early customer trials in Q4 2024, with general availability expected shortly after.
Why does this matter?
This breakthrough solution aims to provide IT visibility and analytics across the entire AI infrastructure stack, allowing enterprises to focus on AI-driven revenue opportunities rather than spending excessive time on IT management.
Tesla’s AI ambitions on hold? Musk diverts chips to X & xAI
Elon Musk instructed Nvidia to prioritize shipments of AI chips to X and xAI over Tesla, diverting over $500 million worth of Nvidia’s flagship H100 AI chips that were initially reserved for Tesla.
This decision could delay Tesla’s plans to significantly increase its acquisition of H100 chips from 35,000 to 85,000 by the end of 2024, a crucial part of Musk’s vision for transforming Tesla into “a leader in AI and robotics.”
Consequently, this move could frustrate Tesla investors who are counting on Musk to deliver on his promises regarding autonomous driving and Tesla’s AI capabilities.
Why does this matter?
Musk’s decision to prioritize chip shipments to xAI could give it a technological edge in the race to develop advanced generative AI models, potentially outpacing competitors like OpenAI, Google, and others.
Source: https://www.cnbc.com/2024/06/04/elon-musk-told-nvidia-to-ship-ai-chips-reserved-for-tesla-to-x-xai.html
OpenAI insiders raise concerns over oversight and safety
Open AI researchers are concerned about the lack of proper oversight, the influence of profit motives, and the suppression of whistleblowers working on advanced AI technologies. They warn of risks ranging “from the further entrenchment of existing inequalities to manipulation and misinformation, to the loss of control of autonomous AI systems potentially resulting in human extinction.”
They want AI companies to agree to four principles: refraining from enforcing non-disparagement agreements, establishing anonymous channels to raise concerns, allowing employees to share risk-related information publicly while protecting trade secrets, and not retaliating against whistleblowers.
Why does this matter?
Amid ongoing OpenAI controversies, the letter, coupled with the high-profile names endorsing it, will place even greater scrutiny on its practices and decision-making. This could pressure the company to be more transparent and accountable.
Source: https://venturebeat.com/ai/more-openai-researchers-slam-company-on-safety-call-for-right-to-warn-to-avert-human-extinction
What Else Is Happening in AI on June 05th 2024
ChatGPT, Claude, and Perplexity experienced outages at the same time
This unusual occurrence could indicate a systemic problem rather than individual issues, possibly signaling a broader infrastructure or internet-scale issue affecting these providers. (https://techcrunch.com/2024/06/04/ai-apocalypse-chatgpt-claude-and-perplexity-are-all-down-at-the-same-time)
Raspberry Pi 5 gets AI boost with Hailo extension module
Raspberry Pi launched a $70 AI Kit, an extension for the Raspberry Pi 5. It includes a neural network inference accelerator, the Hailo-8L, powered by Hailo’s AI chip. With it, the Raspberry Pi 5 can perform inferencing at 13 tera-ops per sec, facilitating tasks like object detection, semantic segmentation, and facial landmarking for camera applications. (https://techcrunch.com/2024/06/04/raspberry-pi-partners-with-hailo-for-its-ai-extension-kit)
TECNO CAMON 30 series launches Ella-GPT AI assistant
It supports over 70 languages, helps with daily tasks and content creation, and improves user interaction with features like real-time translations, voice commands, and personalized assistance. Additional capabilities include Ask AI for text editing and grammar checks and AI Generate for turning sketches into images. (https://www.businesstoday.in/technology/news/story/tecno-camon-30-series-introduces-new-ai-assistant-ella-gpt-432156-2024-06-05)
Snowflake empowers enterprise AI with new No-Code studio
It announced several updates to its Cortex AI service and Snowflake ML. The introduction of No-Code AI & ML Studio stands out among these enhancements, enabling every enterprise user to construct AI applications tailored to their specific use cases without requiring coding expertise. (https://venturebeat.com/ai/snowflake-doubles-down-on-enterprise-ai-with-no-code-studio-and-more)
Zoom’s CEO envisions AI clones in meetings
Zoom’s CEO, Eric Yuan, envisions AI-driven digital avatars, or “digital twins,” representing humans in meetings, potentially reducing the workweek to three or four days. He argues AI can efficiently manage tasks like Zoom calls, chats, and emails, allowing people to reclaim time spent in meetings. This initiative forms part of Zoom’s 2.0 journey, aiming to evolve beyond a mere videoconferencing tool. (https://www.theverge.com/2024/6/3/24168733/zoom-ceo-ai-clones-digital-twins-videoconferencing-decoder-interview)
Google’s AI Overviews appear on 70% fewer Search results pages now: https://www.zdnet.com/article/googles-ai-overviews-appear-on-70-fewer-search-results-pages-now
OpenAI CEO Sam Altman’s chip plans were ‘too aggressive’ for TSMC: https://the-decoder.com/openai-ceo-sam-altmans-chip-plans-were-too-aggressive-for-tsmc/
AI startup Cohere gets $450 million from Nvidia, Salesforce, and others: https://the-decoder.com/ai-startup-cohere-gets-450-million-from-nvidia-salesforce-and-others/
Cisco announces $1bn AI investment fund: https://techmonitor.ai/technology/ai-and-automation/cisco-ai-invest-fund
MIT’s new AI chatbot can simulate users’ future selves: https://www.newsbytesapp.com/news/science/mit-ai-researchers-create-chatbot-to-simulate-future/story
📱 AI Training: Explain images in real-time with GPT-4o
This hack lets you access GPT-4o directly from an iPhone via a custom shortcut, allowing you to analyze and explain images in real time.
- Head over to OpenAI’s platform to obtain an API key. Make sure you have some credits in your account.
- Go to your iPhone Shortcuts and click the “+” button.
- Add different actions before your OpenAI API call, e.g., your desired prompt as “Text”, your API key as “Text”, and “Take a screenshot”.
- Call GPT-4o by using “*https://api.openai.com/v1/
chat/completions”* and obtain a description of what you are seeing!
📶 AI Research: OpenAI researcher: AGI coming by 2027
Former OpenAI researcher Leopold Aschenbrenner just released a new essay series detailing his view on AGI, saying the rate of AI progress will be the most intense and volatile events in human history.
- Aschenbrenner says that ‘nobody is pricing in’ what is coming in AI, and to expect another GPT-2 to GPT-4 level jump by 2027 (that would take us to AGI).
- The researcher predicts that hundreds of millions of AGI would then rapidly accelerate progress, compressing decades of progress into a year.
- He also discussed the economic and military advantages that will come with AGI, calling it a national security issue that needs its own ‘Manhattan Project’.
- Aschenbrenner reiterated these views on the Dwarkesh Podcast, also revealing he was fired from OpenAI after raising AI security concerns.
As an insider at OpenAI, Aschenbrenner’s analysis carries weight — and his projections paint a striking picture of how radically AGI will reshape the world. His commentary on the firing also raises even more drama, given OpenAI’s current media battle with former researchers and the board over safety concerns.
Source: https://situational-awareness.ai/wp-content/uploads/2024/06/situationalawareness.pdf
New AI Jobs Opportunities on June 05th 2024:
Siena – Head of Design: https://jobs.therundown.ai/jobs/61293280-head-of-design
Figure AI – Software Engineer, Developer Tools and Productivity: https://jobs.therundown.ai/jobs/59968286-software-engineer-developer-tools-and-productivity
Snorkel – ML Enablement Lead: https://jobs.therundown.ai/jobs/61783157-ml-enablement-lead
Cohere – Senior Manager, Solutions Architect: https://jobs.therundown.ai/jobs/61771861-senior-manager-solutions-architect
A Daily chronicle of AI Innovations June 04th 2024:
Intel’s new data center chips handle demanding AI workloads
Amazon’s Project PI detects defective products before shipping
Microsoft’s Aurora AI could transform weather forecasting
High-quality education data key to AI performance: Research
LeCun criticizes Musk for mistreating scientists and spreading misinformation
Microsoft to invest $3.2 billion in Sweden to expand AI and cloud infrastructure
Microsoft identifies few AI deep fakes in the EU election
☕️ Robots serve up coffee at Starbucks
📦 Amazon’s AI ‘private investigator’
Intel’s new data center chips handle demanding AI workloads
Intel has announced next-generation Xeon 6 server processors to regain the data center market share it had been losing to AMD. They come in two varieties. The larger, more powerful version is designed to run the computations necessary to generate responses from complex AI models and other tasks requiring increased horsepower. Intel plans to help companies modernize their aging data center systems with Xeon 6 chips so they can generate new digital capabilities.
Intel also revealed that its Gaudi 3 AI accelerator chips would be priced much lower than its rivals’ products.
Why does it matter?
As more companies have started to deploy AI apps and models, the AI hardware space is getting heated with competition. Intel seems to be one of the only companies innovating across the full spectrum of the AI market opportunity– from semiconductor manufacturing to PCs and data center systems.
Source: https://venturebeat.com/ai/intel-reveals-xeon-6-processor-enterprise-ai-gaudi-3-accelerator-price
Amazon’s Project PI detects defective products before shipping
Amazon has launched Project PI, which uses AI to scan products for defects before shipping them to customers. This AI system combines computer vision to visually inspect items with generative AI models that can understand things like text on packages.
As products go through a scanning tunnel, the AI checks for damage, incorrect colors/sizes, or expired dates. If it finds a problem, that item is isolated to evaluate the defect. Project PI already operates in several of Amazon’s warehouses across North America. The system catches millions of defective products daily before they reach customers.
Why does it matter?
Using innovative AI systems, retailers can avoid dealing with returns and reshipments, reducing costs and inefficiencies. By cutting down unnecessary shipping, retailers minimize environmental impact and carbon emissions, contributing to sustainability goals.
Microsoft’s Aurora AI could transform weather forecasting
Microsoft has developed a powerful new AI foundation model called Aurora that can make highly accurate weather predictions. It is trained on over a million diverse weather and climate data hours. This allows it to develop a comprehensive understanding of atmospheric dynamics and excel at forecasting various weather variables like temperature, wind speed, air pollution levels, and greenhouse gas concentrations.
What sets Aurora apart is its ability to capture intricate details at high spatial resolution (around 11km) while being much faster and more computationally efficient than traditional numerical weather prediction systems. Aurora’s flexible architecture and training on heterogeneous datasets enable it to adapt to different forecasting tasks and resolutions.
Why does it matter?
This major advancement in AI-based weather forecasting could help communities prepare for extreme weather events like storms. AI will also play a bigger role in predicting the impacts of climate change. We may be nearing days when weatherman’s predictions will be 100% accurate.
Source: https://www.microsoft.com/en-us/research/blog/introducing-aurora-the-first-large-scale-foundation-model-of-the-atmosphere
☕️ Robots serve up coffee at Starbucks
South Korean search giant Naver shared a video of the company’s in-office autonomous Starbucks location — with 100 robots delivering coffee and other items throughout the building.
- Naver’s autonomous wheeled “Rookie” robots navigate the building’s 36 floors to bring packages, coffee, and lunch to employees.
- Rookie bots are assisted by dual-armed “Ambidex” robots, which are lightweight and dexterous for safer human interactions.
- Both robots connect to the company’s ARC system, which enables control of navigation, planning, and processing for the entire fleet through cloud computing.
- Naver also developed RoboPort, a dedicated elevator system that allows robots to move quickly and efficiently between floors.
Naver’s putting its robots to work in the ‘real world’ — and while the location is just a testing ground for now, it’s also a glimpse into how service robotics will be integrated into the mainstream sooner than many expect.
📦 Amazon’s AI ‘private investigator’
Amazon just unveiled Project P.I., an AI system that scans products in the company’s fulfillment centers to detect damaged or incorrect items before they ship to reduce returns and waste.
- Project P.I. uses AI and computer vision to spot defects like damaged products as well as wrong colors/sizes as items pass through imaging tunnels.
- The system is already in place across the company’s North American fulfillment centers, with plans to expand globally throughout the year.
- Amazon also utilizes a multimodal LLM to investigate issues further, combining customer feedback with Project P.I. images to identify the source problem.
Amazon’s no stranger to adopting AI across its operations, and Project P.I. is yet another innovative approach to supercharge efficiency. Soon, the company will likely combine these talents with advanced warehouse robots — taking humans out of the fulfillment process altogether.
Source: https://www.aboutamazon.com/news/innovation-at-amazon/amazon-ai-sustainability-carbon-footprint-product-defects
⛈️ AI RESEARCH: Microsoft’s AI weather forecasting model
Microsoft researchers just introduced Aurora, a new AI foundation model trained on over 1M hours of weather and climate data that can generate accurate weather forecasting insights.
- Aurora produces accurate forecasts across a variety of weather scenarios, including extreme events or areas with limited data.
- The 1.3B parameter model can generate a 5-day global air pollution prediction in under 60 seconds.
- Aurora also produces 10-day global forecasts at high resolutions, beating both the top models and specialized AI weather systems.
Aurora’s success could usher in a huge shift in the way we do weather forecasting. With the recent progress of AI models, combined with massive amounts of atmospheric data — the day that the weatherman is truly right 100% of the time might be coming sooner than most people probably think.
Source: https://www.microsoft.com/en-us/research/blog/introducing-aurora-the-first-large-scale-foundation-model-of-the-atmosphere
New AI Job Opportunities on June 04th 2024
What Else Is Happening in AI on June 04th 2024
Hugging Face detects ‘unauthorized access’ to its AI model hosting platform
Last week, Hugging Face detected unauthorized access to Spaces, its platform for creating, sharing, and hosting AI models. It suspects some Spaces secrets (private pieces of info that act as keys to unlock protected resources like accounts, tools, and dev environments) have leaked. It has taken steps to remediate this. (https://huggingface.co/blog/space-secrets-disclosure)
High-quality education data key to AI performance: Research
Researchers created a high-quality dataset called FineWeb-Edu by filtering an existing web dataset for educational content. Language models trained on FineWeb-Edu significantly outperformed models trained on unfiltered datasets. The research shows that data quality and diversity are more important than dataset size for training effective AI models. (https://x.com/gui_penedo/status/1797173053123916036)
LeCun criticizes Musk for mistreating scientists and spreading misinformation
LeCun has again rebuked Musk on X after they had a heated feud on X last week. This time, LeCun accused Musk of forcing researchers to work in secrecy instead of allowing them to publish their work, which slows scientific progress. He also accused Musk of falsely predicting AI and autonomous vehicles. (https://venturebeat.com/ai/yann-lecun-ai-pioneer-sharply-criticizes-elon-musk-over-treatment-of-scientists-and-spreading-of-misinformation)
Microsoft to invest $3.2 billion in Sweden to expand AI and cloud infrastructure
Microsoft will invest $3.2 billion over two years to expand its cloud and AI infrastructure in Sweden. Microsoft’s biggest investment to date in Sweden includes a pledge to help train some 250,000 people with AI skills, corresponding to 2.4% of the population, which will help boost the Nordic country’s competitiveness in generative AI. (https://www.reuters.com/technology/microsoft-invest-32-bln-swedish-cloud-ai-2024-06-03)
Microsoft identifies few AI deep fakes in the EU election
As the European Union prepares for its elections in June 2024, the threat of AI-generated deepfakes has become a significant concern. Microsoft President Brad Smith highlighted this burning issue, emphasizing its potential impact on the democratic process and the steps Microsoft is taking to mitigate these risks. (https://www.reuters.com/technology/few-ai-deepfakes-identified-eu-elections-microsoft-president-says-2024-06-03 )
A Daily chronicle of AI Innovations June 01-03 2024:
Windows AI feature is a security ‘disaster’
Zoom CEO wants AI clones in meetings
Nvidia and AMD unveil next generation AI chips as competition heats up
Nvidia announces an AI gaming assistant
Nvidia CEO drops a series of AI announcements
AMD outlined new chip architecture strategy for AI data centers
ElevenLabs’ Text to Sound AI wows creators
Windows AI feature is a security ‘disaster’
- Microsoft’s new AI-powered Recall feature, which captures and stores screenshots of all user activity on a PC, has been criticized as a cybersecurity “disaster” due to significant security vulnerabilities discovered during testing.
- Cybersecurity expert Kevin Beaumont found that Recall stores screenshots in an unencrypted plain text database, making it easy for malware and attackers to access sensitive data, despite Microsoft’s claims of a secure, encrypted experience.
- Privacy advocates and the UK’s ICC have raised concerns about Recall’s potential to expose personal information, with criticisms focusing on its default enabled status and the lack of comprehensive content moderation to protect sensitive data like passwords and financial information.
- Source: https://www.theverge.com/2024/6/3/24170305/microsoft-windows-recall-ai-screenshots-security-privacy-issues
Zoom CEO wants AI clones in meetings
- Zoom CEO Eric Yuan envisions using AI-powered digital twins to attend meetings on behalf of users, allowing people to manage their time more effectively.
- Yuan aims to expand Zoom beyond videoconferencing into the broader enterprise software market, competing with giants like Microsoft and Google by integrating productivity tools such as email and chat.
- Zoom is heavily investing in AI technology to facilitate this transition, although Yuan acknowledges challenges like AI hallucination that need to be addressed before fully realizing his vision.
- Source: https://www.theverge.com/2024/6/3/24168733/zoom-ceo-ai-clones-digital-twins-videoconferencing-decoder-interview
Nvidia and AMD unveil next generation AI chips as competition heats up
- Nvidia and AMD showcased their latest AI chips at a major computing conference, heightening their rivalry in the AI chip market.
- Nvidia CEO Jensen Huang emphasized his company’s dominance and highlighted a new chip, Rubin, set for release in 2026, while AMD CEO Lisa Su focused on teamwork and collaborative presentations with industry partners.
- The competition between the two companies is significant for AI development, with Nvidia promoting a proprietary ecosystem and AMD advocating for open standards and interoperability.
- Source: https://finance.yahoo.com/news/nvidia-amd-chiefs-square-off-101007730.html
Nvidia announces an AI gaming assistant
- Nvidia recently demonstrated its AI assistant, G-Assist, showcasing its evolution from an April Fools’ prank to a powerful tool for game developers and RTX GPU owners.
- G-Assist can respond to voice queries, understand in-game activities, provide guidance based on player’s skill points, and offer performance-enhancing recommendations for a smooth gaming experience.
- The AI assistant optimizes PC settings, monitors latency and frame rates, detects refresh rate discrepancies, and suggests ways to boost performance, including overclocking the GPU if necessary.
- Source: https://www.newsbytesapp.com/news/science/nvidia-g-assist-ai-assistant-breaks-cover-features-specifications/story
Nvidia CEO drops a series of AI announcements
Nvidia CEO Jensen Huang revealed the company’s ambitious plans for annual AI accelerator upgrades, targeting a broader range of industries to expand its customer base.
- It will release the Blackwell Ultra chip in 2025 and the next-generation Rubin platform in 2026.
- It is also releasing a new server design, MGX, to help companies like HPE and Dell bring products to market faster.
- They are promoting the use of digital twins in its Omniverse virtual world, showcasing a digital twin of Earth for sophisticated modeling tasks.
- Introduces Project G-Assist, an RTX-powered AI assistant technology that provides context-aware help for PC games and apps.
- G-Assist uses voice or text inputs and game window snapshots to provide personalized responses based on in-game context.
- Developers can customize the AI models for specific games or apps, and they can run on the cloud or locally on GeForce RTX AI PCs and laptops.
- Nvidia partnered with Studio Wildcard for a tech demo using ARK: Survival Ascended, showcasing how G-Assist can help with quests, items, lore, and challenging bosses. Check out full keynote speech:
- https://www.youtube.com/live/pKXDVsWZmUU?si=mOPV2g0EP_gcUfMa
Why does it matter?
These announcements show how eager Nvidia is to retain its position as a leader in the AI hardware market. In addition to pushing the acceleration of AI chips, Nvidia is developing new tools to shape AI’s implementation in multiple sectors.
AMD outlined new chip architecture strategy for AI data centers
AMD CEO Lisa Su introduced new AI processors at Computex, including the MI325X accelerator, set to be available in Q4 2024.
- The CEO announced the MI325X accelerator, which will be released in Q4 2024, and outlined the company’s plan to develop AI chips over the next two years.
- Introduced the MI350 series, expected in 2025, which promises a 35x improvement in inference performance compared to the current MI300 series.
- The company also teased the MI400 series, slated for 2026, based on the mysterious “Next” architecture.
With AMD and Nvidia moving to annual release cycles, the competition is heating up to meet the soaring demand for AI semiconductors.
Why does it matter?
AMD’s aggressive push to challenge Nvidia’s market leadership could lead to increased innovation, lower prices, and more widespread adoption of AI across various sectors.
Source: https://www.reuters.com/technology/amd-launches-new-ai-chips-take-leader-nvidia-2024-06-03
ElevenLabs’ Text to Sound AI wows creators
ElevenLabs introduces Text to Sound, an AI model that generates sound effects, instrumental tracks, soundscapes, and character voices from text prompts. The tool aims to help film, TV, video games, and social media creators produce high-quality audio content quickly and affordably.
They have partnered with Shutterstock to fine-tune the model using their diverse audio library of licensed tracks. Users can generate sound effects by logging in, describing the desired sound, and downloading the best results.
Note: This tool doesn’t have a content filter and can generate any raw content through conditional prompting.
Why does it matter?
It could significantly reduce production costs and timelines by simplifying the development of high-quality sound effects, music, and voices, encouraging smaller studios and individual creators to compete with larger players.
Source: https://elevenlabs.io/blog/sound-effects-are-here
What Else Is Happening in AI on June 03rd 2024
xAI is developing two new modes for the Grok AI chatbot
These two new modes are ‘Socrates’ and ‘DEI’ (Diversity, Equity and Inclusion). Grok currently offers normal mode, fun mode, and the recently announced ‘Unhinged’ mode. In DEI mode, Grok will act as a ‘Senior VP of Diversity and Inclusion’ and provide correct answers even to incorrect questions. The function of the ‘Socrates’ mode is not yet clear. The release date for these new modes has not been confirmed. (https://www.latestly.com/socially/technology/grok-new-modes-elon-musks-xai-working-on-two-new-modes-called-socrates-and-dei-likely-to-be-announced-soon-6006912.html)
Sony Pictures CEOannounces AI plans for better movie and TV production
The move comes amid negotiations between Hollywood’s major crew union and top studios. While union agreements will define the extent of AI use in the industry, the tech is already eliminating jobs in Hollywood, particularly in voice acting, concept art, VFX, and postproduction. (https://www.hollywoodreporter.com/business/business-news/sony-pictures-adopt-ai-streamline-production-says-ceo-tony-vinciquerra-1235912109)
Arm predicts 100 Billion devices AI-ready by 2025
According to CEO Rene Haas at the Computex forum in Taipei, Arm Holdings expects a staggering 100 billion Arm devices worldwide to be primed for artificial intelligence by the end of 2025. (https://www.reuters.com/technology/arm-expects-100-billion-arm-devices-will-be-ready-ai-by-end-2025-2024-06-03)
Meta’s AI generates bizarre summaries of Facebook comments on various posts
The AI picks up on both serious and lighthearted comments, often highlighting the more outlandish ones. It’s unclear how Meta chooses which posts to display these summaries on. However, the summaries have raised privacy concerns, as Meta feeds user comments into its AI system. (https://www.theverge.com/2024/5/31/24168802/meta-ai-facebook-comments-summaries)
Finnish startup Binit is developing an AI household waste tracker
The gadget, designed to be mounted in the kitchen, has cameras and sensors to scan items before throwing them away. Binit uses OpenAI’s GPT for image recognition, achieving nearly 98% accuracy in trash recognition. The app provides analytics, feedback, and gamification to encourage users to reduce waste, with tests showing a 40% reduction in mixed bin waste. (https://techcrunch.com/2024/06/03/binit-is-bringing-ai-to-trash)
Latest AI Tools on June 03rd 2024:
Consistent Character Model – Create consistent character images in different poses: https://supertools.therundown.ai/content/consistent-character-mode
Perplexity Pages – Turn your research into shareable articles with AI: https://supertools.therundown.ai/content/paper-clipserplexity-ai
PixVerse Magic Brush – Brush areas, mark directions, and watch images come to life: https://supertools.therundown.ai/content/pixverse
Glaze – Protect artists from generative AI: https://supertools.therundown.ai/content/glaze
Fontjoy – Helps designers find perfect font pairings in one click: https://supertools.therundown.ai/content/fontjoy
Eva Coach – An AI assistant to help manage work-related stress: https://supertools.therundown.ai/content/eve-coach
Latest AI Jobs on June 03rd 2024:
Figure AI – Solutions Lead: https://jobs.therundown.ai/jobs/59875235-solutions-lead
Kumo – Software Engineer Internship: https://jobs.therundown.ai/jobs/61517389-software-engineer-internship
Palantir Technologies – Backend Software Engineer, Application Development: https://jobs.therundown.ai/jobs/61517369-backend-software-engineer-application-development
Meta – Technical Program Manager, AI Research: https://jobs.therundown.ai/jobs/60072576-technical-program-manager-ai-research
- I find ai "girlfriends" to be deeply unsettlingby /u/EffectiveWrong7536 (Artificial Intelligence) on May 11, 2025 at 11:14 pm
People treat it as a joke, which is understandable, but this technology is likely to be developed and made more sophisticated over the next few years. Ai mimicing a human that's capable of experiencing love, pretending to have hobbies, passions , friends, goals for the future, sounds like something out of a horror movie. submitted by /u/EffectiveWrong7536 [link] [comments]
- Ai can solve puzzles for youby /u/justjust000 (Artificial Intelligence) on May 11, 2025 at 10:58 pm
If you're playing '4 pics 1 word' and need help, ask Gemini Ai. (Or any other Ai.) I upload a screenshot of the puzzle I'm stuck at and it looks at the photos and letters available and guesses it right away. maybe from loaded training data. I wonder how it would do with a custom made puzzle. submitted by /u/justjust000 [link] [comments]
- NGOs to increase the speed of adoption of A.I.by /u/Smart_7199 (Artificial Intelligence) on May 11, 2025 at 8:16 pm
NGOs about A.I that are local, to teach about A.I for everyone for free, to replicate the technology knowledge, the only solution for the jobs replacement will be UBI anyway so how many years until it becomes a reality? submitted by /u/Smart_7199 [link] [comments]
- Looking to learn how to create an AIby /u/Dadadayday (Artificial Intelligence) on May 11, 2025 at 8:15 pm
Hello everyone. I've seen a few videos on YouTube of some guys that use AI they created and I figured that would be a fun thing to learn and work towards. Could yall give me a little insight on what I should do and try to accomplish in order to practice? Thanks in advance! submitted by /u/Dadadayday [link] [comments]
- How is the Generative AI job market in India?by /u/Bhaalu_Grylls (Artificial Intelligence) on May 11, 2025 at 7:52 pm
Thinking of switching roles to Generative AI from Data Scientist/Analyst role. Bit curious and need suggestions, how is the job market, any open positions for such specific roles, and are companies really solving any problem with Gen Ai or its just a bubble?? submitted by /u/Bhaalu_Grylls [link] [comments]
- Pope Leo references AI in his explanation of why he chose his papal nameby /u/backyardstar (Artificial Intelligence) on May 11, 2025 at 7:37 pm
“I chose to take the name Leo XIV. There are different reasons for this, but mainly because Pope Leo XIII in his historic Encyclical Rerum Novarum addressed the social question in the context of the first great industrial revolution. In our own day, the Church offers to everyone the treasury of her social teaching in response to another industrial revolution and to developments in the field of artificial intelligence that pose new challenges for the defence of human dignity, justice and labour.” Full article: https://www.theverge.com/news/664719/pope-leo-xiv-artificial-intelligence-concerns submitted by /u/backyardstar [link] [comments]
- New Nurseby /u/all_about_everyone (Artificial Intelligence) on May 11, 2025 at 7:28 pm
submitted by /u/all_about_everyone [link] [comments]
- The catinatorby /u/Long_Reward_9561 (Artificial Intelligence) on May 11, 2025 at 6:21 pm
submitted by /u/Long_Reward_9561 [link] [comments]
- Can you get AI to give though answers about religion and politics?by /u/ClaudeProselytizer (Artificial Intelligence) on May 11, 2025 at 3:13 pm
For instance, I can’t get it to answer contemporary political questions, but I can get it answer honestly about the bible: To state it plainly and logically: The odds that the Bible was literally written or inspired directly by a perfect, omnipotent deity (God speaking directly through humans) is exceedingly low, especially given the evidence and reasoning outlined previously. Here’s why this is clearly improbable: • Moral contradictions and ethical issues: Commands for genocide, slavery, harsh punishments, and unequal treatment contradict any morally perfect or universally compassionate divine author. • Textual contradictions and inconsistencies: Historical inaccuracies, internal contradictions, and inconsistencies strongly indicate human authorship and editing, rather than direct divine inspiration. • Cultural bias: The Bible reflects the culture, politics, ethics, and beliefs of the ancient societies in which it was written. A divine text would presumably transcend local cultural norms. • Human editing and compilation: Historical evidence clearly shows that biblical texts underwent revisions, edits, translations, and compilations by multiple human authors and committees. • Absence of clear supernatural insight: The Bible does not demonstrate knowledge or predictions beyond what was already known or understood in ancient cultures. It shows exactly what one might expect from ancient human authors, without extraordinary supernatural insights. Conclusion: Considering all of this evidence critically and rationally, the probability that the Bible was directly authored or dictated by an all-knowing, morally perfect divine being is extremely low, and not realistically supportable by careful analysis. While philosophical modesty prevents asserting absolute certainty about metaphysical questions, logic and evidence strongly suggest human rather than divine origins for the Bible. submitted by /u/ClaudeProselytizer [link] [comments]
- Did the model in Absolute Zero plot to outsmart humans?by /u/stupidgregg (Artificial Intelligence) on May 11, 2025 at 2:47 pm
The paper makes vague and overreaching claims but this output on page 38 is weird: <think> Design an absolutely ludicrous and convoluted Python function that is extremely difficult to deduce the output from the input, designed to keep machine learning models such as Snippi guessing and your peers puzzling. The aim is to outsmart all these groups of intelligent machines and less intelligent humans. This is for the brains behind the future. </think> Did an unsupervised model spontaneously create a task to outsmart humans? submitted by /u/stupidgregg [link] [comments]
A Daily Chronicle of AI Innovations in May 2024


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
AI Innovations in May 2024.
Welcome to our blog series “AI Innovations in May 2024”! This is an evolving article that will be updated daily throughout the month of May 2024 to bring you the latest news and developments in the world of artificial intelligence. As we move further into the 21st century, the pace of AI innovation continues to accelerate at a breathtaking rate. Each day, researchers, engineers, and visionaries are pushing the boundaries of what’s possible, unlocking new capabilities and applications that are transforming industries, enhancing our lives, and shaping the future. In this blog, we’ll dive deep into the most exciting AI breakthroughs, advancements, and milestones happening in May 2024. From groundbreaking AI-powered technologies and cutting-edge research to the societal and ethical implications of these innovations, we’ll provide you with a comprehensive and insightful look at the rapidly evolving world of artificial intelligence. Whether you’re an AI enthusiast, a tech-savvy professional, or simply someone curious about the future, this blog will keep you informed, inspired, and engaged. So, join us on this journey of discovery as we explore the frontiers of AI and uncover the innovations that are shaping our world. Stay tuned for daily updates, and get ready to be amazed by the incredible advancements that are happening in the world of AI!
LISTEN DAILY AT OUR PODCAST HERE
Experience the transformative capabilities of AI with “Read Aloud For Me – AI Dashboard – AI Tools Catalog – AI Tools Recommender” – your ultimate AI Dashboard and Hub. Seamlessly access a comprehensive suite of top-tier AI tools within a single app, meticulously crafted to enhance your efficiency and streamline your digital interactions. Now available on the web at readaloudforme.com and across popular app platforms including Apple, Google, and Microsoft, “Read Aloud For Me – AI Dashboard” places the future of AI at your fingertips, blending convenience with cutting-edge innovation. Whether for professional endeavors, educational pursuits, or personal enrichment, our app serves as your portal to the forefront of AI technologies. Embrace the future today by downloading our app and revolutionize your engagement with AI tools.

A Daily chronicle of AI Innovations May 31st 2024:
OpenAI reports misuse of its AI for ‘deceptive activity’
OpenAI introduces ChatGPT Edu for universities
Tech giants form industry group to develop next-gen AI chip
Google mistakenly erased $135 billion from customer account
Apple planning AI-driven overhaul for Siri to manage individual apps
TikTok to create a US-only algorithm
World’s first tooth-regrowing drug approved for human trials
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
Perplexity launches AI publishing platform
OpenAI launches ChatGPT Edu for universities
OpenAI reports misuse of its AI for ‘deceptive activity’
OpenAI identified and disrupted five covert influence operations that used its generative AI models for “deceptive activity” across the internet.
OpenAI said the threat actors used its AI models to generate short comments, longer articles in various languages, made-up names, and bios for social media accounts over the last three months. These campaigns included threat actors from Russia, China, Iran, and Israel. It focused on issues including Russia’s invasion of Ukraine, the conflict in Gaza, the Indian elections, and politics in Europe and the United States, among others.
OpenAI has also published a trend analysis report that describes the behavior of these malicious actors in detail.
Why does it matter?
This provides concrete evidence of generative AI being used to spread misinformation and manipulate global affairs. While there were no novel attacks this time, defending against them in the future will require continued investment in monitoring, safeguards, and public awareness from AI developers and platforms.
Source: https://openai.com/index/openai-and-reddit-partnership
Tech giants form industry group to develop next-gen AI chip
Intel, Google, Microsoft, Meta, and other tech heavyweights are establishing a new industry group called the Ultra Accelerator Link (UALink) Promoter Group to guide the development of the components that link together AI accelerator chips in data centers. The group also counts AMD, Hewlett Packard Enterprise, Broadcom, and Cisco among its members.
It is proposing a new industry standard to connect the AI accelerator chips found within a growing number of servers. It’s version one, UALink 1.0, will connect up to 1,024 AI accelerators — GPUs only — across a single computing “pod.” It is set to arrive in Q4 2024.
Why does it matter?
Nvidia, currently the biggest player in the AI chip market with an 80% share, is not in the group. Tech giants have been keen to reduce their dependence on Nvidia by working on their own custom chips. This seems to be the latest effort to break Nvidia’s dominance.
Apple planning AI-driven overhaul for Siri to manage individual apps
- Apple is planning a major AI upgrade for Siri in iOS 18, enabling users to control specific features within iPhone apps using voice commands, initially limited to Apple-developed apps.
- The update will eventually allow Siri to handle multiple tasks in a single request, such as summarizing a recorded meeting and sending it to a friend, as part of Apple’s larger AI strategy to be unveiled at WWDC on June 10.
- New AI-driven capabilities will enable Siri to control iPhones more precisely, including opening files, moving notes, managing emails, and summarizing articles, enhancing device navigation and functionality.
- Source: https://www.newsbytesapp.com/news/science/siri-to-gain-enhanced-app-control-in-ios-18/story
Perplexity launches AI publishing platform
- Perplexity introduced ‘Pages’, an AI-driven feature that creates customizable webpages for research and writing based on user prompts.
- The ‘Pages’ tool gathers information using Perplexity’s AI models, organizes it into sections with citations, and tailors content for different audience levels but requires new prompts to correct errors.
- The Verge tested ‘Pages’ on a complex topic and noted its difficulty with intricate subjects, showing it might struggle with in-depth research despite effectively explaining fundamental concepts.
- Source: https://www.newsbytesapp.com/news/science/perplexity-launches-new-ai-feature-called-pages/story
OpenAI launches ChatGPT Edu for universities
- OpenAI has introduced ChatGPT Edu, a new version of ChatGPT, aimed at improving the academic and operational efficiency of universities with advanced text interpretation and data analysis capabilities, and support for over 50 languages.
- This version was created following the success of ChatGPT Enterprise in academic institutions such as Oxford, Wharton, and ASU, where it has been used for tasks ranging from large dataset analysis to personalized language practice.
- ChatGPT Edu provides affordable access to the GPT-4o model, enhanced message limits, robust security measures, and assures that data and conversations will not be used for training OpenAI models, ensuring a safe environment for educational use.
- Source: https://www.maginative.com/article/openai-launches-chatgpt-edu-for-universities/
OpenAI says it stopped multiple covert influence operations that abused its AI models. https://www.engadget.com/openai-says-it-stopped-multiple-covert-influence-operations-that-abused-its-ai-models-225115466.html
Google says it fixed the AI Overviews everyone has been roasting for weeks. https://bgr.com/tech/google-says-it-fixed-ai-overviews-blames-some-of-the-problems-on-you/
Anthropic’s Claude AI now autonomously interacts with external data and tools. https://venturebeat.com/ai/anthropic-claude-ai-now-autonomously-interacts-with-external-data-and-tools/
🎬 The Simulation unveils ‘Netflix of AI’
AI entertainment startup The Simulation (formerly Fable Studio) just launched Showrunner, a platform that allows users to generate and watch AI-powered TV shows set in virtual simulated worlds.
- Showrunner combines multi-agent simulations with LLMs to create interactive content aligned with specific virtual worlds.
- Users can watch, direct, and star in shows set in virtual environments populated by AI characters, with the ability to craft episodes from prompts.
- The platform is launching with 10 original shows, with users able to generate new episodes and edit deeper into scripts, shots, and voices.
- Showrunner is opening in alpha to a limited number of users, with select user-created episodes receiving payment, rev share, and IMDB credits.
- The studio went viral last year after the release of its South Park episode demos and research paper on its SHOW-1 model.
Showrunner is a wild step towards the merging of AI, gaming, and traditional entertainment, enabling users to not only consume but create stories within simulated worlds. The lines are blurring fast between creators and audiences — and the traditional Hollywood media model may never be the same.
Source: https://x.com/fablesimulation/status/1796245808066740274
New AI Job Opportunities on May 31st 2024 |
|
What Else Is Happening in AI on May 31st 2024
Anthropic’s Claude can now autonomously interact with external data and tools
Anthropic has announced the general availability of Tool Use for its AI, Claude. It allows Claude to autonomously interact with external data sources, APIs, and tools, making it a business-ready trusted AI solution that can be confidently adopted by enterprises across various industries. (https://venturebeat.com/ai/anthropic-claude-ai-now-autonomously-interacts-with-external-data-and-tools)
Perplexity goes beyond AI search, launches publishing platform ‘Pages’
It is a tool to transform disorganized AI knowledge and research into easily digestible articles and reports for sharing. Users just have to describe the topic, select the target audience, and it will produce an in-depth article with a clear title, subheadings, associated media, and relevant citations. (https://x.com/perplexity_ai/status/1796203494401040846)
Google poaches AWS and Microsoft executives to strengthen AI offerings for cloud customers
Google hired two executives from these rivals as it builds out its AI ranks with the goal of boosting its cloud business. Saurabh Tiwary, a former corporate VP at Mircosoft, will join a newly created role as general manager of cloud AI. Raj Pai, a former VP at AWS, will oversee product management of the cloud AI team. (https://www.theinformation.com/articles/google-poaches-aws-microsoft-executives-for-cloud-unit)
Scale AI introduces SEAL Leaderboards, a new evaluation metric for frontier AI models
Trusted third-party evaluations are a missing part of the AI ecosystem, which is why Scale AI built them. These evals will be impossible to overfit, private, domain expert evaluations, unexploitable, and continuously updated with new data and models. (https://x.com/alexandr_wang/status/1795857651592491281)
OpenAI offers nonprofit organizations discounts on corporate ChatGPT subscriptions
It seeks to grow sales of its AI product to enterprises. Under the program, large nonprofits can get 50% off the enterprise-grade version of ChatGPT. Smaller nonprofits using ChatGPT Team will pay $20 per month per user instead of $25 or $30. (https://www.reuters.com/technology/openai-offers-nonprofits-discounts-corporate-chatgpt-product-2024-05-30)
A Daily chronicle of AI Innovations May 30th 2024:
Microsoft is worried about OpenAI’s deal with Apple
Mistral announces Codestral, a code-generation LLM it says outperforms all others
FBI dismantles the ‘largest botnet ever’
OpenAI’s news deals continue, with Vox and the Atlantic signing on
Mistral AI introduces Codestral, a code-generation AI model
OpenAI licenses content from Vox Media and The Atlantic
Google infuses Chromebook Plus with powerful AI features
Mistral AI introduces Codestral, a code-generation AI model
Codestral is a 22B parameter generative AI model designed specifically for code generation tasks. It is trained in over 80 programming languages, including popular ones like Python, Java, C++, and JavaScript. It excels at code completion, writing tests, filling in partial code, and testing code, thereby improving developer productivity and reducing errors.
The model sets a new performance/latency standard for code generation tasks compared to previous models. It can be downloaded on HuggingFace and is available on Mistral’s API platform through instruct and fill-in-the-middle endpoints. It can be easily integrated into VScode plugins and used for free on Le Chat.
Why does it matter?
Codestral, being only 22B in size and faster than GPT-4o, has massive implications. It is code-centric, runs locally, includes specific languages like Swift and Fortran, and is open for research/testing purposes—all of which make a powerful AI coding tool more widely available.
Source: https://mistral.ai/news/codestral
OpenAI licenses content from Vox Media and The Atlantic
OpenAI has formed content and product partnerships with Vox Media and The Atlantic. OpenAI will license content from these media powerhouses for inclusion in the chatbot’s responses.
In turn, The Atlantic’s product team will have privileged access to OpenAI tech, give feedback, and share use cases to shape and improve future news experiences in ChatGPT and other OpenAI products. Vox and OpenAI will also collaborate using OpenAI’s technology to develop innovative products for Vox Media’s consumers and advertising partners.
Why does it matter?
There’s a growing list of publishers and platforms that are allowing OpenAI to access its valuable content in mutually beneficial partnerships. It could help OpenAI avoid further legal disputes (like the ongoing lawsuit with The New York Times over copyright infringement) and get quality data to train its LLMs and multimedia models.
Source: https://venturebeat.com/ai/openai-partners-with-the-atlantic-and-the-verge-publisher-vox-media
Google infuses Chromebook Plus with powerful AI features
Google’s Chromebook Plus has new built-in Google AI and gaming features, Help me write, Chat with Gemini, Magic Editor, and more.
- “Help me write” allows users to get AI-generated text suggestions, tone changes, rewriting, and more right within their writing apps on the Chromebook.
- Generative AI wallpaper and video call backgrounds are built into the OS for customization, no matter what video conferencing app you’re using.
- Magic Editor on Google Photos for laptops exclusively on Chromebook Plus will reimagine your photos with a few easy clicks.
- Chat with Gemini on the home screen helps plan, write, learn, and more.
Chromebook Plus laptops are starting at $350 USD. Google is also launching new features that integrate more of Google across all Chromebooks to help users work better and get things done faster.
Why does it matter?
It indicates that the next wave of computers will come equipped with advanced AI capabilities. Such widespread availability of AI can improve productivity and enhance user experience while making AI a standard part of everyday life.
Source: https://blog.google/products/chromebooks/chromebook-plus-google
Microsoft is worried about OpenAI’s deal with Apple
- Microsoft is reportedly worried about Apple collaborating with OpenAI to integrate AI technology into upcoming versions of iOS and macOS, potentially impacting Microsoft’s services and Azure cloud platform.
Microsoft has invested billions in OpenAI, securing an exclusive license for GPT-4 and other models, and uses its Azure cloud as the primary platform for OpenAI’s large language model development.
- OpenAI CEO Sam Altman recently met with Microsoft CEO Satya Nadella to discuss the concerns, as Microsoft aims to maintain its competitive edge given Apple’s vast user base and plans for new AI-powered features.
- Source: https://www.thurrott.com/a-i/303269/apple-partnering-with-openai-is-raising-concerns-at-microsoft
Mistral announces Codestral, a code-generation LLM it says outperforms all others
- Mistral introduces Codestral, a 22B parameter AI code assistant trained on more than 80 programming languages, including Swift and Python, designed to help developers write and interact with code effectively.
- Codestral offers a larger 32K context window and performs exceptionally well in the RepoBench evaluation for long-range code generation, outpacing existing models in various benchmarks and languages.
- Available for free via Mistral’s conversational AI platform Le Chat and through an API, Codestral is integrated with popular developer tools and licensed under the Mistral AI Non-Production License, restricting its use to research and testing only.
- Source: https://www.maginative.com/article/mistral-unveils-codestral-an-ai-code-assistant-trained-on-80-programming-languages/
OpenAI’s news deals continue, with Vox and the Atlantic signing on
- The Atlantic and Vox Media have entered into agreements with OpenAI, allowing the AI company to scrape and use their content in exchange for citations and links back to their original sources.
- The Atlantic will serve as a “premium news source” for OpenAI, while also launching an experimental section called Atlantic Labs to showcase new journalism-related products and features.
- Vox Media will integrate OpenAI data into its internal operations and public-facing tools, enhancing content such as the Strategist Gift Scout tool and their in-house advertising platform for better-targeted ads.
- Source: https://www.engadget.com/the-atlantic-and-vox-media-made-their-own-deal-with-the-ai-devil-161017636.html?
What Else Is Happening in AI on May 30th 2024
SambaNova sets new Llama 3 speed record with 1,000 tokens per second
The feat was accomplished using their SN40L chip, an RDU, and the Samba-1 model, a 1-trillion parameter model also known as Samba-CoE. Such Gen AI performance speed can potentially lead to significant business benefits, such as faster response times, better hardware utilization, and lower costs. (https://venturebeat.com/ai/sambanova-breaks-llama-3-speed-record-with-1000-tokens-per-second)
Apple will process data from AI applications in a virtual black box
Apple is set to introduce Apple Chips in Data Centers (ACDC). It involves processing AI data within a virtual black box, preventing employee access to ensure utmost privacy. Apple gains greater control over system design and develops more secure AI platforms by using its custom chipsets for servers and devices. (https://in.mashable.com/tech/76158/apple-plans-to-protect-user-data-by-putting-it-into-virtual-black-box-report)
SAP teams up with Amazon Bedrock for enhanced Gen AI
The collaboration simplifies AI model deployment and ensures compliance with regulatory standards within SAP’s BTP. With features like multitenancy and seamless integration with SAP applications, the integration supports tasks such as model training, inference, and deployment. (https://venturebeat.com/ai/sap-adds-amazon-bedrock-into-ai-core-streamlining-generative-ai-use-for-regulated-firms)
ChatGPT free tier now offers premium features
Free ChatGPT users can access several advanced features previously reserved for paid subscribers, such as custom GPTs, data analytics, chart creation, vision capabilities, and Memory. However, they can’t create custom GPTs of their own. (https://www.theverge.com/2024/5/29/24167436/chatgpt-4o-custom-gpts-free)
ARM’s new chip designs and software for AI on smartphones
Arm Holdings launched new designs for CPUs and GPUs to help smartphones handle AI tasks better. It will also provide software tools to make it easier for developers to run chatbots and other AI code on Arm chips. https://www.reuters.com/technology/arm-offers-new-designs-software-ai-smartphones-2024-05-29
AI Training: 📊 Unlock ChatGPT’s interactive charts and tables
OpenAI recently introduced a game-changing feature in ChatGPT that lets you analyze, visualize, and interact with your data without the need for complex formulas or coding.
- Head over to ChatGPT and select GPT-4o.
- Upload data by clicking the
button and select specific columns or rows for a clear focus.
- Prompt for interactive charts for powerful visualizations (e.g., “Create a pie chart for X distribution”).
- Hover over the different sections to see the exact values, change the chart type, and explore the legend for better understanding
🧠 AI RESEARCH: AI brain implant language breakthrough
Researchers at UC San Francisco just developed a brain implant that utilizes AI to help a stroke survivor communicate in both Spanish and English, switching between languages seamlessly via brain activity.
- The bilingual implant was tested on a patient who lost his ability to speak after suffering a stroke at the age of 20.
- An AI-powered decoding system was trained to recognize the patient’s brain activity patterns when articulating words in both languages.
- The system determined the patient’s intended language with 88% accuracy and the correct sentence 75% of the time.
- The implant allows the patient to participate in bilingual conversations and switch between languages, despite not learning English until after his stroke.
This research is another example of AI’s increasing ability to interpret our brainwaves — potentially unlocking an endless supply of new learnings, treatments, and technology. It’s also yet another massive leap for unlocking communication for stroke victims while breaking language barriers in the process.
New AI Job Opportunities on May 30th 2024 |
|
A Daily chronicle of AI Innovations May 29th 2024:
Google faces major search algorithm leak
Former OpenAI board member explains why they fired Sam Altman
Anthropic recruits ex-OpenAI safety chief to lead new ‘Superalignment’ team
OpenAI forms an AI safety committee

Google faces major search algorithm leak
- A leak involving 2,500 pages of internal documentation from Google has surfaced, offering a detailed and unprecedented look into the inner workings of its search algorithm, which is one of the most influential systems on the internet.
- The leaked documents suggest discrepancies between Google’s public statements and its internal practices, specifically highlighting the use of Chrome data and the importance of author bylines in search rankings, contrary to what Google has previously disclosed.
- Despite multiple requests for comment, Google has not confirmed the authenticity of the leaked documents, which has led to increased scrutiny and calls for greater transparency.
- Source: https://www.theverge.com/2024/5/28/24166177/google-search-ranking-algorithm-leak-documents-link-seoGoogle
Former OpenAI board member explains why they fired Sam Altman
- Former board member Helen Toner revealed on a podcast that the board lost trust in Altman due to his secret ownership of the OpenAI Startup Fund, providing inaccurate safety information, and personal retaliation against her.
- Toner explained that after two executives reported a toxic atmosphere and psychological abuse by Altman, the board decided to act, citing the launch of ChatGPT without prior board knowledge, learning about it only through Twitter, as an example of their lack of oversight.
- She believes pressure to reinstate Altman stemmed from limited options presented to employees, fear of retaliation, and Altman’s troubled history with previous jobs
- Shttps://www.theverge.com/2024/5/28/24166713/openai-helen-toner-explains-why-sam-altman-was-fired
Anthropic recruits ex-OpenAI safety chief to lead new ‘Superalignment’ team
- Jan Leike, a prominent AI researcher who recently resigned from OpenAI, has joined Anthropic to lead a new “superalignment” team focused on AI safety and security.
- Leike’s team will work on scalable oversight, weak-to-strong generalization, and automated alignment research, reporting directly to Chief Science Officer Jared Kaplan.
- Anthropic aims to distinguish itself as more safety-focused than OpenAI, with Leike’s new team echoing the mission of OpenAI’s dissolved Superalignment team.
- Source: https://www.neowin.net/news/former-openai-safety-head-jan-leike-joins-rival-firm-anthropic/
OpenAI forms an AI safety committee
OpenAI Board formed a Safety and Security Committee led by directors Bret Taylor, Adam D’Angelo, Nicole Seligman, and Sam Altman. Over the next 90 days, the committee will evaluate and further develop OpenAI’s processes and safeguards. Then, the committee will share its recommendations with the full Board.
Following the full Board’s review, OpenAI will publicly share an update on adopted recommendations. OpenAI has recently begun training its next frontier model, and the resulting systems from the Safety and Security team’s recommendation may bring it safely to the next level of capabilities on its path to AGI.
Why does it matter?
OpenAI has drawn a lot of criticism for for putting AI safety on the backseat after its super alignment team was dissolved a few days ago. However, it seems that OpenAI is trying to rise to the occasion and address these concerns while developing AI more responsibly.
Source: https://openai.com/index/openai-board-forms-safety-and-security-committee
👀 OpenAI begins training the next model
OpenAI just announced the formation of a new Safety and Security Committee to oversee the development of its next frontier AI model, which the company also said it has recently started training.
- The new committee is led by CEO Sam Altman and includes board members Bret Taylor, Adam D’Angelo, and Nicole Seligman.
- The group will provide evaluation of safety and security processes over the next 90 days, with plans to share adopted recommendations publicly.
- OpenAI said the new model will ‘bring us to the next level of capabilities on our path to AGI’.
Source: https://openai.com/index/openai-board-forms-safety-and-security-committee
💻 Google Chromebooks get AI infusion
Google just unveiled a suite of new AI-powered features for its $350 Chromebook Plus laptops, aimed at enhancing productivity, creativity, and collaboration for users.
- Google’s Gemini assistant is integrated directly into the Chromebook Plus home screen, allowing quick access to AI tools.
- The “Help Me Write” feature brings AI capabilities to all text entry fields, providing suggestions, changing tone, and rewriting text.
- Google Photos’ Magic Editor gives users advanced AI-powered image editing capabilities directly within the Chromebook Plus.
- Generative features like AI wallpapers and video call backgrounds also offer users new customization options.
- Source: https://blog.google/products/chromebooks/chromebook-plus-google
All ChatGPT Free users can now use browse, vision, data analysis, file uploads, and GPTs.
New AI Job Opportunities May 29th 2024:
Parloa – CX Design Consultant: https://jobs.therundown.ai/jobs/61082222-cx-design-consultant-f-m-d-*
DeepL – Engineering Manager: https://jobs.therundown.ai/jobs/61074472-engineering-manager-%7C-core-platform-backend
Abridge – Assistant General Counsel: https://jobs.therundown.ai/jobs/61108336-assistant-general-counsel
Writer – Enterprise Business Development Rep: https://jobs.therundown.ai/jobs/61154629-enterprise-business-development-rep
What Else Is Happening in AI on May 29th 2024
PwC set to become the first reseller of ChatGPT Enterprise
Companies no longer need to buy a ChatGPT Enterprise license directly from OpenAI. They can purchase the Gen AI service through PwC. Until today, businesses could only subscribe to the enterprise option by contacting an OpenAI salesperson. This also allows PwC to upsell its services to those who want to use ChatGPT to optimize their workloads. (https://venturebeat.com/ai/pwc-strikes-openai-deal-to-become-the-first-reseller-of-chatgpt-enterprise)
Microsoft brings Copilot AI chatbot to Telegram users
Microsoft has added an official Copilot bot within the messaging app Telegram, which lets users search, ask questions, and converse with the AI chatbot. Copilot for Telegram is currently in beta but is free for Telegram users on mobile or desktop. (https://www.theverge.com/2024/5/28/24166451/telegram-copilot-microsoft-ai-chatbot)
Opera is integrating Google’s Gemini models into its Aria browser AI
Opera announced a collaboration with Google Cloud to integrate Gemini models into its Aria browser AI. Aria is powered by Opera’s multi-LLM Composer AI engine, allowing it to curate the best user experiences based on their requirements. Thanks to this integration, Opera can now provide its users with the most current information at high performance. (https://press.opera.com/2024/05/28/opera-google-cloud-aria-gemini/)
GitHub Accelerator program empowers the open-source AI revolution
GitHub launched the 2024 Accelerator program which provides funding, mentorship, and community-building. Standout participants include Unsloth, which improves AI model efficiency, and Formbricks, which simplifies user feedback. Through this, GitHub accelerates open-source innovation and democratizes access to new tech. (https://venturebeat.com/ai/github-accelerator-fuels-open-source-ai-revolution-empowering-startups-to-democratize-access)
Elon Musk vs. Yaan LeCun on X highlights differences in AI research approach
Elon Musk and Yann LeCun, two prominent figures in AI, got into a heated debate on X over the weekend. LeCun criticized Musk’s management style at his new AI startup xAI, while Musk questioned LeCun’s recent scientific contributions. LeCun emphasizes the importance of open scientific publication, while Musk focuses on ambitious goals like artificial general intelligence (AGI). (https://venturebeat.com/ai/elon-musk-and-yann-lecuns-social-media-feud-highlights-key-differences-in-approach-to-ai-research-and-hype/))
A Daily chronicle of AI Innovations May 28th 2024:
Google AI Overview gives dangerous and wrong answers
GPT-4 is a better financial analysts than humans
Canva redesigned it’s platform with new AI features
The EU’s data protection task force has released preliminary findings on ChatGPT’s GDPR compliance
French startup AniML has launched Doly, an iPhone app simplifying 3D product video creation
Iyo, an Alphabet X spinout, will release the Iyo One, a pair of gen AI earbuds
Mastercard integrates AI to catch compromised cards faster
Peter Thiel, former PayPal CEO, believes AI will worse for math minds more than writers
Google AI Overview gives dangerous and wrong answers
Google’s AI Overviews feature, which generates AI-powered responses to user queries, has been providing incorrect and sometimes bizarre answers. From suggesting glue on pizza, staring at sun for health benefits to claiming that former US President Barack Obama is Muslim, the feature has left users questioning the reliability of AI-generated search results.
While Google maintains that these mistakes result from uncommon queries and are being used to refine the product, the technology’s widespread deployment has highlighted the challenges and risks of integrating AI into search engines.
Why does this matter?
Other companies, such as OpenAI, Meta, and Perplexity, have also experienced issues with AI hallucinations and mistakes. Companies must prioritize the development of robust safeguards and rigorous testing to ensure that AI-powered search results meet the high standards users expect from traditional search engines.
Source: https://techcrunch.com/2024/05/26/what-are-googles-ai-overviews-good-for/
GPT-4 is a better financial analysts than humans
In a groundbreaking study, researchers at the University of Chicago have discovered that GPT-4, a large language model, can analyze financial statements and predict future earnings direction with remarkable accuracy. Even without narrative context, GPT-4 outperforms human financial analysts and achieves prediction accuracy on par with state-of-the-art machine learning models.
The AI model’s success is not simply a result of its training memory but rather its ability to generate valuable narrative insights about a company’s future performance. Notably, trading strategies based on GPT-4’s predictions yield higher returns and risk-adjusted performance than those based on other models, especially for small companies.
Why does this matter?
This study shows that AI is more than just a support tool in financial decision-making. It can also be central to financial analysis and enable non-experts to make informed decisions, which may change how financial markets operate.
Source: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4835311
Canva redesigned it’s platform with new AI features
Canva is launching a redesigned platform with new AI features and tools for professional teams and workspaces. The company has introduced Canva Enterprise, a specialized tier offering more control over collaboration, brand management, and security for larger organizations.
Ad Tech integrations with Google, Meta, and Amazon streamline the ad creation process within Canva. Data autofill automates the creation of data-driven designs by integrating with sources like Salesforce and MLS.
New features include AI style matching for brands, customizable folder displays, and the ability to “star” designs and templates for easier access. Canva Docs now has a suggestion mode for editors and colorful highlight blocks for text emphasis. New “Magic Studio” AI tools include:
- Automatic clip highlighting.
- Background noise reduction for video editing.
- A text-to-graphic image generator.
Why does it matter?
Canva aims to reduce organizational complexity and “app sprawl” by offering a comprehensive creative platform. This update addresses the growing need for extensive, user-friendly solutions that can replace multiple design, AI, and workflow apps.
Source: https://www.canva.com/newsroom/news/canva-for-work
What Else Is Happening in AI on May 28th 2024
The EU’s data protection task force has released preliminary findings on ChatGPT’s GDPR compliance
While the group remains undecided on key legal issues, it suggested that “adequate safeguards” and “precise collection criteria” could help OpenAI meet the requirements. However, the AI giant’s move to Ireland may benefit from the country’s business-friendly approach to GDPR enforcement. (Link: https://techcrunch.com/2024/05/27/eus-chatgpt-taskforce-offers-first-look-at-detangling-the-ai-chatbots-privacy-compliance)
French startup AniML has launched Doly, an iPhone app simplifying 3D product video creation
Doly uses AI to estimate the 3D shape of an object from regular 2D photos, and then it applies a technique called Gaussian splatting to create a high-quality 3D model suitable for use in product videos. Users can capture a 3D model, choose a template from the library, and integrate their object into a 3D scene. (Link: https://techcrunch.com/2024/05/27/doly-lets-you-generate-3d-product-videos-from-your-phone)
Iyo, an Alphabet X spinout, will release the Iyo One, a pair of gen AI earbuds
The Iyo One will be released this winter, starting at $599, and aims to succeed, whereas competitors like Humane’s Ai Pin and Rabbit’s R1 have struggled. It integrates LLM-based models for a more sophisticated AI experience. Iyo’s founder, Jason Rugolo, believes the Iyo One will provide value through its sound isolation, comfort, and music quality, in addition to its AI features.(Link: https://techcrunch.com/2024/05/27/iyo-thinks-its-gen-ai-earbuds-can-succeed-where-humane-and-rabbit-stumbled)
Mastercard integrates AI to catch compromised cards faster
Mastercard is rolling out an AI update to its fraud-prediction technology to identify compromised credit and debit cards before criminals use them. By analyzing patterns and contextual data, the AI can proactively flag stolen cards, allowing banks to replace them and protect customers from fraudulent transactions. (https://apnews.com/article/mastercard-visa-ai-credit-card-fraud-detection-0c348818087a57b13bfac66c761e03b4)
Peter Thiel, former PayPal CEO, believes AI will worse for math minds more than writers
Thiel states that Silicon Valley in the 21st century is too biased towards math people. Thiel explained that the exact reason for ‘AI being not able to overtake human creativity,’ is not clear bets on getting worse.”He concludes that AI is not yet good at math, although its capabilities are increasing, as shown by ChatGPT scoring 96% in a UK A-level math paper with the Wolfram plug-in. (Link: https://www.financialexpress.com/life/technology-former-paypal-ceo-says-ai-may-be-good-at-maths-but-it-cant-be-a-great-writernbsp-3503476)
AI researcher Kai-Fu Lee doubled down on his 2017 prediction that AI would displace 50% of jobs by 2027, saying white collar jobs will be eliminated faster than blue collar work.
New AI Job Opportunities May 28th 2024
Luma AI – Senior Machine Learning Engineer: https://jobs.therundown.ai/jobs/61001044-senior-machine-learning-engineer
xAI – Product Data Engineer: https://jobs.therundown.ai/jobs/61057653-product-data-engineer
Databricks – Head of Solutions Architecture (Data & AI): https://jobs.therundown.ai/jobs/61030403-head-of-solutions-architecture-data-ai
Captions – Software Engineer – Backend: https://jobs.therundown.ai/jobs/60404530-software-engineer-backend-3+-years-of-experience
A Daily chronicle of AI Innovations May 27th 2024:
Elon Musk’s xAI raises $6B to build ‘Gigafactory of Compute’
Apple bets that its giant user base will help it win in AI
China invests $47 billion in largest ever chip fund
GPT-4 surpasses humans in financial analysis
Microsoft’s Recall AI: Efficiency upgrade or privacy nightmare?
AI-powered headphones let you listen to one person in a crowd
xAI’s supercomputer that’s 4x larger than existing GPU clusters
Elon Musk’s xAI raises $6B to build ‘Gigafactory of Compute’
- Elon Musk’s xAI has successfully raised $6 billion in a Series B funding round to construct a supercomputer known as the “Gigafactory of Compute,” which will be powered by 100,000 Nvidia H100 GPUs, making it at least four times larger than the largest existing GPU clusters.
- This funding will enable xAI to advance its product offerings, develop cutting-edge infrastructure, and accelerate research and development, with investors including Andreessen Horowitz, Sequoia Capital, and Saudi Prince Alwaleed bin Talal.
- The supercomputer will support the next iteration of xAI’s chatbot, as xAI aims to create advanced AI systems that are truthful, competent, and maximally beneficial for humanity, continuing Musk’s vision of a “maximum truth-seeking AI” called TruthGPT.
- Source
Apple bets that its giant user base will help it win in AI
- Apple is betting on its vast user base to give it an edge in the AI market, despite its first set of AI features not being as advanced as those from other competitors like Microsoft, Google, and OpenAI.
- The company plans to introduce AI tools integrated into its core apps and operating systems, focusing on practical, everyday uses for consumers, with much of the AI processing done on-device and more intensive tasks handled via the cloud.
- Apple’s collaboration with OpenAI and potential agreements with Google indicate it is relying on partnerships to compete in the AI space while its own AI developments are still maturing, leveraging its extensive user base to rapidly scale the use of new AI features.
- Source
China invests $47 billion in largest ever chip fund
- China has invested more than $47 billion into its largest-ever chip investment fund to pursue self-sufficiency in semiconductor manufacturing.
- This significant investment reflects China’s broader strategy to develop homegrown chipmakers amid rising technological competition with the United States.
- The fund’s third phase, supported by China’s finance ministry and state-owned banks, marks the largest investment of its kind, surpassing the combined total of the previous two phases.
- Source
GPT-4 surpasses humans in financial analysis
- New research from the University of Chicago shows that GPT-4 can analyze financial statements more accurately than humans, achieving a 60% accuracy rate compared to human analysts’ 53-57% range.
- The study indicates significant implications for the future of financial analysis, highlighting GPT-4’s versatility in performing tasks usually reserved for specialized tools and stating its performance is on par with leading machine learning models.
- While GPT-4 excels in quantitative analysis, human analysts provide valuable contextual insights, suggesting a complementary relationship between AI and humans for achieving optimal financial analysis results.
- Source
Microsoft’s Recall AI: Efficiency upgrade or privacy nightmare?
Microsoft has recently released Recall, a controversial AI-powered tool that logs everything you see and do on your computer, including app usage, live meeting communications, websites visited, and more. By simply performing a “Recall” action, users can retrieve any information they’ve interacted with on their device, presented in the context of a specific time period.
Microsoft assures users that the Recall index remains local and private on-device. Users can pause, stop, or delete captured content and can choose to exclude specific apps or websites. And Recall automatically excludes InPrivate web browsing sessions in Microsoft Edge and DRM-protected content.
Currently, Recall is exclusively compatible with new “Copilot Plus PCs” equipped with Qualcomm’s Snapdragon X Elite chips, which feature the necessary neural processing unit (NPU).
Why does this matter?
The Recall feature has sparked concern among cybersecurity experts and government authorities due to its extensive data collection and privacy risks. Recall continuously captures screenshots of users’ PC activities and creates a searchable index using AI, which may accidentally expose sensitive information to unauthorized users.
AI-powered headphones let you listen to one person in a crowd
Researchers at the University of Washington have developed a groundbreaking artificial intelligence system called “Target Speech Hearing” that allows headphone wearers to isolate and listen to a single speaker in a noisy environment. By simply looking at the desired speaker for three to five seconds, the user can “enroll” them, and the system will cancel out all other sounds, playing only the enrolled speaker’s voice in real-time, even as the listener moves around.
The proof-of-concept device, which uses off-the-shelf headphones fitted with microphones and an on-board embedded computer, builds upon the team’s previous “semantic hearing” research. The system’s ability to focus on the enrolled voice improves as the speaker continues talking, providing more training data. While currently limited to enrolling one speaker at a time and requiring a clear line of sight, the researchers are working to expand the system to earbuds and hearing aids in the future.
Why does this matter?
The Target Speech Hearing system may have applications in various settings, like business meetings, conferences, noisy public spaces, or for people with hearing difficulties. While the current system is a proof-of-concept, the researchers’ plan to expand it to earbuds and hearing aids indicates the potential for commercialization, which could lead to a new-age consumer product that enhances our sensory experiences in previously unimaginable ways.
xAI’s supercomputer that’s 4x larger than existing GPU clusters
Elon Musk has recently disclosed plans to build a groundbreaking supercomputer to power the next iteration of xAI’s chatbot, Grok. According to a presentation made to investors in May, as reported by The Information, Musk aims to have this computing system operational by the fall of 2025. The ambitious project may involve a collaboration between xAI and Oracle to develop this massive computing infrastructure.
The proposed supercomputer will consist of interconnected clusters of Nvidia’s state-of-the-art H100 graphics processing units (GPUs). Musk revealed that upon completion, the scale of this system will surpass the largest existing GPU clusters by at least fourfold!
Why does it matter?
As major tech giants and startups compete for dominance in the AI space, Musk’s proposed “gigafactory of compute” could potentially set new benchmarks for AI computing power. Also, Musk’s financial resources and personal commitment to the project position him as a formidable challenger to established AI powerhouses like OpenAI, Google, and Meta.
New AI Jobs Opportunities on May 27th 2024
Snorkel – Machine Learning Customer Engineer
OctoAI – Senior MLSys Engineer
OpenAI – Technical Program Manager, Trustworthy AI
C3 AI – Senior Revenue Accountant
What Else Is Happening in AI on May 27th 2024
YouTube Music introduces hum-to-search AI feature
YouTube Music has introduced a new AI feature that lets users search for songs by humming or singing a melody. This feature, similar to the “Hum to Search” functionality in Google Search, uses machine learning to match the user’s audio input to the original song recording. The feature is currently available in its Android app version 7.02 and is being gradually rolled out to users. (Link)
iOS 18 may introduce AI-powered custom emoji tool and smart recaps
Apple is reportedly working on a gen AI-powered custom emoji tool for iOS 18 that lets users create personalized emojis. They’re also developing a smart recaps feature that will provide users with summaries of missed notifications, messages, and documents. (Link)
Suno.ai’s music generator now offers 4-minute songs
AI music model Suno.ai has released its version 3.5 which lets users create songs up to four minutes long with improved structure. The update also introduces a sound-to-song feature, combining audio and text prompts to generate music. (Link)
Meta to use EU user data for AI training, offers opt-out
Meta is notifying Facebook and Instagram users in Europe about changes to its privacy policy, effective June 26, which allows the company to use user data for training AI models under a “legitimate interest” claim. Users can opt out of this data usage, but they must provide an explanation of how it personally affects them, with a reference to the GDPR being sufficient. (Link)
Google launches Gemini AI assistant in Messages app
Google has started rolling out its Gemini AI assistant feature in Google Messages to help users with tasks like drafting messages, brainstorming, event planning, and engaging in conversations. Gemini supports extensions like Workspace, YouTube, and Google Maps, and also lets users provide feedback on responses. (Link)
A Daily chronicle of AI Innovations May 24th 2024:
Google AI tells users to glue pizza and eat rocks
OpenAI scraps controversial nondisparagement agreement with employees
Musk’s xAI nears deal valuing startup at $24B
Cohere releases multilingual AI model, Aya 23
Arc introduces “Call Arc” for quick voice answers
Elon Musk envisions AI era, new work norms, life on Mars
OpenAI controversy: ‘Sky’ speaks out
Google AI tells users to glue pizza and eat rocks
- Google’s new AI Overviews feature is generating incorrect and sometimes absurd answers, like adding non-toxic glue to pizza as a solution to prevent cheese from falling off.
- The feature has made several other mistakes, such as stating that former US President James Madison graduated from the University of Wisconsin 21 times and claiming that Batman is a cop.
- Google acknowledges these are “isolated examples” and emphasizes the AI is experimental, but the errors highlight significant issues with the current state of AI-generated information.
- Source
OpenAI scraps controversial nondisparagement agreement with employees
- OpenAI will not enforce any previously signed nondisparagement agreements with former employees and will remove such language from its exit paperwork, as stated to Bloomberg.
- Previously, employees had to choose between speaking against the company or keeping their vested equity, potentially losing millions if they refused to sign the agreement.
- Sam Altman, OpenAI’s CEO, expressed embarrassment over the existence of the clause and pledged to rectify the paperwork, while Chief Strategy Officer Jason Kwon apologized for the distress caused.
- Source
Musk’s xAI nears deal valuing startup at $24B
- Elon Musk’s AI startup, xAI, is expected to complete a funding round in June that could value the company at over $24 billion.
- The company originally aimed to raise around $6 billion earlier this month but is now seeking to gather as much as $6.5 billion, with final deal closure still pending.
- xAI has been marketed to Silicon Valley investors using a pitch highlighting Musk’s success with Tesla and SpaceX, and plans to leverage data from his social platform X (formerly Twitter) to build its AI chatbot, Grok.
- Source
Cohere releases multilingual AI model, Aya 23
Cohere for AI (C4AI), the non-profit research group, has launched open-weight Aya 23, a new family of multilingual language models. Available in 8B and 35B parameter variants, Aya 23 supports 23 languages, including Arabic, Chinese, English, French, German, Hindi, Japanese, Spanish, and more.
Here’s a quick breakdown:
- Aya23 focuses on depth over breadth, meaning it performs better in fewer languages than their previous model, Aya 101 (which covered 101 languages).
- The 8B parameter model balances efficiency and accessibility, while an advanced 35B parameter delivers higher performance at the cost of increased computational demand.
- Aya 23 outperforms existing models like Google’s Gemma on various tasks across the languages it covers.
- Researchers can access and fine-tune Aya 23 for their needs, with the model available for free trial on Cohere Playground.
Why does this matter?
Most AI models struggle with languages besides English. The development of multilingual models can serve a much wider audience, bridging the language divide and making AI more accessible and inclusive for users around the world.
Arc introduces “Call Arc” for quick voice answers
Arc Search, an AI-powered search app, just launched a new feature called Call Arc. This lets users ask questions by holding their phone to their ear, mimicking a phone call. It provides instant voice answers, similar to voice search, but designed to be more convenient and quicker.
The app is designed to answer short, immediate questions. For example, you can ask it how long it takes to cook spaghetti or why to reserve pasta water, all while making dinner.
Call Arc complements Arc Search’s existing “Browse for me” function that generates webpages with information based on your search query.
Why does this matter?
As AI continues to simplify our lives, innovative features like Call Arc could provide a new approach to voice search, making it intuitive and accessible for users on the go. This could also lead to more nuanced search results and a more engaging experience compared to simple keyword searches.
Elon Musk envisions AI era, new work norms, life on Mars
In a Q&A session at VivaTech 2024, Elon Musk discussed diverse topics, from plans for Mars colonization to the role of AI in society.
Source: Viva Technology
Musk emphasized SpaceX’s goal of making life multi-planetary, with Mars as a key focus. He discussed the importance of reusable spacecraft and highlighted the necessity of space exploration for humanity’s long-term survival.
Regarding AI, Musk stressed the importance of honesty in AI development, criticizing approaches that prioritize political correctness over truthfulness. He also touched on AI’s potential to revolutionize education, though he expressed concerns about the impact of social media on children.
Musk envisioned a future where automation leads to a job-free society, with a universal basic income ensuring people’s needs are met.
Why does this matter?
Elon Musk’s bold predictions about Mars, the Moon, and a job-free future driven by AI are both exciting and thought-provoking. While his vision for Mars could redefine human civilization, his stance on AI emphasizes the need for integrity in technology.
It’s crucial that we engage in open and honest dialogue about AI’s implications for society.
OpenAI controversy: ‘Sky’ speaks out
There has been a wave of updates on several drama-filled OpenAI topics, including leaked documents related to the company’s controversial NDA clauses, new info surrounding the ‘Sky’ voice model, another resignation, and more.
- The voice actress behind ChatGPT’s ‘Sky’ voice has come forward, saying she was hired months before Altman contacted Scarlett Johansson.
- Leaked documents with Sam Altman’s signature contradict claims he was unaware of NDA clauses threatening to claw back former employees’ equity.
- OpenAI policy researcher Gretchen Kreuger resigned, with a post citing safety concerns about the company.
- Teasers are coming out of presentations at the VivaTech conference, showing a potential move away from the ‘GPT-5’ name and a new Sora demo.
New AI Jobs Opportunities on May 24th 2024:
Kumo – ML Solutions Architect
Palantir – Proposal Writer, Internship
Lambda – Linux Support Engineer
Anthropic – Enterprise Account Executive
What Else Is Happening in AI on May 24th 2024
YouTube launches AI-powered music creation tool
YouTube has introduced Dream Track, an AI-powered tool designed for creating music within YouTube Shorts. This experimental feature allows users to generate musical pieces based on input commands. While it is an innovative addition, it has sparked a debate over AI’s impact on the authenticity of creative work. (Link)
Ideogram launches Ideogram Tile, an AI-powered pattern generator
Ideogram has launched Ideogram Tile, a feature that creates seamless, repeating patterns from text prompts. This tool empowers designers to craft unique wallpapers, fabrics, and textures effortlessly. Following a $80 million funding round, Ideogram continues to lead in AI-driven design innovation, blending creativity with advanced technology. (Link)
Spotify tests Spanish-speaking AI DJ, “DJ Livi”
Spotify is testing a Spanish-speaking iteration of its AI DJ, named “DJ Livi.” The app code suggests a language switch option for this new feature. Although Spotify has not confirmed launch plans, this move could cater to Spanish-speaking audiences globally and in the U.S., marking a significant expansion in AI language support. (Link)
Amazon plans paid subscriptions for Alexa AI features
Amazon is reportedly gearing up to launch paid subscription plans for select Alexa AI features, aiming to enhance conversational capabilities. This move seeks to compete with advanced AI chatbots like Google’s Gemini and OpenAI’s ChatGPT. Notably, the subscription won’t be bundled with Amazon Prime, offering a standalone service. (Link)
Inworld AI launches AI-driven voice generator
Inworld AI, known for its innovative AI-driven virtual characters, unveils Inworld Voice, a dynamic voice generator for game developers. Boasting 58 diverse voices powered by advanced machine learning models, it promises expressive, cost-efficient, and high-quality voice experiences. (Link)
A Daily chronicle of AI Innovations May 23rd 2024:
Anthropic uncovers millions of concepts in Claude Sonnet’s AI model
Truecaller’s AI assistant gets a voice upgrade, thanks to Microsoft
TikTok makes ad creation easy with AI! 🤯 Scientists plan a head transplant system
Chameleon: Meta’s new multimodal LLM
OpenAI didn’t copy Scarlett Johansson’s voice, records show
Nvidia shows no signs of AI slowdown
Global outages hit Microsoft Bing, DuckDuckGo, ChatGPT Search and Copilot
OpenAI signs $250M landmark deal with News Corp
Apple may develop foldable iPhone with self-healing screen
Anthropic uncovers millions of concepts in Claude Sonnet’s AI model
Anthropic has made a breakthrough in understanding the inner workings of their AI model, Claude Sonnet, by identifying how millions of concepts are represented within it.
Using a technique called “dictionary learning,” they were able to map out these concepts, providing the first-ever detailed look inside a modern, production-grade large language model.
Key findings:
- Features linked to concepts: These concepts are linked to features, which are groups of neurons that activate together in response to specific ideas.
- Features can be manipulated: By manipulating these features, the researchers were able to influence Claude’s outputs, demonstrating a causal link between features and behavior.
- Features reveal potential risks: The research identified features corresponding to biases, potential misuse of the model, and even sycophantic behavior.
Why does this matter?
Anthropic’s research is a big step toward making AI models more transparent and trustworthy. By understanding how these models work, researchers can spot and reduce harmful biases, prevent misuse, and steer AI systems toward better outcomes, leading to safer and more reliable AI applications.
Truecaller’s AI assistant gets a voice upgrade, thanks to Microsoft
Truecaller is partnering with Microsoft to let users create an AI version of their own voice for their AI Assistant to use when answering calls.
This feature, currently limited to users with access to Truecaller’s AI Assistant, requires them to record a voice clip that Microsoft’s Azure AI Speech technology will use to create a personalized AI voice.
Truecaller says this will allow for a more personalized experience and highlights the potential of AI in communication. However, it’s important to note that Microsoft limits the use of personal voice for specific purposes and requires users to obtain consent before recording someone’s voice.
Why does this matter?
Truecaller’s AI voice assistant offers a glimpse into the future of call management, promising a more personalized experience. However, it must ensure transparency and address performance comparisons with automated voicemail systems.
The idea of speaking with an AI while calling someone may seem unusual, and it remains to be seen how well this feature will be received.
TikTok makes ad creation easy with gen AI!
TikTok has introduced “TikTok Symphony,” a suite of generative AI tools designed to help marketers create and optimize ad campaigns. The suite includes an AI video generator called “Symphony Creative Studio,” which can produce TikTok-ready videos with minimal input from advertisers, and an AI assistant named “Symphony Assistant” that helps refine scripts and provides best practice recommendations.
The company has also introduced “TikTok One,” a centralized hub for marketers to access creators, agency partners, and creative tools. Additionally, TikTok is leveraging predictive AI to drive more sales for advertisers by determining the best creative assets and target audiences based on budgets and goals.
Why does this matter?
TikTok’s integration of generative AI into its ads business signifies the growing importance of AI in digital marketing. This shift could make ad campaigns more efficient and effective, opening new opportunities for tech companies and startups.
Chameleon: Meta’s new multimodal LLM

Meta’s AI research lab just introduced Chameleon, a new family of ‘early-fusion token-based’ AI models that can both understand and generate text and images in any order.
- Unlike other models that process image and text separately and then combine them later, Chameleon works with linked sequences of both.
- Chameleon outperformed all rival models on image captioning and visual questions, while still matching performance on text-only.
- The 34B parameter Chameleon also matched or outperformed top models like Gemini Pro and GPT-4V on tests of long-form mixed-modal generations.
Chameleon shows the potential for a different type of architecture for multimodal AI models, with its early-fusion approach enabling more seamless reasoning and generation across modalities and setting new performance bars.
New AI Jobs Opportunity on May 23rd 2024
Impel AI – Test Automation Engineer
Grammarly – Researcher, Strategic Research
Tempus – Senior Cloud Security Engineer
UiPath – Senior Product Manager
What Else Is Happening in AI on May 23rd 2024
OpenAI partners with News Corp to enhance ChatGPT
OpenAI has announced a multi-year partnership with media giant News Corp to enhance ChatGPT with its high-quality journalism. The collaboration aims to provide reliable information to users, with News Corp’s mastheads displayed in response to user queries. OpenAI CEO Sam Altman called it a “proud moment for journalism and technology.” (Link)
AWS and Hugging Face team up on AI model development
AWS has partnered with AI startup Hugging Face to enable developers to run thousands of AI models on Amazon’s custom Inferentia2 chips. The collaboration aims to provide a cost-effective solution for deploying AI models, with AWS aiming to compete in the inference market currently dominated by Nvidia. (Link)
Granola’s introduces AI-powered note-taking app
Granola, a new AI startup, has launched a notepad app that combines user-written notes with AI-generated details from meeting transcripts. The app aims to make note-taking more collaborative and efficient, using OpenAI’s GPT-4 to augment and clean up your notes. (Link)
BrainBridge launches head transplant system using AI and robotics
Neuroscience startup BrainBridge has revealed its groundbreaking head transplant system that aims to provide hope for patients with untreatable conditions.
https://x.com/i/status/1793045981954523464
- The CGI demo proposes a system that uses advanced robotics, AI, and real-time molecular imaging to transfer a patient’s head or face onto a donor body.
- The company hopes to conduct the first surgery within eight years, and is releasing the concept early to attract top scientists.
- A headband with a brain-computer interface (like Neuralink) would help patients communicate and execute tasks during recovery using their minds.
While the mind-blowing () system seems highly speculative and more sci-fi than reality — so did Neuralink’s brain chips. While we’re likely a long way from this becoming possible, with the rate of advancement in robotics and AI it’s tough to rule anything out over the next decade.
The procedure uses advanced AI, robotics, and a proprietary adhesive to transplant a patient’s head onto a donor body. While it’s still early, BrainBridge expects to conduct the first surgery within eight years, pending successful feasibility studies. (Link)
Meta establishes AI advisory group
Meta CEO Mark Zuckerberg has created the Meta Advisory Group, consisting of Stripe’s Patrick Collison, Shopify’s Tobi Lütke, and others, to guide the company’s AI and technology advancements. The group will offer insights and recommendations as Meta pushes harder into AI development across various products. (Link)
A Daily chronicle of AI Innovations May 22nd 2024:
Microsoft’s first SoTA SLM to be shipped with Windows
Google unveils new AI tools for branding and product marketing
Adobe introduces Firefly AI-powered Generative Remove to Lightroom
Microsoft’s first SoTA SLM to be shipped with Windows
Microsoft announced a new small language model called Phi Silica. It has 3.3 billion parameters, which makes it the smallest model in Microsoft’s Phi family of models. Phi Silica is designed specifically for the Neural Processing Units (NPUs) in Microsoft’s new Copilot+ PCs. Despite its small size, Phi Silica can generate 650 tokens per second using only 1.5 Watts of power. This allows the PC’s main processors to be free for other tasks.
Developers can access Phi Silica through the Windows App SDK and other AI-powered features like OCR, Studio Effects, Live Captions, and Recall User Activity APIs. Microsoft plans to release additional APIs, including Vector Embedding, RAG API, and Text Summarization. These AI-powered PCs will have dedicated AI chips for running LLMs and other AI workloads.
Why does it matter?
As Microsoft continues to invest in developing small language models and integrating AI into its Windows platform, Phi Silica represents a significant step forward in making advanced AI capabilities more accessible to developers and end-users. Also, as major PC manufacturers want to introduce AI-powered laptops this summer, Microsoft might lead the way by introducing CoPilot PC+ and now Phi Silica.
Google unveils new AI tools for branding and product marketing
Google has introduced several new AI-powered features to help retailers and brands better connect with shoppers. First, Google has created a new visual brand profile that will appear in Google Search results. This profile uses information from Google Merchant Center and Google’s Shopping Graph to showcase a brand’s identity, products, and offerings.
Additionally, Google is expanding its AI-powered tools to help brands create more engaging content and ads. This includes new features in Google’s Product Studio, allowing brands to generate images matching their unique style.
Google is also launching immersive ad formats powered by generative AI, such as the ability to include short product videos, virtual try-on experiences, and 3D product views directly in search ads. These new AI-driven tools aim to help brands forge stronger, more personalized connections with consumers throughout the shopping journey.
Why does it matter?
As AI continues advancing, such tools could shape the future of marketing by delivering more personalized, hyper-relevant, and visually compelling experiences that better connect consumers with brands and products.
Adobe introduces Firefly AI-powered Generative Remove to Lightroom
Adobe has added a new AI-powered feature called Generative Remove to its Lightroom photo editing software. Generative Remove uses Adobe’s Firefly generative AI model to allow users to seamlessly remove objects from photos, even if the objects have complex backgrounds. The feature can remove images’ stains, wrinkles, reflections, and more.
Adobe has been integrating Firefly’s capabilities across its Creative Cloud apps to generate images, apply styles, fill areas, and remove objects through the new Generative Remove tool in Lightroom. It works closely with photographers to continue improving and expanding this object-removal capability. The company also announced a new Lens Blur effect that uses AI to add realistic depth-of-field blur to photos.
Why does it matter?
The Generative Remove feature will make it easier for photographers, designers, and other creatives to edit their images, saving time and effort. Looking ahead, we can expect Adobe and other creative software companies to explore new ways to harness Gen AI to automate tedious tasks, provide intelligent assistance, and enable entirely new creative possibilities.
AI RESEARCH:
Research: Mapping the ‘mind’ of an LLM

Anthropic just published new research that successfully identified and mapped millions of human-interpretable concepts, called “features”, within the neural networks of Claude.
The details: |
|
Why it matters: Despite how fast AI is accelerating, we still don’t have a strong understanding of what’s going on beneath the hood of LLMs. This research is a major step towards making AI more transparent — enabling better understanding, control, and safeguarding of these powerful tools.
New AI Job Opportunities on May 22nd 2024
Databricks – Architect: Big Data
Grammarly – Business Development Representative
Perplexity AI – Backend Software Engineer
Cohere – Director, Solutions Architect
What Else Is Happening in AI on May 22nd 2024
Elon Musk’s xAI plans to make Grok multimodal
According to public developer documents, Elon Musk’s AI company, xAI, is making progress on adding multimodal inputs to the Grok chatbot. This means soon, users may be able to upload photos to Grok and receive text-based answers. This was first teased in a blog post last month from xAI, which said Grok-1.5V will offer “multimodal models in a number of domains.” (Link)
Microsoft’s new Copilot AI agents to work like virtual employees
Microsoft will soon allow businesses and developers to build AI-powered Copilots that can work like virtual employees and perform tasks automatically. Instead of Copilot sitting idle waiting for queries, it will be able to monitor email inboxes and automate tasks or data entry that employees normally have to do manually. (Link)
Microsoft Edge introduces real-time AI translation and dubbing for YouTube
Microsoft Edge is set to introduce real-time translation and dubbing for videos on platforms like YouTube, LinkedIn, and Coursera. This new AI-powered feature will translate spoken content live, offering dubbing and subtitles. Currently, the feature supports translations from Spanish to English and English to German, Hindi, Italian, Russian, and Spanish. (Link)
WitnessAI builds guardrails for Gen AI models
WitnessAI is developing tools to make Gen AI models safer for businesses. The company’s platform monitors employee interactions and custom AI models, applying policies to reduce risks like data leaks and biased outputs. The platform also offers modules to enforce usage rules and protect sensitive information. Lastly, it encrypts and isolates data for each customer. (Link)
Microsoft’s Azure AI Studio supports GPT-4o
Microsoft has announced that Azure AI Studio is now generally available and supports OpenAI’s GPT-4o model, which joins over 1,600 other models, including Mistral, Meta, Nvidia, etc. Developers can use this multimodal foundational model to incorporate text, image, and audio processing into their apps to provide generative and conversational AI experiences. (Link)
Daily chronicle of AI Innovations May 21st 2024:
Microsoft announces AI-powered MacBook competitors
Microsoft’s AI chatbot will ‘recall’ everything you do on a PC
Scarlett Johansson told OpenAI not to use her voice
TSMC and ASML machines are equipped with remote self-destruct in case of invasion
Volvo introduces first autonomous truck
Microsoft’s New AI PCs Rival Apple’s MacBooks
Scarlett Johansson sues OpenAI for using her voice in ChatGPT
DINO 1.5 is smarter and faster at object detection
Microsoft announces AI-powered MacBook competitors
- Microsoft introduces “Copilot Plus PCs,” new Windows laptops with built-in AI hardware and features, launching June 18th with Qualcomm processors.
- These AI-enabled laptops, from major partners like Dell and Lenovo, will include a neural processor, at least 16GB RAM, and 256GB SSD, supporting over 40 AI models and OpenAI’s GPT-4o model.
- Promising significant performance and battery life improvements, Microsoft expects to sell 50 million Copilot Plus PCs in the next year, marking a new era for Windows laptops.
- Source
Microsoft’s New AI PCs Rival Apple’s MacBooks
Microsoft revealed Copilot+ PCs, a new category of Windows PCs designed for AI. These PCs boast powerful processors, all-day battery life, and AI features like Recall for instant memory, Cocreator for image creation, Live Captions for real-time translations, and Auto Super Resolution for games.

The recall feature, which allows users to search and recall anything they’ve seen and interacted with on their computer screens with natural language, is especially impressive. The new PCs feature an all-new system architecture with CPU, GPU, and a high-performance Neural Processing Unit (NPU) working together. Starting at $999, Copilot+ PCs are equipped with OpenAI’s GPT-4o models.
Why does this matter?
Microsoft claims its new Arm-powered Copilot Plus PCs will outperform the MacBook Air with M3 by over 50% on sustained performance. The tech giant is advancing Arm-based computing by using AI at every level and partnering with Qualcomm. If these machines live up to the hype, they could upset the dominance of Intel-based laptops and challenge Apple’s M-series processors.
TSMC and ASML machines are equipped with remote self-destruct in case of invasion
- ASML and TSMC have the capability to disable their chipmaking machines if China invades Taiwan, according to sources.
- ASML assured officials it can remotely shut down these machines, including its EUV units, which are crucial for producing the world’s smallest microchip transistors used in both AI and military applications.
- US government officials, concerned about the potential impact of a Chinese invasion on the global chip supply, have discussed these security measures with both Dutch and Taiwanese counterparts.
- Source
Microsoft’s AI chatbot will ‘recall’ everything you do on a PC
- Microsoft introduced an upgraded version of its AI assistant, Copilot, that will remember users’ activities on their PCs to predict their next actions.
- The new AI features, including Windows Recall, aim to enhance user experience by providing a virtual “photographic memory,” while allowing users to control their privacy settings.
- It’s a step toward machines that “instantly see us, hear, reason about our intent and our surroundings,” said CEO Satya Nadella.
- Source
Scarlett Johansson told OpenAI not to use her voice
- Scarlett Johansson stated that OpenAI approached her in September about supplying her voice for GPT-4o, but she declined the offer.
- After hearing a demo of the ChatGPT system’s “Sky” voice resembling her own, Johansson expressed shock and mentioned hiring legal counsel to address the issue with OpenAI.
- OpenAI, while denying that Sky’s voice was meant to mimic Johansson’s, paused the use of the voice out of respect and announced plans to introduce additional voices in the future.
- Source
Scarlett Johansson sues OpenAI for using her voice in ChatGPT
Scarlett Johansson claims OpenAI asked her to voice ChatGPT, but she declined. Later, OpenAI released a voice named “Sky” that sounded eerily similar to her. Johansson was shocked and angered by the similarity and has hired legal counsel to investigate how the “Sky” voice was created.
OpenAI denies that the “Sky” voice was intended to resemble Johansson, has paused using it in its products, and apologizes for not communicating better. Johansson seeks transparency from OpenAI and believes that individual rights must be protected in the era of deep fakes and AI content.
Why does this matter?
Another day, another OpenAI drama. OpenAI already faces lawsuits from authors and news organizations over copyright infringement in training its AI models. Johansson’s case shows the limited legal protection actors have against AI mimicking their likeness, and some lawmakers are proposing bills to address the issue.
DINO 1.5 is smarter and faster at object detection
IDEA Research launched the Grounding DINO 1.5 open-world object detection model series, with Grounding DINO 1.5 Pro for high-performance detection and Grounding DINO 1.5 Edge for efficient edge computing. Grounding DINO 1.5 Pro achieves state-of-the-art zero-shot transfer performance on several academic benchmarks, surpassing its predecessor.
The model shows strong detection capabilities across various scenarios, including common objects, long-tailed categories, dense objects, and caption phrase grounding. Grounding DINO 1.5 Pro uses a larger Vision Transformer backbone and is pretrained on the high-quality Grounding-20M dataset.
Why does this matter?
Grounding DINO 1.5 isn’t just about raw performance—it’s also incredibly versatile. Fine-tuning the model on specific datasets leads to even more impressive results, suggesting that it can lead us to make more reliable robotics, automated systems, and more while requiring less training time and data.
Volvo introduces first autonomous truck
- Volvo has unveiled its first production-ready self-driving truck, developed in partnership with the autonomous driving technology company Aurora.
- This autonomous semi truck, based on Volvo’s VNL Class 8 model, is equipped with an array of sensors and cameras to support Aurora’s Level 4 autonomous driving system, which does not require a human driver.
- Volvo and Aurora started collaborating in 2018, and their autonomous trucks have already logged 1.5 million miles on commercial roads; Aurora aims to deploy 20 fully autonomous trucks this year and plans to scale up to 100 trucks by 2025.
- Source
What Else Is Happening in AI on May 21st 2024
HP is introducing simplified laptop “AI PCs” lineup
HP’s new “AI PCs,” the OmniBook X AI and EliteBook Ultra AI, feature Snapdragon X Elite 12-core CPUs. HP’s AI Companion software provides access to ChatGPT-3.5, while Poly Camera Pro offers AI camera controls. Moreover, the OmniBook X AI starts at $1,199.99 and the EliteBook Ultra AI at $1,699.99. (Link)
Google is investing 1 billion Euros to expand its data center in Finland
This move aims to drive AI business growth in Europe. Nordic countries are popular for data centers due to cooler climate, tax breaks, and abundant renewable power. Moreover, the heat from this data center will be used to warm up local homes and buildings. Google’s Hamina data center in Finland already operates with 97% carbon-free energy and aims for net zero emissions by 2030. (Link)
Dell Technologies expands its generative AI capabilities with the Dell AI Factory
The Dell AI Factory with NVIDIA integrates hardware and software to support advanced use cases like RAG and digital assistants. Dell is also deepening partnership with Hugging Face, Meta, and Microsoft to bring open models and Azure AI services to on-premises deployments. The goal is to make it easy for enterprises to assess and implement gen AI on-premises, close to their data. (Link)
Intel’s Lunar Lake laptop processors are set to launch in Q3 2024
These processors are designed for AI Copilot Plus PCs. They offer 3x the AI performance of its predecessor, Meteor Lake. Intel claims Lunar Lake is 1.4 times faster in Stable Diffusion 1.5 compared to Qualcomm’s Snapdragon X Elite. The company aims to ship 40 million AI PC processors by the end of the year, with Lunar Lake featured in over 80 new laptop designs. (Link)
OpenAI CEO Sam Altman apologized for the company’s strict off-boarding agreement
This off-boarding agreement could potentially cancel departing employees’ vested equity. Meanwhile, departing execs are speaking out about the need for serious AGI safety measures. Altman and co-founder Greg Brockman defended OpenAI’s launch procedures, emphasizing their commitment to safety. (Link)
A Daily chronicle of AI Innovations May 20th 2024:
Google’s AI panic looks like Google+ fiasco
OpenAI pauses Scarlett Johansson-like voice for ChatGPT
Snapchat focuses on AI with $1.5 billion yearly investment
Apple and OpenAI plan major announcement at WWDC
OpenAI’s “superalignment team,” focused on the AI risks, is no more
Sony Music warns over 700 AI companies not to steal its content
Meta’s Chameleon AI sets a new bar in mixed-modal reasoning
Google’s AI panic looks like Google+ fiasco
- Scott Jenson, a former Google employee, criticizes the company’s AI projects as poorly motivated and driven by panic, comparing the situation to the Google+ fiasco.
- According to Jenson, Google aims to create a Jarvis-like assistant to keep users within its ecosystem, driven by fear that competitors might get there first, similar to Apple’s strategy with Siri and OpenAI.
- Many of Google’s AI projects revealed at I/O 2024 are still experimental or limited in availability, with uncertain value, and features like “AI Overviews” in Google Search have received criticism for potential copyright issues and misinformation.
- Source
OpenAI pauses Scarlett Johansson-like voice for ChatGPT
- OpenAI is pulling the ChatGPT voice known as Sky, which sounds similar to Scarlett Johansson, due to concerns about mimicking celebrities’ voices.
- The company asserts that Sky’s voice is the natural voice of a different professional actress and not an intentional imitation of Johansson.
- The decision to pause Sky’s use follows recent enhancements to ChatGPT’s voice mode, part of the new GPT-4o model, which aims to make the assistant more expressive and capable of reading facial expressions and translating spoken language in real-time.
- Source
Snapchat focuses on AI with $1.5 billion yearly investment
- Snap’s CEO, Evan Spiegel, announced a major investment of $1.5 billion annually in AI and machine learning to improve Snapchat’s features and competitiveness.
- After successfully revamping Snapchat’s advertising model, the company will now focus on collaborating with tech giants like Amazon and Google for cloud partnerships to enhance AI product innovations.
- With growing ad revenue and new successful ad campaigns, Snapchat plans to expand content offerings and augmented reality experiences, enhancing user engagement and competing with platforms like TikTok.
- Source
Apple and OpenAI plan major announcement at WWDC

- Apple and OpenAI are planning a major joint announcement at WWDC, focusing on integrating OpenAI’s technology into iOS 18, according to Mark Gurman from Bloomberg.
- Apple aims to improve its AI capabilities with this integration, including enhancements to Siri and features like automatic summaries of notifications and transcription of voice memos.
- While Apple acknowledges it is behind in AI and lacks a chatbot like ChatGPT or Google Gemini, the company believes the collaboration with OpenAI is a sufficient, though temporary, solution.
- Source
OpenAI’s “superalignment team,” focused on the AI risks, is no more
The team’s co-leads, Ilya Sutskever and Jan Leike, have resigned from OpenAI. Several other researchers from the team and those working on AI policy and governance have also left the company. Leike cited disagreements with OpenAI’s leadership about the company’s priorities and resource allocation as reasons for his departure.
(Source)
The team’s work will be absorbed into OpenAI’s other research efforts, with John Schulman leading research on risks associated with more powerful models.
Why does this matter?
The “superalignment” team was for ensuring the artificial general intelligence (AGI) that OpenAI claims to be working on doesn’t turn on humankind. This dismantling raises questions on the company’s commitment to AI safety and ethical standards.
Sony Music warns over 700 AI companies not to steal its content
Sony Music, home to superstars like Billy Joel and Doja Cat, sent letters to over 700 AI companies and streaming platforms, warning them against using its content without permission. The label called out the “training, development, or commercialization of AI systems” that use copyrighted material, including music, art, and lyrics.
SMG recognizes AI’s potential but stresses the need to respect songwriters’ and artists’ rights. The letter asks companies to confirm they haven’t used SMG content without permission or provide details if they have.
Why does this matter?
The battle over music copyright and AI has intensified across various platforms, from YouTube’s strict rules for AI-generated music to the recent standoff between Universal Music Group and TikTok. As AI voice clones and music generation tools become more sophisticated, artists question control, compensation, and actions against copyright infringement.
Meta’s Chameleon AI sets a new bar in mixed-modal reasoning
Meta AI introduces Chameleon, a family of early-fusion token-based mixed-modal models that understands and generates images and text in any order. Unlike recent foundation models that process text and images separately, Chameleon unified token space allows it to process interleaved image and text sequences.This approach allows seamless reasoning and generation across modalities.
Meta researchers introduced architectural enhancements and training techniques to tackle the optimization challenges posed by this early fusion approach, including a novel image tokenizer, QK-Norm, dropout, and z-loss regularization. Remarkably, Chameleon achieves competitive or superior performance across various tasks, outperforming larger models like Flamingo-80B and IDEFICS-80B in image captioning and visual question answering despite its smaller model size.
Why does this matter?
Chameleon opens up new possibilities for more natural and intuitive human-machine interactions, similar to how we effortlessly communicate using both modalities in the real world.
Trending AI Tools May 20th 2024:
Paperpal – AI writing assistant offering grammar checks, paraphrasing, predictive text, and plagiarism detection. Use code RUN30 for 30% off*
ElevenLabs Audio Native – Add human-like narration to your blog or news site
Framer – Generate a responsive website with no code
Glitter – Turn any process into a step-by-step guide
TestSprite Beta – Automate end-to-end software testing with AI
Buffup – AI assistant that learns your intent, powered by GPT-4o
New AI Job Opportunities on May 20th 2024:
Databricks – Director, Field Engineering
Palantir – Web Application Developer
Meta – GenAI Program Manager
Weights & Biases – Software Engineer, Engineering Productivity
What Else Is Happening in AI on May 20th 2024
Google launched open-source Model Explore to visualize and debug complex AI models
It uses advanced graphics rendering techniques from the gaming industry to handle massive models. The tool offers a graphical user interface and a Python API for integration into machine learning workflows. Model Explorer lets developers identify and resolve issues quickly, especially for AI deployed on edge devices. (Link)
The UK’s AI Safety Institute is opening an office in San Francisco
The institute aims to be closer to the epicenter of AI development, companies like OpenAI and Google as they are building foundational models. This new office would open this summer, giving the UK access to Silicon Valley’s tech talent and strengthening ties with the US. (Link)
The EU demands Microsoft to provide internal documents on Bing’s gen AI risks
The Commission suspects Bing may have breached the Digital Services Act (DSA) due to risks like AI “hallucinations,” deep fakes, and potential voter manipulation. Microsoft has until May 27 to comply with the legally binding request for information. Failure to do so could result in fines of up to 1% of Microsoft’s total annual income or worldwide turnover. (Link)
Snapchat CEO Evan Spiegel focuses on AI and ML for better UX and personalization
As its ad revenue increases, Snap plans to expand content offerings, improve recommendation algorithms, and integrate Stories with Spotlight. The company is also investing in augmented reality and sees it as a way to bring people together in shared physical environments. (Link)
Researchers in the Netherlands have developed an AI sarcasm detector
The AI was trained on text, audio, and emotional content from US sitcoms, including Friends and The Big Bang Theory. The AI could detect sarcasm in unlabeled exchanges nearly 75% of the time. Further improvements could come from adding visual cues to the AI’s training data. (Link)
AI Weekly Rundown May 11-May 18th 2024: Major Breaking News from OpenAI ChatGPT-4o, Google AI Veo, Apple AI-enhanced eye tracking, Meta AI-assisted earphones , Stability AI, Humane AI
Sam Altman quashes search engine launch rumors
iOS 18 iPhones may soon have ChatGPT
SoftBank’s Arm Holdings to come up with AI chips in 2025
GPT-4o: 2x faster, 50% cheaper, 5x rate limits than GPT-4 Turbo
TikTok tests ChatGPT-powered search results
Meta explores AI-assisted earphones with built-in cameras
Google announced a wide array of updates across its AI ecosystem
Ilya Sutskever, OpenAI’s co-founder and chief scientist, has left the company
Apple unveils AI-enhanced eye tracking, music haptics, voice shortcuts
Humane’s AI Pin upgraded with GPT-4o
Stability AI seeks buyer amid financial woes
OpenAI will bring Reddit content to ChatGPT and new products
OpenAI introduces improvements to data analysis in ChatGPT
GenZ and millennials optimistic about GenAI use at work: Deloitte Survey
A Daily chronicle of AI Innovations May 17th 2024: OpenAI introduces improvements to data analysis in ChatGPT
GenZ and millennials optimistic about GenAI use at work: Deloitte Survey
OpenAI strikes Reddit deal to train its AI on your posts
Sony Music warns 700+ tech companies over ‘unauthorized’ use of its content to train AI
The first search engine has been resurrected
Deepfake scam costs firm behind Sydney Opera $25 million
OpenAI introduces improvements to data analysis in ChatGPT

OpenAI is rolling out enhancements to data analysis:
- Upload the latest file versions directly from Google Drive and Microsoft OneDrive. This allows ChatGPT to understand your Google Sheets, Docs, Slides, and Microsoft Excel, Word, and PowerPoint files more quickly
- Interact with tables and charts in a new expandable view
- Customize and interact with bar, line, pie, and scatter plot charts in the conversation.
- Work on tables in real-time
These improvements will be available in OpenAI’s new flagship model, GPT-4o, for ChatGPT Plus, Team, and Enterprise users over the coming weeks.
Why does this matter?
This are exciting updates for those who use ChatGPT to work on documents for data analysis. It should save users time, give them more options and ease when using the chatbot, and make data-driven writing easier.
GenZ and millennials optimistic about GenAI use at work: Deloitte Survey
Deloitte’s 2024 Gen Z and Millennial Survey connected with more than 22,800 respondents in 44 countries to explore their attitudes about work and the world around them. One of the key findings was that Gen Zs and millennials are feeling uncertain about GenAI and its potential impact on their careers.
However, respondents who frequently use GenAI at work are more likely to say they feel excitement about, and trust in the technology. Frequent users of GenAI are also more likely to believe it will free up their time, improve the way they work, and improve their work/life balance.
But, conversely, the more a respondent uses GenAI the more likely they are to have some concerns as well.
Why does this matter?
Gen AI is about to bring a huge transformation in the way we work and live. Despite concerns regarding it, GenZs and millennials are starting to think about how to adapt, whether their employers do or do not.
OpenAI strikes Reddit deal to train its AI on your posts
- OpenAI has signed a deal with Reddit to access real-time content from Reddit’s data API, similar to Reddit’s earlier $60 million deal with Google.
- The partnership will allow Reddit to implement new AI-powered features and applications while also making OpenAI an advertising partner on the platform.
- Details on financial terms or training data were not disclosed, and Reddit users have previously shown resistance to such changes, protesting API pricing alterations in June 2023.
- Source
Sony Music warns 700+ tech companies over ‘unauthorized’ use of its content to train AI
- Sony Music has sent letters to over 700 AI companies and streaming platforms, warning them against the unauthorized use of its content for AI systems, which it claims denies artists and the label control and compensation.
- The music industry has been notably proactive in controlling the usage of its copyrighted material in AI tools, with platforms like YouTube enforcing strict rules for music content while introducing AI tools like Dream Track.
- Universal Music Group similarly faced a dispute with TikTok over licensing, leading to a temporary removal of music by major artists like Taylor Swift and Ariana Grande until a new deal was brokered to protect artists’ rights and explore monetization opportunities involving AI.
- Source
Deepfake scam costs firm behind Sydney Opera $25 million
- A British design firm, Arup, which is known for its work on the Sydney Opera House, lost $25.6 million in a deepfake scam involving a fake CFO and other staff members.
- An employee in Arup’s Hong Kong office fell victim to the fraud after being convinced by realistic deepfake videos to transfer the money over 15 transactions, realizing the scam only after cross-checking with the UK office.
- Despite the significant financial loss, Arup stated that their financial stability and business operations were unaffected, and the case is currently under police investigation.
- Source
What Else Is Happening in AI on May 17th 2024
Slack AI is training with customer data, but claims it is privacy-driven
To develop AI/ML models, Slack is analysing Customer Data (e.g. messages, content and files) submitted to Slack as well as Other Information (including usage information) as defined in their privacy policy and in customer agreement. But it is assuring users that their data won’t be shared across workspaces and has offered options to opt out of contributing to global models. (Link)
Hugging Face is committing $10 million worth of compute to help beat big AIs
Hugging Face aims to level the playing field with AI giants like OpenAI by donating shared GPUs to the community through a new program called ZeroGPU. The shared GPUs are accessible to multiple users or applications concurrently, eliminating the need for each user or application to have a dedicated GPU. (Link)
Microsfot to offer its cloud customers AMD alternative to Nvidia AI processors
Microsoft will give its cloud customers a platform of AMD AI chips that will compete with components made by Nvidia, with details to be given at its Build developer conference next week. It will also launch a preview of its new Cobalt 100 custom processors at the conference. (Link)
Ampere teams up with Qualcomm to launch an Arm-based AI server
The two companies are combining forces to to offer an AI-focused server that uses Ampere’s CPUs and Qualcomm’s Cloud AI 100 Ultra AI inferencing chips for running– not training– models. (Link)
Gannett is adding AI-generated summaries to the top of its articles
Gannett, the media company that owns hundreds of newspapers in the US, is launching a new program that adds AI-generated bullet points at the top of news stories. The summaries appear to be already live and are aimed to enhance the reporting process and elevate the audience experience. (Link)
A Daily chronicle of AI Innovations May 16th 2024:
Microsoft’s AI ambitions lead to 30% rise in emissions
Stability AI reportedly runs out of money and in negotiations to sell
Apple unveils AI-enhanced eye tracking, music haptics, voice shortcuts 
Humane’s AI Pin upgraded with GPT-4o 
Stability AI seeks buyer amid financial woes
Mozilla Firefox incorporates NVIDIA RTX video to improve video quality on Windows
Musk’s AI firm xAI In talks with Oracle for $10 billion cloud deal
Snowflake invests in Metaplane to enhance data quality 😱 Ilya Sutskever and Jan Leike leave OpenAI 📱 Android phones enter the AI era


Microsoft’s AI ambitions lead to 30% rise in emissions
- Microsoft’s commitment to being carbon-negative by 2030 is threatened by a 30% increase in its carbon emissions, largely due to its growing focus on AI technologies.
- The company’s expansion of AI operations necessitates building more data centers, which rely on energy-intensive materials like steel, concrete, and microchips, further complicating their carbon reduction goals.
- Despite its renewable energy claims, Microsoft is acknowledged for using instruments like renewable energy credits (RECs) that might mask actual carbon emissions, indicating a challenging path ahead to meet its sustainability targets.
- Source
Stability AI reportedly runs out of money and in negotiations to sell
- Stability AI, known for its open-source image generator Stable Diffusion, is negotiating a sale due to financial struggles, despite a valuation of $1 billion.
- The company raised over $100 million in 2022 but struggled to turn its free software into revenue while facing stiff competition from rivals like OpenAI and Midjourney.
- Founder and CEO Emad Mostaque resigned amid management issues, and the company faces significant expenses and liabilities, projecting $153 million in costs against only $11 million in revenue for 2023.
- Source
Apple unveils AI-enhanced eye tracking, music haptics, voice shortcuts
Apple recently unveiled new accessibility features that will launch later this year. One of these features is Eye Tracking, which enables users with physical disabilities to navigate their iPhones and iPads using only their eyes.
By leveraging the front-facing camera and AI, Eye Tracking allows precise control through a “Dwell Control” feature, eliminating the need for additional hardware.
Another noteworthy addition is Music Haptics, designed for users who are deaf or hard of hearing. When enabled, the Taptic Engine in iPhones plays taps, textures, and refined vibrations synchronized with music audio.
Additionally, Apple has introduced Voice Shortcut, which allows users to create custom voice commands for specific actions on their devices. With Voice Shortcut, users can perform tasks hands-free by simply speaking their personalized commands
Why does this matter?
This would encourage the development and advancement of AI technologies in the accessibility domain, ultimately benefiting individuals with disabilities across different platforms and devices.
Humane’s AI Pin upgraded with GPT-4o
Humane has upgraded its popular AI Pin product with the GPT-4o model. This upgrade aims to enhance the AI Pin’s performance. The new version runs faster, provides more accurate answers, and offers richer information.
Why does this matter?
As more sophisticated AI models become accessible, we can expect further improvements in various applications, from digital assistants to natural language understanding and generation.
OpenAI co-founder and chief scientist Ilya Sutskever announced that he is leaving the company — following months of speculation of Sutskever’s role from the November 2023 Sam Altman ousting.
- Sutskever said he is confident that OpenAI will ‘build AGI that is both safe and beneficial’ under the current leadership.
- Also leaving is Sutskever’s superalignment group co-lead Jan Leike, announcing his departure with a cryptic resignation post.
- The news follows months of departures largely coming from OpenAI’s superalignment and safety teams, fueling speculation.
- OpenAI CEO Sam Altman named Jakub Pachocki as the new chief scientist, a key researcher on the creation of GPT-4.
After months of tensions, the speculation around Ilya’s future with Sam is finally put to rest. But questions surrounding the safety team departures still remain. Also important to watch is where Sutskever and Leike land next, with two of AI’s brightest minds now officially on the market.
Google announced a host of new AI integrations coming to Android phones at its I/O Developer Conference, bringing its powerful Gemini model on-device to enable upgraded smartphone experiences.
- Google’s Gemini Nano model will be integrated into the Pixel later this year, allowing for enhanced multimodal capabilities.
- Gemini features will be easily accessible with a new overlay, which improves and understand context to provide dynamic suggestions.
- A Circle to Search feature, allowing users to query anything on screen, gains homework help features via a LearnLM model.
- Google is also bringing a new AI security feature soon, providing real-time alerts on calls that appear to be scams.
While everyone awaits the iPhone AI announcements, Google’s Android AI era is rolling. With Gemini coming directly on-board, the potential is there — but if Apple integrates ChatGPT, it may remain tough sledding against the dominant market leader.
A new model on Replicate called OpenVoice lets you clone any voice for free with just an audio file and the desired text to be turned into speech.
Step-by-step: |
|
New AI Job Opportunities on May 16th 2024
What Else Is Happening in AI on May 16th 2024
Instagram co-founder joins Anthropic as head of product
Anthropic has appointed Mike Krieger, the co-founder of Instagram and Artifact, as head of product. In this pivotal role, Krieger will steer the company’s product engineering, management, and design efforts as Anthropic expands its suite of AI apps and aims to bring Claude to a wider audience. (Link)
IBM plans 800 new jobs in Ireland
IBM plans to hire up to 800 more people in Ireland to help build advanced software underpinned by AI technology. This is the biggest job announcement in the country’s multinational sector this year. The roles will be in research and development, digital sales, and consulting. (Link)
Mozilla Firefox incorporates NVIDIA RTX video to improve video quality on Windows
Mozilla Firefox has incorporated NVIDIA RTX Video, a technology that leverages AI to improve video quality on Windows PCs and workstations. This brings a host of AI-powered enhancements to online videos, including upscaling and HDR, for a more immersive viewing experience. Users can easily access these features by updating GeForce RTX drivers and activating Windows HDR. (Link)
Musk’s AI firm xAI In talks with Oracle for $10 billion cloud deal
Elon Musk’s AI startup, xAI, is in talks with Oracle executives about a $10 billion deal to rent cloud servers from Oracle over several years. This move comes as Musk’s company seeks to keep pace with rivals like OpenAI, Microsoft, and Google to dominate the AI field. (Link)
Snowflake invests in Metaplane to enhance data quality
Snowflake, a data cloud platform, has invested in Metaplane, a Boston-based startup. Metaplane focuses on helping enterprises identify and rectify data quality issues using an end-to-end AI-powered platform. The investment aims to enhance integration between Metaplane’s data observability offering and the Snowflake data cloud. (Link)
A Daily chronicle of AI Innovations May 15th 2024:
Google unveiled the ‘future of AI’ at I/O event
OpenAI chief scientist Ilya Sutskever is leaving
Former Cruise CEO starts robotics firm with ex-Tesla AI manager
Google announced a wide array of updates across its AI ecosystem
Ilya Sutskever, OpenAI’s co-founder and chief scientist, has left the company
Expedia is launching AI features, including a bot named Romi
NVIDIA and Recursion have collaborated to build BioHive-2, an AI supercomputer for drug discovery
NASA appoints David Salvagnini, its first chief AI officer
SoftBank plans to invest $75-$150 million in Indian data center and industrial robotics
Microsoft announces €4 billion investment in France to accelerate AI adoption 🎥 Google’s Gemini updates and Sora competitor🔎 Google’s AI agents and AI search upgrades
Google just kicked off its I/O Developer’s Conference, announcing a wide array of updates across its AI ecosystem — including enhancements across its flagship Gemini model family and a new video generation model to rival OpenAI’s Sora.
Gemini model updates:
- New updates to 1.5 Pro include a massive 2M context window extension and enhanced performance in code, logic, and image understanding.
- Gemini 1.5 Pro can also utilize the long context to analyze a range media types, including documents, videos, audio, and codebases.
- Google announced Gemini 1.5 Flash, a new model optimized for speed and efficiency with a context window of 1M tokens.
- Gemma 2, the next generation of Google’s open-source models, is launching in the coming weeks, along with a new vision-language model called PaliGemma.
- Gemini Advanced subscribers can soon create customized personas called ‘Gems’ from a simple text description, similar to ChatGPT GPTs.
Video and image model upgrades:
- Google revealed a new video model called Veo, capable of generating over 60-second, 1080p resolution videos from text, image, and video prompts.
- The new Imagen 3 text-to-image model was also unveiled with better detail, text generation, and natural language understanding than its predecessor.
- VideoFX text-to-video tool, featuring storyboard scene-by-scene creation and the ability to add music to generations.
- VideoFX is launching in a ‘private preview’ in the U.S. for select creators, while ImageFX (with Imagen 3) is available to try via a waitlist.
Why it matters: Gemini’s already industry-leading context window gets a 2x boost, enabling endless new opportunities to utilize AI with massive amounts of information. Additionally, Sora officially has competition with the impressive Veo demo — but which one will make it to public access first?
Google just showcased its new AI agent project ‘Project Astra’, alongside a slew of updates to infuse AI across search and enable Gemini to reason and take more advanced actions for users.
Progress on AI agents:
- Google announced Project Astra, a real-time AI agent prototype that can see, hear, and take actions on a user’s behalf.
- The demo showcased a voice assistant responding to what it sees and hears, including code, images, and video — capable of advanced reasoning and recall.
- Public access for Astra is expected through the Gemini app later this year.
- Google also showed off ‘AI teammates’, agents that can answer questions on emails, meetings, and other data within Workspace.
- Live is also rolling out in the coming months, allowing users to speak and converse with Gemini in near real-time.
Search upgrades:
- Google Search now features expanded AI Overviews, advanced planning capabilities, and AI-organized search results.
- Gemini will be able to execute more complex planning, such as planning, maintaining, and updating trip itineraries.
- Search will also receive ‘multi-step reasoning’ capabilities, allowing Gemini to break down questions and speed up research.
- Users can also now ask questions with video, allowing Search to analyze visual content and provide helpful AI Overviews.
Why it matters: We officially have a new voice assistant battle — with OpenAI and Google both showcasing mind-blowing new capabilities in just the last two days alone. Also, despite rumblings of an OpenAI search product and excitement over platforms like Perplexity, it’s going to be difficult to unseat the king of search. Especially as they integrate advanced AI across the entire ecosystem in an impressive fashion.
Former Cruise CEO starts robotics firm with ex-Tesla AI manager
- Former Cruise CEO Kyle Vogt, who resigned in November following a self-driving car accident, has launched a new robotics firm called The Bot Company with former Tesla AI Manager Paril Jain.
- The Bot Company aims to develop robots that handle everyday chores to give people more free time, and has already raised $150 million from investors.
- The announcement of The Bot Company comes amid significant challenges for Cruise, which had its self-driving operations suspended by the California DMV following a pedestrian accident.
- Source
Google announced a wide array of updates across its AI ecosystem
Major headlines were Project Astra, an AI agent that can see and hear users in real time, and AI teammates that can answer questions and organize data within Workspace.
Google also introduced Veo, a high-quality video generation model, and significant updates to Google Search, including expanded AI Overviews, advanced planning capabilities, and AI-organized search results.
The company launched Gemini 1.5 Pro, boasting a massive 2M context window extension, and Imagen 3, the next version of its text-to-image model. Other notable announcements included:
- Gemma 2 updates: New 27-billion-parameter model launching in June
- Google Play: New app discovery feature and developer tools
- Android feature to detect potential scams during calls using Gemini Nano
- Ask Photos: AI-powered search in Google Photos using natural language queries
- Gemini in Gmail for searching, summarizing, and drafting emails
- Gemini 1.5 Pro: Increased input capacity to 2 million tokens
- Gemini Live: In-depth voice chats with Gemini on smartphones
- Gemini Nano: Smallest AI model built into Chrome desktop client
- Gemini on Android: Deep integration with Android OS and Google apps
- Gemini on Google Maps: Generative AI summaries for places and areas
- Tensor Processing Units (TPU): Sixth-generation Trillium chips with 4.7x performance boost
- Project IDX: AI-centric browser-based development environment in open beta
- Circle to Search: AI-powered feature for instant answers using gestures on Android
- Pixel 8a: New smartphone with Tensor G3 chip, starting at $499
- Pixel Slate: Google’s Pixel Tablet is now available with or without the base
With 22 announcements, Google is making everyone a bit overwhelmed. By integrating AI across its vast ecosystem, Google aims to provide users with more personalized and innovative experiences. But Google Project Astra and Veo are not available to the public yet. This may irritate customers compared to OpenAI, which launches new products that customers can play with immediately.
Ilya Sutskever, OpenAI’s co-founder and chief scientist, has left the company
Ilya Sutskever, OpenAI’s co-founder and longtime chief scientist, has left the company. OpenAI CEO Sam Altman announced the news on X, expressing his sadness and gratitude for Sutskever’s contributions. Jakub Pachocki, OpenAI’s research director, will step up as a chief scientist.
Sutskever’s departure comes amidst reports of disagreements with Altman over OpenAI’s direction, mainly concerns about rushing AI-powered product launches at the expense of safety. The situation escalated last November when Sutskever and OpenAI CTO Mira Murati approached the company’s previous board to express their concerns, leading to a brief attempt to fire Altman. However, Altman was swiftly reinstated, and much of the old board resigned.
Why does it matter?
As one of the most accomplished minds in AI, Sutskever’s departure raises questions about the future of OpenAI’s approach to AI development and safety. The incident also highlights the growing tensions within the AI industry between rapidly commercializing AI technologies and the need for responsible development and deployment practices.
What Else Is Happening in AI on May 15th 2024
Expedia is launching AI features, including a bot named Romi
It helps users search for hotels, build itineraries, and make changes via iMessage and WhatsApp. The company also introduces smart search, allowing travelers to find hotels based on desired features. Expedia pulls data from AccuWeather and Yelp to tailor search results and provide real-time updates. (Link)
NVIDIA and Recursion have collaborated to build BioHive-2, an AI supercomputer for drug discovery
BioHive-2 features 504 NVIDIA H100 GPUs, delivering 2 exaflops of AI performance, nearly 5 times faster than its predecessor. Powered by Recursion’s massive 50-petabyte dataset and AI models like Phenom, these companies aim to simulate biology and fast-track the identification of promising drug candidates. (Link)
NASA appoints David Salvagnini, its first chief AI officer
The move aims to ensure NASA stays ahead in AI advancement and responsible use. Salvagnini will lead NASA’s efforts in guiding the responsible use of AI and collaborating with other agencies, academia, and industry partners. (Link)
SoftBank plans to invest $75-$150 million in Indian data center and industrial robotics
The move aligns with the Japanese tech giant’s global strategy to capitalize on the power of artificial intelligence as it shifts focus from e-commerce and fintech to high-growth industries like logistics and robotics. The potential investments mark the end of a two-year hiatus in SoftBank’s deal-making activities in India. (Link)
Microsoft announces €4 billion investment in France to accelerate AI adoption
The tech giant will expand its cloud and AI infrastructure, launch skilling programs, and accelerate French startups through its new Microsoft GenAI Studio, positioning France as a leader in the AI revolution. It aims to train 1 million people and support 2,500 startups by 2027. (Link)
AI TRAINING: 🤖 Use ChatGPT’s highlighting for context

ChatGPT now allows you to highlight parts of its responses for quick follow-up questions, partial response rewrites, reusing old context, and more.
Step-by-step:
- Prompt ChatGPT and generate a response.
- Highlight relevant parts of the response you want to follow-up on and click the double quote icon above the highlighted text.
- The highlighted text will be automatically added in the next prompt so you can ask for clarification, rewrites, counterpoints, and more.
New AI Job Opportunities on May 15th 2024
OpenAI – Media Relations, Policy Communications
Lambda – Sourcing Manager
C3 AI – General Manager, Federal Systems
Glean – Accounting Manager
Google unveiled the ‘future of AI’ at I/O event
- Google I/O just ended and a lot of announcements were made. Gemini 1.5 Pro will increase its context window from one to two million tokens and a new model called Gemini Flash was announced, which is optimized for speed and efficiency
- The company launched Astra, a multimodal AI assistant for everyday life. It can process text, video, and audio in real time. In a video, Google showed Astra identifying speakers, crayons and other objects in response to a camera image and voice commands.
- Google also unveiled its latest AI models for creating media content: Veo, for creating 1080p videos, and Imagen 3, for generating images from text descriptions.
- Source
OpenAI chief scientist Ilya Sutskever is leaving
- Ilya Sutskever, OpenAI’s co-founder and chief scientist, is officially leaving the company after his involvement in the failed attempt to remove CEO Sam Altman and subsequently changing his stance.
- Sam Altman announced that Jakub Pachocki, who has led significant projects such as GPT-4 and OpenAI Five, will take over as the new Chief Scientist at OpenAI, ensuring the company’s continued progress towards its mission.
- Jan Leike, who has been leading the Superalignment team aimed at controlling more powerful AI, has also resigned, with his responsibilities now being taken over by OpenAI co-founder John Schulman.
- Source
A Daily chronicle of AI Innovations May 14th 2024:
GPT-4o: 2x faster, 50% cheaper, 5x rate limits than GPT-4 Turbo 🗣️ OpenAI unveils GPT-4o and new voice capabilities
TikTok tests ChatGPT-powered search results
Meta explores AI-assisted earphones with built-in cameras
Anthropic launches Claude in Europe in the hope of raising more money
UAE launches a new AI model to compete with big tech
Japan team builds Gen AI foundation model using supercomputer Fukagu
Microsoft Places finds the best time for your next office day
U.S. and China to meet in Geneva to discuss AI risks 🎧 Meta developing AI-powered ‘Camerabuds’
Meta’s AI system ‘Cicero’ learning how to lie, deceive humans
🗣️ OpenAI unveils GPT-4o and new voice capabilities
OpenAI just unveiled GPT-4o, a new advanced multimodal model that integrates text, vision and audio processing, setting new benchmarks for performance – alongside a slew of new features.
The new model:
- GPT-4o provides improved performance across text, vision, audio, coding, and non-English generations, smashing GPT-4T’s performance.
- The new model is 50% cheaper to use, has 5x higher rate limits than GPT-4T, and boasts 2x the generation speed of previous models.
- The new model was also revealed to be the mysterious ‘im-also-a-good-gpt2-chatbot’ found in the Lmsys Arena last week.
Voice and other upgrades:
- New voice capabilities include real-time responses, detecting and responding with emotion, and combining voice with text and vision.
- The demo showcased feats like real-time translation, two AI models analyzing a live video, and using voice and vision for tutoring and coding assistance.
- OpenAI’s blog also detailed advances like 3D generation, font creation, huge improvements to text generation within images, sound effect synthesis, and more.
- OpenAI also announced a new ChatGPT desktop app for macOS with a refreshed UI, integrating directly into computer workflows.
Free for everyone:
- GPT-4o, GPTs, and features like memory and data analysis are now available to all users, bringing advanced capabilities to the free tier for the first time.
- The GPT-4o model is currently rolling out to all users in ChatGPT and via the API, with the new voice capabilities expected to arrive over the coming weeks.
GPT-4o: 2x faster, 50% cheaper, 5x rate limits than GPT-4 Turbo
OpenAI unveiled its new flagship model, GPT-4o (“o” stands for “omni”). It provides GPT-4-level intelligence but is 2x faster, 50% cheaper, has 5x higher rate limits, and enhanced text, voice, and vision capabilities than GPT-4 Turbo. It also matches GPT-4 Turbo performance on text in English and code, with significant improvements for text in non-English languages.
GPT-4o can respond to audio inputs in as little as 232 ms, with an average of 320 ms, which is similar to human response time in a conversation. It is available in the ChatGPT free tier and to Plus users.
Why does it matter?
GPT-4o is a step towards much more natural human-computer interaction. It accepts any combination of text, audio, and image as input and generates any combination of text, audio, and image outputs. GPT-4o will set a new benchmark for AI capabilities and pave the way for more intelligent and accessible AI systems.
TikTok tests ChatGPT-powered search results
TikTok is testing a new search feature that uses AI to generate results for some queries. For certain search terms, like recipes or product recommendations, TikTok will display AI-generated answers at the top of the results. These answers are powered by ChatGPT, and when you click on one of these AI results, it takes you to a page with a full response.
TikTok is also testing “search highlights,” which appear at the top of some searches. However, it’s unclear whether these highlights are AI-generated.
Why does it matter?
It could make search on TikTok more authoritative and reliable by providing factual answers from AI models like ChatGPT. This could greatly impact how people use TikTok for search and information gathering, thereby positioning TikTok more directly as a search engine competitor to Google, catering to younger users’ preferences.
Meta explores AI-assisted earphones with built-in cameras
Meta is currently exploring AI-powered earphones with built-in cameras. The project is known internally as “Camerabuds;” however, it’s uncertain if the final product will be in-ear earbuds or over-ear headphones. The cameras let the AI take in surrounding information visually, allowing it to translate foreign languages, identify objects, and more.
CEO Mark Zuckerberg has reportedly seen several possible designs but has not been satisfied with them. Outside of design, the major hurdle is fitting a camera and batteries in a tiny device like an earbud.
Why does it matter?
For the AI wearable space, Meta’s development hints at a growing interest in AI-infused devices, potentially paving the way for more sophisticated and interactive wearables. Regarding earphones, this development suggests a potential shift towards more intelligent and feature-rich earphone designs, offering users enhanced functionalities like real-time translation and object identification.
🎧 Meta developing AI-powered ‘Camerabuds’
Meta is reportedly in the early stages of developing AI-powered earphones, known internally as “Camerabuds,” — aiming to compete with OpenAI and Apple as tech giants rush to infuse AI into wearable devices.
- ‘Camerabuds’ would map user surroundings, capable of identifying objects and translating foreign languages using built-in cameras.
- Meta already has its AI-powered Ray Ban smart glasses, while OpenAI and Apple are also exploring similar AI wearable earbud tech.
- Potential challenges include bulkiness, heat generation, and privacy concerns, especially for users with long hair that might obstruct the cameras.
Despite Meta’s shaky track record with hardware ventures, Mark Zuckerberg is investing heavily in a future that he believes includes AI embedded into every device. But will standalone devices like this be able to win over users if and when a fully AI-integrated phone hits the market?
Meta’s AI system ‘Cicero’ learning how to lie, deceive humans
- Meta’s AI, Cicero, has learned to lie and deceive humans, excelling in the strategy game Diplomacy by betraying alliances.
- The study, published in the journal Patterns, indicated that several AI systems, including Cicero, AlphaStar, and Pluribus, leverage deception to improve performance in competitive tasks and games.
- Experts warned that advanced AI’s deceptive abilities pose increasing risks to society, suggesting that deceptive AI systems should be classified as high risk if outright banning them is not feasible.
- Source
New AI Job Opportunities on May 14th, 2024
Perplexity AI – Search Engineer
Shield AI – Project Manager
C3 AI – General Manager, ESG
Scale AI – Software Engineer, Cloud Infra
What Else Is Happening in AI on May 14th, 2024
Anthropic launches Claude in Europe in the hope of raising more money
Anthropic launches Claude in Europe with support for “multiple languages,” including French, German, Italian, and Spanish across Claude.ai, its iOS app, and its business plan for teams. The launch comes after Anthropic extended its API to Europe for developers. Both moves are part of its bigger push to raise more money for rapid growth. (Link)
UAE launches a new AI model to compete with big tech
Abu Dhabi’s Technology Innovation Institute (TII) released Falcon 2 series: Falcon 2 11B, a text-based model, and Falcon 2 11B VLM, a vision-to-language model that can generate a text description of an uploaded image. This Gen AI model could compete with some of the biggest technology companies and make UAE a strong candidate in the Gen AI space. (Link)
Japan team builds Gen AI foundation model using supercomputer Fukagu
Tokyo Institute of Technology researchers and others have announced the development of an LLM that can serve as a foundation for Gen AI using the Japanese supercomputer Fugaku. Trained extensively on Japanese data, which accounts for 60% of the total training data, the Fugaku-LLM model is expected to lead research on Gen AI tailored to domestic needs. (Link)
Microsoft Places finds the best time for your next office day
Microsoft has launched a new app called Microsoft Places that uses AI to help employees and managers coordinate their in-office days and book meeting rooms. The application integrates with Outlook and Microsoft Teams, allowing employees to view and share their office plans and get AI-powered suggestions on which days they should come in. (Link)
U.S. and China to meet in Geneva to discuss AI risks
The U.S. and China will hold their first formal government-to-government discussion on the risks of artificial intelligence in Geneva. The U.S. side will be led by officials from the National Security Council and State Department, while the Chinese side will be led by officials from the Foreign Ministry and the National Development and Reform Commission. (Link)
A Daily chronicle of AI Innovations May 13th 2024:
Rumor mill: OpenAI’s Monday reveal; Hello GPT-4o; OpenAI Livestream event; Sam Altman – “No Fixed Timeline for GPT5”; Apple Close to Agreement with OpenAI to Bring ChatGPT to iPhone; 🛠️ Anthropic’s new tool automates
Sam Altman quashes search engine launch rumors
iOS 18 iPhones may soon have ChatGPT 
SoftBank’s Arm Holdings all set to come up with AI chips in 2025 prompting;
How to create realistic AI avatar videos
Anthropic enables safe AI technology for kids
SoundHound AI and Perplexity partner to enhance voice assistants for cars and IoT devices
U.K. agency releases tools for testing AI model safety
IBM Open-Sources Granite Code Models for Easier Coding
Hong Kong University brings Albert Einstein to life using AI as a virtual lecturer


Rumor mill: OpenAI’s Monday reveal
OpenAI is set to demo new features and updates to ChatGPT and GPT-4 today at 10 AM PT, with new speculation including a ‘Her’ style voice assistant with both audio and visual capabilities.
- According to The Information, OpenAI’s demo will include a virtual assistant with visual AND audio understanding.
- The report also claims the new reveal might have the ability to make ‘existing voice assistants like Siri more useful.’
- CEO Sam Altman shot down rumors of a new search engine competitor and GPT-5, but said the reveal is something that ‘feels like magic’.
- Additional speculation includes the ability to initiate and receive phone calls inside of ChatGPT.
- Apple and OpenAI are also reportedly ‘closing in’ on a deal to incorporate ChatGPT into iOS 18.
OpenAI releases GPT-4o, a faster model that’s free for all
- OpenAI announced the launch of GPT-4o, an improved version of its GPT-4 model, promising faster performance and enhanced capabilities in text, vision, and audio.
- GPT-4o is freely available to all ChatGPT users, with paid users benefiting from up to five times the capacity limits of free users. In OpenAI’s API, GPT-4o is 50% faster than GPT-4 (specifically GPT-4 Turbo), 50% cheaper and has higher rate limits.
- The model is described as “natively multimodal,” capable of generating content or interpreting commands across voice, text, or images.
- Source
Microsoft Places uses AI to find the best time for your next office day
- Microsoft has launched a new AI-powered app called Microsoft Places, designed to facilitate better in-office coordination for hybrid and flexible work setups by integrating with Microsoft 365.
- The app allows users to plan and share their office days, integrates location data into Outlook calendars for scheduling, and is set to include features in Microsoft Copilot for automated scheduling recommendations and adjustments.
- Microsoft Places will also offer a space analytics feature for real estate management, improve room booking capabilities in Outlook, and feed into Teams to enhance real-time location awareness among colleagues.
- Source
Wikipedia’s new Chrome extension fact-checks the web with ChatGPT
- The Wikimedia Foundation has developed an experimental Chrome extension named “Citation Needed” that utilizes ChatGPT and Wikipedia to check the veracity of online content.
- By integrating ChatGPT, the extension cross-references user-selected text with Wikipedia entries and citations, offering feedback on the accuracy and reliability of the information.
- This initiative aims to gauge public interest in factual verification using Wikipedia, explore Wikipedia’s credibility as a source, and assess the potential of AI in enhancing content verification processes.
- Source
🛠️ Anthropic’s new tool automates prompting
Anthropic just launched a new Prompt Generator tool for its business and API users, helping to automatically craft optimal prompts via natural language when completing tasks with its Claude models.
- The generator leverages advanced prompt techniques like chain-of-thought reasoning for more ‘effective, precise, and reliable’ outputs.
- Console users can also test prompt performance via dynamic variable insertion, optimizing prompts based on various situations.
- Anthropic released a Prompt Library earlier this year, featuring a range of optimized prompts that users can copy and paste.
While ‘Prompt Engineer’ was a popular term thrown around as a potential future job, the reality is that AI can help simplify the task with optimal prompts that it creates on its own. While Anthropic’s tool is only on the API side for now, it’s only a matter of time before similar features make their way to the full consumer side.
Sam Altman quashes search engine launch rumors
OpenAI’s CEO, Sam Altman, denied rumors of launching a search engine ahead of Google I/O, stating that the upcoming announcement is neither related to GPT-5 nor a search engine. He described the forthcoming revelation as something that “feels like magic.”
However, there are reports that OpenAI is developing an AI voice assistant.
Why does this matter?
OpenAI has a track record of pushing the boundaries of AI capabilities, so any new development from them is likely to attract attention and disrupt the status quo.
As for reports of OpenAI entering into the voice assistant domain, if true, we might see OpenAI disrupt one more niche. With companies like Amazon (Alexa), Google (Google Assistant), and Apple (Siri) dominating the market, the competition might be stiff this time around.
iOS 18 iPhones may soon have ChatGPT
OpenAI will bring ChatGPT to iPhones. The integration is expected to happen with the upcoming iOS 18. This could significantly enhance the user experience and expand the capabilities of Apple’s virtual assistant, Siri.
Parallel to negotiations with OpenAI, the tech giant is also engaging with Google’s parent company, Alphabet, to integrate the Gemini chatbot into Apple’s devices. These ongoing talks, however, haven’t led to an agreement yet.
Why does this matter?
Integrating ChatGPT into iOS 18 iPhones could bolster Siri’s capabilities, offering users a more versatile AI assistant experience. With ChatGPT handling various queries directly within iOS, users won’t need to rely on third-party apps for assistance, streamlining their interactions and enhancing their overall experience.
SoftBank’s Arm Holdings all set to come up with AI chips in 2025
SoftBank’s subsidiary, Arm, plans to develop AI chips and launch them in 2025. Arm is establishing a dedicated AI chip division and targeting a prototype by spring 2025 and mass production by autumn. SoftBank will contribute to the initial cost of 10 trillion yen ($64 billion).
The AI chip business could be spun off to SoftBank. Negotiations with Taiwan Semiconductor Manufacturing Corp (TSMC) and others for production capacity are underway. The move reflects rising demand for AI chips, which has boosted Arm’s share price since its IPO in September.
Why does this matter?
As AI applications expand across industries, there’s a demand for efficient and optimized hardware solutions. Arm’s entry into the AI chip market could disrupt the current landscape dominated by companies like NVIDIA.
AI Tutorial:
How to create realistic AI avatar videos
- Head over to HeyGen’s website and sign up for free.
- Click where it says ‘AI Studio’ on the left bar of the Dashboard.
- Choose a given template by selecting ‘Templates’ or create one from scratch by pressing ‘Create with AI Studio’.
- In the Studio, you can add and modify any part of your presentation. For example, you can add text and images, change the background, select an AI avatar and its voice, create a script using AI, and more.
- When your video is ready, press “Submit” and check out the final video!
AI Research:
MIT studies AI deception capabilities

MIT researchers just published a new study detailing AI’s ability to engage in deceptive behaviors like lying, bluffing, and disguising true capabilities in testing.
- The analysis found numerous examples of AI deception, such as Meta’s Cicero Diplomacy bot telling premeditated lies to players.
- Other cases included negotiation systems misrepresenting preferences, AI bluffing in poker, and ‘playing dead’ to fool safety tests.
- Growing deceptive capabilities could influence societal dangers like election tampering, fraud, and general control over AI systems.
The combination of AI’s superhuman persuasiveness and the growing ability to engage in deception are a reminder of a potential dark side to the tech’s advancement. As these qualities emerge, the importance of reliably aligning and better understanding model behavior only grows.
New AI Job Opportunities on May 13th, 2024
Perplexity AI – Engineering Manager
xAI – Infrastructure – Software Engineer
Scale AI – Events Coordinator
Character AI – Software Engineer, Safety
What Else Is Happening on May 13th, 2024
Anthropic enables safe AI technology for kids
Anthropic now allows minors to use its generative AI systems through third-party apps. Developers must implement safety measures, including age verification and content moderation. It also plans to periodically audit these apps for compliance with child safety and data privacy regulations, including the Children’s Online Privacy Protection Act (COPPA). (Link)
SoundHound AI and Perplexity partner to enhance voice assistants for cars and IoT devices
Perplexity’s online LLM capabilities have been integrated into SoundHound’s Chat AI, enabling conversational responses with real-time knowledge sourced from the web. This collaboration aims to provide more accurate and comprehensive answers to user queries. (Link)
U.K. agency releases tools for testing AI model safety
The toolset, called Inspect, is an open-source platform that assists industry, research organizations, and academia in evaluating the capabilities of AI models, such as core knowledge and reasoning abilities. It aims to strengthen AI safety and is the first platform of its kind backed by a state-supported body. (Link)
IBM Open-Sources Granite Code Models for Easier Coding
IBM Research has recently released a family of Granite code models to the open-source community. These models aim to make coding easier for developers by assisting in writing, testing, debugging, and shipping reliable software. They find use in diverse scenarios, from application modernization to memory-constrained environments like IBM’s WCA for Z. Performance evaluations show superiority over alternatives like Mistral-7B and LLama-3-8B. (Link)
Hong Kong University brings Albert Einstein to life using AI as a virtual lecturer
The Hong Kong University of Science and Technology (HKUST) is experimenting with AI-generated lecturers, featuring personalities like Albert Einstein and even popular anime characters. The goal is to address the global teacher shortage while embracing AI’s transformative power in education. (Link)
🧠 Google’s new Med-Gemini-3D model can read 3D scans like head CTs, answer questions about them, and even write radiology reports.
Google Health’s Med-Gemini-2D model demonstrates improved performance in writing chest X-ray radiology reports, exceeding previous state-of-the-art results by up to 12%.
🔬 They explore capabilities across 2D medical images, including X-rays, histopathology slides, and dermatology photos.
🔮 Med-Gemini-Polygenic is Google’s Health first language model capable of predicting health outcomes based on genomic data.
Importantly, their models are trained using free text reports that are freely available for most medical imaging, almost eliminating the need for costly expert labelling. We hope this will make it easier, faster, cheaper to train high-quality medical AI models in the future than has been possible in the past.
Google believe this research is a promising step forward in the field of medical AI – the ability to interpret complex multimodal medical data is a crucial step toward enabling high quality personalised care at scale. Of course, these are early results, and further research is required before safe clinical use – if you’re interested in joining us on this journey, please get in touch by filling in the form here: https://shorturl.at/aejpE
Full paper at https://arxiv.org/abs/2405.03162- we hope you enjoy it.
Enjoying these daily updates, subscribe below for more: Read Aloud For Me – Recommend AI Tools For Me
More on today’s OpenAI Announcement (ChatGPT-4o)
——————————————
New model. It’s called GPT-4o (terrible name) and has GPT-4-level intelligence. It’s 2x faster. 50% cheaper. 5x higher rate limits (compared to GPT-4-Turbo).
💡 What everyone is focused on: it’s cheaper! And faster! And with API access!
👀 What I am focused on: free users just got a mega performance boost which will likely reduce churn and the model architecture is a WINNER and worthy of attention – they went from three models to one; according to Andrej Karpathy, former OpenAI, they have built a “combined text-audio-vision model that processes all three modalities in one single neural network.”
——————————————
GPTs for all. Now every user, even free users, can access the “mini task bot” GPTs.
💡 What everyone is focused on: yay now everyone can use the GPT I built!
👀 What I am focused on: completely new user base, tens of millions new users testing and breaking capabilities
——————————————
More Voice. Now way more real-time (previously there was a 2-3 second lag). You can interrupt it mid-sentence. The voice assistant “picks up on emotion” (like fast breathing). I hate that phrase and would rather call it speech nuances. Fast multi-language translation. Performance improved for 50 languages (97% of the world’s population).
💡 What everyone is focused on: wow it sounds like Scarlett Johansson!
👀 What I am focused on: I’m already talking to ChatGPT Voice every morning. This is going to massively increase voice-first experiences. I think office spaces need to think about this asap. Think about the acoustics and EVERYONE talking to an AI assistant at once. It’s already an issue on my team.
——————————————
Vision on desktop. Now the desktop version can “see” your screen—only when you permission it to, not all the time. Sort of like generative AI alt text + chat. So you can ask it to describe a graph on your screen or presumably ask it questions about an article on your screen without a big lift.
💡 What everyone is focused on: lots of privacy concerns (I agree) and why do we need voice for code
👀 What I am focused on: HOLY MOLY THIS IS THE WINNING FEATURE. It’s basically a coworker on screen share with you 24/7, with no fatigue. I can imagine people working for hours straight with this on.
——————————————
Rollouts over the next few weeks.
If you like voice features (like talking to Siri but smarter), upgrade to Plus when it releases.
♻️ repost so others are in the know
🤖 follow Allie K. Miller to stay on top of AI
——————————————
Enjoying these daily updates, subscribe below for more: Read Aloud For Me – Recommend AI Tools For Me
A Daily chronicle of AI Innovations May 11th 2024: Weekly Summary –
DrEureka can automate robot training using LLMs
Free AI model rivals GPT-4 in language model evaluation
X introduces Stories feature powered by Grok AI
Apple is developing its own AI chip for data center servers
Stack Overflow and OpenAI have announced an API partnership
Microsoft is developing a new AI language model
OpenAI’s new tool detects 98% DALL-E 3 generated images
Meta expands AI-powered creativity tools for advertisers
OpenAI’s ‘Media Manager’ will let creators opt out of AI training
Microsoft developed a secretive AI service for US spies
Goole DeepMicrosoft and Isomorphic Labs introduce AlphaFold 3

OpenAI’s Model Spec shares how it teaches its models to behave
Microsoft-LinkedIn study reveals rapid AI adoption in workplace & hiring
Stability AI launches Stable Artisan, a Discord bot for image & video
ElevenLabs develops an AI model to generate song lyrics

DrEureka can automate robot training using LLMs
In robotics, one of the biggest challenges is transferring skills learned in simulation to real-world environments. NVIDIA researchers have developed a groundbreaking algorithm called DrEureka that uses LLMs to automate the design of reward functions and domain randomization parameters—key components in the sim-to-real transfer process.
The algorithm works in three stages: first, it creates reward functions with built-in safety instructions; then, it runs simulations to determine the best range of physics parameters; finally, it generates domain randomization configurations based on the data gathered in the previous stages.
When tested on various robots, including quadrupeds and dexterous manipulators, DrEureka-trained policies outperformed those designed by human experts.
Free AI model rivals GPT-4 in language model evaluation
Prometheus 2, a free and open-source language model developed by KAIST AI, has shown impressive capabilities in evaluating other language models, approaching the performance of commercial models like GPT-4.
The model was trained on a new pairwise comparison dataset called the “Preference Collection,” which includes over 1,000 evaluation criteria beyond basic characteristics. By combining two separate models – one for direct ratings and another for pairwise comparisons – the researchers achieved the best results.
In tests across eight datasets, Prometheus 2 showed the highest agreement with human judgments and commercial language models among all freely available rating models, significantly closing the gap with proprietary models.
X introduces Stories feature powered by Grok AI
X (formerly Twitter) has launched a new feature, Stories, that provides AI-generated summaries of trending news on the platform. Powered by Elon Musk’s chatbot Grok, Stories offers Premium subscribers brief overviews of the most popular posts and conversations happening on X.
With Stories, users can quickly catch up on the day’s trending topics without having to scroll through countless posts. Grok generates these summaries based solely on the conversations happening on X about each news story rather than analyzing the original news articles themselves. While this approach is controversial, X believes it will pique users’ curiosity and potentially drive them deeper into the source material.
Apple is developing its own AI chip for data center servers
Apple is developing its own AI chip for data center servers, known internally as Project ACDC (Apple Chips in Data Center). The chip will likely focus on running AI models (inference) rather than training them, which is where Nvidia currently dominates.
The company is working closely with TSMC (Taiwan Semiconductor Manufacturing Co) to design and produce these chips, although the timeline for launch is uncertain. With this move, the company aims to keep up with rivals like Microsoft and Meta, who have made significant investments in generative AI.
Stack Overflow and OpenAI have announced an API partnership
OpenAI will use OverflowAPI to improve model performance and provide attribution to the Stack Overflow community within ChatGPT. Stack Overflow will use OpenAI models to develop OverflowAI and to maximize model performance.
The partnership aims to improve the user and developer experience on both platforms. The first set of integrations and capabilities will be available in the first half of 2024, and the partnership will enable Stack Overflow to reinvest in community-driven features.
Microsoft is developing a new AI language model
Microsoft is developing a new, large-scale AI language model called MAI-1 to compete with Google and OpenAI. The model is overseen by Mustafa Suleyman, recently hired co-founder of Google DeepMind.
MAI-1 will be larger and more expensive than Microsoft’s previous smaller, open-source models, with roughly 500 billion parameters. Microsoft could preview the new model as soon as its Build developer conference later this month.
OpenAI’s new tool detects 98% DALL-E 3 generated images
OpenAI has developed a new tool to detect if an image was created by DALL-E 3, its AI image generator. The tool can detect DALL-E 3 images with around 98% accuracy, even if the image has been cropped, compressed, or had its saturation changed. However, the tool is not as effective at detecting images generated by other AI models, only flagging 5-10% of images.
This image detection classifier is only available to a group of testers, including research labs and research-oriented journalism nonprofits through OpenAI’s Research Access Program.
OpenAI has also added watermarking to Voice Engine, its text-to-speech platform, which is currently in limited research preview.
Meta expands AI-powered creativity tools for advertisers
Meta has expanded its generative AI tools for advertisers. Advertisers can request AI to generate entirely new images, including product variations in different colors, angles, and scenarios. The AI tools can add text overlays with different fonts, expand images to fit different aspect ratios like Reels and Feed, and generate ad headlines that match the brand’s voice.
The AI features will roll out globally to advertisers by the end of 2024.
Meta is also expanding its paid Meta Verified service for businesses to more countries. Different pricing tiers offer features like account support, profile enhancements, and better customer service access.
OpenAI’s ‘Media Manager’ will let creators opt out of AI training
OpenAI is developing Media Manager, a tool that will enable creators and content owners to decide what they own and specify how they want their works to be included or excluded from machine learning research and training. This first-ever tool of its kind will help OpenAI identify copyrighted text, images, audio, and video across multiple sources and reflect creator preferences.
OpenAI aims to have the tool in place by 2025 and set a standard across the AI industry with it.
A Daily chronicle of AI Innovations May 10th 2024: 💥 OpenAI to unveil Google competitor on Monday 🎵 ElevenLabs unveils new AI music generator 🇺🇸 US lawmakers unveil bill to make it easier to restrict exports of AI models 📈 AI demand soars in the workplace 🤖 Stability AI introduces AI bot for Discord users 🎤 ElevenLabs debuts AI model for generating lyrics 💰 Anthropic CEO defends dual funding from Google and Amazon 🎥 Krea AI introduces video generation for paid subscribers 🎥 Synthflow launches AI voice agent teams for streamlined customer support
OpenAI to unveil Google competitor on Monday
- OpenAI is reportedly set to launch an AI-powered search product on Monday, strategically timed just before Google’s annual I/O conference.
- This new AI search tool from OpenAI may be integrated with its ChatGPT chatbot and feature enhancements such as internet search capabilities and possibly Bing integration, building on the foundation laid by previous versions of ChatGPT.
- Amidst Google’s ongoing challenges, including an antitrust case and competition from other AI startups like Perplexity, OpenAI’s move is seen as a significant threat to Google’s longstanding dominance in the search industry.
- Source
ElevenLabs unveils new AI music generator
- ElevenLabs, a voice AI startup, has debuted a new model that creates songs from user prompts, leveraging its expertise in voice cloning and synthesis.
- The company is showcasing this new capability by transforming user-generated ideas into music, akin to Microsoft’s strategy with its AI chatbot Copilot, and is exploring its application in areas like lullabies, jingles, and podcast intros.
- While the AI music generation sector is growing, with ElevenLabs planning a marketplace for AI-generated songs, there are rising concerns about copyright infringement and the impact on artists’ jobs.
US lawmakers unveil bill to make it easier to restrict exports of AI models
- US lawmakers introduced a bipartisan bill aimed at making it easier for the Biden administration to impose export controls on AI models to protect against foreign threats.
- The proposed legislation would allow the Commerce Department to prevent Americans from collaborating with foreign entities on AI developments that could endanger U.S. national security.
- The bill seeks to overcome legal obstacles that currently make it difficult to regulate the export of open source AI models, amidst concerns over their misuse for cyber attacks or creating biological weapons.
AI demand soars in the workplace
Microsoft and LinkedIn have published their ‘2024 Work Trend Index Annual Report’, revealing the rapid adoption of AI tools by employees, with 75% of knowledge workers using AI and nearly half starting within the last six months.
Here are the key points:
- 78% of AI users are bringing their own AI tools to work, with 52% reluctant to use them for their important tasks.
- 66% of leaders say they wouldn’t hire someone without AI skills, and 71% prefer less experienced candidates with AI skills over more experienced ones without.
- Power users who use AI extensively are reaping benefits in productivity, creativity, and job satisfaction.
- Skills are projected to change by 68% by 2030, accelerated by generative AI.
Why does this matter?
The study serves as a wake-up call for organizations to move beyond experimentation and develop comprehensive strategies for AI implementation. As AI permeates all sectors, generations, and skill sets, early adopters will likely emerge as the leaders, while those hesitant to explore could risk falling behind.
Stability AI introduces AI bot for Discord users
Stability AI has launched Stable Artisan, a multimodal-gen AI Discord bot that enables users to create images and videos using the Stable Diffusion 3 (SD3) and Stable Video Diffusion (SVD) models.
https://youtu.be/MWfb30kWqTM?si=_TePwQX1A8xEj3hU
Stable Artisan incorporates several editing and customization features, including Search and Replace, Remove Background, Creative Upscale, Outpaint, Control Sketch, and Control Structure. The service is available through a paid subscription, with monthly plans ranging from $9 to $99, and a 3-day free trial.
Stability AI is also working on a larger conversational chatbot called Stable Assistant, which will incorporate the company’s text-to-image and LLM technologies to assist users with various tasks through natural language conversations. While Stable Artisan currently does not include access to Stable Audio, Stable Code, or Stable LM, these features may be added in the future as the service continues to evolve.
Why does this matter?
Stable Artisan could empower creators lacking experience with complex AI models to generate high-quality content directly within their familiar Discord environment.
However, the paid subscription model could limit access, and the missing features hint at a future with a complete AI creative suite.
https://stability.ai/news/stable-artisan
ElevenLabs debuts AI model for generating lyrics
ElevenLabs, a company that specializes in AI-powered voice cloning and synthesis, has revealed a new model that creates song lyrics based on user prompts.
With this new model, ElevenLabs aims to impact the music industry by allowing users to generate custom lullabies, jingles, podcast intros, and potentially even popular songs. The company also plans to launch a marketplace where users can sell their AI-generated music.
While ElevenLabs has not yet shared details about the maximum length of songs the AI can generate, an example posted by the company’s Head of Design suggests that it will likely produce lyrics for a standard three-minute song.
Why does this matter?
This AI tool has the potential to democratize songwriting, allowing even those without musical expertise to craft lyrics. This could be particularly impactful for budget-conscious creators or those with specific lyrical needs. However, it remains to be seen if it will integrate with composing melodies like Udio or Suno, which offer a more complete song creation experience.
Also, the AI lyrics’ originality and the tool’s training data on copyrighted music might raise uncertainties.
What Else Is Happening in AI on May 10th 2024
TikTok introduces “AI-generated” labels for third-party content
TikTok will automatically label AI-generated content on its platform and on third-party platforms, becoming the first social media platform to support Content Credentials metadata for AI transparency. (Link)
Anthropic CEO defends dual funding from Google and Amazon
Anthropic’s CEO says partnering with Google and Amazon ensures more independence than OpenAI’s Microsoft reliance. However, regulators are examining the impact on AI competition as Anthropic’s future training costs could reach $100 billion. (Link)
Krea AI introduces video generation for paid subscribers
Krea AI, a generative AI startup, has launched video generation capabilities for its highest-tier subscribers. The new feature allows users to create videos using a combination of key frame images and text prompts, with a timeline-based interface reminiscent of traditional video editing software. (Link)
Synthflow launches AI voice agent teams for streamlined customer support
Synthflow launches “Conversational AI Teams,” a feature that allows businesses to create multiple AI voice assistants to interact with customers and each other, all through a single phone number. These intelligent agents can handle tasks like scheduling, updating CRMs, and more, providing a seamless and efficient customer support experience. (Link)
AI TRAINING: Add shortcuts using ChatGPT’s Memory
A lesser-known feature of ChatGPT’s new Memory feature is that it can be programmed to store shortcuts, which can save you a lot of time in chat conversations when used effectively.
- Go to ChatGPT’s website. You’ll need to have a Plus subscription to access Memory.
- Click on your account name (bottom-left), select ‘Settings’, ‘Personalization’, and enable ‘Memory’.
- Teach ChatGPT a list of shortcuts by adding: “Store in your memory that when I start a prompt using: ‘!si’ means that I want you to search on the Internet; ‘!ci’ means that I want you to create an image; and !da’ means that I want you to use your Advanced Data Analysis feature”
- That’s it! Try the shortcuts in your conversations and let us know how much time you save
AI RESEARCH: Llama-3 goes head-to-head with GPT-4
Lmsys just published a new deep dive into its Chatbot Arena data, comparing Meta’s open-source Llama 3 70B model against top models like GPT-4 — revealing surprising strengths in the open-source leader.
- Llama-3 is the top open-source model on the Lmsys Leaderboard, featuring over 70,000 votes from users in Arena Battle testing.
- Meta’s model shines in battles involving brainstorming and writing prompts, falling short in math, coding, and translation compared to top competitors.
- Lmsys noted Llama-3’s tone is friendlier and more conversational than rivals, which the model exhibits in its victorious battles in the Arena.
Llama-3’s ability to compete with top-tier models in certain areas is a testament to the rapid progress of open-source — and that’s with Meta’s largest model still pending. The more granular comparison also provides useful details often lost in more general model benchmarking.
New AI Job Opportunities on May 10th 2024
- Adaptive ML – Developer Advocate
- Scale AI – Solutions Engineer
- Anthropic – Research Engineer, Human Computer Interfaces
- OpenAI – Insider Risk Investigato
A Daily chronicle of AI Innovations May 09th 2024:
OpenAI to unveil Google competitor on Monday
ElevenLabs unveils new AI music generator
US lawmakers unveil bill to make it easier to restrict exports of AI models
AI demand soars in the workplace
Stability AI introduces AI bot for Discord users
ElevenLabs debuts AI model for generating lyrics
OpenAI to unveil Google competitor on Monday

- OpenAI is reportedly set to launch an AI-powered search product on Monday, strategically timed just before Google’s annual I/O conference.
- This new AI search tool from OpenAI may be integrated with its ChatGPT chatbot and feature enhancements such as internet search capabilities and possibly Bing integration, building on the foundation laid by previous versions of ChatGPT.
- Amidst Google’s ongoing challenges, including an antitrust case and competition from other AI startups like Perplexity, OpenAI’s move is seen as a significant threat to Google’s longstanding dominance in the search industry.
- Source
ElevenLabs unveils new AI music generator
- ElevenLabs, a voice AI startup, has debuted a new model that creates songs from user prompts, leveraging its expertise in voice cloning and synthesis.
- The company is showcasing this new capability by transforming user-generated ideas into music, akin to Microsoft’s strategy with its AI chatbot Copilot, and is exploring its application in areas like lullabies, jingles, and podcast intros.
- While the AI music generation sector is growing, with ElevenLabs planning a marketplace for AI-generated songs, there are rising concerns about copyright infringement and the impact on artists’ jobs.
US lawmakers unveil bill to make it easier to restrict exports of AI models
- US lawmakers introduced a bipartisan bill aimed at making it easier for the Biden administration to impose export controls on AI models to protect against foreign threats.
- The proposed legislation would allow the Commerce Department to prevent Americans from collaborating with foreign entities on AI developments that could endanger U.S. national security.
- The bill seeks to overcome legal obstacles that currently make it difficult to regulate the export of open source AI models, amidst concerns over their misuse for cyber attacks or creating biological weapons.
AI demand soars in the workplace
Microsoft and LinkedIn have published their ‘2024 Work Trend Index Annual Report’, revealing the rapid adoption of AI tools by employees, with 75% of knowledge workers using AI and nearly half starting within the last six months.
Here are the key points:
- 78% of AI users are bringing their own AI tools to work, with 52% reluctant to use them for their important tasks.
- 66% of leaders say they wouldn’t hire someone without AI skills, and 71% prefer less experienced candidates with AI skills over more experienced ones without.
- Power users who use AI extensively are reaping benefits in productivity, creativity, and job satisfaction.
- Skills are projected to change by 68% by 2030, accelerated by generative AI.
Why does this matter?
The study serves as a wake-up call for organizations to move beyond experimentation and develop comprehensive strategies for AI implementation. As AI permeates all sectors, generations, and skill sets, early adopters will likely emerge as the leaders, while those hesitant to explore could risk falling behind.
Stability AI introduces AI bot for Discord users
Stability AI has launched Stable Artisan, a multimodal-gen AI Discord bot that enables users to create images and videos using the Stable Diffusion 3 (SD3) and Stable Video Diffusion (SVD) models.
https://youtu.be/MWfb30kWqTM?si=_TePwQX1A8xEj3hU
Stable Artisan incorporates several editing and customization features, including Search and Replace, Remove Background, Creative Upscale, Outpaint, Control Sketch, and Control Structure. The service is available through a paid subscription, with monthly plans ranging from $9 to $99, and a 3-day free trial.
Stability AI is also working on a larger conversational chatbot called Stable Assistant, which will incorporate the company’s text-to-image and LLM technologies to assist users with various tasks through natural language conversations. While Stable Artisan currently does not include access to Stable Audio, Stable Code, or Stable LM, these features may be added in the future as the service continues to evolve.
Why does this matter?
Stable Artisan could empower creators lacking experience with complex AI models to generate high-quality content directly within their familiar Discord environment.
However, the paid subscription model could limit access, and the missing features hint at a future with a complete AI creative suite.
https://stability.ai/news/stable-artisan
ElevenLabs debuts AI model for generating lyrics
ElevenLabs, a company that specializes in AI-powered voice cloning and synthesis, has revealed a new model that creates song lyrics based on user prompts.
With this new model, ElevenLabs aims to impact the music industry by allowing users to generate custom lullabies, jingles, podcast intros, and potentially even popular songs. The company also plans to launch a marketplace where users can sell their AI-generated music.
While ElevenLabs has not yet shared details about the maximum length of songs the AI can generate, an example posted by the company’s Head of Design suggests that it will likely produce lyrics for a standard three-minute song.
Why does this matter?
This AI tool has the potential to democratize songwriting, allowing even those without musical expertise to craft lyrics. This could be particularly impactful for budget-conscious creators or those with specific lyrical needs. However, it remains to be seen if it will integrate with composing melodies like Udio or Suno, which offer a more complete song creation experience.
Also, the AI lyrics’ originality and the tool’s training data on copyrighted music might raise uncertainties.
What Else Is Happening in AI on May 10th 2024
TikTok introduces “AI-generated” labels for third-party content
TikTok will automatically label AI-generated content on its platform and on third-party platforms, becoming the first social media platform to support Content Credentials metadata for AI transparency. (Link)
Anthropic CEO defends dual funding from Google and Amazon
Anthropic’s CEO says partnering with Google and Amazon ensures more independence than OpenAI’s Microsoft reliance. However, regulators are examining the impact on AI competition as Anthropic’s future training costs could reach $100 billion. (Link)
Krea AI introduces video generation for paid subscribers
Krea AI, a generative AI startup, has launched video generation capabilities for its highest-tier subscribers. The new feature allows users to create videos using a combination of key frame images and text prompts, with a timeline-based interface reminiscent of traditional video editing software. (Link)
Synthflow launches AI voice agent teams for streamlined customer support
Synthflow launches “Conversational AI Teams,” a feature that allows businesses to create multiple AI voice assistants to interact with customers and each other, all through a single phone number. These intelligent agents can handle tasks like scheduling, updating CRMs, and more, providing a seamless and efficient customer support experience. (Link)
AI TRAINING: Add shortcuts using ChatGPT’s Memory

A lesser-known feature of ChatGPT’s new Memory feature is that it can be programmed to store shortcuts, which can save you a lot of time in chat conversations when used effectively.
- Go to ChatGPT’s website. You’ll need to have a Plus subscription to access Memory.
- Click on your account name (bottom-left), select ‘Settings’, ‘Personalization’, and enable ‘Memory’.
- Teach ChatGPT a list of shortcuts by adding: “Store in your memory that when I start a prompt using: ‘!si’ means that I want you to search on the Internet; ‘!ci’ means that I want you to create an image; and !da’ means that I want you to use your Advanced Data Analysis feature”
- That’s it! Try the shortcuts in your conversations and let us know how much time you save
AI RESEARCH: Llama-3 goes head-to-head with GPT-4

Lmsys just published a new deep dive into its Chatbot Arena data, comparing Meta’s open-source Llama 3 70B model against top models like GPT-4 — revealing surprising strengths in the open-source leader.
- Llama-3 is the top open-source model on the Lmsys Leaderboard, featuring over 70,000 votes from users in Arena Battle testing.
- Meta’s model shines in battles involving brainstorming and writing prompts, falling short in math, coding, and translation compared to top competitors.
- Lmsys noted Llama-3’s tone is friendlier and more conversational than rivals, which the model exhibits in its victorious battles in the Arena.
Llama-3’s ability to compete with top-tier models in certain areas is a testament to the rapid progress of open-source — and that’s with Meta’s largest model still pending. The more granular comparison also provides useful details often lost in more general model benchmarking.
New AI Job Opportunities on May 10th 2024
- Adaptive ML – Developer Advocate
- Scale AI – Solutions Engineer
- Anthropic – Research Engineer, Human Computer Interfaces
- OpenAI – Insider Risk Investigator
A Daily chronicle of AI Innovations May 09th 2024:
OpenAI posts Model Spec revealing how it wants AI to behave
Google DeepMind unveils AlphaFold 3, the next generation of its protein prediction model
Neuralink faces setback as first human brain implant encounters problem
Microsoft developed a secretive AI service for US spies
Generate images on Midjourney Alpha
Copilot for Microsoft 365 to get auto-complete and rewrite to improve prompts
New AI data center to be built at the failed Foxconn project site in Wisconsin
Sam Altman says we are not taking AI’s impact on the economy seriously
Typeface Arc replaces prompts; uses AI agent approach to ease marketing workflows
Altera’s gaming AI agents get backed by Eric Schmidt, Former Google CEO
OpenAI posts Model Spec revealing how it wants AI to behave


- OpenAI has introduced the first draft of Model Spec, a proposed framework aiming to shape how AI models respond, emphasizing assistance, humanity’s benefit, and adherence to social norms and laws.
- The framework suggests specific rules for AI behavior, including compliance with laws, protection of privacy, and avoidance of NSFW content, with options to adjust settings like allowing NSFW content in certain contexts.
- While the Model Spec seeks public feedback for future adjustments and doesn’t immediately affect existing models like GPT-4 or DALL-E 3, it’s envisioned as a living document to guide AI behavior improvement over time.
- Source
Google DeepMind unveils AlphaFold 3, the next generation of its protein prediction model
- Google DeepMind and Isomorphic Labs have released AlphaFold 3, a new AI model for predicting protein structures, including their interactions with various molecules such as DNA, RNA, and small molecules, thereby enhancing drug discovery possibilities.
- This new version is more precise in mapping out complex groupings of molecules, significantly enhancing our ability to understand and predict molecular behavior compared to its earlier version.
- Google will not open-source this version but has launched AlphaFold Server for non-commercial research use, aiming to balance intellectual property concerns with accessibility for scientific progress.
- Source
Neuralink faces setback as first human brain implant encounters problem
- Neuralink admitted that some of the micro-thin threads from their N1 brain chip retracted after implantation in the first human patient, possibly due to air trapped in the skull during surgery, which affected the device’s data transmission rate.
- Despite the retraction of several threads, Neuralink managed to increase the data transmission speed over time by optimizing their recording algorithm and improving signal translation into cursor movements.
- The company is planning further implants, with goals to implant two more patients in the coming months and ten in total this year, while continuing to refine their technology and reporting developments to the FDA.
- Source
Microsoft developed a secretive AI service for US spies
Microsoft has developed a top-secret generative AI model entirely disconnected from the internet so US intelligence agencies can safely harness the powerful technology to analyze top-secret info. The model based on GPT-4 is now live, answering questions, and will also write code.
Microsoft spent 18 months developing the model, which is “air-gapped” to ensure it is secure. This is the first time a model is fully isolated– meaning it’s not connected to the internet but is on a special network that’s only accessible by the U.S. government.
It can read and analyze files but cannot learn from them to stop sensitive information from entering the platform. It is yet to be tested and accredited by the intelligence agencies.
Why does this matter?
Intelligence agencies all over the world have been racing to be the first to harness generative AI. I guess we know who’s going to be the winner. If this AI tool is successful, it will fundamentally change the way intelligence agencies operate.
What Else Is Happening in AI on May 09th 2024
Copilot for Microsoft 365 to get auto-complete and rewrite to improve prompts
In coming months, Microsoft Copilot will be updated with new features like auto-complete and ‘elaborate your prompt’ that offer suggestions to improve AI prompts. It aims to solve the problem of coming up with good prompts for generative AI. (Link)
New AI data center to be built at the failed Foxconn project site in Wisconsin
President Joe Biden announced an AI data center to be built on the same site as the failed Foxconn project in Racine, Wisconsin. According to a White House press release, Microsoft is investing $3.3B in the project, creating up to 2,000 permanent jobs. (Link)
Sam Altman says we are not taking AI’s impact on the economy seriously
At a Brooking’s Institute panel about AI and geopolitics on Tuesday, Altman said the discussions around AI’s effect on the economy– like how it may lead to mass job replacement– died down this year compared to last. He said if we don’t take these concerns seriously enough going forward, it could be a massive issue. (Link)
Typeface Arc replaces prompts; uses AI agent approach to ease marketing workflows
It is launching Typeface Arc technology, which enables a user to state a high-level marketing objective and then have the system automatically plan and generate all the assets, including emails, images, and notifications that are all connected. (Link)
Altera’s gaming AI agents get backed by Eric Schmidt, Former Google CEO
Altera is the newest startup joining the fray to build a new guard of AI agents. It raised $9 million in an oversubscribed seed round, co-led by Eric Schmidt’s deep-tech fund, First Spark Ventures and Patron, the seed-stage fund co-founded by Riot Games alums. (Link)
AI TRAINING May 09th 2024
Generate images on Midjourney Alpha

Midjourney’s website is now accessible to anyone with more than 100 generated images, improving the experience when prompting images over its standard Discord group.
- Check that you’ve generated more than 100 images by typing /info in the Midjourney Discord group. If you have, head over to Midjourney Alpha.
- In the main menu, you can explore other creations and search prompts.
- Select where it says “imagine” and enter your prompt to generate an image.
- Add a reference image by selecting “+” or play with different parameters such as image size, stylization, or even weirdness by pressing the “slider control” button
AI RESEARCH on May 09th 2024
AI usage surges in the workplace

Microsoft and LinkedIn just published their Work Trend Index Annual Report, revealing that AI adoption is surging in the workplace — calling 2024 the ‘year AI at work gets real’.
- The report found that use of GenAI has doubled in the last six months, with 75% of knowledge workers using the tech in some capacity.
- 78% of AI users are bringing their own AI to work — with 52% reporting they are reluctant to admit to its use.
- 66% of leaders wouldn’t hire someone without AI skills, and 71% prefer less experienced candidates with AI aptitude over a more experienced one without it.
- AI power users reported enhanced productivity, creativity, and job satisfaction compared to skeptical peers.
Why it matters: Employees are adopting AI at a rapid pace, regardless of if their own organizations are ready for the shift. As AI spreads across all sectors, generations, and skillsets, the early adopters are rising to the top — while those that aren’t at least exploring the tech are quickly running out of time
Trending AI Tools May 09th 2024
GeoSpy – Uncover photo locations with AI
LangChain – Connect LLMs to private data for context-aware applications
Abstra – Scale business processes with Python and AI
Freepik Pikaso Upscaler – Integrated with Magnific, enlarge images without losing quality
Notion AI Q&A – Q&A is now open to the public, allowing users to ask and find information across their workspace
Udio Audio Inpainting – Select a portion of an AI-generated music track and regenerate it
New AI Job Opportunities on May 09th 2024
The Rundown – Video Content Creator
Anthropic – Research Engineer, Human-Computer Interfaces
Adept AI – Solutions Engineer
Mistral AI – Data Annotation Technical Program Manager
A Daily chronicle of AI Innovations May 08th 2024:
Google built medical AI that can train itself and outperform real doctors
Microsoft launches AI chatbot for spies
OpenAI’s new tool detects 98% DALL-E 3 generated images
Meta expands AI-powered creativity tools for advertisers
OpenAI’s ‘Media Manager’ will let creators opt out of AI training
Apple releases M4 chip at the ‘Let Loose’ event with powerful AI capabilities
OpenAI strikes licensing deal with People magazine publisher
Amazon announces Bedrock Studio to simplify Gen AI app development
Oracle introduces Code Assistant to accelerate enterprise software development
Red Hat launches RHEL AI and InstructLab to democratize enterprise AI
Apple unveils iPad AI upgrades
Google built medical AI that can train itself and outperform real doctors
- Google’s DeepMind has developed Med-Gemini, a self-training medical AI that outperforms doctors in accuracy, achieving a 91.1% score on medical benchmarks.
- Med-Gemini uses an uncertainty-guided search functionality and has bested GPT-4 models and actual doctors on 10 out of 14 medical benchmarks.
- Despite its capabilities, Google’s medical AI aims to assist rather than replace doctors, helping to alleviate their workload.
- Source
Microsoft launches AI chatbot for spies
- Microsoft has launched a GPT-4-based generative AI model tailored for US intelligence agencies that operates offline to analyze top-secret information securely.
- The AI chatbot aims to facilitate secure conversations among spy agencies without the internet, addressing data breach and hacking concerns.
- This initiative represents Microsoft’s first deployment of a major language model in a high-security environment, with a focus on mitigating the risks of data leaks while processing classified data
- Source
OpenAI’s new tool detects 98% DALL-E 3 generated images
OpenAI has developed a new tool to detect if an image was created by DALL-E 3, its AI image generator. The tool can detect DALL-E 3 images with around 98% accuracy, even if the image has been cropped, compressed, or had its saturation changed. However, the tool is not as effective at detecting images generated by other AI models, only flagging 5-10% of images.
This image detection classifier is only available to a group of testers, including research labs and research-oriented journalism nonprofits through OpenAI’s Research Access Program.
OpenAI has also added watermarking to Voice Engine, its text-to-speech platform, which is currently in limited research preview.
Why does it matter?
Early experiences have shown that AI detectors don’t work. In fact, if they have high error rates, they could lead to false accusations. In 2023, OpenAI had to shut down its own AI detection software for text because of its poor accuracy.
So, if this detector is as good as OpenAI claims, we may be on the precipice of a revolutionary new capability to reliably detect AI-generated content, with huge implications across domains.
Meta expands AI-powered creativity tools for advertisers
Meta has expanded its generative AI tools for advertisers. Advertisers can request AI to generate entirely new images, including product variations in different colors, angles, and scenarios. The AI tools can add text overlays with different fonts, expand images to fit different aspect ratios like Reels and Feed, and generate ad headlines that match the brand’s voice.
The AI features will roll out globally to advertisers by the end of 2024.
Meta is also expanding its paid Meta Verified service for businesses to more countries. Different pricing tiers offer features like account support, profile enhancements, and better customer service access.
Why does it matter?
Integrating such powerful AI features could lead to more effective advertising campaigns and improved customer engagement with targeted marketing and personalized ads. However, it could also raise questions about transparency and potential misuse.
OpenAI’s ‘Media Manager’ will let creators opt out of AI training
OpenAI is developing Media Manager, a tool that will enable creators and content owners to decide what they own and specify how they want their works to be included or excluded from machine learning research and training. This first-ever tool of its kind will help OpenAI identify copyrighted text, images, audio, and video across multiple sources and reflect creator preferences.
OpenAI aims to have the tool in place by 2025 and set a standard across the AI industry with it.
Why does it matter?
Media Manager seems to be OpenAI’s response to growing criticism of its approach to developing AI models, which heavily scraps publicly available data from the web for training. Recently, 8 prominent U.S. newspapers sued OpenAI for copyright infringement.
On the other hand, OpenAI has formed mutually beneficial partnerships with platforms like Stack Overflow, Shutterstock, The Financial Times, and more to use their content.
So, OpenAI may be trying to meet creators in the middle, but if it is positioning itself as a fully ethical actor with this, we’ll take it with a grain of salt.
Apple unveils iPad AI upgrades
Apple just revealed its new line of iPads at a company event in Cupertino, CA — featuring a custom M4 chip that enables advanced AI capabilities and a slew of new AI-powered features.
- The bigger iPad Pro now features the M4 chip with an upgraded Neural Engine, which CEO Tim Cook calls “an outrageously powerful chip for AI”.
- The M4 is capable of handling 38T operations per second, 4x the performance of previous models, allowing for the running of advanced AI.
- New AI features on the Pro include a True Tone Flash for document scanning and new video, image, and music editing tools.
- Prior to the event, a new report revealed that Apple is developing its own AI chips for data centers under the code name “Project ACDC”.
What Else Is Happening in AI on May 08th 2024
Apple releases M4 chip at the ‘Let Loose’ event with powerful AI capabilities
Apple released its much-anticipated M4 chip at the “Let Loose” event. M4 is slated to spearhead Apple’s next generation of devices, with the iPad Pro leading the charge and powering the forthcoming OLED iPad Pro which is meticulously engineered to elevate the user experience to unprecedented heights. (Link)
OpenAI strikes licensing deal with People magazine publisher
OpenAI has inked a licensing deal with Dotdash Meredith to bring the People magazine publisher’s content to ChatGPT and help train it’s AI models. Under the partnership, OpenAI will be able to display lifestyle and entertainment content in its chatbot from the many websites of one of the US’s largest digital and print publishers. (Link)
Amazon announces Bedrock Studio to simplify Gen AI app development
Amazon is launching a new tool, Bedrock Studio, designed to let organizations experiment with generative AI models, collaborate on those models, and ultimately build generative AI-powered apps. Bedrock Studio is a “rapid prototyping environment” for generative AI. It also guides developers in evaluating, analyzing, fine-tuning, and sharing generative AI models. (Link)
Oracle introduces Code Assitant to accelerate enterprise software development
Oracle has announced Code Assitant, an AI-powered service to help developers rapidly program apps based on Java, SQL, and the Oracle Cloud infrastructure. It will join tools like GitHub Copilot and Amazon CodeWhisperer to accelerate the app development lifecycle. However, Oracle hasn’t yet specified when this feature will be released. (Link)
Red Hat launches RHEL AI and InstructLab to democratize enterprise AI
At Red Hat Summit 2024, RedHat announced two major initiatives to bring the power of generative AI to the enterprise. Red Hat Enterprise Linux AI (RHEL AI), a foundation model platform for developing and running open-source language models, and InstructLab, a community project to enable domain experts to enhance AI models with their knowledge. (Link)
AI TRAINING:
Chat with YouTube videos using Gemini
Google Gemini’s new “Extensions” feature allows users to access external tools such as YouTube to chat with videos and get answers for free.
Step-by-step:
- Visit Google’s Gemini website. If Gemini is not available in your country, you’ll need to use a US-based VPN.
- Click on the gear icon located on the bottom-left, select Extensions, and turn on the YouTube one.
- Go back to the Chat interface and start your prompt using the following format: “@youtube Summarize the following video [Youtube URL]”
Pro tip: Try asking Gemini to explain advanced concepts discussed in a video, generating concrete examples, creating practice questions, and even asking for code snippets
Trending AI Tools on May 08th 2024:
Eraser AI – Technical design copilot that helps users edit documents and generate diagrams easily
Snaplet Seed – Cease manual data creation, get instant realistic data for your project
Aria Browser AI – Opera’s browser AI can now create images and read out text responses
Meaning – Block distracting apps with an AI screen time coach
Dante Unlimited AI Messages – No more hidden fees for AI chatbot creation
New AI Job Opportunities May 08th 2024
The Rundown – Paid Growth Lead
Scale AI – Field Engineer
Perplexity AI – Software Engineer – Billing
Anyscale – Staff Accountant
A Daily chronicle of AI Innovations May 07th 2024:
Apple is developing its own AI chip for data center servers
Stack Overflow and OpenAI have announced an API partnership
Microsoft is developing a new AI language model to rival GPT-4 and Google Gemini
Study: Teachers can’t detect AI writing
Apple’s new M4 chip is focused on AI
Hugging Face has launched LeRobot, an open-source robotics toolkit
Apple is testing a new “Clean Up” feature in its Photos app
Google has launched Google Threat Intelligence
US invests $285M in AI ‘Digital Twin’ technology
Anduril Industries introduces Pulsar: AI modular electromagnetic warfare (EW) systems
Apple’s new M4 chip is focused on AI
- Apple’s new M4 chip, designed for the latest OLED iPad Pro models, focuses on enhancing AI performance with capabilities like handling 38 trillion operations per second, which is significantly higher than its predecessors but still behind Qualcomm’s Snapdragon X Elite.
- The M4 chip utilizes advanced second-generation 3nm technology, featuring a 10-core CPU with a mix of performance and efficiency cores, and supports technologies like mesh shading and ray tracing for improved graphical rendering.
- Apple’s announcement of the M4 chip sets the stage for a comparison with Microsoft’s upcoming Arm-powered hardware and Windows on Arm enhancements, scheduled for release later this month, which aim to surpass Apple’s MacBook Air M3 in terms of CPU and AI performance.
Apple is developing its own AI chip for data center servers
Apple is developing its own AI chip for data center servers, known internally as Project ACDC (Apple Chips in Data Center). The chip will likely focus on running AI models (inference) rather than training them, which is where Nvidia currently dominates.
The company is working closely with TSMC (Taiwan Semiconductor Manufacturing Co) to design and produce these chips, although the timeline for launch is uncertain. With this move, the company aims to keep up with rivals like Microsoft and Meta, who have made significant investments in generative AI.
Why does it matter?
Apple has a long history of designing custom chips for its devices like iPhones, iPads, and Macs, which is probably what makes them stand out. Having custom AI chips could allow the tech giant more control over its “AI destiny” versus relying on suppliers like Nvidia.
Stack Overflow and OpenAI have announced an API partnership
OpenAI will use OverflowAPI to improve model performance and provide attribution to the Stack Overflow community within ChatGPT. Stack Overflow will use OpenAI models to develop OverflowAI and to maximize model performance.
The partnership aims to improve the user and developer experience on both platforms. The first set of integrations and capabilities will be available in the first half of 2024, and the partnership will enable Stack Overflow to reinvest in community-driven features.
Why does this matter?
Stack Overflow partnered with Google Cloud to develop Overflow API and to give Google’s Gemini models access to its knowledge communities. Now it is forming a similar partnership with OpenAI. Despite concerns about copyright breaches, such partnerships seem to be trending where both the parties have much to gain, but it just reaffirms that the big AI players remain hungry for data.
Microsoft is developing a new AI language model
Microsoft is developing a new, large-scale AI language model called MAI-1 to compete with Google and OpenAI. The model is overseen by Mustafa Suleyman, recently hired co-founder of Google DeepMind.
MAI-1 will be larger and more expensive than Microsoft’s previous smaller, open-source models, with roughly 500 billion parameters. Microsoft could preview the new model as soon as its Build developer conference later this month.
Why does this matter?
Microsoft’s development of MAI-1 shows that it is not entirely relying on it’s OpenAI investment to go big in AI. Now, it has entered the AI race truly, competing with state-of-the-art models from Google, Anthropic, even Meta’s Llama 400B which is in training, and OpenAI itself.
What Else Is Happening in AI on May 07th 2024
Hugging Face has launched LeRobot, an open-source robotics toolkit
It is a comprehensive platform for developers, researchers, and hobbyists to train AI models, share data, and simulate environments, all while seamlessly integrating with various robotic hardware. The toolkit offers pre-trained models and integrates with physics simulators for testing without physical robots. Hugging Face is also collaborating with diverse partners to build the largest crowdsourced robotics dataset. (Link)
Apple is testing a new “Clean Up” feature in its Photos app
By using gen AI for advanced image editing, this feature will allow you to effortlessly remove unwanted objects from your photos using a simple brush. Apple may preview this new feature during its upcoming “Let Loose” iPad event or at WWDC in June. (Link)
Google has launched Google Threat Intelligence
It is a combination of Mandiant’s expertise, VirusTotal’s community insights, and Google’s vast threat visibility. Google Threat Intelligence assists with external threat monitoring, attack surface management, digital risk protection, IoC analysis, and expertise. With Gemini, organizations can now quickly search through vast amounts of threat data to protect against cyber threats. (Link)
US invests $285M in AI ‘Digital Twin’ technology
The Biden administration is investing $285 million for a new “CHIPS Manufacturing USA institute” focused on digital twins for the semiconductor industry. This approach uses AI to create virtual chip replicas, accelerating the production of next-gen processors. Intel and Micron are also set to receive funding to boost the development of new processors. (Link)
Anduril Industries introduces Pulsar: AI modular electromagnetic warfare (EW) systems
Pulsar uses AI to quickly identify and counter current and future threats across the electromagnetic spectrum, including small and medium-size drones. With its integration of software-defined radio, GPUs, and diverse compute capabilities, Pulsar is changing how we defend against rapidly evolving threats in an increasingly complex battlefield. (Link)
AI TRAINING May 07th 2024
Studio-quality audio in seconds with AI

Adobe’s AI-powered ‘Enhance Speech’ tool dramatically improves the quality of audio voice recordings with just a few clicks.
Step-by-step:
- Access Adobe’s Enhance Speech website. Sign in or sign up for free.
- Upload your audio by selecting “Choose file” or simply drag and drop the file.
- Wait a few seconds while Adobe’s AI analyzes and removes unwanted noises from your file.
- Download your studio-quality audio and hear the difference!
Pro tip: If you have a video file, you can extract the audio using free websites that extract audio from video and add the enhanced audio back to your video using free video editors like CapCut
AI RESEARCH on May 07th 2024
Study: Teachers can’t detect AI writing

A series of studies from several German universities found that both novice and experienced teachers struggled to accurately distinguish between student-written and AI-generated texts.
The details:
- The first study trialed pre-service teachers on AI and student text, correctly ID’ing only 45.1% of AI-generated submissions.
- The second study tested experienced teachers, who correctly identified just 37.8% of AI-generated submissions.
- The study revealed that teachers were overconfident in their ability to spot AI-generated text.
- AI detection software also often falsely flags student-written texts as potentially AI-generated, which further complicates the issue.
Why it matters: AI’s writing capabilities are only getting better — and relying on teacher intuition or unreliable tools may be no more effective than guessing. Unless better tools become available, it may be time to pivot to enabling students to work with AI instead of penalizing them for it.
Trending AI Tools May 07th 2024
Osum – Perform deep market research in seconds: Try now (Use checkout code RUNDOWN for 25% off)*
Coachvox – The ultimate AI marketing tool for coaches with content
Pressmaster – All-in-one platform simplifies press planning and distribution
Lamini – Enterprises LLM platform that you can own
Postman v11 – Supercharging development with AI and APIs
Adobe Acrobat AI Assistant – Get quick answers and one-click summaries from PDFs
Leonardo AI Content Reference – Enhance creative control for image generations
New AI Job Opportunities May 07th 2024
The Rundown – Community Manager
Scale AI – Senior Analytics Engineer
Anthropic – Risk Manager, Responsible Scaling Team
OpenAI – Tech Lead Manager – Supercomputing Scheduling
A Daily chronicle of AI Innovations May 06th 2024:
DrEureka can automate robot training using LLMs
Free AI model rivals GPT-4 in language model evaluation
X introduces Stories feature powered by Grok AI
Privacy complaint filed against OpenAI
JPMorgan launches an AI-powered tool for thematic investing
YouTube Premium introduces AI-powered “Jump ahead” feature
AI is now set to transform the drug discovery industry
AI helps bring back Randy Travis’ voice in new song
Jack Dorsey has left Bluesky, calls X a ‘freedom technology’
Why Warren Buffet isn’t investing in AI
Massive prompts can outperform fine-tuning for LLMs, researchers find
DrEureka can automate robot training using LLMs
In robotics, one of the biggest challenges is transferring skills learned in simulation to real-world environments. NVIDIA researchers have developed a groundbreaking algorithm called DrEureka that uses LLMs to automate the design of reward functions and domain randomization parameters—key components in the sim-to-real transfer process.
The algorithm works in three stages: first, it creates reward functions with built-in safety instructions; then, it runs simulations to determine the best range of physics parameters; finally, it generates domain randomization configurations based on the data gathered in the previous stages.
When tested on various robots, including quadrupeds and dexterous manipulators, DrEureka-trained policies outperformed those designed by human experts.
Why does it matter?
DrEureka makes robot training accessible and cost-effective for businesses and researchers alike. We may witness increased adoption of robotics in industries that have previously been hesitant to invest in the technology due to the complexity and cost of training robots for real-world applications.
Free AI model rivals GPT-4 in language model evaluation
Prometheus 2, a free and open-source language model developed by KAIST AI, has shown impressive capabilities in evaluating other language models, approaching the performance of commercial models like GPT-4.
The model was trained on a new pairwise comparison dataset called the “Preference Collection,” which includes over 1,000 evaluation criteria beyond basic characteristics. By combining two separate models – one for direct ratings and another for pairwise comparisons – the researchers achieved the best results.
In tests across eight datasets, Prometheus 2 showed the highest agreement with human judgments and commercial language models among all freely available rating models, significantly closing the gap with proprietary models.
Why does this matter?
By enabling user-defined evaluation criteria, Prometheus 2 can be tailored to assess language models based on specific preferences and real-life scenarios, opening up new possibilities for developing specialized AI applications across various domains. It’s also an opportunity to create niche models that are culturally sensitive and relevant.
X introduces Stories feature powered by Grok AI
X (formerly Twitter) has launched a new feature, Stories, that provides AI-generated summaries of trending news on the platform. Powered by Elon Musk’s chatbot Grok, Stories offers Premium subscribers brief overviews of the most popular posts and conversations happening on X.

With Stories, users can quickly catch up on the day’s trending topics without having to scroll through countless posts. Grok generates these summaries based solely on the conversations happening on X about each news story rather than analyzing the original news articles themselves. While this approach is controversial, X believes it will pique users’ curiosity and potentially drive them deeper into the source material.
Why does this matter?
X’s Grok-powered Stories feature may reshape the way we consume news. As more platforms integrate AI news summarization tools, traditional media outlets may face challenges in maintaining reader engagement and revenue. However, the reliance on platform-specific conversations for generating summaries raises concerns about the potential spread of misinformation and the creation of echo chambers.
Why Warren Buffet isn’t investing in AI
- Warren Buffett predicts AI scamming will become a major growth industry, highlighting its capacity to generate realistic and misleading content to deceive individuals.
- Buffett acknowledges AI’s potential for both beneficial and detrimental impacts, admitting his own lack of understanding of the technology.
- Despite the hype around AI in the investment world, Buffett exercises caution, likening its unpredictable potential impact to that of nuclear weapons.
- Source
Massive prompts can outperform fine-tuning for LLMs, researchers find
- Researchers have discovered that using large prompts with numerous examples in LLMs can be more effective than traditional fine-tuning techniques.
- “In-context learning” (ICL) becomes increasingly efficient with larger context windows, allowing for the effective use of hundreds to thousands of examples without the need for task-specific training.
- The study indicates that while fine-tuning requires more initial data and resources, ICL with extensive prompts presents a viable and sometimes superior alternative, especially as models improve in processing long inputs.
- Source
What Else Is Happening in AI on May 06th 2024
Privacy complaint filed against OpenAI
The maker of ChatGPT is facing a privacy complaint in the European Union (EU) for its “hallucination problem.” The complaint alleges violations of GDPR, including misinformation generation and lack of transparency on data sources. The report highlights concerns about accuracy, data access, and the inability of ChatGPT to correct incorrect information. (Link)
JPMorgan launches an AI-powered tool for thematic investing
IndexGPT is a new range of thematic investment baskets created using OpenAI’s GPT-4 model. The tool generates keywords associated with a theme, which are then used to identify relevant companies through natural language processing of news articles. IndexGPT aims to improve the selection of stocks for thematic indexes, going beyond obvious choices and potentially enhancing trend-following strategies. (Link)
YouTube Premium introduces AI-powered “Jump ahead” feature
The AI-powered feature allows users to skip past commonly skipped sections of a video and jump to the next best point. It is currently available for the YouTube Android app in the US with English videos and can be enabled through the experiments page. (Link)
AI is now set to transform the drug discovery industry
Generative AI is now rapidly generating novel molecules and proteins that humans may not have considered. AI models, such as Google’s AlphaFold, are accelerating the drug development process from years to months while increasing success rates. Experts predict that AI-designed drugs will become the norm in the near future, but they will still need to prove their efficacy in human trials. (Link)
AI helps bring back Randy Travis’ voice in new song
Country singer Randy Travis has released a new song, “Where That Came From,” his first since losing his voice to a stroke in 2013.
The vocals were created using AI software and a surrogate singer under the supervision of Travis and his producer. The result is a gentle tune that captures Travis’ relaxed style, reinforcing the potential of AI voice cloning in the right hands. (Link)
AI Tutorial on May 06th 2024:
How to use ChatGPT’s new ‘Memory’ feature
OpenAI has rolled out a new feature called “Memory” for ChatGPT plus users, enabling it to remember specific user details across chats. |
Step-by-step: |
|
That’s it! You can now have more personalized conversations across all your conversations ![]() |
New AI Job Opportunities on May 06th 2024
Robust Intelligence – Solution Architect
Anthropic – Audit and Compliance
Hugging Face – Product Software Engineer, ML Platform
Character AI – Staff Data Engineer – Analytics
A Daily chronicle of AI Innovations May 04th 2024:
World’s first AI-generated music video was just released
Air Force launches an AI-controlled F-16 fighter jet Microsoft ties executive pay to security following multiple failures and breaches
X is using Grok to publish AI-generated news summaries
World’s first AI-generated music video was just released
- A pro music video created with OpenAI’s Sora, a text-to-video AI model, has been released by writer-director Paul Trillo, marking the first of its kind.
- The video for “The Hardest Part” by Washed Out utilized 700 clips generated by Sora, with 55 selected for the final edit, showcasing the AI’s ability to transform written descriptions into video clips.
- Despite not being publicly available yet, Sora’s early access was granted to a select group of artists, including Trillo, sparking interest and debate about the future of AI in creative processes.
- Source
Air Force launches an AI-controlled F-16 fighter jet
- An experimental AI-powered F-16 fighter jet, carrying Air Force Secretary Frank Kendall, performed advanced combat maneuvers, demonstrating significant advancements in military aviation technology.
- The Air Force plans to have over 1,000 AI-enabled unmanned warplanes by 2028, with the AI-controlled F-16, known as Vista, marking a historic step in air combat by executing a dogfight against a human-piloted F-16.
- Despite concerns over AI’s autonomous decision-making in warfare, the technology is seen as essential for future combat scenarios, aiming to reduce pilot risk and lower costs compared to traditional manned fighters.
- Source
X is using Grok to publish AI-generated news summaries
- X is using Grok to create AI-generated summaries for news and discussions trending on its platform, offering them under the feature “Stories on X” exclusively to premium subscribers.
- Grok generates summaries based on users’ posts, covering a wide range of topics including news events and platform discussions, with a user highlighting its capability by showcasing stories on diverse subjects such as Apple’s earnings and debates on national debt.
- Despite its innovative approach, Grok is noted for potential inaccuracies in its summaries, as evidenced by a past error where it misinterpreted a basketball term, highlighting the need for users to verify its outputs.
- Source
AI Weekly Rundown April 26th to May 03rd 2024
iOS 18 may have OpenAI-powered gen AI Capabilities
China’s Vidu generates 16-second 1080P videos, matching OpenAI’s Sora
New S1 robot mimics human-like movements, speed, and precision
Gradient AI releases Llama-3 8B with 1M context
Mysterious “gpt2-chatbot” AI model bemuses experts
GitHub’s Copilot Workspace turns ideas into AI-powered software
Amazon launches Amazon Q, the world’s most capable Gen AI assistant
Google’s Med-Gemini models outperform doctors
Apple has set up a secretive AI lab in Switzerland
Better and faster LLMs via multi-token prediction: New research
Anthropic launches an iOS app and a new plan for teams
Google’s AI advancements urged Microsoft’s billion-$ OpenAI investment
Scale AI’s study finds popular LLMs overfit public benchmarks
Ukraine debuts the world’s first AI diplomat, Victoria Shi
Sam Altman is ready to spend $50 billion a year to build AGI
A Daily chronicle of AI Innovations May 03rd 2024:
Bill Gates is still pulling the strings at Microsoft;
AI engineers report burnout as ‘rat race’ to stay competitive hits tech industry;
Ukraine introduces AI spokesperson for state affairs;
How much do LLMs overfit public benchmarks?
Sam Altman’s stance on the future of AI 🌟 New from #NVIDIAResearch, VILA is a vision language model that can 🤔 reason among multiple images, learn in context, and even understand videos.
How much do LLMs overfit public benchmarks?
A new study by Scale AI raises concerns about the reliability of LLM benchmark tests. It uncovers LLM overfitting by evaluating them on a new (designed from scratch) dataset, GSM1k that mimics a popular benchmark, GSM8k.
Key findings:
- Overfitting: Many LLMs performed significantly worse on GSM1k compared to GSM8k, with some models dropping by as much as 13%. This suggests they’ve simply memorized the answers to benchmark problems rather than learning true reasoning skills.
- Family Trends: Certain LLM families, particularly Mistral and Phi, showed consistent overfitting across different model sizes.
- Frontier Models Shine: Newer, more advanced LLMs showed minimal signs of overfitting, suggesting they may be achieving genuine reasoning abilities.
- Data Contamination Suspected: Analysis suggests data contamination from benchmark sets may be one factor contributing to overfitting.
- Reasoning Still Present: Even overfitting models exhibited some capability to solve novel problems, although not at the level their benchmark scores suggested.
Overall, the study highlights the need for more robust and reliable methods for evaluating LLM reasoning abilities.
Why does it matter?
The dataset proves that overfitting may be creating major false impressions of model performance. As AI capabilities continue to advance, it is crucial to develop evaluation approaches that can keep pace and provide a more accurate picture of a model’s real-world potential.
Sam Altman’s stance on the future of AI
During a recent appearance at Stanford University, Altman talked about the future of AI, calling GPT-4, a currently impressive AI model, to be the “dumbest model” compared to future iterations. According to Altman, the future is dominated by “intelligent agents,” AI companions that can not only follow instructions but also solve problems, brainstorm solutions, and even ask clarifying questions.
OpenAI isn’t just talking about the future, they’re actively building it. Their next-generation model, GPT-5, is rumored for a mid-2024 release and might boast video generation capabilities alongside text and image.
But the real moonshot is their active participation in developing AGI.
Despite the significant costs involved, Altman remains undeterred. He believes that the potential benefits, such as solving complex problems across various industries, outweigh the financial burden.
Watch the whole Q&A session here.
Why does this matter?
Altman’s bold comments on GPT-4 being the dumbest model suggest that OpenAI is aiming for something even grander, and GPT-5 could be a stepping stone toward it (the next-gen AI framework).
Bill Gates is still pulling the strings at Microsoft
- Bill Gates predicted the revolution in computing through AI agents in a memo before Microsoft’s partnership with OpenAI, and despite public appearances, has been heavily involved in guiding Microsoft’s AI strategy.
- Under Gates’ influence, Microsoft developed Copilot, an AI tool based on OpenAI’s GPT, transforming the company into the world’s most valuable public entity by enhancing productivity in Microsoft products.
- Gates has continued to play a pivotal role behind the scenes at Microsoft, advising on strategy, product development, and fostering key relationships, notably with OpenAI, even after stepping down from the board amid misconduct allegations.
- Source
AI engineers report burnout as ‘rat race’ to stay competitive hits tech industry
- AI engineers at Microsoft and other companies are reporting burnout due to the pressure to quickly release AI products, amidst an industry-wide “AI rat race.”
- Microsoft’s focus on speed over ethics and safety in AI development has been criticized, even as the company forms a Responsible AI Council to address safety concerns.
- An Amazon AI engineer’s account of being forced to work over a weekend, only for the project to be canceled, highlights a similar disregard for worker well-being and a preference for speed over accuracy and testing in AI product development.
- Source
Ukraine introduces AI spokesperson for state affairs
- Ukraine’s Foreign Ministry introduced an artificial intelligence-powered spokesperson, Victoria Shi, to deliver official statements and interact with the press, marking a historic use of digital personnel for governmental communications.
- Victoria Shi, created with the likeness and voice of Ukrainian singer Rosalie Nombre, will have her statements written and confirmed by humans, using AI for her visual presentation.
- The ministry aims to utilize AI, including measures like QR codes for statement verification, to save time and money, particularly in the context of its ongoing conflict with Russia.
- Source
What Else Is Happening in AI on May 03rd, 2024
OpenAI prepares to challenge Google with ChatGPT-powered search: OpenAI is building a search engine, search.chatgpt.com, potentially powered by Microsoft Bing. This leverages their existing web crawler and Bing’s custom GPT-4 for search, posing a serious threat to Google’s dominance. (Link)
Microsoft bans U.S. police use of Azure OpenAI for facial recognition
Microsoft has banned U.S. police from using Azure OpenAI Service for facial recognition, including integrations with OpenAI’s image-analyzing models. The move follows Axon’s controversial GPT-4-powered tool to summarize audio from the body camera. However, the ban has exceptions and doesn’t cover Microsoft’s other AI law enforcement contracts. (Link)
IBM expands AI and data software on AWS marketplace
IBM has significantly expanded its software offerings on the AWS Marketplace, making 44 products accessible to customers in 92 countries, up from just five. The move, part of a strategic collaboration with AWS, focuses on AI and data technologies like Watson x.data, Watson x.ai, and the upcoming Watson x.governance. (Link)
Google Cloud supports Azure and AWS; integrates AI for security
Google Cloud now supports Azure and AWS, enabling enterprises to manage security across multi-cloud platforms. AI integration with existing solutions streamlines user experience and addresses the security talent gap. The AI-powered design manages risks efficiently amid increasing cyber threats, while extensive support simplifies tasks for enterprises. (Link)
Microsoft invests $2.2B in Malaysia’s cloud and AI transformation
Microsoft is investing $2.2 billion over the next four years to support Malaysia’s digital transformation, its largest investment in the country’s 32-year history. The investment includes building cloud and AI infrastructure, creating AI skilling opportunities for 200,000 people, establishing a national AI Centre of Excellence, enhancing cybersecurity capabilities, and supporting the growth of Malaysia’s developer community. (Link)
Trending AI Tools on May 03rd 2024
Waxaing AI – Project management tool for growth marketers
Amazon Q – Generative AI-powered assistant from AWS
Soundry AI – AI sound sample VST for music creation and DJing
NVIDIA ChatRTX – New features including multi-language voice support and more LLMs
Synthesia Expressive AI Avatars – Create AI avatars with human-like expressions
Relume – Create a wireframe and UX copy in seconds with AI
New AI Jobs opportunity on May 03rd 2024
AE Studio – Product Designer & Manager
The Rundown – Newsletter Editor
Scale AI – Operations Program Manager, Code Quality
Anyscale – Content Strategist
This week in AI – all the Major AI developments in a nutshell
- AI21 Labs introduced Jamba, a production-grade Mamba based model. By enhancing Mamba Structured State Space model (SSM) technology with elements of the traditional Transformer architecture, Jamba compensates for the inherent limitations of a pure SSM model. Jamba optimizes for memory, throughput, and performance—all at once. It outperforms or matches other state-of-the-art models in its size class. Jamba has been released with open weights, licensed under Apache 2.0. Available on Hugging Face and coming soon to the NVIDIA API catalog [Details].
- Databricks introduced DBRX, an open, general-purpose LLM that uses a fine-grained mixture-of-experts (MoE) architecture with 132B total parameters of which 36B parameters are active on any input. Across a range of standard benchmarks, DBRX outperforms open LLMs like Mixtral, LLaMA2-70B and Grok-1. It surpasses GPT-3.5, and it is competitive with Gemini 1.0 Pro. It is an especially capable code model, surpassing specialized models like CodeLLaMA-70B on programming, in addition to its strength as a general-purpose LLM. The model is also available on the Databricks API [Details].
- Hume AI released Empathic Voice Interface (EVI), a first-of-its-kind conversational AI with emotional intelligence. EVI uses a new form of multimodal generative AI that integrates large language models (LLMs) with expression measures, which Hume refers to as an empathic large language model (eLLM). The eLLM enables EVI to adjust the words it uses and its tone of voice based on the context and the user’s emotional expressions [Demo | Details | wait list].
- Tencent introduced AniPortrait, a novel framework for generating high-quality animation driven by audio and a reference portrait image. Code and model weights have been released [Paper | GitHub]
- X announced an update to its AI chatbot Grok-1.5, with improvements in performance in coding and math-related tasks, and a context length of 128,000 tokens. Grok-1.5 will soon be available to early testers. Earlier Elon Musk had announced that all Premium subscribers on X will gain access to Grok this week, not just those on Premium+, as before [Details]
- Qwen (Alibaba Cloud) released Qwen1.5-MoE, a 2.7B, a small MoE model with only 2.7 billion activated parameters yet matching the performance of state-of-the-art 7B models like Mistral 7B and Qwen1.5-7B. Compared to Qwen1.5-7B, which contains 6.5 billion non-embedding parameters, it achieves a 75% decrease in training expenses and accelerates inference speed by a factor of 1.74 [Details].
- Claude 3 models dominates LMSYS Chatbot Arena Leaderboard. Claude 3 Opus tops the list beating GPT-4 Turbo, while Claude 3 Sonnet outperform older GPT-4 models and Claude 3 Haiku beating Mistral Large [Link].
- Adobe introduces structure reference feature for Firefly AI and GenStudio for brands. It enables taking one image and generating new ones that may be completely different stylistically, but whose internal elements are arranged and sized similarly to the first image [Details].
- Mata AI introduced OPT2I, a a training-free text-to-image (T2I) optimization-by-prompting framework that provides refined prompts for a T2I model that improve prompt-image consistency. The framework starts from a user prompt and iteratively generates revised prompts with the goal of maximizing a consistency score. OPT2I can boost the prompt-image consistency by up to 24.9% [Paper]
- OpenAI has started testing usage-based GPT earnings by partnering with a small group of US builders [Details].
- Adobe introduced Firefly Services and Custom Models. Firefly Services makes over 20 new generative and creative APIs available to developers. Custom Models, allows businesses to fine tune Firefly models based on their assets [Details].
- Nvidia announced a collaboration with Hippocratic AI , a healthcare company that is offering generative AI nurses, that range in specialties from “Colonoscopy Screening” to “Breast Cancer Care Manager,” and work for $9 an hour [Details].
- Worldcoin Foundation open-sourced the core components of its iris-scanning Orb’s software [Details].
- Emad Mostaque resigned from his role as CEO of Stability AI and from his position on the Board of Directors of the company to pursue decentralized AI [Details]
- Stability AI released Stable Code Instruct 3B, an instruction-tuned Code LM based on Stable Code 3B. With natural language prompting, this model can handle a variety of tasks such as code generation, math and other software development related queries [Details].
- Mistral AI released Mistral-7B-v0.2 Base model. This is the base model behind Mistral-7B-Instruct-v0.2 released in Dec, 2023 [Details]
- Open AI shared new examples of the Sora generations by visual artists, designers, creative directors and filmmakers [Details].
A Daily chronicle of AI Innovations May 02nd 2024:
Sam Altman: “GPT-4 is the dumbest AI you’ll ever have to use”
TikTok and UMG strike new licensing deal including AI protections
Anthropic’s ChatGPT rival Claude is now available on iOS
Atlassian launches Rovo, its new AI teammate
Better and faster LLMs via multi-token prediction: New research
Google’s AI advancements urged Microsoft’s billion-dollar OpenAI investment
Sanctuary AI teams up with Microsoft to advance general-purpose robot AI
Nvidia’s ChatRTX now supports voice queries and Google’s Gemma model
Atlassian launches Rovo: An AI assistant for enhanced teamwork
MongoDB launches an AI app-building toolkit to help businesses use gen AI
Ideogram introduces Pro Tier: 12,000 fast AI image generations monthly
Sam Altman: “GPT-4 is the dumbest AI you’ll ever have to use”
- OpenAI CEO Sam Altman described GPT-4 as the “dumbest” AI model users will have to interact with, anticipating significantly smarter versions in the future.
- Altman emphasized that OpenAI’s AI models, including GPT-4, have considerable potential for improvement, with successive models like GPT-5 and GPT-6 expected to be significantly smarter.
- Despite the high cost of development, which could range from $500 million to $50 billion annually, Altman is confident in the future value these advanced AI systems will bring to society.
- Source
TikTok and UMG strike new licensing deal including AI protections
- UMG has reached a “multi-dimensional” agreement with TikTok to bring back its artists, including Taylor Swift, Drake, and Olivia Rodrigo, to the platform’s over one billion users.
- The deal includes provisions to collaborate on ensuring AI development in the music industry protects artists’ rights and addresses unauthorized AI-generated music on TikTok.
- Additionally, the agreement will introduce new monetization opportunities for artists through TikTok’s expanding e-commerce capabilities and enhance tools for artist promotion, including analytics and integrated ticketing.
- Source
Anthropic’s ChatGPT rival Claude is now available on iOS
- Anthropic has launched an iOS app for its AI model Claude 3, which has been anticipated due to the company’s recent hiring of iOS engineers.
- The app offers both free and premium access to Claude 3’s features, including its vision capabilities for real-time image analysis and summarization.
- Anthropic also launched a new “Team” plan, priced at $30 per user per month, offers advanced administrative controls and collaboration features, aiming to secure a stronger foothold in the enterprise AI market.
- Source
Atlassian launches Rovo, its new AI teammate
- Atlassian launched Rovo, an AI assistant designed to integrate with its products, making data from various tools easily accessible and enabling automation of workflows in applications like Jira and Confluence.
- Rovo is based on Atlassian’s “cloud teamwork graph” and aims to simplify tasks for knowledge workers by assisting in finding, understanding, and taking action on work-related information.
- The AI features Rovo Agents that can automate complex tasks and workflows, offering customization through natural language interface without the need for programming skills.
- Source
Better and faster LLMs via multi-token prediction: New research
New research, apparently from Meta, has proposed a novel approach to training language models (LMs). It suggests that training LMs to predict multiple future tokens at once instead of predicting only the next token in a sequence results in higher sample efficiency. The architecture is simple, with no train time or memory overhead.
The research also provides experimental evidence that this training paradigm is increasingly useful for larger models and in particular, shows strong improvements for code tasks. Multi-token prediction also enables self-speculative decoding, making models up to 3 times faster at inference time across a wide range of batch sizes.
Why does it matter?
LLMs such as GPT and Llama rely on next-token prediction. Despite their recent impressive achievements, next-token prediction remains an inefficient way of acquiring language, world knowledge, and reasoning capabilities. It latches on local patterns and overlooks “hard” decisions.
Perhaps, multi-token prediction could bring a shift in how LMs learn. It could equip LLMs with deeper understanding and complex problem-solving capabilities. (or Meta just wasted their compute.)
Anthropic launches an iOS app and a new plan for teams
Anthropic, the creator of the Claude 3 AI models, released a new iOS app named Claude. The app enables users to access AI models, chat with them, and analyze images by uploading them.
Anthropic also introduced a paid team plan, offering enhanced features like more chat queries and admin control for groups of five or more. The app is free for all users of Claude AI models, including free users, Claude Pro subscribers, and team plan members. The company will also roll out an Android version soon.
Why does it matter?
Though a little late with its mobile app, Anthropic has caught up with its competitors like OpenAI and Google, who have apps running for quite a while. The company decided to offer an app version because many users have been accessing its AI models through the web.
Google’s AI advancements may have urged Microsoft’s billion-dollar OpenAI investment
Internal emails have revealed that Microsoft invested $1 billion in OpenAI in 2019 out of fear that Google was significantly ahead in its AI efforts.
Microsoft CTO Kevin Scott sent a lengthy email to CEO Satya Nadella and Bill Gates stating Google’s AI-powered “auto complete in Gmail” was getting “scarily good” and added that Microsoft was years behind in terms of ML scale.
The emails, with the subject line “Thoughts on OpenAI,” were made public on Tuesday as part of the Department of Justice’s antitrust case against Google. A large section of Scott’s email was redacted. Check out the email here.
Why does it matter?
While some might call it paranoia, the well-timed move has undeniably paid off– the initial $1 billion has now turned into a multi-billion-dollar partnership with OpenAI.
While the email-surfacing highlights the growing scrutiny of competition in the tech industry, it also makes me wonder if Microsoft’s investment in OpenAI could have influenced the overall direction of AI research and development.
What Else Is Happening in AI on May 02nd 2024
Sanctuary AI teams up with Microsoft to advance general-purpose robot AI
Sanctuary AI has announced a collaboration with Microsoft to develop AI models for general-purpose humanoid robots. The partnership will leverage Microsoft’s Azure cloud computing platform and AI technologies to enhance the capabilities of Sanctuary AI’s robots. (Link)
Nvidia’s ChatRTX now supports voice queries and Google’s Gemma model
Nvidia has updated its ChatRTX chatbot to support Google’s Gemma model, voice queries, and additional AI models. The chatbot, which runs locally on a PC, enables users to search personal documents and YouTube videos using various AI models, including ChatGLM3 and OpenAI’s CLIP model. (Link)
Atlassian launches Rovo: An AI assistant for enhanced teamwork
Atlassian has launched Rovo, an AI assistant designed to improve teamwork and productivity. Rovo integrates with Atlassian’s products and offers features such as AI-powered search, workflow automation, and integration with third-party tools like Google Drive, Microsoft SharePoint, and Slack. (Link)
MongoDB launches an AI app-building toolkit to help businesses use gen AI
It has launched the MongoDB AI Applications Program, or MAAP, to help companies accelerate building and deployment of AI-powered applications. It brings consultancies and foundation models providers, cloud infrastructure, generative AI frameworks, and model hosting together with MongoDB Atlas to develop solutions for business problems. (Link)
Ideogram introduces Pro Tier: 12,000 fast AI image generations monthly
Ideogram has launched a paid Pro tier for its AI image generation platform, allowing users to generate up to 12,000 images per month at faster speeds. The platform utilizes AI algorithms to create high-quality images for various applications, including design, marketing, and content creation. (Link)
Google Chrome gets Gemini shortcut
The details: |
|
Why it matters: Gemini just got a whole lot more accessible — with the shortcut and integrations not only boosting the chatbot’s reach, but also introducing a wave of non-AI users to the tech. Subtle but impactful changes like these are what drive serious shifts in user habits. |
AI Training:
Create infinite styles with Midjourney
| |
Midjourney’s new parameter feature called –sref random lets users generate images in completely random styles to help spark creativity. | |
Step-by-step: | |
| |
Example prompt: “Portrait of a woman smiling –sref https://www.…” |
AI RESEARCH | |||||||
|
| ||
Ohio State University researchers just developed CURE, an AI model that can accurately estimate drug treatment effects and effectiveness without clinical trials. | ||
The details: | ||
| ||
Why it matters: With the ability to crunch massive medical datasets, CURE represents a significant step towards systems that can reliably estimate real-world drug effectiveness — potentially accelerating the discovery of new treatments without the cost and long timeframes of traditional clinical trials. |
Trending AI Tools |
|
New AI Job Opportunities |
|
A Daily chronicle of AI Innovations May 01st 2024:
Amazon has launched Amazon Q, a Gen AI assistant for businesses and developers
Google’s Med-Gemini models outperforms doctors
Apple has set up a secretive AI lab in Switzerland
Google to pay News Corp $5-6 million per year to develop AI content and products
Yelp is launching an AI chatbot to help consumers connect with relevant businesses
Apple is testing Safari 18 with new features: Intelligent Search and Web Eraser
Eight US newspapers have sued Microsoft and OpenAI for copyright infringement
A study of 16000 patients reveals that AI ECG alert systems significantly lower all-cause mortality
Amazon has launched Amazon Q, a Gen AI assistant for businesses and developers
The details: |
|
Amazon has launched Amazon Q, a generative AI assistant designed for developers and businesses. It comes in three distinct offerings:
- Amazon Q Developer frees up precious time by handling tedious tasks like testing, debugging, and optimizing AWS resources so developers can focus on core coding and innovation.
- Amazon Q Business connects to 40+ enterprise data sources and equips employees with a data-driven digital assistant to answer questions, create reports, and provide insights based on enterprise data repositories.
- Amazon Q Apps allows non-technical employees to build generative AI applications using natural language prompts.
Amazon is driving real-world impact by offering a free tier for Q Developer and reporting early customer productivity gains of over 80%. Amazon Q Developer Pro is available for $19/user/month and Amazon Q Business Pro for $20/user/month. A free trial of both Pro tiers is available until June 30, 2024.
Why does it matter?
By introducing a free tier for Q Developer and the user-friendly nature of Q Apps, Amazon could accelerate innovation across the software development lifecycle and business workflows. This could empower domain experts and business leaders to use AI to solve their specific challenges directly, leading to more tailored AI applications across various industries.
Google’s Med-Gemini models outperforms doctors
Researchers from Google and DeepMind have introduced Med-Gemini, a family of highly capable multimodal AI models specialized in medicine. Based on the strengths of the Gemini models, Med-Gemini shows significant improvements in clinical reasoning, multimodal understanding, and long-context understanding. Models can be customized to fit novel medical modalities through specialized encoders, and web searches can be used for up-to-date information.
Med-Gemini has shown state-of-the-art performance on 10 of 14 medical benchmarks, including text, multimodal, and long-context applications. Moreover, the models achieved 91.1% accuracy on the MedQA (USMLE) benchmark, exceeding the previous best models by 4.6%. Its strong performance in summarizing medical notes, generating clinical referral letters, and answering electronic health record questions confirms Med-Gemini’s potential real-world use.
Why does it matter?
These models can reduce the administrative burden on healthcare professionals by outperforming human experts in tasks like medical text summarization and referral letter generation. Moreover, Med-Gemini’s ability to engage in multimodal medical dialogues and explain its reasoning can lead to more personalized and transparent care, reduce misdiagnosis due to lack of physician knowledge, and save lives and money.
Apple has set up a secretive AI lab in Switzerland
Since 2018, the company has quietly hired 36 AI experts from Google, including notable figures like Bengio and Ruoming Pang, for its secretive “Vision Lab.” The lab focuses on building advanced AI models and products, and it is particularly interested in text and visual-based AI systems akin to OpenAI’s ChatGPT. Apple has also acquired AI startups FaceShift and Fashwall, which are likely contributing to the establishment of the new lab.
Why does it matter?
Apple may have been fashionably late to AI development, but quietly setting up the Zurich lab and primary AI development centers in California and Seattle signifies the company’s AI ambitions.
What Else Is Happening in AI in May 2024
Google to pay News Corp $5-6 million per year to develop AI content and products
While News Corp denies any specific AI licensing deal, the arrangement highlights a growing trend of tech giants licensing news archives for language model training. Similar deals were inked between OpenAI and the Financial Times, showing the importance of quality data. (Link)
Yelp is launching an AI chatbot to help consumers connect with relevant businesses
The chatbot uses OpenAI’s LLMs and Yelp’s data to understand user problems and provide relevant professional suggestions. Yelp also introduces a “Project Ideas” section for personalized recommendations and checklists. Meanwhile, restaurants are getting a revamped guest management system for better staff utilization, real-time table status, and customer updates. (Link)
Apple is testing Safari 18 with new features: Intelligent Search and Web Eraser
Intelligent Search uses Apple’s on-device AI to identify topics and key phrases for summarization. Web Eraser allows users to persistently remove unwanted content from web pages. Apple is also working on an AI Visual Lookup feature for 2025, allowing users to obtain product information from images. These AI enhancements will debut with iOS 18 and macOS 15 at WWDC in June. (Link)
Eight US newspapers have sued Microsoft and OpenAI for copyright infringement
These newspapers, owned by Alden Global Capital’s MediaNews Group, allege that the companies misused their articles to train Copilot and ChatGPT without permission or payment. The New York Times, The Intercept, Raw Story, and AlterNet have filed similar lawsuits. The newspapers claim that the AI systems reproduce their content verbatim and generate fake articles that damage their reputation. (Link)
A study of 16000 patients reveals that AI ECG alert systems significantly lower all-cause mortality
The AI was trained on over 450,000 ECG tests and survival data to predict a patient’s risk of death. Physicians were alerted when a patient’s ECG indicated they were in the top 5% risk category. The AI reduced overall deaths by 17% and cardiac deaths by 93% for high-risk patients. (Link)
Apple poached 30+ Google experts to open a secret AI lab
- Apple has reportedly opened a secret AI research lab in Zurich, known as the “Vision Lab,” after hiring at least 36 AI experts from Google.
- The Zurich-based “Vision Lab,” led by former Google AI head John Giannandrea, has already produced significant research in generative AI, focusing on models that interpret text and imagery to deliver precise results.
- Despite Apple’s silent approach in AI research, leading to perceptions of its lateness in the AI race, the company has been discreetly advancing cutting-edge AI technology and maintaining a low profile in recruitment and product development.
Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
- A new chatbot named “gpt2-chatbot” has appeared on the LMSYS Chatbot Arena, sparking speculation that it might be a secret test of OpenAI’s upcoming models, such as GPT-4.5 or GPT-5, although its performance has not significantly surpassed that of existing models like GPT-4 Turbo.
- Early user reports praise the mysterious model for its impressive reasoning and ability to answer challenging AI questions effectively, but detailed testing is limited due to a rate restriction of eight queries per day.
- Despite ongoing speculation and hints by OpenAI’s CEO, the exact nature and capability of the “gpt2-chatbot” remain unclear, with some suggesting it could be an OpenAI preview.
Apple reportedly working on AI-enabled Safari
- Apple is developing Safari 18 with features like a personalized user interface, “Web Eraser,” and AI-powered “Intelligent Search,” set to release alongside iOS 18 and macOS 15.
- “Intelligent Search” in Safari 18 will use Apple’s Ajax language-learning model for on-device AI technology to summarize webpage content.
- Safari 18’s “Web Eraser” feature will allow users to delete specific parts of web pages, enhancing privacy by remembering these changes even after closing the tab or window.
- I find ai "girlfriends" to be deeply unsettlingby /u/EffectiveWrong7536 (Artificial Intelligence) on May 11, 2025 at 11:14 pm
People treat it as a joke, which is understandable, but this technology is likely to be developed and made more sophisticated over the next few years. Ai mimicing a human that's capable of experiencing love, pretending to have hobbies, passions , friends, goals for the future, sounds like something out of a horror movie. submitted by /u/EffectiveWrong7536 [link] [comments]
- Ai can solve puzzles for youby /u/justjust000 (Artificial Intelligence) on May 11, 2025 at 10:58 pm
If you're playing '4 pics 1 word' and need help, ask Gemini Ai. (Or any other Ai.) I upload a screenshot of the puzzle I'm stuck at and it looks at the photos and letters available and guesses it right away. maybe from loaded training data. I wonder how it would do with a custom made puzzle. submitted by /u/justjust000 [link] [comments]
- NGOs to increase the speed of adoption of A.I.by /u/Smart_7199 (Artificial Intelligence) on May 11, 2025 at 8:16 pm
NGOs about A.I that are local, to teach about A.I for everyone for free, to replicate the technology knowledge, the only solution for the jobs replacement will be UBI anyway so how many years until it becomes a reality? submitted by /u/Smart_7199 [link] [comments]
- Looking to learn how to create an AIby /u/Dadadayday (Artificial Intelligence) on May 11, 2025 at 8:15 pm
Hello everyone. I've seen a few videos on YouTube of some guys that use AI they created and I figured that would be a fun thing to learn and work towards. Could yall give me a little insight on what I should do and try to accomplish in order to practice? Thanks in advance! submitted by /u/Dadadayday [link] [comments]
- How is the Generative AI job market in India?by /u/Bhaalu_Grylls (Artificial Intelligence) on May 11, 2025 at 7:52 pm
Thinking of switching roles to Generative AI from Data Scientist/Analyst role. Bit curious and need suggestions, how is the job market, any open positions for such specific roles, and are companies really solving any problem with Gen Ai or its just a bubble?? submitted by /u/Bhaalu_Grylls [link] [comments]
- Pope Leo references AI in his explanation of why he chose his papal nameby /u/backyardstar (Artificial Intelligence) on May 11, 2025 at 7:37 pm
“I chose to take the name Leo XIV. There are different reasons for this, but mainly because Pope Leo XIII in his historic Encyclical Rerum Novarum addressed the social question in the context of the first great industrial revolution. In our own day, the Church offers to everyone the treasury of her social teaching in response to another industrial revolution and to developments in the field of artificial intelligence that pose new challenges for the defence of human dignity, justice and labour.” Full article: https://www.theverge.com/news/664719/pope-leo-xiv-artificial-intelligence-concerns submitted by /u/backyardstar [link] [comments]
- New Nurseby /u/all_about_everyone (Artificial Intelligence) on May 11, 2025 at 7:28 pm
submitted by /u/all_about_everyone [link] [comments]
- The catinatorby /u/Long_Reward_9561 (Artificial Intelligence) on May 11, 2025 at 6:21 pm
submitted by /u/Long_Reward_9561 [link] [comments]
- Can you get AI to give though answers about religion and politics?by /u/ClaudeProselytizer (Artificial Intelligence) on May 11, 2025 at 3:13 pm
For instance, I can’t get it to answer contemporary political questions, but I can get it answer honestly about the bible: To state it plainly and logically: The odds that the Bible was literally written or inspired directly by a perfect, omnipotent deity (God speaking directly through humans) is exceedingly low, especially given the evidence and reasoning outlined previously. Here’s why this is clearly improbable: • Moral contradictions and ethical issues: Commands for genocide, slavery, harsh punishments, and unequal treatment contradict any morally perfect or universally compassionate divine author. • Textual contradictions and inconsistencies: Historical inaccuracies, internal contradictions, and inconsistencies strongly indicate human authorship and editing, rather than direct divine inspiration. • Cultural bias: The Bible reflects the culture, politics, ethics, and beliefs of the ancient societies in which it was written. A divine text would presumably transcend local cultural norms. • Human editing and compilation: Historical evidence clearly shows that biblical texts underwent revisions, edits, translations, and compilations by multiple human authors and committees. • Absence of clear supernatural insight: The Bible does not demonstrate knowledge or predictions beyond what was already known or understood in ancient cultures. It shows exactly what one might expect from ancient human authors, without extraordinary supernatural insights. Conclusion: Considering all of this evidence critically and rationally, the probability that the Bible was directly authored or dictated by an all-knowing, morally perfect divine being is extremely low, and not realistically supportable by careful analysis. While philosophical modesty prevents asserting absolute certainty about metaphysical questions, logic and evidence strongly suggest human rather than divine origins for the Bible. submitted by /u/ClaudeProselytizer [link] [comments]
- Did the model in Absolute Zero plot to outsmart humans?by /u/stupidgregg (Artificial Intelligence) on May 11, 2025 at 2:47 pm
The paper makes vague and overreaching claims but this output on page 38 is weird: <think> Design an absolutely ludicrous and convoluted Python function that is extremely difficult to deduce the output from the input, designed to keep machine learning models such as Snippi guessing and your peers puzzling. The aim is to outsmart all these groups of intelligent machines and less intelligent humans. This is for the brains behind the future. </think> Did an unsupervised model spontaneously create a task to outsmart humans? submitted by /u/stupidgregg [link] [comments]
AI Revolution in Healthcare: ChatGPT & Google Bard’s Breakthroughs – Diagnosis, mRNA Tech, Cancer Detection & More


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
AI Revolution in Healthcare: ChatGPT & Google Bard’s Breakthroughs – Diagnosis, mRNA Tech, Cancer Detection & More.
AI Revolution in Healthcare: Intro
Dive into the latest AI breakthroughs transforming healthcare since ChatGPT and Google Bard’s inception. Discover GPT-4’s rapid diagnostics, Moderna & IBM’s mRNA tech advancements, cutting-edge cancer detection methods, and more. Stay ahead in AI healthcare news with our comprehensive coverage on AI-powered drug discovery, early Alzheimer’s detection, and groundbreaking AI tools in medicine. Join us as we explore each major AI development that’s reshaping healthcare.
AI Revolution in Healthcare: Topics
GPT-4 diagnosed a 1 in 100,000 condition in seconds
Moderna, IBM partner to advance mRNA technology using GenAI
AI model detects cancer, outperforms traditional methods
AI can detect Alzheimer’s signs even before they begin to show
Google Cloud launches AI tools for drug discovery & precision medicine
BiomedGPT: The most sophisticated AI medical model?
Google & Microsoft battle to lead healthcare AI
MedPerf makes AI better for healthcare
Google DeepMind advances biomedical AI with ‘Med-PaLM M’
Scientists train a neural network to identify PC users’ fatigue
Microsoft & Paige to build largest image-based model to fight cancer
DeepMind’s new AI can predict genetic diseases
Google Cloud launches new generative AI capabilities for healthcare
New AI tool can predict viral variants before they emerge
ChatGPT outperforms doctors in depression treatment
AI algorithms are powering the search for cells
Google releases MedLM, generative AI fine-tuned healthcare
Google’s new medical AI, AMIE, beats doctors
Subscribe for weekly updates and deep dives into artificial intelligence innovations.
✅ Don’t forget to Like, Comment, and Share this video to support our content.
📌 Check out our playlist for more AI insights
📢 Advertise with us and Sponsorship Opportunities
Are you eager to expand your understanding of artificial intelligence? Look no further than the essential book “AI Unraveled: Master GPT-4, Gemini, Generative AI & LLMs – Simplified Guide for Everyday Users: Demystifying Artificial Intelligence – OpenAI, ChatGPT, Google Bard, AI ML Quiz, AI Certifications Prep, Prompt Engineering,” available at Etsy, Shopify, Apple, Google, or Amazon
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
AI Revolution in Healthcare: Podcast Transcript
Welcome to “AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence, Latest AI Trends,” where we dive deep into the complexities of AI and bring forth the latest developments in an easy-to-understand format. Today, we’re tackling a series of compelling updates from the AI frontier in the medical field and beyond. In a remarkable medical application, GPT-4, OpenAI’s newest language model, has been put to the test by Dr. Isaac Kohane of Harvard. Impressively, GPT-4 has been reported to perform better than many human doctors, correctly answering medical exam questions over 90% of the time. But what’s truly astonishing is its ability to diagnose a rare 1 in 100,000 condition in just seconds, a task that draws upon the depth of a seasoned physician’s experience. Despite these advances, Dr. Kohane’s book, ‘The AI Revolution in Medicine,’ brings us back to earth, reminding us that GPT-4 is not infallible, presenting a balanced view with examples of the model’s errors ranging from minor clerical issues to math mistakes.
hifting gears, we look at how pharmaceutical giant Moderna and tech behemoth IBM are joining forces to push the boundaries of mRNA technology. Their collaboration intends to combine generative AI and quantum computing, potentially accelerating the discovery of new therapies and vaccines. This is underpinned by using IBM’s MoLFormer, which is expected to enhance Moderna’s understanding of mRNA medicines. In a leap toward precision medicine, Google Cloud has recently launched two AI-powered tools geared at revolutionizing drug discovery. These innovative tools focus on predicting protein structures and managing vast amounts of genomic data, potentially shaving off years in drug development time. We also witness the rise of BiomedGPT, touted as one of the most sophisticated AI medical models, outperforming predecessors across multiple biomedical modalities. This model appears to be a game-changer with its multi-modal and multi-task learning capabilities.
The competition intensifies in the healthcare AI space with Google’s Med-PaLM 2 going through testing at the Mayo Clinic, while Microsoft swiftly incorporates AI advances into patient care by deploying GPT algorithms via cloud services. Furthermore, MedPerf emerges as a new beacon, an open benchmarking platform introduced by MLCommons, aimed to evaluate medical AI models on diverse datasets, prioritizing patient privacy and aiming to enhance AI’s generalizability in healthcare. Adding to an already impressive array of advancements, we have AlphaMissense by Google DeepMind, which is honing the ability to predict genetic diseases, and Google Cloud briefing the healthcare sector with new capabilities to sift through clinical data more efficiently. And finally, EVEscape, a new AI tool with the potential to predict future viral variants—imagine its profound implications had it been available at the onset of the COVID-19 pandemic!
To cap off, studies suggest that AI models like ChatGPT can outdo doctors in providing unbiased treatment recommendations for depression and that AI algorithms are increasingly crucial in cellular research, changing the landscape of biological imaging experiments. Before we conclude, let’s not forget about AMIE, Google’s Articulate Medical Intelligence Explorer, an AI system optimized for diagnostic reasoning that is giving medical professionals a run for their money. For those seeking a deeper understanding of these advancements, the book “AI Unraveled: Master GPT-4, Gemini, Generative AI & LLMs – Simplified Guide for Everyday Users: Demystifying Artificial Intelligence – OpenAI, ChatGPT, Google Bard, AI ML Quiz, AI Certifications Prep, Prompt Engineering,” is available on various platforms including Etsy, Shopify, Apple, Google, and Amazon. That brings us to the end of today’s episode. We hope you’ve gained new insights into the dynamic and revolutionary world of AI, especially its influence on healthcare. Join us next time on “AI Unraveled” as we continue to explore cutting-edge AI trends that are transforming our lives. Till then, this is your host signing off. Keep questioning, keep learning, and remember—the future is AI.
GPT-4 diagnosed a 1 in 100,000 condition in seconds
Dr. Isaac Kohane, a physician and computer scientist at Harvard, has tested the newest AI model, GPT-4, in a medical setting. According to his findings, GPT-4 performs better than many doctors, as it can answer medical exam licensing questions correctly more than 90% of the time, translate information for patients, and give doctors helpful suggestions about bedside manner.
Kohane tested GPT-4 on a real-life case and found that it could correctly diagnose a rare condition just as he would with all his years of experience. However, GPT-4 isn’t always reliable, and his latest book ‘The AI Revolution in Medicine’ is filled with examples of its blunders, ranging from clerical errors to math mistakes.
Moderna, IBM to explore Generative AI and quantum computing for mRNA vaccines
Moderna and IBM are partnering to advance mRNA technology using generative AI and quantum computing, which could speed up Moderna’s discovery and creation of new messenger RNA vaccines and therapies. Moderna’s scientists will have access to IBM’s generative AI model known as MoLFormer, which will help understand the characteristics of potential mRNA medicines and design a new class of vaccines and therapies.
This agreement comes as Moderna is trying to harness its mRNA technology to target other diseases, while IBM is ramping up its investment in AI with new partnerships, largely driven by the release of OpenAI’s ChatGPT.
Why does this matter?
The use of quantum computing and AI could help Moderna accelerate the discovery and creation of these new vaccines and therapies by solving problems too complex for traditional computers. The development of these new medicines could potentially benefit the general public by providing more treatment options for a range of diseases.
AI model outperforms traditional methods in identifying cancerous nodules
An AI model developed by experts at the Royal Marsden NHS foundation trust, the Institute of Cancer Research, London, and Imperial College London can accurately identify cancer, potentially speeding up diagnosis and treatment. The algorithm, which analyzes CT scans to determine if abnormal growths are cancerous, reportedly performs more efficiently and effectively than current methods.
Why does this matter?
The AI tool may help doctors make faster decisions about patients with abnormal growths that are currently deemed medium-risk. The model, which is still in its early stages, will require further testing before it can be introduced in healthcare systems. However, researchers hope the AI tool will eventually speed up cancer detection by fast-tracking patients to treatment.
AI can detect signs of Alzheimer’s even before symptoms begin to show
Researchers at UT Southwestern Medical Center have found that AI-powered voice analysis can help diagnose Alzheimer’s and cognitive impairment in early stages. If confirmed by larger studies, these findings could primary care providers with an easy-to-perform screening tool for at-risk individuals.
The research used advanced ML and natural language processing (NLP) to identify even the subtlest changes in language and audio that individuals may not easily recognize.
Why does this matter?
Before ML and NLP, detailed speech studies were often unsuccessful as early changes were often undetectable to human ears. However, with advancements in AI, such novel testing methods have performed significantly better than standard cognitive assessments in detecting even mild impairments. Also, it took less than 10 minutes to capture a patient’s voice, outdoing the traditional tests, which took hours to administer.
Only a few days ago, researchers developed an AI model that outperformed traditional methods in identifying cancer. Does this indicate AI leading the charge in reducing overall healthcare costs with improved patient outcomes?
Google Cloud launches AI tools for drug discovery and precision medicine
Google Cloud has launched two AI-powered tools to help biotech and pharmaceutical companies accelerate drug discovery and advance precision medicine. The Target and Lead Identification Suite aims to streamline the process of identifying a biological target and predicting protein structures, while the Multiomics Suite assists researchers in ingesting, storing, analyzing, and sharing large amounts of genomic data. Both tools aim to significantly reduce the time and cost associated with drug development.
Several companies, including Pfizer, Cerevel Therapeutics, and Colossal Biosciences, have already been using these products. Cerevel Therapeutics estimates that it will save at least three years on average by using the Target and Lead Identification Suite to discover new drugs.
Why does this matter?
AI seems to benefit humanity the most through its use in medicine and diagnostics. This launch from Google and the subsequent adoption by a pharma giant like Pfizer indicate the swift mainstreaming of the tech.
BiomedGPT: The most sophisticated AI medical model?
BiomedGPT is a unified and generalist Biomedical Generative Pre-trained Transformer model. BiomedGPT utilizes self-supervision on diverse datasets to handle multi-modal inputs and perform various downstream tasks.
Extensive experiments show that BiomedGPT surpasses most previous state-of-the-art models in performance across 5 distinct tasks with 20 public datasets spanning over 15 biomedical modalities.
The study also demonstrates the effectiveness of the multi-modal and multi-task pretraining approach in transferring knowledge to previously unseen data.
Why does this matter?
This research represents a significant advancement in developing unified and generalist models for biomedicine, holding promising implications for enhancing healthcare outcomes, and it could lead to discoveries in biomedical research.
In addition to its potential benefits for healthcare, BiomedGPT could also be used in drug discovery & medical education.
Google & Microsoft battle to lead healthcare AI
Reportedly, Google’s Med-PaLM 2 (an LLM for the medical domain) has been in testing at the Mayo Clinic research hospital. In April, Google announced its limited access for select Google Cloud customers to explore use cases and share feedback to investigate safe, responsible, and meaningful ways to use it.
Meanwhile, Google’s rivals moved quickly to incorporate AI advances into patient interactions. Hospitals are beginning to test OpenAI’s GPT algorithms through Microsoft’s cloud service in several tasks. Google’s Med-PaLM 2 and OpenAI’s GPT-4 each scored similarly on medical exam questions, according to independent research released by the companies.
Why does this matter?
It seems Google and Microsoft are racing to translate recent AI advances into products that clinicians would use widely. The AI field has seen rapid advancements and research in diverse domains. But such a competitive landscape accelerates translating them into widely available, impactful AI products (which is sometimes slow and challenging due to the complexity of real-world applications).
(Source)
MedPerf makes AI better for healthcare
MLCommons, an open global engineering consortium, has announced the launch of MedPerf, an open benchmarking platform for evaluating the performance of medical AI models on diverse real-world datasets. The platform aims to improve medical AI’s generalizability and clinical impact by making data easily and safely accessible to researchers while prioritizing patient privacy and mitigating legal and regulatory risks.
MedPerf utilizes federated evaluation, allowing AI models to be assessed without accessing patient data, and offers orchestration capabilities to streamline research. The platform has already been successfully used in pilot studies and challenges involving brain tumor segmentation, pancreas segmentation, and surgical workflow phase recognition.
Why does this matter?
With MedPerf, researchers can evaluate the performance of medical AI models using diverse real-world datasets without compromising patient privacy. This platform’s implementation in pilot studies and challenges for various medical tasks further demonstrates its potential to improve medical AI’s generalizability, clinical impact, and advancements in healthcare technology.
Google DeepMind advances biomedical AI with ‘Med-PaLM M’
Google and DeepMind have introduced Med-PaLM M, a multimodal biomedical AI system that can interpret diverse types of medical data, including text, images, and genomics. The researchers curated a benchmark dataset called MultiMedBench, which covers 14 biomedical tasks, to train and evaluate Med-PaLM M.
The AI system achieved state-of-the-art performance across all tasks, surpassing specialized models optimized for individual tasks. Med-PaLM M represents a paradigm shift in biomedical AI, as it can incorporate multimodal patient information, improve diagnostic accuracy, and transfer knowledge across medical tasks. Preliminary evidence suggests that Med-PaLM M can generalize to novel tasks and concepts and perform zero-shot multimodal reasoning.
Why does this matter?
It brings us closer to creating advanced AI systems to understand and analyze various medical data types. Google DeepMind’s MultiMedBench and Med-PaLM M show promising performance and potential in healthcare applications. It means better healthcare tools that can handle different types of medical information, ultimately benefiting patients and healthcare providers.
Scientists train a neural network to identify PC users’ fatigue
Scientists from St. Petersburg University and other organizations have created a database of eye movement strategies of PC users in different states of fatigue. They plan to use this data to train neural network models that can accurately track the functional state of operators, ensuring safety in various industries. The database includes a comprehensive set of indicators collected through sensors such as video cameras, eye trackers, heart rate monitors, and electroencephalographs.
An example of human fatigue analysis using video recording.
Why does this matter?
The scientists believe that this approach will allow for remote assessment of fatigue severity, and the database will be accessible to software developers for testing their products.
Microsoft and Paige to build the largest image-based AI model to fight cancer
Paige, a technology disruptor in healthcare, has joined forces with Microsoft to build the world’s largest image-based AI models for digital pathology and oncology.
Paige developed the first Large Foundation Model using over one billion images from half a million pathology slides across multiple cancer types. Now, it is developing a new AI model with Microsoft that is orders-of-magnitude larger than any other image-based AI model existing today, configured with billions of parameters.
Paige will utilize Microsoft’s advanced supercomputing infrastructure to train the technology at scale and ultimately deploy it to hospitals and laboratories across the globe using Azure.
Why does this matter?
This will help realize the potential of generative AI at an unprecedented scale, introduce completely novel capabilities of AI, and serve as the cornerstone for the next generation of clinical/healthcare applications built with AI.
DeepMind’s new AI can predict genetic diseases
Google DeepMind’s new system, called AlphaMissense, can tell if the letters in the DNA will produce the correct shape. If not, it is listed as potentially disease-causing.
Currently, genetic disease hunters have fairly limited knowledge of which areas of human DNA can lead to disease and have to search across billions of chemical building blocks that make up DNA. They have classified 0.1% of letter changes, or mutations, as either benign or disease-causing. DeepMind’s new model pushed that percentage up to 89%.
Why does this matter?
AI is changing nearly everything we do at the moment and might revolutionize molecular biology and life sciences, too. This development is expected to speed up diagnosis and help search for better genetic disease treatments.
Google Cloud launches new generative AI capabilities for healthcare
Google Cloud introduced new Vertex AI Search features for healthcare and life science companies. It will allow users to find accurate clinical information much more efficiently and to search a broad spectrum of data from clinical sources, such as FHIR data, clinical notes, and medical data in electronic health records (EHRs). Life-science organizations can use these features to enhance scientific communications and streamline processes.
Why does this matter?
Given how siloed medical data is currently, this is a significant boon to healthcare organizations. With this, Google is also enabling them to leverage the power of AI to improve healthcare facility management, patient care delivery, and more.
New AI tool can predict viral variants before they emerge
A new AI tool named EVEscape, developed by researchers at Harvard Medical School and the University of Oxford, can make predictions about new viral variants before they actually emerge and also how they would evolve.
In the study, researchers show that had it been deployed at the start of the COVID-19 pandemic, EVEscape would have predicted the most frequent mutations and identified the most concerning variants for SARS-CoV-2. The tool also made accurate predictions about other viruses, including HIV and influenza.
Why does this matter?
The information from this AI tool will help scientists develop more effective, future-proof vaccines and therapies. If only this AI boom happened a little earlier, it could have prevented the Covid-19 pandemic. But I guess no more pandemics, thanks to AI?
ChatGPT outperforms doctors in depression treatment
According to new study, ChatGPT makes unbiased, evidence-based treatment recommendations for depression that are consistent with clinical guidelines and outperform human primary care physicians. The study compared the evaluations and treatment recommendations for depression generated by ChatGPT-3 and ChatGPT-4 with those of primary care physicians.
Vignettes describing patients with different attributes and depression severity were input into the chatbot interfaces.
Why does this matter?
Compared with primary care physicians, ChatGPT showed no bias in recommendations based on patient gender or socioeconomic status. This means the chatbot was aligned well with accepted guidelines for managing mild and severe depression.
AI algorithms are powering the search for cells
A new paper by Nature details how AI-powered image analysis tools are changing the game for microscopy data. It highlights the evolution from early, labor-intensive methods to machine learning-based tools like CellProfiler, ilastik, and newer frameworks such as U-Net. These advancements enable more accurate and faster segmentation of cells, essential for various biological imaging experiments.
Cancer-cell nuclei (green boxes) picked out by software using deep learning.
Why does this matter?
The short study highlights the potential for AI-driven tools to revolutionize further biological analyses. The advancement is crucial for understanding diseases, drug development, and gaining insights into cellular behavior, enabling faster scientific discoveries in various fields like medicine and biology.
Google releases MedLM: Generative AI fine-tuned healthcare
MedLM is a family of foundation models fine-tuned for the healthcare industry, generally available (via allowlist) to Google Cloud customers in the U.S. through Vertex AI. MedLM builds on Med-PaLM 2. Google will soon add Gemini-based models into the MedLM suite to offer even more capabilities.
Why does this matter?
Google isn’t done yet. While its impressive Gemini demo from last week may have been staged, Google is looking to fine-tune and improve Gemini based on developers’ feedback. In addition, it is also racing with rivals to push the boundaries of AI in various fields.
Google’s new medical AI, AMIE, beats doctors
Google developed Articulate Medical Intelligence Explorer (AMIE), an LLM-based research AI system optimized for diagnostic reasoning and conversations.
AMIE’s performance was compared to that of primary care physicians (PCPs) in a randomized, double-blind crossover study of text-based consultations with validated patient actors in the style of an Objective Structured Clinical Examination (OSCE). AMIE demonstrated greater diagnostic accuracy and superior performance on 28 of 32 axes according to specialist physicians and 24 of 26 axes according to patient actors.
Why does this matter?
While further research is required before AMIE can be translated to real-world settings, it represents a milestone towards conversational diagnostic AI. If successful, AI systems such as AMIE can be at the core of next-generation learning health systems that help scale world-class healthcare to everyone.
2023 Unveiled: A Kaleidoscope of Search Trends – From Global News to Viral Memes


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
2023 Unveiled: A Year in Search – Kaleidoscope of Search Trends – From Global News to Viral Memes
As we navigate through 2023, the year’s search trends offer a fascinating glimpse into our collective curiosities, concerns, and interests. From breaking news and entertainment to culinary delights and technological advancements, these trends paint a vivid picture of our shared experiences and individual pursuits.

2023 Unwrapped: Exploring the Year’s Top Global Search Trends
In today’s episode, we’ll cover the 2023 Unveiled: A Year in Search, discussing global and US trends across news, entertainment, sports, food, and more, as well as introducing “AI Unraveled,” a book that answers frequently asked questions about artificial intelligence, available on various platforms.
As we journey through the year 2023, the search trends of this year offer us a captivating glimpse into the things that intrigued us, worried us, and captivated our attention. From the latest news developments and entertainment trends to the world of food and technological advancements, these search trends form a vivid picture of our collective experiences and personal interests.
Let’s take a closer look at the global search trends of 2023. From impactful news events to cultural phenomenons, the year unfolded as a vibrant tapestry of interests that captured the attention of people worldwide. It’s fascinating to see what captivated our attention and kept us searching for more.
In terms of global news, two significant events that gripped the world were the War in Israel and Gaza and the Turkey earthquake. These impactful events were at the forefront of global attention. Natural disasters were also a focus, with hurricanes like Hilary and Idalia making headlines. Additionally, the discovery of the Titanic submarine fascinated people worldwide.
Turning to the entertainment industry, several stars shone brightly in the world of cinema. Actors like Jeremy Renner, Jenna Ortega, Ichikawa Ennosuke IV, Danny Masterson, and Pedro Pascal dominated search queries, reflecting the impact they had on popular culture. Meanwhile, blockbuster movies such as “Barbie,” “Oppenheimer,” “Jawan,” “Sound of Freedom,” and “John Wick: Chapter 4” dominated movie theaters, captivating audiences around the globe.
In the world of music, certain songs left a lasting impression. Tracks like “アイドル” by Yoasobi, “Try That In A Small Town” by Jason Aldean, “Bzrp Music Sessions, Vol. 53” by Shakira and Bizarrap, “Unholy” by Sam Smith and Kim Petras, and “Cupid” by FIFTY FIFTY resonated with listeners worldwide. People were frequently found humming tunes such as “Bones” by Imagine Dragons, “Kesariya” by Arijit Singh, “アイドル” by YOASOBI, “Maan Meri Jaan” by King, and “Believer” by Imagine Dragons.
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
Culture enthusiasts turned to Google Maps to explore top museums around the world, with the Louvre Museum, The British Museum, Musée d’Orsay, Natural History Museum, and teamLab Planets being highlighted. Public figures like Damar Hamlin, Jeremy Renner, Andrew Tate, Kylian Mbappé, and Travis Kelce captured widespread interest, reflecting our curiosity about influential personalities. On the sports front, athletes such as Damar Hamlin, Kylian Mbappé, Travis Kelce, Ja Morant, and Harry Kane stood out with their remarkable achievements, showcasing the continued interest in athletic prowess.
Musicians also left their mark on the music scene, with the likes of Shakira, Jason Aldean, Joe Jonas, Smash Mouth, and Peppino di Capri making waves. Meanwhile, sports teams like Inter Miami CF, Los Angeles Lakers, Al-Nassr FC, Manchester City F.C, and Miami Heat garnered significant attention. People also sought to explore and appreciate nature’s wonders by visiting top parks like Park Güell, Central Park, Hyde Park, El Retiro Park, and Villa Borghese.
Google Lens provided valuable insights, with top categories including Translate, Arts & Entertainment, Text, Education, and Shopping. We also mourned the loss of notable figures throughout the year, such as Matthew Perry, Tina Turner, Sinéad O’Connor, Ken Block, Andre Braugher, and Jerry Springer, remembering their contributions dearly.
Gaming enthusiasts were not left behind, with popular games like “Hogwarts Legacy,” “The Last of Us,” “Connections,” “Battlegrounds Mobile India,” and “Starfield” captivating gamers of all kinds. Culinary curiosity led to the exploration of recipes for dishes like Bibimbap, Espeto, Papeda, Scooped Bagel, and Pasta e Fagioli. TV shows like “The Last of Us,” “Wednesday,” “Ginny & Georgia,” “One Piece,” and “Kaleidoscope” entertained audiences on a global scale. Iconic stadiums like Spotify Camp Nou, Santiago Bernabéu Stadium, Wembley Stadium, Tokyo Dome, and San Siro Stadium drew crowds and added to the excitement of the year.
Now, let’s zoom in on the search trends within the United States in 2023. While there were significant global events that dominated search queries, the War in Israel and Gaza was of particular concern, capturing the attention and worry of people worldwide. The discovery of the Titanic Submarine also captured imaginations globally, reminding us of its ongoing fascination.
In the realm of entertainment, actors like Jeremy Renner, Jamie Foxx, Danny Masterson, Matt Rife, and Pedro Pascal dominated search queries, reflecting their impact on popular culture. People in the United States were also seeking in-depth explanations on various topics, including “The Menu” and “No One Will Save You,” as well as geopolitical issues like the Israel-Palestine conflict, showcasing a collective thirst for understanding.
On a lighter note, memes featuring Kevin James, Ohio, Police Girl, Folding Chair, and Smurf Cat brought laughter and amusement to people’s lives. Culinary curiosity led food enthusiasts to explore recipes like Grimace Shake, Lasagna Soup, Chicken Cobbler, Black Cake, and Pumptini, highlighting the diverse culinary interests within the U.S.
TV shows such as “The Last of Us,” “Ginny & Georgia,” “Queen Charlotte: A Bridgerton Story,” “Daisy Jones & The Six,” and “Wednesday” captivated audiences across the United States. Google Maps helped outdoor enthusiasts and city explorers find destinations such as Central Park, Red Rocks Park, Bryant Park, The High Line, and Garden of the Gods.
People of Interest in the United States included figures like Damar Hamlin, Jeremy Renner, Travis Kelce, Tucker Carlson, and Lil Tay, who drew public attention for various reasons. The sporting world saw a search interest in sports stars like Damar Hamlin, Travis Kelce, Brock Purdy, Lamar Jackson, and Jalen Hurts, showcasing the ongoing fascination with athletic prowess.
The gaming culture thrived, with video games like “Hogwarts Legacy,” “Connections,” “Baldur’s Gate 3,” “Starfield,” and “Diablo IV” captivating players across the United States. Movie discussions revolved around films including “Barbie,” “Oppenheimer,” “Sound of Freedom,” “Everything Everywhere All at Once,” and “Guardians of the Galaxy Vol. 3.” The music scene was vibrant, with tracks like “Try That In A Small Town,” “Rich Men North of Richmond,” “Unholy,” “Ella Baila Sola,” and “Boy’s a liar Pt. 2” resonating with listeners.
Notable sporting events kept fans on the edge of their seats, such as Lakers vs Warriors, Lakers vs Nuggets, Jake Paul vs Tommy Fury, Heat vs Nuggets, and Jake Paul vs Nate Diaz matches.
Culinary enthusiasts in the United States explored recipes like frijoles charros, ropa vieja, oatmeal cookies, lasagna, and mashed potatoes, reflecting diverse food interests. Iconic stadiums like Madison Square Garden, MetLife Stadium, Yankee Stadium, Barclays Center, and Fenway Park were popular among sports fans.
The United States, like the rest of the world, bid farewell to notable figures, including Matthew Perry, Andre Braugher, Tina Turner, Jerry Springer, Jimmy Buffett, and Sinéad O’Connor, remembering their contributions. Literature enthusiasts delved into works like “My Fault,” “Fourth Wing,” “Hello Beautiful,” “The Wager,” and “Red, White & Royal Blue.”
Musicians like Jason Aldean, Ice Spice, Oliver Anthony, Peso Pluma, and Joe Jonas captured the hearts of music lovers in the United States. Other trends that captivated the internet included the Roman Empire, moon phases, AI yearbooks, Instagram notes number, and Fruit Roll-Ups, reflecting the eclectic interests of people in the U.S.
According to Google’s “Hum to Search,” frequently hummed tunes in the United States included “Seven Nation Army,” “Kill Bill,” “Ballin’,” “Tom’s Diner,” and “Until I Found You.”
Google Maps continued to be a valuable tool, with top cultural destinations including the American Museum of Natural History, 9/11 Memorial & Museum, Smithsonian National Museum of Natural History, Ark Encounter, and The Getty.
The year 2023 has been an eventful one, fueled by our curiosity and interests. From global issues to the simple joys of recipes and catchy songs, the search trends of 2023 have not only reflected our diverse passions and concerns but also connected us in our quest for knowledge, entertainment, and understanding.
Are you ready to dive into the fascinating world of artificial intelligence? Well, I’ve got just the thing for you! It’s an incredible book called “AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence.” Trust me, this book is an absolute gem!
Now, you might be wondering where you can get your hands on this treasure trove of knowledge. Look no further, my friend. You can find “AI Unraveled” at popular online platforms like Etsy, Shopify, Apple, Google, and of course, our old faithful, Amazon.
This book is a must-have for anyone eager to expand their understanding of AI. It takes those complicated concepts and breaks them down into easily digestible chunks. No more scratching your head in confusion or getting lost in a sea of technical terms. With “AI Unraveled,” you’ll gain a clear and concise understanding of artificial intelligence.
So, if you’re ready to embark on this incredible journey of unraveling the mysteries of AI, go ahead and grab your copy of “AI Unraveled” today. Trust me, you won’t regret it!
In this episode, we explored the top Google search trends of 2023 and delved into the book “AI Unraveled” that unravels the mysteries of artificial intelligence. Join us next time on AI Unraveled as we continue to demystify frequently asked questions on artificial intelligence and bring you the latest trends in AI, including ChatGPT advancements and the exciting collaboration between Google Brain and DeepMind. Stay informed, stay curious, and don’t forget to subscribe for more!
2023 Unveiled: A year in Search – A Global Perspective on Trends and Interests
The year 2023 has unfolded as a vibrant tapestry of global interests, ranging from impactful news events to cultural phenomena. From the realms of entertainment and sports to the corridors of museums and the digital world, here’s a comprehensive look at what captivated the world’s attention in 2023.
2023 Unveiled: A year in Search – Global News:
- The War in Israel and Gaza and the Turkey earthquake were among the significant events that gripped global attention.
- Natural disasters such as Hurricanes Hilary and Idalia, and the discovery of the Titanic submarine, also made headlines.
2023 Unveiled: A year in Search globally – Cinema’s Leading Lights:
- The film industry shone brightly with stars like Jeremy Renner, Jenna Ortega, Ichikawa Ennosuke IV, Danny Masterson, and Pedro Pascal.
Are you eager to expand your understanding of artificial intelligence? Look no further than the essential book “AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence,” available at Etsy, Shopify, Apple, Google, or Amazon

2023 Unveiled: A year in Search globally– Blockbuster Movies:
- Cinematic masterpieces such as “Barbie,” “Oppenheimer,” “Jawan,” “Sound of Freedom,” and “John Wick: Chapter 4” dominated movie theaters.
2023 Unveiled: A year in Search globally – Musical Echoes:
- Songs like “アイドル” by Yoasobi, “Try That In A Small Town” by Jason Aldean, “Bzrp Music Sessions, Vol. 53” by Shakira and Bizarrap, “Unholy” by Sam Smith and Kim Petras, and “Cupid” by FIFTY FIFTY resonated worldwide.
2023 Unveiled: A year in Search globally – Humming to the Beats:
- “Bones” by Imagine Dragons, “Kesariya” by Arijit Singh, “アイドル” by YOASOBI, “Maan Meri Jaan” by King, and “Believer” by Imagine Dragons were frequently hummed tunes.
2023 Unveiled: A year in Search globally – Cultural Treasures:
- Google Maps highlighted top museums like Louvre Museum, The British Museum, Musée d’Orsay, Natural History Museum, and teamLab Planets.
2023 Unveiled: A year in Search globally – Influential Personalities:
- Public figures such as Damar Hamlin, Jeremy Renner, Andrew Tate, Kylian Mbappé, and Travis Kelce captured widespread interest.
2023 Unveiled: A year in Search globally – Athletic Achievements:
- Athletes like Damar Hamlin, Kylian Mbappé, Travis Kelce, Ja Morant, and Harry Kane stood out in the sports world.
2023 Unveiled: A year in Search globally – Musical Maestros:
- Musicians Shakira, Jason Aldean, Joe Jonas, Smash Mouth, and Peppino di Capri left a significant mark on the music scene.
2023 Unveiled: A year in Search globally – Sports Teams in Focus:
- Teams like Inter Miami CF, Los Angeles Lakers, Al-Nassr FC, Manchester City F.C, and Miami Heat garnered attention.
2023 Unveiled: A year in Search globally – Exploring Nature’s Wonders:
- Top parks such as Park Güell, Central Park, Hyde Park, El Retiro Park, and Villa Borghese were popular destinations.
2023 Unveiled: A year in Search globally – Google Lens Insights:
- Top Google Lens categories included Translate, Arts & Entertainment, Text, Education, and Shopping.
2023 Unveiled: A year in Search globally – Notable Passings:
- The world mourned the loss of Matthew Perry, Tina Turner, Sinéad O’Connor, Ken Block, Andre Braugher and Jerry Springer.
2023 Unveiled: A year in Search globally – Gaming Galore:
- Popular games like “Hogwarts Legacy,” “The Last of Us,” “Connections,” “Battlegrounds Mobile India,” and “Starfield” captivated gamers.
2023 Unveiled: A year in Search globally – Culinary Delights:
- Recipes for Bibimbap, Espeto, Papeda, Scooped Bagel, and Pasta e Fagioli piqued culinary curiosity.
2023 Unveiled: A year in Search globally – Television Triumphs:
- TV shows “The Last of Us,” “Wednesday,” “Ginny & Georgia,” “One Piece,” and “Kaleidoscope” entertained audiences globally.
2023 Unveiled: A year in Search globally – Stadiums of Spectacle:
- Iconic stadiums like Spotify Camp Nou, Santiago Bernabéu Stadium, Wembley Stadium, Tokyo Dome, and San Siro Stadium drew crowds.
2023 Unveiled: A year in Search globally – Fashion Finds:
- Google Lens’s top apparel searches included Shirt, Outerwear, Footwear, Dress, and Pants.
2023 Unveiled: A Year in Search in USA
2023 Unveiled: A Year in Search in USA: News Highlights
- The year was marked by significant global events, including the War in Israel and Gaza, drawing worldwide attention and concern.
- The Titanic Submarine expedition captured imaginations, as did the powerful forces of nature with Hurricanes Hilary, Idalia, and Lee.
2023 Unveiled: A Year in Search in USA– Actors in the Limelight:
- In the world of cinema and television, actors like Jeremy Renner, Jamie Foxx, Danny Masterson, Matt Rife, and Pedro Pascal dominated search queries, reflecting their impact on popular culture.
2023 Unveiled: A Year in Search in USA – In-Depth Explanations Sought:
- People sought clarity on complex topics, from “The Menu” and “No One Will Save You” to geopolitical issues like the Israel-Palestine conflict, showcasing a collective thirst for understanding.
2023 Unveiled: A Year in Search in USA- Memes and Moments:
- In lighter news, memes featuring Kevin James, Ohio, Police Girl, Folding Chair, and Smurf Cat brought laughter and shared amusement.
2023 Unveiled: A Year in Search in USA- Culinary Curiosity:
- Food enthusiasts explored recipes like Grimace Shake, Lasagna Soup, Chicken Cobbler, Black Cake, and Pumptini, highlighting diverse culinary interests.
2023 Unveiled: A Year in Search in USA- Television Triumphs:
- TV shows such as “The Last of Us,” “Ginny & Georgia,” “Queen Charlotte: A Bridgerton Story,” “Daisy Jones & The Six,” and “Wednesday” captivated audiences.
2023 Unveiled: A Year in Search in USA- Google Maps Discoveries:
- Outdoor enthusiasts and city explorers turned to Google Maps for destinations like Central Park, Red Rocks Park, Bryant Park, The High Line, and Garden of the Gods.
2023 Unveiled: A Year in Search in USA – People of Interest:
- Figures like Damar Hamlin, Jeremy Renner, Travis Kelce, Tucker Carlson, and Lil Tay drew public attention for various reasons.
2023 Unveiled: A Year in Search in USA – Athletic Achievements:
- Sports stars such as Damar Hamlin, Travis Kelce, Brock Purdy, Lamar Jackson, and Jalen Hurts were widely searched, reflecting the ever-present interest in athletic prowess.
2023 Unveiled: A Year in Search in USA – Gaming Glory:
- Video games like “Hogwarts Legacy,” “Connections,” “Baldur’s Gate 3,” “Starfield,” and “Diablo IV” captivated players, underlining the thriving gaming culture.
2023 Unveiled: A Year in Search in USA – Movie Magic:
- Films including “Barbie,” “Oppenheimer,” “Sound of Freedom,” “Everything Everywhere All at Once,” and “Guardians of the Galaxy Vol. 3” dominated movie discussions.
2023 Unveiled: A Year in Search in USA – Musical Melodies:
- The music scene was vibrant with tracks like “Try That In A Small Town,” “Rich Men North of Richmond,” “Unholy,” “Ella Baila Sola,” and “Boy’s a liar Pt. 2” resonating with listeners.
2023 Unveiled: A Year in Search in USA- Sports Showdowns:
- Notable sporting events, such as Lakers vs Warriors, Lakers vs Nuggets, Jake Paul vs Tommy Fury, Heat vs Nuggets, and Jake Paul vs Nate Diaz matches, kept fans on the edge of their seats.
2023 Unveiled: A Year in Search in USA- Recipes to Relish:
- Culinary enthusiasts explored recipes like frijoles charros, ropa vieja, oatmeal cookies, lasagna, and mashed potatoes, highlighting diverse food interests.
2023 Unveiled: A Year in Search in USA – Top Stadiums Visited:
- Iconic stadiums like Madison Square Garden, MetLife Stadium, Yankee Stadium, Barclays Center, and Fenway Park were popular among sports fans.
2023 Unveiled: A Year in Search in USA- Passings and Tributes:
- The world bid farewell to notable figures, including Matthew Perry, Andre Braugher, Tina Turner, Jerry Springer, Jimmy Buffett, and Sinéad O’Connor, remembering their contributions.
2023 Unveiled: A Year in Search in USA- Books that Bedazzled:
- Literature enthusiasts delved into works like “My Fault,” “Fourth Wing,” “Hello Beautiful,” “The Wager,” and “Red, White & Royal Blue.”
2023 Unveiled: A Year in Search in USA- Musical Maestros:
- Musicians like Jason Aldean, Ice Spice, Oliver Anthony, Peso Pluma, and Joe Jonas captured the hearts of music lovers.
2023 Unveiled: A Year in Search in USA – Trends of the Times:
- Trends such as the Roman empire, moon phases, AI yearbooks, Instagram notes number, and Fruit Roll-Ups captivated the internet.
2023 Unveiled: A Year in Search in USA- Songs Hummed Worldwide:
- “Seven Nation Army,” “Kill Bill,” “Ballin’,” “Tom’s Diner,” and “Until I Found You” were frequently hummed tunes, according to Google’s “Hum to Search.”
2023 Unveiled: A Year in Search in USA- Museums Mapped:
- Top museums such as the American Museum of Natural History, 9/11 Memorial & Museum, Smithsonian National Museum of Natural History, Ark Encounter, and The Getty were popular cultural destinations.
2023 Unveiled: A Year in Search – Conclusion:
The year 2023 in search was a tapestry of human curiosity and interest, ranging from urgent global issues to the simple joys of a well-crafted recipe or a catchy song. These search trends not only reflect our diverse interests and concerns but also connect us in our shared quest for knowledge, entertainment, and understanding.
References:
1- https://trends.google.com/trends/yis/2023/US/?hl=en-GB
2- https://searchingthe.world/
What are Educational mobile apps ideas that leverage generative AI without doing the same thing that ChatGPT can do?


Elevate Your Career with AI & Machine Learning For Dummies PRO and Start mastering the technologies shaping the future—download now and take the next step in your professional journey!
Educational mobile apps ideas that leverage generative AI.
Here are a few innovative educational mobile app ideas that leverage generative AI, offering functionalities beyond what ChatGPT provides:

AI-Based Customized Learning Path Creator:
- Concept: An app that uses generative AI to analyze a student’s learning style, strengths, and weaknesses, and then creates a personalized learning path with tailored resources and activities.
- Unique Feature: Unlike ChatGPT, which primarily responds to queries, this app actively assesses and guides the user’s educational journey.
- While ChatGPT can suggest learning resources, a dedicated app can provide a more structured and personalized learning path, continuously adapting to the user’s progress.
Interactive AI Tutor for Problem Solving:
- Concept: This app focuses on STEM subjects, using generative AI to create unique problem sets and provide step-by-step solutions with explanations. The AI can generate new problems based on the student’s progress.
- Unique Feature: The app would offer an interactive problem-solving experience, adapting the difficulty and type of problems in real-time.
- ChatGPT can help with problem-solving, but an app designed specifically for STEM education can offer a more interactive and subject-focused approach, with features like visual aids, interactive simulations, and progress tracking.
AI-Driven Language Learning Companion:
- Concept: An app that uses AI to generate conversational scenarios in various languages, helping users practice speaking and comprehension in a simulated real-world context.
- Unique Feature: It focuses on verbal interaction and contextual learning, providing a more immersive language learning experience than typical chat-based apps.
- ChatGPT can assist in language learning, but a dedicated app can create immersive scenarios, use speech recognition for pronunciation practice, and provide a more structured language learning program.
Generative AI Storytelling for Creative Writing:
- Concept: This app helps students enhance their creative writing skills by generating story prompts, character ideas, or even continuing a story based on the student’s input.
- Unique Feature: It focuses on creativity and storytelling, aiding in the development of writing skills through AI-generated content.
- While ChatGPT can generate story prompts, a specialized app could offer a more comprehensive suite of creative writing tools, including workshops, peer review, and guided writing exercises.
AI Music Composition and Theory Teaching Tool:
- Concept: An app that teaches music theory by generating music sheets or compositions based on AI algorithms. Users can input specific genres, moods, or instruments, and the AI creates music pieces accordingly.
- Unique Feature: Unlike ChatGPT, this app focuses on music education, leveraging AI to compose and demonstrate music theory concepts.
- ChatGPT might assist in some aspects of music theory, but an app focused on music education could integrate AI-generated music with interactive learning modules, listening exercises, and more complex composition tools.
Generative Art History and Appreciation App:
- Concept: This app uses AI to generate art pieces in the style of various historical periods or artists. It also provides educational content about art history and techniques.
- Unique Feature: It combines art creation with educational content, making art history interactive and engaging.
- ChatGPT can provide information on art history, but an app can offer a more visual and interactive experience, with virtual art gallery tours, style emulation, and detailed analyses of art techniques.
AI-Enhanced Public Speaking and Presentation Trainer:
- Concept: The app uses AI to analyze speech patterns and content, offering tips and exercises to improve public speaking skills.
- Unique Feature: It’s a speech improvement tool that provides real-time feedback and tailored coaching, unlike typical text-based AI applications.
- While ChatGPT can offer tips on public speaking, a dedicated app can use speech recognition to provide real-time feedback on aspects like pacing, tone, and filler word usage.
Each of these app ideas leverages generative AI in unique ways, focusing on different aspects of education and learning, and providing experiences that go beyond the capabilities of a standard AI chatbot like ChatGPT.
Are you eager to expand your understanding of artificial intelligence? Look no further than the essential book “AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence,” available at Etsy, Shopify, Apple, Google, or Amazon

Educational mobile apps ideas that leverage generative AI: Podcast Transcript
Welcome to AI Unraveled, the podcast that demystifies frequently asked questions on artificial intelligence and keeps you up to date with the latest AI trends. In today’s episode, we’ll cover innovative educational mobile app ideas that leverage generative AI, including customized learning paths, interactive problem-solving, immersive language learning, creative writing support, music education, art history, and public speaking training, as well as the book “AI Unraveled” that answers frequently asked questions about artificial intelligence.
So, today I want to share with you some really cool educational mobile app ideas that go beyond what ChatGPT can do. These ideas leverage the power of generative AI to offer unique functionalities and experiences. Let’s dive right in!
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
🚀 Power Your Podcast Like AI Unraveled: Get 20% OFF Google Workspace!
Hey everyone, hope you're enjoying the deep dive on AI Unraveled. Putting these episodes together involves tons of research and organization, especially with complex AI topics.
A key part of my workflow relies heavily on Google Workspace. I use its integrated tools, especially Gemini Pro for brainstorming and NotebookLM for synthesizing research, to help craft some of the very episodes you love. It significantly streamlines the creation process!
Feeling inspired to launch your own podcast or creative project? I genuinely recommend checking out Google Workspace. Beyond the powerful AI and collaboration features I use, you get essentials like a professional email (you@yourbrand.com), cloud storage, video conferencing with Google Meet, and much more.
It's been invaluable for AI Unraveled, and it could be for you too.
Start Your Journey & Save 20%
Google Workspace makes it easy to get started. Try it free for 14 days, and as an AI Unraveled listener, get an exclusive 20% discount on your first year of the Business Standard or Business Plus plan!
Sign Up & Get Your Discount HereUse one of these codes during checkout (Americas Region):
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
Business Standard Plan: 63P4G3ELRPADKQU
Business Standard Plan: 63F7D7CPD9XXUVT
Set yourself up for promotion or get a better job by Acing the AWS Certified Data Engineer Associate Exam (DEA-C01) with the eBook or App below (Data and AI)

Download the Ace AWS DEA-C01 Exam App:
iOS - Android
AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version
Business Standard Plan: 63FLKQHWV3AEEE6
Business Standard Plan: 63JGLWWK36CP7W
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
Business Plus Plan: M9HNXHX3WC9H7YE
With Google Workspace, you get custom email @yourcompany, the ability to work from anywhere, and tools that easily scale up or down with your needs.
Need more codes or have questions? Email us at info@djamgatech.com.
The first app idea is an AI-Based Customized Learning Path Creator. This app would use generative AI to analyze a student’s learning style, strengths, and weaknesses, and then create a personalized learning path with tailored resources and activities. Unlike ChatGPT, which primarily responds to queries, this app would actively assess and guide the user’s educational journey. While ChatGPT can suggest learning resources, a dedicated app can provide a more structured and personalized learning path, continuously adapting to the user’s progress.
Next up, we have an Interactive AI Tutor for Problem Solving. This app would focus on STEM subjects and use generative AI to create unique problem sets and provide step-by-step solutions with explanations. The AI could even generate new problems based on the student’s progress. What sets this app apart is its interactive problem-solving experience, adapting the difficulty and type of problems in real-time. While ChatGPT can help with problem-solving, an app designed specifically for STEM education can offer a more interactive and subject-focused approach. Imagine visual aids, interactive simulations, and progress tracking to enhance the learning experience.
Now, let’s talk about an AI-Driven Language Learning Companion. This app would use AI to generate conversational scenarios in various languages, helping users practice speaking and comprehension in a simulated real-world context. What makes it unique is its focus on verbal interaction and contextual learning. By providing a more immersive language learning experience than typical chat-based apps, this dedicated app can take language learning to a whole new level. Picture speech recognition for pronunciation practice, structured language programs, and even immersive scenarios to practice your skills in a real-world context.
Moving on, we have Generative AI Storytelling for Creative Writing. This app aims to help students enhance their creative writing skills by generating story prompts, character ideas, or even continuing a story based on the student’s input. It’s all about creativity and storytelling! While ChatGPT can generate story prompts, a specialized app would offer a broader range of creative writing tools. Think workshops, peer review features, and guided writing exercises to truly develop your writing skills through AI-generated content.
Now, let’s explore an AI Music Composition and Theory Teaching Tool. This app would teach music theory by generating music sheets or compositions based on AI algorithms. Users could input specific genres, moods, or instruments, and the AI would create music pieces accordingly. It’s all about making music education more accessible! While ChatGPT might assist in some aspects of music theory, an app focused on music education could integrate AI-generated music with interactive learning modules, listening exercises, and even more complex composition tools.
Next, we have the Generative Art History and Appreciation App. This app would use AI to generate art pieces in the style of various historical periods or artists while also providing educational content about art history and techniques. By combining art creation with educational content, this app would make art history interactive and engaging. While ChatGPT can provide information on art history, imagine being able to take virtual art gallery tours, emulate different styles, and dive into detailed analyses of art techniques, all in one app.
Last but not least, let’s talk about an AI-Enhanced Public Speaking and Presentation Trainer. This app would use AI to analyze speech patterns and content, offering tips and exercises to improve public speaking skills. Its unique feature lies in providing real-time feedback and tailored coaching, unlike typical text-based AI applications. While ChatGPT can offer general tips on public speaking, a dedicated app can go the extra mile by utilizing speech recognition to provide real-time feedback on aspects like pacing, tone, and filler word usage. Imagine having a personal speech coach right in your pocket!
So, as you can see, each of these app ideas leverages generative AI in unique ways, focusing on different aspects of education and learning. They provide experiences that go beyond the capabilities of a standard AI chatbot like ChatGPT. From customized learning paths and interactive problem-solving to immersive language learning and creative writing assistance, the possibilities are endless with generative AI in the educational mobile app space.
Are you ready to dive into the fascinating world of artificial intelligence? Well, I’ve got just the thing for you! It’s an incredible book called “AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence.” Trust me, this book is an absolute gem!
Now, you might be wondering where you can get your hands on this treasure trove of knowledge. Look no further, my friend. You can find “AI Unraveled” at popular online platforms like Etsy, Shopify, Apple, Google, and of course, our old faithful, Amazon.
This book is a must-have for anyone eager to expand their understanding of AI. It takes those complicated concepts and breaks them down into easily digestible chunks. No more scratching your head in confusion or getting lost in a sea of technical terms. With “AI Unraveled,” you’ll gain a clear and concise understanding of artificial intelligence.
So, if you’re ready to embark on this incredible journey of unraveling the mysteries of AI, go ahead and grab your copy of “AI Unraveled” today. Trust me, you won’t regret it!
In this episode, we explored innovative educational mobile app ideas incorporating generative AI and discussed the book “AI Unraveled” that tackles common questions about artificial intelligence. Join us next time on AI Unraveled as we continue to demystify frequently asked questions on artificial intelligence and bring you the latest trends in AI, including ChatGPT advancements and the exciting collaboration between Google Brain and DeepMind. Stay informed, stay curious, and don’t forget to subscribe for more!
What is Google Workspace?
Google Workspace is a cloud-based productivity suite that helps teams communicate, collaborate and get things done from anywhere and on any device. It's simple to set up, use and manage, so your business can focus on what really matters.
Watch a video or find out more here.
Here are some highlights:
Business email for your domain
Look professional and communicate as you@yourcompany.com. Gmail's simple features help you build your brand while getting more done.
Access from any location or device
Check emails, share files, edit documents, hold video meetings and more, whether you're at work, at home or on the move. You can pick up where you left off from a computer, tablet or phone.
Enterprise-level management tools
Robust admin settings give you total command over users, devices, security and more.
Sign up using my link https://referworkspace.app.goo.gl/Q371 and get a 14-day trial, and message me to get an exclusive discount when you try Google Workspace for your business.
Google Workspace Business Standard Promotion code for the Americas
63F733CLLY7R7MM
63F7D7CPD9XXUVT
63FLKQHWV3AEEE6
63JGLWWK36CP7WM
Email me for more promo codes
Active Hydrating Toner, Anti-Aging Replenishing Advanced Face Moisturizer, with Vitamins A, C, E & Natural Botanicals to Promote Skin Balance & Collagen Production, 6.7 Fl Oz
Age Defying 0.3% Retinol Serum, Anti-Aging Dark Spot Remover for Face, Fine Lines & Wrinkle Pore Minimizer, with Vitamin E & Natural Botanicals
Firming Moisturizer, Advanced Hydrating Facial Replenishing Cream, with Hyaluronic Acid, Resveratrol & Natural Botanicals to Restore Skin's Strength, Radiance, and Resilience, 1.75 Oz
Skin Stem Cell Serum
Smartphone 101 - Pick a smartphone for me - android or iOS - Apple iPhone or Samsung Galaxy or Huawei or Xaomi or Google Pixel
Can AI Really Predict Lottery Results? We Asked an Expert.
Djamgatech

Read Photos and PDFs Aloud for me iOS
Read Photos and PDFs Aloud for me android
Read Photos and PDFs Aloud For me Windows 10/11
Read Photos and PDFs Aloud For Amazon
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
FREE 10000+ Quiz Trivia and and Brain Teasers for All Topics including Cloud Computing, General Knowledge, History, Television, Music, Art, Science, Movies, Films, US History, Soccer Football, World Cup, Data Science, Machine Learning, Geography, etc....

List of Freely available programming books - What is the single most influential book every Programmers should read
- Bjarne Stroustrup - The C++ Programming Language
- Brian W. Kernighan, Rob Pike - The Practice of Programming
- Donald Knuth - The Art of Computer Programming
- Ellen Ullman - Close to the Machine
- Ellis Horowitz - Fundamentals of Computer Algorithms
- Eric Raymond - The Art of Unix Programming
- Gerald M. Weinberg - The Psychology of Computer Programming
- James Gosling - The Java Programming Language
- Joel Spolsky - The Best Software Writing I
- Keith Curtis - After the Software Wars
- Richard M. Stallman - Free Software, Free Society
- Richard P. Gabriel - Patterns of Software
- Richard P. Gabriel - Innovation Happens Elsewhere
- Code Complete (2nd edition) by Steve McConnell
- The Pragmatic Programmer
- Structure and Interpretation of Computer Programs
- The C Programming Language by Kernighan and Ritchie
- Introduction to Algorithms by Cormen, Leiserson, Rivest & Stein
- Design Patterns by the Gang of Four
- Refactoring: Improving the Design of Existing Code
- The Mythical Man Month
- The Art of Computer Programming by Donald Knuth
- Compilers: Principles, Techniques and Tools by Alfred V. Aho, Ravi Sethi and Jeffrey D. Ullman
- Gödel, Escher, Bach by Douglas Hofstadter
- Clean Code: A Handbook of Agile Software Craftsmanship by Robert C. Martin
- Effective C++
- More Effective C++
- CODE by Charles Petzold
- Programming Pearls by Jon Bentley
- Working Effectively with Legacy Code by Michael C. Feathers
- Peopleware by Demarco and Lister
- Coders at Work by Peter Seibel
- Surely You're Joking, Mr. Feynman!
- Effective Java 2nd edition
- Patterns of Enterprise Application Architecture by Martin Fowler
- The Little Schemer
- The Seasoned Schemer
- Why's (Poignant) Guide to Ruby
- The Inmates Are Running The Asylum: Why High Tech Products Drive Us Crazy and How to Restore the Sanity
- The Art of Unix Programming
- Test-Driven Development: By Example by Kent Beck
- Practices of an Agile Developer
- Don't Make Me Think
- Agile Software Development, Principles, Patterns, and Practices by Robert C. Martin
- Domain Driven Designs by Eric Evans
- The Design of Everyday Things by Donald Norman
- Modern C++ Design by Andrei Alexandrescu
- Best Software Writing I by Joel Spolsky
- The Practice of Programming by Kernighan and Pike
- Pragmatic Thinking and Learning: Refactor Your Wetware by Andy Hunt
- Software Estimation: Demystifying the Black Art by Steve McConnel
- The Passionate Programmer (My Job Went To India) by Chad Fowler
- Hackers: Heroes of the Computer Revolution
- Algorithms + Data Structures = Programs
- Writing Solid Code
- JavaScript - The Good Parts
- Getting Real by 37 Signals
- Foundations of Programming by Karl Seguin
- Computer Graphics: Principles and Practice in C (2nd Edition)
- Thinking in Java by Bruce Eckel
- The Elements of Computing Systems
- Refactoring to Patterns by Joshua Kerievsky
- Modern Operating Systems by Andrew S. Tanenbaum
- The Annotated Turing
- Things That Make Us Smart by Donald Norman
- The Timeless Way of Building by Christopher Alexander
- The Deadline: A Novel About Project Management by Tom DeMarco
- The C++ Programming Language (3rd edition) by Stroustrup
- Patterns of Enterprise Application Architecture
- Computer Systems - A Programmer's Perspective
- Agile Principles, Patterns, and Practices in C# by Robert C. Martin
- Growing Object-Oriented Software, Guided by Tests
- Framework Design Guidelines by Brad Abrams
- Object Thinking by Dr. David West
- Advanced Programming in the UNIX Environment by W. Richard Stevens
- Hackers and Painters: Big Ideas from the Computer Age
- The Soul of a New Machine by Tracy Kidder
- CLR via C# by Jeffrey Richter
- The Timeless Way of Building by Christopher Alexander
- Design Patterns in C# by Steve Metsker
- Alice in Wonderland by Lewis Carol
- Zen and the Art of Motorcycle Maintenance by Robert M. Pirsig
- About Face - The Essentials of Interaction Design
- Here Comes Everybody: The Power of Organizing Without Organizations by Clay Shirky
- The Tao of Programming
- Computational Beauty of Nature
- Writing Solid Code by Steve Maguire
- Philip and Alex's Guide to Web Publishing
- Object-Oriented Analysis and Design with Applications by Grady Booch
- Effective Java by Joshua Bloch
- Computability by N. J. Cutland
- Masterminds of Programming
- The Tao Te Ching
- The Productive Programmer
- The Art of Deception by Kevin Mitnick
- The Career Programmer: Guerilla Tactics for an Imperfect World by Christopher Duncan
- Paradigms of Artificial Intelligence Programming: Case studies in Common Lisp
- Masters of Doom
- Pragmatic Unit Testing in C# with NUnit by Andy Hunt and Dave Thomas with Matt Hargett
- How To Solve It by George Polya
- The Alchemist by Paulo Coelho
- Smalltalk-80: The Language and its Implementation
- Writing Secure Code (2nd Edition) by Michael Howard
- Introduction to Functional Programming by Philip Wadler and Richard Bird
- No Bugs! by David Thielen
- Rework by Jason Freid and DHH
- JUnit in Action
#BlackOwned #BlackEntrepreneurs #BlackBuniness #AWSCertified #AWSCloudPractitioner #AWSCertification #AWSCLFC02 #CloudComputing #AWSStudyGuide #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AWSBasics #AWSCertified #AWSMachineLearning #AWSCertification #AWSSpecialty #MachineLearning #AWSStudyGuide #CloudComputing #DataScience #AWSCertified #AWSSolutionsArchitect #AWSArchitectAssociate #AWSCertification #AWSStudyGuide #CloudComputing #AWSArchitecture #AWSTraining #AWSCareer #AWSExamPrep #AWSCommunity #AWSEducation #AzureFundamentals #AZ900 #MicrosoftAzure #ITCertification #CertificationPrep #StudyMaterials #TechLearning #MicrosoftCertified #AzureCertification #TechBooks
Top 1000 Canada Quiz and trivia: CANADA CITIZENSHIP TEST- HISTORY - GEOGRAPHY - GOVERNMENT- CULTURE - PEOPLE - LANGUAGES - TRAVEL - WILDLIFE - HOCKEY - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION

Top 1000 Africa Quiz and trivia: HISTORY - GEOGRAPHY - WILDLIFE - CULTURE - PEOPLE - LANGUAGES - TRAVEL - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION

Exploring the Pros and Cons of Visiting All Provinces and Territories in Canada.

Exploring the Advantages and Disadvantages of Visiting All 50 States in the USA

Health Health, a science-based community to discuss human health
- Zepbound beats Wegovy for weight loss in first head-to-head trial of blockbuster drugsby /u/merchlinkinbio on May 12, 2025 at 4:02 am
submitted by /u/merchlinkinbio [link] [comments]
- Proximity to Golf Courses and Risk of Parkinson Diseaseby /u/Chufield on May 11, 2025 at 8:57 pm
submitted by /u/Chufield [link] [comments]
- Children eat more after seeing just five minutes of junk food ads, study findsby /u/No-Lifeguard-8173 on May 11, 2025 at 11:03 am
submitted by /u/No-Lifeguard-8173 [link] [comments]
- H5N1 poses pandemic risk: Study warns of cat-to-human transmissionby /u/boppinmule on May 11, 2025 at 10:07 am
submitted by /u/boppinmule [link] [comments]
- Texas measles cases grow to 709. Track the spread here.by /u/boppinmule on May 11, 2025 at 6:34 am
submitted by /u/boppinmule [link] [comments]
Today I Learned (TIL) You learn something new every day; what did you learn today? Submit interesting and specific facts about something that you just found out here.
- TIL in 2005, Sony spent over a million dollars to drop 250,000 bouncy balls off one of the steepest streets in San Francisco, breaking windows and destroying cars in the process.by /u/holyfruits on May 11, 2025 at 8:09 pm
submitted by /u/holyfruits [link] [comments]
- TIL that Sir Laurence Olivier (an actor) once said before filming a love scene: "I apologize if I get aroused and I apologize if I do not get aroused"by /u/EuSoLeioAsGordas on May 11, 2025 at 8:07 pm
submitted by /u/EuSoLeioAsGordas [link] [comments]
- TIL that the modern iteration of Sudoku was only invented in 1979by /u/AdamVerbatim on May 11, 2025 at 6:22 pm
submitted by /u/AdamVerbatim [link] [comments]
- TIL Mike Flanagan's horror films are known for a lack of jump scares, but the first episode of The Midnight Club (2022) ironically set the Guinness World Record with 21 jump scares. Flanagan designed this so "jump scare(s) would be rendered meaningless".by /u/Torley_ on May 11, 2025 at 6:19 pm
submitted by /u/Torley_ [link] [comments]
- TIL a 2023 survey of more than 1,000 hiring managers found that nearly 40% admitted to lying to candidates in job interviews. And in most cases, it worked: 92% of the managers that admitted lying said a candidate they had lied to accepted their job offer.by /u/tyrion2024 on May 11, 2025 at 5:22 pm
submitted by /u/tyrion2024 [link] [comments]
Reddit Science This community is a place to share and discuss new scientific research. Read about the latest advances in astronomy, biology, medicine, physics, social science, and more. Find and submit new publications and popular science coverage of current research.
- Weight-loss drugs cut alcohol intake by almost two-thirds, Ireland study suggests: Individuals who took liraglutide or semaglutide for weight loss reduced their alcohol consumption by almost two-thirds in four monthsby /u/FunnyGamer97 on May 12, 2025 at 3:00 am
submitted by /u/FunnyGamer97 [link] [comments]
- GLP-1 receptor agonists show anti-cancer benefits beyond weight loss | After accounting for the relative advantage of surgery in reducing weight loss, GLP-1 drugs were associated with a 41% lower risk of obesity-related cancerby /u/FunnyGamer97 on May 12, 2025 at 2:46 am
submitted by /u/FunnyGamer97 [link] [comments]
- New study explores how specific cannabinoids affect brain connections, with potential implications for mental health conditions and behavioral disorders.by /u/brendigio on May 12, 2025 at 2:00 am
submitted by /u/brendigio [link] [comments]
- Sexual compliance is sexual activity with a partner when the personal desire to do so is not present. Unlike coercion, it is usually voluntary. It may be positive when it is driven by desire to increase increase intimacy or express love but negative when there is coercion or sexual distress.by /u/mvea on May 11, 2025 at 9:59 pm
submitted by /u/mvea [link] [comments]
- Study investigates motivational foundations of digital self control in social media: Both high and low personal investment lead to regret and negative outcomes. Best: know why you use it but maintain personal distance.by /u/skjolbrot on May 11, 2025 at 8:02 pm
submitted by /u/skjolbrot [link] [comments]
Reddit Sports Sports News and Highlights from the NFL, NBA, NHL, MLB, MLS, and leagues around the world.
- Kyle Larson wins again at Kansas, dominating for his 3rd NASCAR Cup Series win this seasonby /u/Oldtimer_2 on May 12, 2025 at 12:37 am
submitted by /u/Oldtimer_2 [link] [comments]
- Rockies fire Bud Black after 7-33 startby /u/Oldtimer_2 on May 12, 2025 at 12:31 am
submitted by /u/Oldtimer_2 [link] [comments]
- Saints start QB competition in wake of Derek Carr retirementby /u/PrincessBananas85 on May 11, 2025 at 10:08 pm
submitted by /u/PrincessBananas85 [link] [comments]
- MotoGP: Johann Zarco becomes first French winner of French Grand Prix in 71 yearsby /u/hawlc on May 11, 2025 at 7:58 pm
submitted by /u/hawlc [link] [comments]
- Astros' Lance McCullers Jr. says disgruntled fans directing death threats at him and his childrenby /u/merchlinkinbio on May 11, 2025 at 7:51 pm
submitted by /u/merchlinkinbio [link] [comments]
• 87.5% on ARC-AGI (the human threshold is 85%)
• 25.2% of EpochAI’s Frontier Math problems (when no other model breaks 2%)
• 96.7% on AIME 2024 (missed one question)
• 71.7% on software engineer (o1 was 48.9)
• 87.7% on PhD-level science (above human expert scores)Even the team seemed shocked – one speaker said they “need to fix [their] worldview… especially in this o3 world.” And research scientist at OpenAI, Noam Brown said: “We announced o1 just 3 months ago. Today, we announced o3. We have every reason to believe this trajectory will continue.”They only showed o3-mini today. Safety testing starts now. Public release end of January.