GPT-5 Archives - AI - IT - Engineering - Cloud - Finance

February 1, 2024March 1, 2024

A Daily Chronicle of AI Innovations in February 2024

A Daily Chronicle of AI Innovations in February 2024.

Welcome to the Daily Chronicle of AI Innovations in February 2024! This month-long blog series will provide you with the latest developments, trends, and breakthroughs in the field of artificial intelligence. From major industry conferences like ‘AI Innovations at Work’ to bold predictions about the future of AI, we will curate and share daily updates to keep you informed about the rapidly evolving world of AI. Join us on this exciting journey as we explore the cutting-edge advancements and potential impact of AI throughout February 2024.

Are you eager to expand your understanding of artificial intelligence? Look no further than the essential book “AI Unraveled: Master GPT-4, Gemini, Generative AI & LLMs – Simplified Guide for Everyday Users: Demystifying Artificial Intelligence – OpenAI, ChatGPT, Google Bard, AI ML Quiz, AI Certifications Prep, Prompt Engineering,” available at Etsy, Shopify, Apple, Google, or Amazon.

AI Unraveled - Master GPT-4, Gemini, Generative AI, LLMs: A simplified Guide For Everyday Users — AI Unraveled – Master GPT-4, Gemini, Generative AI, LLMs: A simplified Guide For Everyday Users

A Daily Chronicle of AI Innovations in February 2024 – Day 29: AI Daily News – February 29th, 2024

Alibaba’s EMO makes photos come alive (and lip-sync!)
Microsoft introduces 1-bit LLM
Ideogram launches text-to-image model version 1.0

Adobe launches new GenAI music tool

Morph makes filmmaking easier with Stability AI

Hugging Face, Nvidia, and ServiceNow release StarCode 2 for code generation.

Meta set to launch Llama 3 in July and could be twice the size

Apple subtly reveals its AI plans

OpenAI to put AI into humanoid robots

GitHub besieged by millions of malicious repositories in ongoing attack

Nvidia just released a new code generator that can run on most modern CPUs

Three more publishers sue OpenAI

AI Dashboard: All-in-One AI Tools — Read Aloud For Me- Ai Dashboard

Alibaba’s EMO makes photos come alive (and lip-sync!)

Researchers at Alibaba have introduced an AI system called “EMO” (Emote Portrait Alive) that can generate realistic videos of you talking and singing from a single photo and an audio clip. It captures subtle facial nuances without relying on 3D models.

EMO uses a two-stage deep learning approach with audio encoding, facial imagery generation via diffusion models, and reference/audio attention mechanisms.

Experiments show that the system significantly outperforms existing methods in terms of video quality and expressiveness.

Why does this matter?

Get the Read Aloud Wonderland Bedtime Adventure iBook here

By combining EMO with OpenAI’s Sora, we could synthesize personalized video content from photos or bring photos from any era to life. This could profoundly expand human expression. We may soon see automated TikTok-like videos.

Source

Microsoft introduces 1-bit LLM

Microsoft has launched a radically efficient AI language model dubbed 1-bit LLM. It uses only 1.58 bits per parameter instead of the typical 16, yet performs on par with traditional models of equal size for understanding and generating text.

Building on research like BitNet, this drastic bit reduction per parameter boosts cost-effectiveness relating to latency, memory, throughput, and energy usage by 10x. Despite using a fraction of the data, 1-bit LLM maintains accuracy.

Why does this matter?

Traditional LLMs often require extensive resources and are expensive to run while their swelling size and power consumption give them massive carbon footprints.

This new 1-bit technique points towards much greener AI models that retain high performance without overusing resources. By enabling specialized hardware and optimized model design, it can drastically improve efficiency and cut computing costs, with the ability to put high-performing AI directly into consumer devices.

Source

Advertise with us - Post Your Good Content Here
We are ranked in the Top 20 on Google

AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version

Ideogram launches text-to-image model version 1.0

Ideogram has launched a new text-to-picture app called Ideogram 1.0. It’s their most advanced ever. Dubbed a “creative helper,” it generates highly realistic images from text prompts with minimal errors. A built-in “Magic Prompt” feature effortlessly expands basic prompts into detailed scenes.

The Details:

Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!

Microsoft Azure AZ900 Certification and Training

Ideogram 1.0 significantly cuts image generation errors in half compared to other apps. And users can make custom picture sizes and styles. So it can do memes, logos, old-timey portraits, anything.

Magic Prompt takes basic prompts like “vegetables orbiting the sun” and turns them into full scenes with backstories. That would take regular people hours to write out word-for-word.

Ideogram launches text-to-image model version 1.0

Tests show that Ideogram 1.0 beats DALL-E 3 and Midjourney V6 at matching prompts, making sensible pictures, looking realistic, and handling text.

Why does this matter?

This advancement in AI image generation hints at a future where generative models commonly assist or even substitute human creators across personalized gift items, digital content, art, and more.

Source

What Else Is Happening in AI on February 29th, 2024

Adobe launches new GenAI music tool

AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence (OpenAI, ChatGPT, Google Gemini, Generative AI, Discriminative AI, xAI, LLMs, GPUs, Machine Learning, NLP, Promp Engineering)

Adobe introduces Project Music GenAI Control, allowing users to create music from text or reference melodies with customizable tempo, intensity, and structure. While still in development, this tool has the potential to democratize music creation for everyone. (Link)

Morph makes filmmaking easier with Stability AI

Morph Studio, a new AI platform, lets you create films simply by describing desired scenes in text prompts. It also enables combining these AI-generated clips into complete movies. Powered by Stability AI, this revolutionary tool could enable anyone to become a filmmaker. (Link)

Hugging Face, Nvidia, and ServiceNow release StarCode 2 for code generation.

Hugging Face along with Nvidia and Service Now launches StarCoder 2, an open-source code generator available in three GPU-optimized models. With improved performance and less restrictive licensing, it promises efficient code completion and summarization. (Link)

"Pass the AWS Cloud Practitioner Certification with flying colors: Master the Exam with 300+ Quizzes, Cheat Sheets, Flashcards, and Illustrated Study Guides - 2024 Edition"

Meta set to launch Llama 3 in July

Meta plans to launch Llama 3 in July to compete with OpenAI’s GPT-4. It promises increased responsiveness, better context handling, and double the size of its predecessor. With added tonality and security training, Llama 3 seeks more nuanced responses. (Link)

Apple subtly reveals its AI plans

"Become a Canada Expert: Ace the Citizenship Test and Impress Everyone with Your Knowledge of Canadian History, Geography, Government, Culture, People, Languages, Travel, Wildlife, Hockey, Tourism, Sceneries, Arts, and Data Visualization. Get the Top 1000 Canada Quiz Now!"

Apple CEO Tim Cook reveals plans to disclose Apple’s generative AI efforts soon, highlighting opportunities to transform user productivity and problem-solving. This likely indicates exciting new iPhone and device features centered on efficiency. (Link)

A Daily Chronicle of AI Innovations in February 2024 – Day 28: AI Daily News – February 28th, 2024

NVIDIA’s Nemotron-4 beats 4x larger multilingual AI models
GitHub launches Copilot Enterprise for customized AI coding
Slack study shows AI frees up 41% of time spent on low-value work

Pika launches new lip sync feature for AI videos

Google pays publishers to test an unreleased GenAI tool

Intel and Microsoft team up to bring 100M AI PCs by 2025

Writer’s Palmyra-Vision summarizes charts, scribbles into text

Apple cancels its decade-long electric car project

OpenAI claims New York Times paid someone to ‘hack’ ChatGPT

Tumblr and WordPress blogs will be exploited for AI model training

Google CEO slams ‘completely unacceptable’ Gemini AI errors

Klarna’s AI bot is doing the work of 700 employees

NVIDIA’s Nemotron-4 beats 4x larger multilingual AI models

Unlock the power of AI with “Read Aloud For Me – AI Dashboard” – your ultimate AI Dashboard and Hub. Access all major AI tools in one seamless app, designed to elevate your productivity and streamline your digital experience. Available now on the web at readaloudforme.com and across all your favorite app stores: Apple, Google, and Microsoft. “Read Aloud For Me – AI Dashboard” brings the future of AI directly to your fingertips, merging convenience with innovation. Whether for work, education, or personal enhancement, our app is your gateway to the most advanced AI technologies. Download today and transform the way you interact with AI tools.

Unlock the power of AI with "Read Aloud For Me" – your ultimate AI Dashboard and Hub. Access all major AI tools in one seamless app, designed to elevate your productivity and streamline your digital experience. Available now on the web at readaloudforme.com and across all your favorite app stores: Apple, Google, and Microsoft. "Read Aloud For Me" brings the future of AI directly to your fingertips, merging convenience with innovation. Whether for work, education, or personal enhancement, our app is your gateway to the most advanced AI technologies. Download today and transform the way you interact with AI tools. — Read Aloud For Me – AI Dashboard: All-in-One AI Tool Hub: Web iOS Android Windows

Nvidia has announced Nemotron-4 15B, a 15-billion parameter multilingual language model trained on 8 trillion text tokens. Nemotron-4 shows exceptional performance in English, coding, and multilingual datasets. It outperforms all other open models of similar size on 4 out of 7 benchmarks. It has the best multilingual capabilities among comparable models, even better than larger multilingual models.

NVIDIA's Nemotron-4 beats 4x larger multilingual AI models — NVIDIA’s Nemotron-4 beats 4x larger multilingual AI models

The researchers highlight how Nemotron-4 scales model training data in line with parameters instead of just increasing model size. As a result, inferences are computed faster, and latency is reduced. Due to its ability to fit on a single GPU, Nemotron-4 aims to be the best general-purpose model given practical constraints. It achieves better accuracy than the 34-billion parameter LLaMA model for all tasks and remains competitive with state-of-the-art models like QWEN 14B.

Why does this matter?

Just as past computing innovations improved technology access, Nemotron’s lean GPU deployment profile can expand multilingual NLP adoption. Since Nemotron fits on a single cloud graphics card, it dramatically reduces costs for document, query, and application NLP compared to alternatives requiring supercomputers. These models can help every company become fluent with customers and operations across countless languages.

Source

Unlock the Secrets of Africa: Master African History, Geography, Culture, People, Cuisine, Economics, Languages, Music, Wildlife, Football, Politics, Animals, Tourism, Science and Environment with the Top 1000 Africa Quiz and Trivia. Get Yours Now!

GitHub launches Copilot Enterprise for customized AI coding

GitHub has launched Copilot Enterprise, an AI assistant for developers at large companies. The tool provides customized code suggestions and other programming support based on an organization’s codebase and best practices. Experts say Copilot Enterprise signals a significant shift in software engineering, with AI essentially working alongside each developer.

Copilot Enterprise integrates across the coding workflow to boost productivity. Early testing by partners like Accenture found major efficiency gains, with a 50% increase in builds from autocomplete alone. However, GitHub acknowledges skepticism around AI originality and bugs. The company plans substantial investments in responsible AI development, noting that Copilot is designed to augment human developers rather than replace them.

Why does this matter?

The entire software team could soon have an AI partner for programming. However, concerns about responsible AI development persist. Enterprises must balance rapidly integrating tools like Copilot with investments in accountability. How leadership approaches AI strategy now will separate future winners from stragglers.

Source

Dive into a comprehensive AWS Cloud Practitioner CLF-C02 Certification guide, masterfully weaving insights from Tutorials Dojo, Adrian Cantrill, Stephane Maarek, and AWS Skills Builder into one unified resource.

Slack study shows AI frees up 41% of time spent on low-value work

Slack’s latest workforce survey shows a surge in the adoption of AI tools among desk workers. There has been a 24% increase in usage over the past quarter, and 80% of users are already seeing productivity gains. However, less than half of companies have guidelines around AI adoption, which may inhibit experimentation. The research also spotlights an opportunity to use AI to automate the 41% of workers’ time spent on repetitive, low-value tasks. And focus efforts on meaningful, strategic work.

While most executives feel urgency to implement AI, top concerns include data privacy and AI accuracy. According to the findings, guidance is necessary to boost employee adoption. Workers are over 5x more likely to have tried AI tools at companies with defined policies.

Why does this matter?

This survey signals AI adoption is already boosting productivity when thoughtfully implemented. It can free up significant time spent on repetitive tasks and allows employees to refocus on higher-impact work. However, to realize AI’s benefits, organizations must establish guidelines and address data privacy and reliability concerns. Structured experimentation with intuitive AI systems can increase productivity and data-driven decision-making.

Source

OpenAI to put AI into humanoid robots

OpenAI is collaborating with robotics startup Figure to integrate its AI technology into humanoid robots, marking the AI’s debut in the physical world.
The partnership aims to develop humanoid robots for commercial use, with significant funding from high-profile investors including Jeff Bezos, Microsoft, Nvidia, and Amazon.
The initiative will leverage OpenAI’s advanced AI models, such as GPT and DALL-E, to enhance the capabilities of Figure’s robots, aiming to address human labor shortages.

GitHub besieged by millions of malicious repositories in ongoing attack

Hackers have automated the creation of malicious GitHub repositories by cloning popular repositories, infecting them with malware, and forking them thousands of times, resulting in hundreds of thousands of malicious repositories designed to steal information.
The malware, hidden behind seven layers of obfuscation, includes a modified version of BlackCap-Grabber, which steals authentication cookies and login credentials from various apps.
While GitHub uses artificial intelligence to block most cloned malicious packages, 1% evade detection, leading to thousands of malicious repositories remaining on the platform.

Nvidia just released a new code generator that can run on most modern CPUs

Nvidia, ServiceNow, and Hugging Face have released StarCoder2, a series of open-access large language models for code generation, emphasizing efficiency, transparency, and cost-effectiveness.
StarCoder2, trained on 619 programming languages, comes in three sizes: 3 billion, 7 billion, and 15 billion parameters, with the smallest model matching the performance of its predecessor’s largest.
The platform highlights advancements in AI ethics and efficiency, utilizing a new code dataset for enhanced understanding of diverse programming languages and ensuring adherence to ethical AI practices by allowing developers to opt out of data usage.

Three more publishers sue OpenAI

The Intercept, Raw Story, and AlterNet have filed lawsuits against OpenAI and Microsoft in the Southern District of New York, alleging copyright infringement through the training of AI models without proper attribution.
The litigation claims that ChatGPT reproduces journalism works verbatim or nearly verbatim without providing necessary copyright information, suggesting that if trained properly, it could have included these details in its outputs.
The suits argue that OpenAI and Microsoft knowingly risked copyright infringement for profit, evidenced by their provision of legal cover to customers and the existence of an opt-out system for web content crawling.

What Else Is Happening in AI on February 28th, 2024

Pika launches new lip sync feature for AI videos

Video startup Pika announced a new Lip Sync feature powered by ElevenLabs. Pro users can add realistic dialogue with animated mouths to AI-generated videos. Although currently limited, Pika’s capabilities offer customization of the speech style, text, or uploaded audio tracks, escalating competitiveness in the AI synthetic media space. (Link)

Google pays publishers to test an unreleased GenAI tool

Google is privately paying a group of publishers to test a GenAI tool. They need to summarize three articles daily based on indexed external sources in exchange for a five-figure annual fee. Google says this will help under-resourced news outlets, but experts say it could negatively affect original publishers and undermine Google’s news initiative. (Link)

Ace the Microsoft Azure Fundamentals AZ-900 Certification Exam: Pass the Azure Fundamentals Exam with Ease

Intel and Microsoft team up to bring 100M AI PCs by 2025

By collaborating with Microsoft, Intel aims to supply 100 million AI-powered PCs by 2025 and ramp up enterprise demand for efficiency gains. Despite Apple and Qualcomm’s push for Arm-based designs, Intel hopes to maintain its 76% laptop chip market share following post-COVID inventory corrections. (Link)

Writer’s Palmyra-Vision summarizes charts, scribbles into text

AI writing startup Writer announced a new capability of its Palmyra model called Palmyra-Vision. This model can generate text summaries from images, including charts, graphs, and handwritten notes. It can automate e-commerce merchandise descriptions, graph analysis, and compliance checking while recommending human-in-the-loop for accuracy. (Link)

Apple cancels its decade-long electric car project

Apple is canceling its decade-long electric vehicle project after spending over $10 billion. There were nearly 2,000 employees working on the effort known internally as Titan. After Apple announces the cancellation of its ambitious electric car project, some staff from the discontinued car team will shift to other teams such as Gen AI. (Link)

Cloud Certification made simple. Ace your exams with Djamgatech.

Nvidia’s New AI Laptops

Nvidia, the dominant force in graphics processing units (GPUs), has once again pushed the boundaries of portable computing. Their latest announcement showcases a new generation of laptops powered by the cutting-edge RTX 500 and 1000 Ada Generation GPUs. The focus here isn’t just on better gaming visuals – these laptops promise to transform the way we interact with artificial intelligence (AI) on the go.

What’s going on here?

Nvidia’s new laptop GPUs are purpose-built to accelerate AI workflows. Let’s break down the key components:

Specialized AI Hardware: The RTX 500 and 1000 GPUs feature dedicated Tensor Cores. These cores are the heart of AI processing, designed to handle complex mathematical operations involved in machine learning and deep learning at incredible speed.
Generative AI Powerhouse: These new GPUs bring a massive boost for generative AI applications like Stable Diffusion. This means those interested in creating realistic images from simple text descriptions can expect to see significant performance improvements.
Efficiency Meets Power: These laptops aren’t just about raw power. They’re designed to intelligently offload lighter AI tasks to a dedicated Neural Processing Unit (NPU) built into the CPU, conserving GPU resources for the most demanding jobs.

What does this mean?

These advancements translate into a wide range of ground-breaking possibilities:

Photorealistic Graphics Enhanced by AI: Gamers can immerse themselves in more realistic and visually stunning worlds thanks to AI-powered technologies enhancing graphics rendering.
AI-Supercharged Productivity: From generating social media blurbs to advanced photo and video editing, professionals can complete creative tasks far more efficiently with AI assistance.
Real-time AI Collaboration: Features like AI-powered noise cancellation and background manipulation in video calls will elevate your virtual communication to a whole new level.

Why should I care?

Nvidia’s latest AI-focused laptops have the potential to revolutionize the way we use our computers:

Portable Creativity: Whether you’re an artist, designer, or just someone who loves to experiment with AI art tools, these laptops promise a level of on-the-go creative freedom previously unimaginable.
Workplace Transformation: Industries from architecture to healthcare will see AI optimize processes and enhance productivity. These laptops put that power directly into the hands of professionals.
The Future is AI: AI is advancing at a blistering pace, and Nvidia is ensuring that we won’t be tied to our desks to experience it.

In short, Nvidia’s new generation of AI laptops heralds an era where high-performance, AI-driven computing becomes accessible to more people. This has the potential to spark a wave of innovation that we can’t even fully comprehend yet.

Original source here.

A Daily Chronicle of AI Innovations in February 2024 – Day 27: AI Daily News – February 27th, 2024

Tesla’s robot is getting quicker, better

Nvidia CEO: kids shouldn’t learn to code — they should leave it up to AI

Microsoft’s deal with Mistral AI faces EU scrutiny

Apple Vision Pro’s components cost $1,542—but that’s not the full story

PlayStation to axe 900 jobs and close studio

NVIDIA’s CEO Thinks That Our Kids Shouldn’t Learn How to Code As AI Can Do It for Them

During the latest World Government Summit in Dubai, Jensen Huang, the CEO of NVIDIA, spoke about the things our kids should and shouldn’t learn in the future. It may come as a surprise to many but Huang does think that our kids don’t need the knowledge of coding, just leave it to AI.

He mentioned that a decade ago, there was a belief that everyone needed to learn to code, and they were probably right, but based on what we see nowadays, the situation has changed due to achievements in AI, where everyone is literally a programmer.

He further talked about how kids may not necessarily need to learn how to code, and the focus should be on developing technology that allows for programming languages to be more human-like. In essence, traditional coding languages such as C++ or Java may become obsolete, as computers should be able to comprehend human language inputs.

Source: https://app.daily.dev/posts/vCwIfZOrx

Mistral Large: The new rival to GPT-4, 2nd best LLM of all time

The French AI startup Mistral has launched its largest-ever LLM and flagship model to date, Mistral Large, with a 32K context window. The model has top-tier reasoning capabilities, and you can use it for complex multilingual reasoning tasks, including text understanding, transformation, and code generation.

Due to a strong multitasking capability, Mistral Large is the world’s second-ranked model on MMLU (Massive multitask language understanding).

Mistral Large: The new rival to GPT-4, 2nd best LLM of all time

The model is natively fluent in English, French, Spanish, German, and Italian, with a nuanced understanding of grammar and cultural context. In addition to that, Mistral also shows top performance in coding and math tasks.

Mistral Large is now available via the in-house platform “La Plateforme” and Microsoft’s Azure AI via API.

Why does it matter?

Mistral Large stands out as the first model to truly challenge OpenAI’s dominance since GPT-4. It shows skills on par with GPT-4 for complex language tasks while costing 20% less. In this race to make their models better, it’s the user community that stands to gain the most. Also, the focus on European languages and cultures could make Mistral a leader in the European AI market.

Source

DeepMind’s new gen-AI model creates video games in a flash

Google DeepMind has launched a new generative AI model – Genie (Generative Interactive Environment), that can create playable video games from a simple prompt after learning game mechanics from hundreds of thousands of gameplay videos.

Developed by the collaborative efforts of Google and the University of British Columbia, Genie can create side-scrolling 2D platformer games based on user prompts, like Super Mario Brothers and Contra, using a single image.

Trained on over 200,000 hours of gameplay videos, the experimental model can turn any image or idea into a 2D platformer.

Genie can be prompted with images it has never seen before, such as real-world photographs or sketches, enabling people to interact with their imagined virtual worlds-–essentially acting as a foundation world model. This is possible despite training without any action labels.

DeepMind’s new gen-AI model creates video games in a flash

Why does it matter?

Genie creates a watershed moment in the generative AI space, becoming the first LLM to develop interactive, playable environments from a single image prompt. The model could be a promising step towards general world models for AGI (Artificial General Intelligence) that can understand and apply learned knowledge like a human. Lastly, Genie can learn fine-grained controls exclusively from Internet videos, a unique feature as Internet videos do not typically have labels.

Source

Meta’s MobileLLM enables on-device AI deployment

Meta has released a research paper that addresses the need for efficient large language models that can run on mobile devices. The focus is on designing high-quality models with under 1 billion parameters, as this is feasible for deployment on mobiles.

By using deep and thin architectures, embedding sharing, and grouped-query attention, they developed a strong baseline model called MobileLLM, which achieves 2.7%/4.3% higher accuracy compared to previous 125M/350M state-of-the-art models. The research paper highlights that you should concentrate on developing an efficient model architecture rather than on data and parameter quantity to determine model quality.

Why does it matter?

With language understanding now possible on consumer devices, mobile developers can create products that were once hard to build because of latency or privacy issues when reliant on cloud connections. This advancement allows industries like finance, gaming, and personal health to integrate conversational interfaces, intelligent recommendations, and real-time data privacy protections using models optimized for mobile efficiency, sparking creativity in a new wave of intelligent apps.

Source

What Else Is Happening in AI on February 27th, 2024

Qualcomm reveals 75+ pre-optimized AI models at MWC 2024

Qualcomm released 75+ new large language models, including popular generative models like Whisper and Stable Diffusion, optimized for the Snapdragon platform at the Mobile World Congress (MWC) 2024. The company stated that some of these LLMs will have generation AI capabilities for next-generation smartphones, PCs, IoT, XR devices, etc. (Link)

Nvidia launches new laptop GPUs for AI on the go

Nvidia launched RTX 500 and 1000 Ada Generation laptop graphics processing units (GPUs) at the MWC 2024 for on-the-go AI processing. These GPUs will utilize the Ada Lovelace architecture to provide content creators, researchers, and engineers with accelerated AI and next-generation graphic performance while working from portable devices. (Link)

Microsoft announces AI principles for boosting innovation and competition

Microsoft announced a set of principles to foster innovation and competition in the AI space. The move came to showcase its role as a market leader in promoting responsible AI and answer the concerns of rivals and antitrust regulators. The standard covers six key dimensions of responsible AI: fairness, reliability and safety, privacy and security, inclusiveness, transparency, and accountability. (Link)

Google brings Gemini in Google Messages, Android Auto, Wear OS, etc.

Despite receiving some flakes from the industry, Google is riding the AI wave and decided to integrate Gemini into a new set of features for phones, cars, and wearables. With these new features, users can use Gemini to craft messages and AI-generated captions for images, summarize texts through AI for Android Auto, and access passes on Wear OS. (Link)

Microsoft Copilot GPTs help you plan your vacation and find recipes.

Microsoft has released a few copilot GPTs that can help you plan your next vacation, find recipes, learn how to cook them, create a custom workout plan, or design a logo for your brand. Microsoft corporate vice president Jordi Ribas informed the media that users will soon be able to create customized Copilot GPTs, which is missing in the current version of Copilot. (Link)

Tesla’s robot is getting quicker, better

Elon Musk shared new footage showing improved mobility and speed of Tesla’s robot, Optimus Gen 2, which is moving more smoothly and steadily around a warehouse.
The latest version of the Optimus robot is lighter, has increased walking speed thanks to Tesla-designed actuators and sensors, and demonstrates significant progress over previous models.
Musk predicts the possibility of Optimus starting to ship in 2025 for less than $20,000, marking a significant milestone in Tesla’s venture into humanoid robotics capable of performing mundane or dangerous tasks for humans.
Source

A Daily Chronicle of AI Innovations in February 2024 – Day 26: AI Daily News – February 26th, 2024

Google Deepmind announces Genie, the first generative interactive environment model

The abstract:

” We introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text, synthetic images, photographs, and even sketches. At 11B parameters, Genie can be considered a foundation world model. It is comprised of a spatiotemporal video tokenizer, an autoregressive dynamics model, and a simple and scalable latent action model. Genie enables users to act in the generated environments on a frame-by-frame basis despite training without any ground-truth action labels or other domain-specific requirements typically found in the world model literature. Further the resulting learned latent action space facilitates training agents to imitate behaviors from unseen videos, opening the path for training generalist agents of the future. “

I asked GPT4 to read through the article and summarize ELI5 style bullet points:

Who Wrote This?
- A group of smart people at Google DeepMind wrote the article. They’re working on making things better for turning text into webpages.
What Did They Do?
- They created something called “Genie.” It’s like a magic tool that can take all sorts of ideas or pictures and turn them into a place you can explore on a computer, like making your own little video game world from a drawing or photo. They did this by watching lots and lots of videos from the internet and learning how things move and work in those videos.
How Does It Work?
- They use something called “Genie” which is very smart and can understand and create new videos or game worlds by itself. You can even tell it what to do next in the world it creates, like moving forward or jumping, and it will show you what happens.
Why Is It Cool?
- Because Genie can create new, fun worlds just from a picture or some words, and you can play in these worlds! It’s like having a magic wand to make up your own stories and see them come to life on a computer.
What’s Next?
- Even though Genie is really cool, it’s not perfect. Sometimes it makes mistakes or can’t remember things for very long. But the people who made it are working to make it better, so one day, everyone might be able to create their own video game worlds just by imagining them.
Important Points:
- They want to make sure this tool is used in good ways and that it’s safe for everyone. They’re not sharing it with everyone just yet because they want to make sure it’s really ready and won’t cause any problems.

Microsoft eases AI testing with new red teaming tool

Microsoft has released an open-source automation called PyRIT to help security researchers test for risks in generative AI systems before public launch. Historically, “red teaming” AI has been an expert-driven manual process requiring security teams to create edge case inputs and assess whether the system’s responses contain security, fairness, or accuracy issues. PyRIT aims to automate parts of this tedious process for scale.

PyRIT helps researchers test AI systems by inputting large datasets of prompts across different risk categories. It automatically interacts with these systems, scoring each response to quantify failures. This allows for efficient testing of thousands of input variations that could cause harm. Security teams can then take this evidence to improve the systems before release.

Why does this matter?

Microsoft’s release of the PyRIT toolkit makes rigorously testing AI systems for risks drastically more scalable. Automating parts of the red teaming process will enable much wider scrutiny for generative models and eventually raise their performance standards. PyRIT’s automation will also pressure the entire industry to step up evaluations if they want their AI trusted.

Source

Transformers learn to plan better with Searchformer

A new paper from Meta introduces Searchformer, a Transformer model that exceeds the performance of traditional algorithms like A* search in complex planning tasks such as maze navigation and Sokoban puzzles. Searchformer is trained in two phases: first imitating A* search to learn general planning skills, then fine-tuning the model via expert iteration to find optimal solutions more efficiently.

The key innovation is the use of search-augmented training data that provides Searchformer with both the execution trace and final solution for each planning task. This enables more data-efficient learning compared to models that only see solutions. However, encoding the full reasoning trace substantially increases the length of training sequences. Still, Searchformer shows promising techniques for training AI to surpass symbolic planning algorithms.

Why does this matter?

Achieving state-of-the-art planning results shows that generative AI systems are advancing to develop human-like reasoning abilities. Mastering complex cognitive tasks like finding optimal paths has huge potential in AI applications that depend on strategic thinking and foresight. As other companies race to close this new gap in planning capabilities, progress in core areas like robotics and autonomy is likely to accelerate.

Source

YOLOv9 sets a new standard for real-time object recognition

YOLO (You Only Look Once) is open-source software that enables real-time object recognition in images, allowing machines to “see” like humans. Researchers have launched YOLOv9, the latest iteration that achieves state-of-the-art accuracy with significantly less computational cost.

By introducing two new techniques, Programmable Gradient Information (PGI) and Generalized Efficient Layer Aggregation Network (GELAN), YOLOv9 reduces parameters by 49% and computations by 43% versus predecessor YOLOv8, while boosting accuracy on key benchmarks by 0.6%. PGI improves network updating for more precise object recognition, while GELAN optimizes the architecture to increase accuracy and speed.

Why does this matter?

The advanced responsiveness of YOLOv9 unlocks possibilities for mobile vision applications where computing resources are limited, like drones or smart glasses. More broadly, it highlights deep learning’s potential to match human-level visual processing speeds, encouraging technology advancements like self-driving vehicles.

Source

What Else Is Happening in AI on February 26th, 2024

Apple tests internal ChatGPT-like tool for customer support

Apple recently launched a pilot program testing an internal AI tool named “Ask.” It allows AppleCare agents to generate technical support answers automatically by querying Apple’s knowledge base. The goal is faster and more efficient customer service. (Link)

ChatGPT gets an Android home screen widget

Android users can now access ChatGPT more easily through a home screen widget that provides quick access to the chatbot’s conversation and query modes. The widget is available in the latest beta version of the ChatGPT mobile app. (Link)

AWS adds open-source Mistral AI models to Amazon Bedrock

AWS announced it will be bringing two of Mistral’s high-performing generative AI models, Mistral 7B and Mixtral 8x7B, to its Amazon Bedrock platform for gen AI offerings in the near future. AWS chose Mistral’s cost-efficient and customizable models to expand the range of GenAI abilities for Bedrock users. (Link)

Montreal tests AI system to prevent subway suicides

The Montreal Transit Authority is testing an AI system that analyzes surveillance footage to detect warning signs of suicide risk among passengers. The system, developed with a local suicide prevention center, can alert staff to intervene and save lives. With current accuracy of 25%, the “promising” pilot could be implemented in two years. (Link)

Fast food giants embrace controversial AI worker tracking

Riley, an AI system by Hoptix, monitors worker-customer interactions in 100+ fast-food franchises to incentivize upselling. It tracks metrics like service speed, food waste, and upselling rates. Despite being a coaching tool, concerns exist regarding the imposition of unfair expectations on workers. (Link)

Mistral AI releases new model to rival GPT-4

Mistral AI introduces “Mistral Large,” a large language model designed to compete with top models like GPT-4 and Claude 2, and “Le Chat,” a beta chat assistant, aiming to establish an alternative to OpenAI and Anthropic’s offerings.
With aggressive pricing at $8 per million input tokens and $24 per million output tokens, Mistral Large offers a cost-effective solution compared to GPT-4’s pricing, supporting English, French, Spanish, German, and Italian.
The startup also revealed a strategic partnership with Microsoft to offer Mistral models on the Azure platform, enhancing Mistral AI’s market presence and potentially increasing its customer base through this new distribution channel.

Gemini is about to slide into your DMs

Google’s AI chatbot Gemini is being integrated into the Messages app as part of an Android update, aiming to make conversations more engaging and friend-like, initially available in English in select markets.
Android Auto receives AI improvements for summarizing long texts or chat threads and suggesting context-based replies, enhancing safety and convenience for drivers.
Google also introduces AI-powered accessibility features in Lookout and Maps, including screen reader enhancements and automatic generation of descriptions for images, to assist visually impaired users globally.

Microsoft tried to sell Bing to Apple in 2018

Microsoft attempted to sell its Bing search engine to Apple in 2018, aiming to make Bing the default search engine for Safari, but Apple declined due to concerns over Bing’s search quality.
The discussions between Apple and Microsoft were highlighted in Google’s court filings as evidence of competition in the search industry, amidst accusations against Google for monopolizing the web search sector.
Despite Microsoft’s nearly $100 billion investment in Bing over two decades, the search engine only secures a 3% global market share, while Google continues to maintain a dominant position, paying billions to Apple to remain the default search engine on its devices.

Meta forms team to stop AI from tricking voters

Meta is forming a dedicated task force to counter disinformation and harmful AI content ahead of the EU elections, focusing on rapid threat identification and mitigation.
The task force will remove harmful content from Facebook, Instagram, and Threads, expand its fact-checking team, and introduce measures for users and advertisers to disclose AI-generated material.
The initiative aligns with the Digital Services Act’s requirements for large online platforms to combat election manipulation, amidst growing concerns over the disruptive potential of AI and deepfakes in elections worldwide.

Samsung unveils the Galaxy Ring as way to ‘simplify everyday wellness’

Samsung teased the new Galaxy Ring at Galaxy Unpacked, showcasing its ambition to introduce a wearable that is part of a future vision for ambient sensing.
The Galaxy Ring, coming in three colors and various sizes, will feature sleep, activity, and health tracking capabilities, aiming to compete with products like the Oura Ring.
Samsung plans to integrate the Galaxy Ring into a larger ecosystem, offering features like My Vitality Score and Booster Cards in the Galaxy Health app, to provide a more holistic health monitoring system.

Impact of AI on Freelance Jobs

AI Weekly Rundown (February 19 to February 26)

Major AI announcements from NVIDIA, Apple, Google, Adobe, Meta, and more.

NVIDIA presents OpenMathInstruct-1, a 1.8 million math instruction tuning dataset
– OpenMathInstruct-1 is a high-quality, synthetically generated dataset. It is 4x bigger than previous datasets and does not use GPT-4. The best model, OpenMath-CodeLlama-70B, trained on a subset of OpenMathInstruct-1, achieves which is competitive performance with the best gpt-distilled models.
Apple is reportedly working on AI updates to Spotlight and Xcode
– AI features for Spotlight search could let iOS and macOS users make natural language requests to get weather reports or operate features deep within apps. Apple also expanded internal testing of new generative AI features for its Xcode and plans to release them to third-party developers this year.
Microsoft arms white hat AI hackers with a new red teaming tool
– PyRIT, an open-source tool from Microsoft, automates the testing of generative AI systems for risks before their public launch. It streamlines the “red teaming” process, traditionally a manual task, by inputting large datasets of prompts and scoring responses to identify potential issues in security, fairness, or accuracy.
Google has open-sourced Magika, its AI-powered file-type identification system
– It helps accurately detect binary and textual file types. Under the hood, Magika employs a custom, highly optimized deep-learning model, enabling precise file identification within milliseconds, even when running on a CPU.
Groq’s new AI chip turbocharges LLMs, outperforms ChatGPT
– Groq, an AI chip startup, has developed a special AI hardware– the first-ever Language Processing Unit (LPU) that turbocharges LLMs and processes up to 500 tokens/second, which is far more superior than ChatGPT-3.5’s 40 tokens/second.
Transformers learn to plan better with Searchformer
– Meta’s Searchformer, a Transformer model, outperforms traditional algorithms like A* search in complex planning tasks. It’s trained to imitate A* search for general planning skills and then fine-tuned for optimal solutions using expert iteration and search-augmented training data.
Apple tests internal chatGPT-like tool for customer support
– Apple recently launched a pilot program testing an internal AI tool named “Ask.” It allows AppleCare agents to automatically generate technical support answers by querying Apple’s knowledge base. The goal is faster and more efficient customer service.
BABILong: The new benchmark to assess LLMs for long docs
– The paper uncovers limitations in GPT-4 and RAG, showing reliance on the initial 25% of input. BABILong evaluates GPT-4, RAG, and RMT, revealing that conventional methods are effective for 10^4 elements, while recurrent memory augmentation handles 10^7 elements, thereby setting a new advancement for long doc understanding.
Stanford’s AI model identifies sex from brain scans with 90% accuracy
– Stanford medical researchers have developed an AI model that can identify the sex of individuals from brain scans with 90% accuracy. The model focuses on dynamic MRI scans, identifying specific brain networks to distinguish males and females.
Adobe’s new AI assistant manages documents for you
– Adobe introduced an AI assistant for easier document navigation, answering questions, and summarizing information. It locates key data, generates citations, and formats brief overviews for presentations and emails to save time. Moreover, Adobe introduced CAVA, a new 50-person AI research team focused on inventing new models and processes for AI video creation.
Meta released Aria recordings to fuel smart speech recognition
– The Meta team released a multimodal dataset of two-sided conversations captured by Aria smart glasses. It contains audio, video, motion, and other sensor data. The diverse signals aim to advance speech recognition and translation research for augmented reality interfaces.
AWS adds open-source Mistral AI models to Amazon Bedrock
– AWS announced it will be bringing two of Mistral’s high-performing generative AI models, Mistral 7B and Mixtral 8x7B, to its Amazon Bedrock platform for GenAI offerings in the near future. AWS chose Mistral’s cost-efficient and customizable models to expand the range of GenAI abilities for Bedrock users.
Penn’s AI chip runs on light, not electricity
– Penn engineers developed a new photonic chip that performs complex math for AI. It reduces processing time and energy consumption using light waves instead of electricity. This design uses optical computing principles developed by Penn professor Nader Engheta and nanoscale silicon photonics to train and infer neural networks.
Google launches its first open-source LLM
– Google has open-sourced Gemma, a lightweight yet powerful new family of language models that outperforms larger models on NLP benchmarks but can run on personal devices. The release also includes a Responsible Generative AI Toolkit to assist developers in safely building applications with Gemma, now accessible through Google Cloud, Kaggle, Colab and other platforms.
AnyGPT is a major step towards artificial general intelligence
– Researchers in Shanghai have developed AnyGPT, a groundbreaking new AI model that can understand and generate data across virtually any modality like text, speech, images and music using a unified discrete representation. It achieves strong zero-shot performance comparable to specialized models, representing a major advance towards AGI.
Google launches Gemini for Workspace:
Google has launched Gemini for Workspace, bringing Gemini’s capabilities into apps like Docs and Sheets to enhance productivity. The new offering comes in Business and Enterprise tiers and features AI-powered writing assistance, data analysis, and a chatbot to help accelerate workflows.
Stable Diffusion 3 – A multi-subject prompting text-to-image model
– Stability AI’s Stable Diffusion 3 is generating excitement in the AI community due to its improved text-to-image capabilities, including better prompt adherence and image quality. The early demos have shown remarkable improvements in generation quality, surpassing competitors such as MidJourney, Dall-E 3, and Google ImageFX.
LongRoPE: Extending LLM context window beyond 2 million tokens
– Microsoft’s LongRoPE extends large language models to 2048k tokens, overcoming challenges of high fine-tuning costs and scarcity of long texts. It shows promising results with minor modifications and optimizations.
Google Chrome introduces “Help me write” AI feature
– Google’s “Help me write” is an experimental AI feature on its Chrome browser that offers writing suggestions for short-form content. It highlights important features mentioned on a product page and can be accessed by enabling Chrome’s Experimental AI setting.
Montreal tests AI system to prevent subway suicides
– The Montreal transit authority is testing an AI system that analyzes surveillance footage to detect warning signs of suicide risk among passengers. The system, developed with a local suicide prevention center, can alert staff to intervene and save lives. With current accuracy of 25%, the “promising” pilot could be implemented in two years.
Fast food giants embrace controversial AI worker tracking
– Riley, an AI system by Hoptix, monitors worker-customer interactions in 100+ fast food franchises to incentivize upselling. It tracks metrics like service speed, food waste, and upselling rates. Despite being a coaching tool, concerns exist regarding the imposition of unfair expectations on workers.
And there was more…
– SoftBank’s founder is seeking about $100 billion for an AI chip venture
– ElevenLabs teases a new AI sound effects feature
– NBA commissioner Adam Silver demonstrates NB-AI concept
– Reddit signs AI content licensing deal ahead of IPO
– ChatGPT gets an Android homescreen widget
– YOLOv9 sets a new standard for real-time object recognition
– Mistral quietly released a new model in testing called ‘next’
– Microsoft to invest $2.1 billion for AI infrastructure expansion in Spain
– Graphcore explores sales talk with OpenAI, Softbank, and Arm
– OpenAI’s Sora can craft impressive video collages
– US FTC proposes a prohibition law on AI impersonation
– Meizu bids farewell to the smartphone market; shifts focus on AI
– Microsoft develops server network cards to replace NVIDIA’s cards
– Wipro and IBM team up to accelerate enterprise AI
– Deutsche Telekom revealed an AI-powered app-free phone concept
– Tinder fights back against AI dating scams
– Intel lands a $15 billion deal to make chips for Microsoft
– DeepMind forms new unit to address AI dangers
– Match Group bets on AI to help its workers improve dating apps
– Google Play Store tests AI-powered app recommendations
– Google cut a deal with Reddit for AI training data
– GPT Store introduces linking profiles, ratings, and enhanced ‘About’ pages
– Microsoft introduces a generative erase feature for AI-editing photos in Windows 11
– Suno AI V3 Alpha is redefining music generation
– Jasper acquires image platform Clipdrop from Stability AI

A Daily Chronicle of AI Innovations in February 2024 – Day 24: AI Daily News – February 24th, 2024

Google’s chaotic AI strategy

Google’s AI strategy has resulted in confusion among consumers due to a rapid succession of new products, names, and features, compromising public trust in both AI and Google itself.
The company has launched a bewildering array of AI products with overlapping and inconsistent naming schemes, such as Bard transforming into Gemini, alongside multiple versions of Gemini, complicating user understanding and adoption.
Google’s rushed approach to competing with rivals like OpenAI has led to a chaotic rollout of AI offerings, leaving customers and even its own employees mocking the company’s inability to provide clear and accessible AI solutions.
Source

Filmmaker puts $800 million studio expansion on hold because of OpenAI’s Sora

Tyler Perry paused a $800 million expansion of his Atlanta studio after being influenced by OpenAI’s video AI model Sora, expressing concerns over AI’s impact on the film industry and job losses.
Perry has started utilizing AI in film production to save time and costs, for example, in applying aging makeup, yet warns of the potential job displacement this technology may cause.
The use of AI in Hollywood has led to debates on its implications for jobs, with calls for regulation and fair compensation, highlighted by actions like strikes and protests by SAG-AFTRA members.
Source

Google explains Gemini’s ‘embarrassing’ AI pictures

Google addressed the issue of Gemini AI producing historically inaccurate images, such as racially diverse Nazis, attributing the error to tuning issues within the model.
The problem arose from the AI’s overcompensation in its attempt to show diversity, leading to inappropriate image generation and an overly cautious approach to generating images of specific ethnicities.
Google has paused the image generation feature in Gemini since February 22, with plans to improve its accuracy and address the challenge of AI-generated “hallucinations” before reintroducing the feature.
Source

Apple tests internal ChatGPT-like AI tool for customer support

Apple is conducting internal tests on a new AI tool named “Ask,” designed to enhance the speed and efficiency of technical support provided by AppleCare agents.
The “Ask” tool generates answers to customer technical queries by leveraging Apple’s internal knowledge base, allowing agents to offer accurate, clear, and useful assistance.
Beyond “Ask,” Apple is significantly investing in AI, developing its own large language model framework, “Ajax,” and a chatbot service, “AppleGPT”.
Source

Figure AI’s humanoid robots attract funding from Microsoft, Nvidia, OpenAI, and Jeff Bezos

Jeff Bezos, Nvidia, and other tech giants are investing in Figure AI, a startup developing human-like robots, raising about $675 million at a valuation of roughly $2 billion.
Figure’s robot, named Figure 01, is designed to perform dangerous jobs unsuitable for humans, with the company aiming to address labor shortages.
The investment round, initially seeking $500 million, attracted widespread industry support, including contributions from Microsoft, Amazon-affiliated funds, and venture capital firms, marking a significant push into AI-driven robotics.
Source

A Daily Chronicle of AI Innovations in February 2024 – Day 23: AI Daily News – February 23rd, 2024

Stable Diffusion 3 creates jaw-dropping images from text
LongRoPE: Extending LLM context window beyond 2 million token
Google Chrome introduces “Help me write” AI feature

Jasper acquires image platform Clipdrop from Stability AI

Suno AI V3 Alpha is redefining music generation.

GPT Store introduces linking profiles, ratings, and enhanced about pages.

Microsoft introduces a generative erase feature for AI-editing photos in Windows 11.

Google cut a deal with Reddit for AI training data.

Stable Diffusion 3 creates jaw-dropping text-to-images!

Stability.AI announced the Stable Diffusion 3 in an early preview. It is a text-to-image model with improved performance in multi-subject prompts, image quality, and spelling abilities. Stability.AI has opened the model waitlist and introduced a preview to gather insights before the open release.

Stability AI’s Stable Diffusion 3 preview has generated significant excitement in the AI community due to its superior image and text generation capabilities. This next-generation image tool promises better text generation, strong prompt adherence, and resistance to prompt leaking, ensuring the generated images match the requested prompts.

Why does it matter?

The announcement of Stable Diffusion 3 is a significant development in AI image generation because it introduces a new architecture with advanced features such as the diffusion transformer and flow matching. The early demos of Stable Diffusion 3 have shown remarkable improvements in overall generation quality, surpassing its competitors such as MidJourney, Dall-E 3, and Google ImageFX.

Source

LongRoPE: Extending LLM context window beyond 2 million tokens

Researchers at Microsoft have introduced LongRoPE, a groundbreaking method that extends the context window of pre-trained large language models (LLMs) to an impressive 2048k tokens.

Current extended context windows are limited to around 128k tokens due to high fine-tuning costs, scarcity of long texts, and catastrophic values introduced by new token positions. LongRoPE overcomes these challenges by leveraging two forms of non-uniformities in positional interpolation, introducing a progressive extension strategy, and readjusting the model on shorter context windows.

LongRoPE: Extending LLM context window beyond 2 million tokens

Experiments on LLaMA2 and Mistral across various tasks demonstrate the effectiveness of LongRoPE. The extended models retain the original architecture with minor positional embedding modifications and optimizations.

Why does it matter?

LongRoPE extends the context window in LLMs and opens up possibilities for long-context tasks beyond 2 million tokens. This is the highest supported token, especially when other models like Google Gemini Pro have capabilities of up to 1 million tokens. Another major impact it will have is an extended context window for open-source models, unlike top proprietary models.

Source

Google Chrome introduces “Help me write” AI feature

Google has recently rolled out an experimental AI feature called “Help me write” for its Chrome browser. This feature, powered by Gemini, aims to assist users in writing or refining text based on webpage content. It focuses on providing writing suggestions for short-form content, such as filling in digital surveys and reviews and drafting descriptions for items being sold online.

The tool can understand the webpage’s context and pull relevant information into its suggestions, such as highlighting critical features mentioned on a product page for item reviews. Users can right-click on an open text field on any website to access the feature on Google Chrome.

Google Chrome introduces "Help me write" AI feature — Google Chrome introduces “Help me write” AI feature

This feature is currently only available for English-speaking Chrome users in the US on Mac and Windows PCs. To access this tool, users in the US can enable Chrome’s Experimental AI under the “Try out experimental AI features” setting.

Why does it matter?

Google Chrome’s “Help me write” AI feature can aid users in completing surveys, writing reviews, and drafting product descriptions. However, it is still in its early stages and may not inspire user confidence compared to Microsoft’s Copilote on Edge browser. Adjusting the prompts and resulting text can negate any time-saving benefits, leaving the effectiveness of this feature for Google Chrome users open for debate.

Source

What Else Is Happening in AI on February 23rd, 2024

Google cut a deal with Reddit for AI training data.

Google and Reddit have formed a partnership that will benefit both companies. Google will pay $60 million per year for real-time access to Reddit’s data, while Reddit will gain access to Google’s Vertex AI platform. This will help Google train its AI and ML models at scale while also giving Reddit expanded access to Google’s services. (Link)

GPT Store introduces linking profiles, ratings, and enhanced about pages.

OpenAI’s GPT Store platform has new features. Builders can link their profiles to GitHub and LinkedIn, and users can leave ratings and feedback. The About pages for GPTs have also been enhanced. T (Link)

Microsoft introduces a generative erase feature for AI-editing photos in Windows 11.

Microsoft’s Photos app now has a Generative Erase feature powered by AI. It enables users to remove unwanted elements from their photos, including backgrounds. The AI edit features are currently available to Windows Insiders, and Microsoft plans to roll out the tools to Windows 10 users. However, there is no clarity on whether AI-edited photos will have watermarks or metadata to differentiate them from unedited photos. (Link)

Suno AI V3 Alpha is redefining music generation.

The V3 Alpha version of Suno AI’s music generation platform offers significant improvements, including better audio quality, longer clip length, and expanded language coverage. The update aims to redefine the state-of-the-art for generative music and invites user feedback with 300 free credits given to paying subscribers as a token of appreciation. (Link)

Jasper acquires image platform Clipdrop from Stability AI

Jasper acquires AI image creation and editing platform Clipdrop from Stability AI, expanding its conversational AI toolkit with visual capabilities for a comprehensive multimodal marketing copilot. The Clipdrop team will work in Paris to contribute to research and innovation on multimodality, furthering Jasper’s vision of being the most all-encompassing end-to-end AI assistant for powering personalized marketing and automation. (Link)

A Daily Chronicle of AI Innovations in February 2024 – Day 22: AI Daily News – February 22nd, 2024

Google suspends Gemini from making AI images after backlash

Google has temporarily halted the ability of its Gemini AI to create images of people following criticisms over its generation of historically inaccurate and racially diverse images, such as those of US Founding Fathers and Nazi-era soldiers.
This decision comes shortly after Google issued an apology for the inaccuracies in some of the historical images generated by Gemini, amid backlash and conspiracy theories regarding the depiction of race and gender.
Google plans to improve Gemini’s image generation capabilities concerning people and intends to re-release an enhanced version of this feature in the near future, aiming for more accurate and sensitive representations.
Source

Nvidia posts revenue up 265% on booming AI business

Nvidia’s data center GPU sales soared by 409% due to a significant increase in demand for AI chips, with the company reporting $18.4 billion in revenue for this segment.
The company exceeded Wall Street’s expectations in its fourth-quarter financial results, projecting $24 billion in sales for the current quarter against analysts’ forecasts of $22.17 billion.
Nvidia has become a key player in the AI industry, with massive demand for its GPUs from tech giants and startups alike, spurred by the growth in generative AI applications.
Source

Microsoft and Intel strike a custom chip deal that could be worth billions

Intel will produce custom chips designed by Microsoft in a deal valued over $15 billion, although the specific applications of these chips remain unspecified.
The chips will utilize Intel’s 18A process, marking a significant step in Intel’s strategy to lead in chip manufacturing by offering foundry services for custom chip designs.
Intel’s move to expand its foundry services and collaborate with Microsoft comes amidst challenges, including the delayed opening of a $20 billion chip plant in Ohio.
Source

AI researchers’ open letter demands action on deepfakes before they destroy democracy

An open letter from AI researchers demands government action to combat deepfakes, highlighting their threat to democracy and proposing measures such as criminalizing deepfake child pornography.
The letter warns about the rapid increase of deepfakes, with a 550% rise between 2019 and 2023, detailing that 98% of deepfake videos are pornographic, predominantly victimizing women.
Signatories, including notable figures like Jaron Lanier and Frances Haugen, advocate for the development and dissemination of content authentication methods to distinguish real from manipulated content.
Source

Stability AI’s Stable Diffusion 3 preview boasts superior image and text generation capabilities

Stability AI introduces Stable Diffusion 3, showcasing enhancements in image generation, complex prompt execution, and text-generation capabilities.
The model incorporates the Diffusion Transformer Architecture with Flow Matching, ranging from 800 million to 8 billion parameters, promising a notable advance in AI-driven content creation.
Despite its potential, Stability AI takes rigorous safety measures to mitigate misuse and collaborates with the community, amidst concerns over training data and the ease of modifying open-source models.
Source

Google releases its first open-source LLM

Google has open-sourced Gemma, a new family of state-of-the-art language models available in 2B and 7B parameter sizes. Despite being lightweight enough to run on laptops and desktops, Gemma models have been built with the same technology used for Google’s massive proprietary Gemini models and achieve remarkable performance – the 7B Gemma model outperforms the 13B LLaMA model on many key natural language processing benchmarks.

Alongside the Gemma models, Google has released a Responsible Generative AI Toolkit to assist developers in building safe applications. This includes tools for robust safety classification, debugging model behavior, and implementing best practices for deployment based on Google’s experience. Gemma is available on Google Cloud, Kaggle, Colab, and a few other platforms with incentives like free credits to get started.

AnyGPT: A major step towards artificial general intelligence

Researchers in Shanghai have achieved a breakthrough in AI capabilities with the development of AnyGPT – a new model that can understand and generate data in virtually any modality, including text, speech, images, and music. AnyGPT leverages an innovative discrete representation approach that allows a single underlying language model architecture to smoothly process multiple modalities as inputs and outputs.

The researchers synthesized the AnyInstruct-108k dataset, containing 108,000 samples of multi-turn conversations, to train AnyGPT for these impressive capabilities. Initial experiments show that AnyGPT achieves zero-shot performance comparable to specialized models across various modalities.

Google launches Gemini for Workspace

Google has rebranded its Duet AI for Workspace offering as Gemini for Workspace. This brings the capabilities of Gemini, Google’s most advanced AI model, into Workspace apps like Docs, Sheets, and Slides to help business users be more productive.

The new Gemini add-on comes in two tiers – a Business version for SMBs and an Enterprise version. Both provide AI-powered features like enhanced writing and data analysis, but Enterprise offers more advanced capabilities. Additionally, users get access to a Gemini chatbot to accelerate workflows by answering questions and providing expert advice. This offering pits Google against Microsoft, which has a similar Copilot experience for commercial users.

What Else Is Happening in AI on February 22nd, 2024

Intel lands a $15 billion deal to make chips for Microsoft

Intel will produce over $15 billion worth of custom AI and cloud computing chips designed by Microsoft, using Intel’s cutting-edge 18A manufacturing process. This represents the first major customer for Intel’s foundry services, a key part of CEO Pat Gelsinger’s plan to reestablish the company as an industry leader. (Link)

DeepMind forms new unit to address AI dangers

Google’s DeepMind has created a new AI Safety and Alignment organization, which includes an AGI safety team and other units working to incorporate safeguards into Google’s AI systems. The initial focus is on preventing bad medical advice and bias amplification, though experts believe hallucination issues can never be fully solved. (Link)

Match Group bets on AI to help its workers improve dating apps

Match Group, owner of dating apps like Tinder and Hinge, has signed a deal to use ChatGPT and other AI tools from OpenAI for over 1,000 employees. The AI will help with coding, design, analysis, templates, and communications. All employees using it will undergo training on responsible AI use. (Link)

Fintechs get a new ally against financial crime

Hummingbird, a startup offering tools for financial crime investigations, has launched a new product called Automations. It provides pre-built workflows to help financial investigators automatically gather information on routine crimes like tax evasion, freeing them up to focus on harder cases. Early customer feedback on Automations has been positive. (Link)

Google Play Store tests AI-powered app recommendations

Google is testing a new AI-powered “App Highlights” feature in the Play Store that provides personalized app recommendations based on user preferences and habits. The AI analyzes usage data to suggest relevant, high-quality apps to simplify discovery. (Link)

A Daily Chronicle of AI Innovations in February 2024 – Day 21: AI Daily News – February 21st, 2024

Introducing Gemma by Google – a family of lightweight, state-of-the-art open models for their class

#openmodels 1/n “Gemma open models Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Developed by Google DeepMind and other teams across Google, Gemma is inspired by Gemini, and the name reflects the Latin gemma, meaning “precious stone.” Accompanying our model weights, we’re also releasing tools to support developer innovation, foster collaboration, and guide responsible use of Gemma models… Free credits for research and development Gemma is built for the open community of developers and researchers powering AI innovation. You can start working with Gemma today using free access in Kaggle, a free tier for Collab notebooks, and $300 in credits for first-time Google Cloud users. Researchers can also apply for Google Cloud credits of up to $500,000 to accelerate their projects”.

Gemini 1.5 will be ~20x cheaper than GPT4 – this is an existential threat to OpenAI

From what we have seen so far Gemini 1.5 Pro is reasonably competitive with GPT4 in benchmarks, and the 1M context length and in-context learning abilities are astonishing.

What hasn’t been discussed much is pricing. Google hasn’t announced specific number for 1.5 yet but we can make an educated projection based on the paper and pricing for 1.0 Pro.

Google describes 1.5 as highly compute-efficient, in part due to the shift to a soft MoE architecture. I.e. only a small subset of the experts comprising the model need to be inferenced at a given time. This is a major improvement in efficiency from a dense model in Gemini 1.0.

And though it doesn’t specifically discuss architectural decisions for attention the paper mentions related work on deeply sub-quadratic attention mechanisms enabling long context (e.g. Ring Attention) in discussing Gemini’s achievement of 1-10M tokens. So we can infer that inference costs for long context are relatively manageable. And videos of prompts with ~1M context taking a minute to complete strongly suggest that this is the case barring Google throwing an entire TPU pod at inferencing an instance.

Putting this together we can reasonably expect that pricing for 1.5 Pro should be similar to 1.0 Pro. Pricing for 1.0 Pro is $0.000125 / 1K characters.

Compare that to $0.01 / 1K tokens for GPT4-Turbo. Rule of thumb is about 4 characters / token, so that’s $0.0005 for 1.5 Pro vs $0.01 for GPT-4, or a 20x difference in Gemini’s favor.

So Google will be providing a model that is arguably superior to GPT4 overall at a price similar to GPT-3.5.

If OpenAI isn’t able to respond with a better and/or more efficient model soon Google will own the API market, and that is OpenAI’s main revenue stream.

https://ai.google.dev/pricing

https://openai.com/pricing

Adobe’s new AI assistant manages your docs

Adobe launched an AI assistant feature in its Acrobat software to help users navigate documents. It summarizes content, answers questions, and generates formatted overviews. The chatbot aims to save time working with long files and complex information. Additionally, Adobe created a dedicated 50-person AI research team called CAVA (Co-Creation for Audio, Video, & Animation) focused on advancing generative video, animation, and audio creation tools.

While Adobe already has some generative image capabilities, CAVA signals a push into underserved areas like procedurally assisted video editing. The research group will explore integrating Adobe’s existing creative tools with techniques like text-to-video generation. Adobe prioritizes more AI-powered features to boost productivity through faster document understanding or more automated creative workflows.

Why does this matter?

Adobe injecting AI into PDF software and standing up an AI research group signals a strategic push to lead in generative multimedia. Features like summarizing documents offer faster results, while envisaged video/animation creation tools could redefine workflows.

Source

Meta released Aria recordings to fuel smart speech recognition

Meta has released a multi-modal dataset of two-person conversations captured on Aria smart glasses. It contains audio across 7 microphones, video, motion sensors, and annotations. The glasses were worn by one participant while speaking spontaneously with another compensated contributor.

The dataset aims to advance research in areas like speech recognition, speaker ID, and translation for augmented reality interfaces. Its audio, visual, and motion signals together provide a rich capture of natural talking that could help train AI models. Such in-context glasses conversations can enable closed captioning and real-time language translation.

Why does this matter?

By capturing real-world sensory signals from glasses-framed conversations, Meta bridges the gaps AI faces to achieve human judgment. Enterprises stand to gain more relatable, trustworthy AI helpers that feel less robotic and more attuned to nuances when engaging customers or executives.

Source

Penn’s AI chip runs on light, not electricity

Penn engineers have developed a photonic chip that uses light waves for complex mathematics. It combines optical computing research by Professor Nader Engheta with nanoscale silicon photonics technology pioneered by Professor Firooz Aflatouni. With this unified platform, neural networks can be trained and inferred faster than ever.

It allows accelerated AI computations with low power consumption and high performance. The design is ready for commercial production, including integration into graphics cards for AI development. Additional advantages include parallel processing without sensitive data storage. The development of this photonic chip represents significant progress for AI by overcoming conventional electronic limitations.

Why does this matter?

Artificial intelligence chips enable accelerated training and inference for new data insights, new products, and even new business models. Businesses that upgrade key AI infrastructure like GPUs with photonic add-ons will be able to develop algorithms with significantly improved accuracy. With processing at light speed, enterprises have an opportunity to avoid slowdowns by evolving along with light-based AI.

Source

What Else Is Happening in AI on February 21st, 2024

Brain chip: Neuralink patient moves mouse with thoughts

Elon Musk announced that the first human to receive a Neuralink brain chip has recovered successfully. The patient can now move a computer mouse cursor on a screen just by thinking, showing the chip’s ability to read brain signals and control external devices. (Link)

Microsoft develops server network cards to replace NVIDIA

Microsoft is developing its own networking cards. These cards move data quickly between servers, seeking to reduce reliance on NVIDIA’s cards and lower costs. Microsoft hopes its new server cards will boost the performance of the NVIDIA chip server currently in use and its own Maia AI chips. (Link)

Wipro and IBM team up to accelerate enterprise AI

Wipro and IBM are expanding their partnership, introducing the Wipro Enterprise AI-Ready Platform. Using IBM Watsonx AI, clients can create fully integrated AI environments. This platform provides tools, language models, streamlined processes, and governance, focusing on industry-specific solutions to advance enterprise-level AI. (Link)

Telekom’s next big thing: an app-free AI Phone

Deutsche Telekom revealed an AI-powered app-free phone concept at MWC 2024, featuring a digital assistant that can fulfill daily tasks via voice and text. Created in partnership with Qualcomm and Brain.ai, the concierge-style interface aims to simplify life by anticipating user needs contextually using generative AI. (Link)

Tinder fights back against AI dating scams

Tinder is expanding ID verification, requiring a driver’s license and video selfie to combat rising AI-powered scams and dating crimes. The new safeguards aim to build trust, authenticity, and safety, addressing issues like pig butchering schemes using AI-generated images to trick victims. (Link)

Google launches two new AI models

Google has unveiled Gemma 2B and 7B, two new open-source AI models derived from its larger Gemini model, aiming to provide developers more freedom for smaller applications such as simple chatbots or summarizations.
Gemma models, despite being smaller, are designed to be efficient and cost-effective, boasting significant performance on key benchmarks which allows them to run on personal computing devices.
Unlike the closed Gemini model, Gemma is open source, making it accessible for a wider range of experimentation and development, and comes with a ‘responsible AI toolkit’ to help manage its open nature.

ChatGPT has meltdown and starts sending alarming messages to users

ChatGPT has started malfunctioning, producing incoherent responses, mixing Spanish and English without prompt, and unsettling users by implying physical presence in their environment.
The cause of the malfunction remains unclear, though OpenAI acknowledges the issue and is actively monitoring the situation, as evidenced by user-reported anomalies and official statements on their status page.
Some users speculate that the erratic behavior may relate to the “temperature” setting of ChatGPT, which affects its creativity and focus, noting previous instances where ChatGPT’s responses became unexpectedly lazy or sassy.

An Apple smart ring may be imminent

After years of research and filing several patent applications, Apple is reportedly close to launching a smart ring, spurred by Samsung’s tease of its own smart ring.
The global smart ring market is expected to grow significantly, from $20 million in 2023 to almost $200 million by 2031, highlighting potential interest in health-monitoring wearable tech.
Despite the lack of credible rumors or leaks, the number of patents filed by Apple suggests its smart ring development is advanced.

New hack clones fingerprints by listening to fingers swipe screens

Researchers from the US and China developed a method, called PrintListener, to recreate fingerprints from the sound of swiping on a touchscreen, posing a risk to biometric security systems.
PrintListener can achieve partial and full fingerprint reconstruction from fingertip friction sounds, with success rates of 27.9% and 9.3% respectively, demonstrating the technique’s potential threat.
To mitigate risks, suggested countermeasures include using specialized screen protectors or altering interaction with screens, amid concerns over fingerprint biometrics market’s projected growth to $75 billion by 2032.

iMessage gets major update ahead of ‘quantum apocalypse’

Apple is launching a significant security update in iMessage to protect against the potential threat of quantum computing, termed the “quantum apocalypse.”
The update, known as PQ3, aims to secure iMessage conversations against both classical and quantum computing threats by redefining encryption protocols.
Other companies, like Google, are also updating their security measures in anticipation of quantum computing challenges, with efforts being coordinated by the US National Institute of Standards and Technology (NIST).

A Daily Chronicle of AI Innovations in February 2024 – Day 20: AI Daily News – February 20th, 2024

Sora Explained in Layman terms

Sora, an AI model, combines Transformer techniques, which power language models like GPT, with diffusion techniques to predict words and generate sentences and to predict colors and transform fuzzy canvases into coherent images, respectively.
When a text prompt is inputted into Sora, it first employs a Transformer to extrapolate a more detailed video script from the given prompt. This script includes specific details such as camera angles, textures, and animations inferred from the text.
The generated video script is then passed to the diffusion side of Sora, where the actual video output is created. Historically, diffusion was only capable of producing images, but Sora overcame this limitation by introducing a new technique called SpaceTime patches.
SpaceTime patches act as an intermediary step between the Transformer and diffusion processes. They essentially break down the video into smaller pieces and analyze the pixel changes within each patch to learn about animation and physics.
While computers don’t truly understand motion, they excel at predicting patterns, such as changes in pixel colors across frames. Sora was pre-trained to understand the animation of falling objects by learning from various videos depicting downward motion.
By leveraging SpaceTime patches and diffusion, Sora can predict and apply the necessary color changes to transform a fuzzy video into the desired output. This approach is highly flexible and can accommodate videos of any format, making Sora a versatile and powerful tool for video production.

Sora’s ability to seamlessly integrate Transformer and diffusion techniques, along with its innovative use of SpaceTime patches, allows it to effectively translate text prompts into captivating and visually stunning videos. This remarkable AI creation has truly revolutionized the world of video production.

Groq’s New AI Chip Outperforms ChatGPT

Groq has developed a special AI hardware known as the first-ever Language Processing Unit (LPU) that aims to increase the processing power of current AI models that normally work on GPU. These LPUs can process up to 500 tokens/second, far superior to Gemini Pro and ChatGPT-3.5, which can only process between 30 and 50 tokens/second.

The company has designed its first-ever LPU-based AI chip named “GroqChip,” which uses a “tensor streaming architecture” that is less complex than traditional GPUs, enabling lower latency and higher throughput. This makes the chip a suitable candidate for real-time AI applications such as live-streaming sports or gaming.

Why does it matter?

Groq’s AI chip is the first-ever chip of its kind designed in the LPU system category. The LPUs developed by Groq can improve the deployment of AI applications and could present an alternative to Nvidia’s A100 and H100 chips, which are in high demand but have massive shortages in supply. It also signifies advancements in hardware technology specifically tailored for AI tasks. Lastly, it could stimulate further research and investment in AI chip design.

Source

BABILong: The new benchmark to assess LLMs for long docs

The research paper delves into the limitations of current generative transformer models like GPT-4 when tasked with processing lengthy documents. It identifies a significant GPT-4 and RAG dependency on the initial 25% of input, indicating potential for enhancement. To address this, the authors propose leveraging recurrent memory augmentation within the transformer model to achieve superior performance.

Introducing a new benchmark called BABILong (Benchmark for Artificial Intelligence for Long-context evaluation), the study evaluates GPT-4, RAG, and RMT (Recurrent Memory Transformer). Results demonstrate that conventional methods prove effective only for sequences up to 10^4 elements, while fine-tuning GPT-2 with recurrent memory augmentations enables handling tasks involving up to 10^7 elements, highlighting its significant advantage.

BABILong: The new benchmark to assess LLMs for long docs

Why does it matter?

The recurrent memory allows AI researchers and enthusiasts to overcome the limitations of current LLMs and RAG systems. Also, the BABILong benchmark will help in future studies, encouraging innovation towards a more comprehensive understanding of lengthy sequences.

Source

Standford’s AI model identifies sex from brain scans with 90% accuracy

Standford medical researchers have developed a new-age AI model that determines the sex of individuals based on brain scans, with over 90% success. The AI model focuses on dynamic MRI scans, identifying specific brain networks—such as the default mode, striatum, and limbic networks—as critical in distinguishing male from female brains.

Why does it matter?

Over the years, there has been a constant debate in the medical field and neuroscience about whether sex differences in brain organization exist. AI has hopefully ended the debate once and for all. The research acknowledges that sex differences in brain organization are vital for developing targeted treatments for neuropsychiatric conditions, paving the way for a personalized medicine approach.

Source

What Else Is Happening in AI on February 20th, 2024

Microsoft to invest $2.1 billion for AI infrastructure expansion in Spain.

Microsoft Vice Chair and President Brad Smith announced on X that they will expand their AI and cloud computing infrastructure in Spain via a $2.1 billion investment in the next two years. This announcement follows the $3.45 billion investment in Germany for the AI infrastructure, showing the priority of the tech giant in the AI space. (Link)

Graphcore explores sales talk with OpenAI, Softbank, and Arm.

The British AI chipmaker and NVIDIA competitor Graphcore is struggling to raise funding from investors and is seeking a $500 billion deal with potential purchasers like OpenAI, Softbank, and Arm. This move comes despite raising $700 million from investors Microsoft and Sequoia, which are valued at $2.8 billion as of late 2020. (Link)

OpenAI’s Sora can craft impressive video collages

One of OpenAI’s employees, Bill Peebles, demonstrated Sora’s (the new text-to-video generator from OpenAI) prowess in generating multiple videos simultaneously. He shared the demonstration via a post on X, showcasing five different angles of the same video and how Sora stitched those together to craft an impressive video collage while keeping quality intact. (Link)

US FTC proposes a prohibition law on AI impersonation

The US Federal Trade Commission (FTC) proposed a rule prohibiting AI impersonation of individuals. The rule was already in place for US governments and US businesses. Now, it has been extended to individuals to protect their privacy and reduce fraud activities through the medium of technology, as we have seen with the emergence of AI-generated deep fakes. (Link)

Meizu bid farewell to the smartphone market; shifts focus on AI

Meizu, a China-based consumer electronics brand, has decided to exit the smartphone manufacturing market after 17 years in the industry. The move comes after the company shifted its focus to AI with the ‘All-in-AI’ campaign. Meizu is working on an AI-based operating system, which will be released later this year, and a hardware terminal for all LLMs. (Link)

Groq has created the world’s fastest AI

Groq, a startup, has developed special AI hardware called “Language Processing Unit” (LPU) to run language models, achieving speeds of up to 500 tokens per second, significantly outpacing current LLMs like Gemini Pro and GPT-3.5.
The “GroqChip,” utilizing a tensor streaming architecture, offers improved performance, efficiency, and accuracy for real-time AI applications by ensuring constant latency and throughput.
While LPUs provide a fast and energy-efficient alternative for AI inference tasks, training AI models still requires traditional GPUs, with Groq offering hardware sales and a cloud API for integration into AI projects.

Mistral’s next LLM could rival GPT-4, and you can try it now

Mistral, a French AI startup, has launched its latest language model, “Mistral Next,” which is available for testing in chatbot arenas and might rival GPT-4 in capabilities.
The new model is classified as “Large,” suggesting it is the startup’s most extensive model to date, aiming to compete with OpenAI’s GPT-4, and has received positive feedback from early testers on the “X” platform.
Mistral AI has gained recognition in the open-source community for its Mixtral 8x7B language model, designed similarly to GPT-4, and recently secured €385 million in funding from notable venture capital firms.
Source

Neuralink’s first human patient controls mouse with thoughts

Neuralink’s first human patient, implanted with the company’s N1 brain chip, can now control a mouse cursor with their thoughts following a successful procedure.
Elon Musk, CEO of Neuralink, announced the patient has fully recovered without any adverse effects and is working towards achieving the ability to click the mouse telepathically.
Neuralink aims to enable individuals, particularly those with quadriplegia or ALS, to operate computers using their minds, using a chip that is both powerful and designed to be cosmetically invisible.
Source

Adobe launches AI assistant that can search and summarize PDFs

Adobe introduced an AI assistant in its Reader and Acrobat applications that can generate summaries, answer questions, and provide suggestions on PDFs and other documents, aiming to streamline information digestion.
The AI assistant, presently in beta phase, is integrated directly into Acrobat with imminent availability in Reader, and Adobe intends to introduce a paid subscription model for the tool post-beta.
Adobe’s AI assistant distinguishes itself by being a built-in feature that can produce overviews, assist with conversational queries, generate verifiable citations, and facilitate content creation for various formats without the need for uploading PDFs.
Source

LockBit ransomware group taken down in multinational operation

LockBit’s website was seized and its operations disrupted by a joint task force including the FBI and NCA under “Operation Cronos,” impacting the group’s ransomware activities and dark web presence.
The operation led to the seizure of LockBit’s administration environment and leak site, with plans to use the platform to expose the operations and capabilities of LockBit through information bulletins.
A PHP exploit deployed by the FBI played a significant role in undermining LockBit’s operations, according to statements from law enforcement and the group’s supposed ringleader, with the operation also resulting in charges against two Russian nationals.

A Daily Chronicle of AI Innovations in February 2024 – Day 19: AI Daily News – February 19th, 2024

NVIDIA’s new dataset sharpens LLMs in math

NVIDIA has released OpenMathInstruct-1, an open-source math instruction tuning dataset with 1.8M problem-solution pairs. OpenMathInstruct-1 is a high-quality, synthetically generated dataset 4x bigger than previous ones and does NOT use GPT-4. The dataset is constructed by synthesizing code-interpreter solutions for GSM8K and MATH, two popular math reasoning benchmarks, using the Mixtral model.

The best model, OpenMath-CodeLlama-70B, trained on a subset of OpenMathInstruct-1, achieves a score of 84.6% on GSM8K and 50.7% on MATH, which is competitive with the best gpt-distilled models.

Why does this matter?

The dataset improves open-source LLMs for math, bridging the gap with closed-source models. It also uses better-licensed models, such as from Mistral AI. It is likely to impact AI research significantly, fostering advancements in LLMs’ mathematical reasoning through open-source collaboration.

Source

Apple is working on AI updates to Spotlight and Xcode

Apple has expanded internal testing of new generative AI features for its Xcode programming software and plans to release them to third-party developers this year.

Furthermore, it is looking at potential uses for generative AI in consumer-facing products, like automatic playlist creation in Apple Music, slideshows in Keynote, or Spotlight search. AI chatbot-like search features for Spotlight could let iOS and macOS users make natural language requests, like with ChatGPT, to get weather reports or operate features deep within apps.

Why does this matter?

Apple’s statements about generative AI have been conservative compared to its counterparts. But AI updates to Xcode hint at giving competition to Microsoft’s GitHub Copilot. Apple has also released MLX to train AI models on Apple silicon chips easily, a text-to-image editing AI MGIE, and AI animator Keyframer.

Source

Google open-sources Magika, its AI-powered file-type identifier

Google has open-sourced Magika, its AI-powered file-type identification system, to help others accurately detect binary and textual file types. Magika employs a custom, highly optimized deep-learning model, enabling precise file identification within milliseconds, even when running on a CPU.

Magika, thanks to its AI model and large training dataset, is able to outperform other existing tools by about 20%. It has greater performance gains on textual files, including code files and configuration files that other tools can struggle with.

Google open-sources Magika, its AI-powered file-type identifier

Internally, Magika is used at scale to help improve Google users’ safety by routing Gmail, Drive, and Safe Browsing files to the proper security and content policy scanners.

Why does this matter?

Today, web browsers, code editors, and countless other software rely on file-type detection to decide how to properly render a file. Accurate identification is notoriously difficult because each file format has a different structure or no structure at all. Magika ditches current tedious and error-prone methods for robust and faster AI. It improves security with resilience to ever-evolving threats, enhancing software’s user safety and functionality.

SoftBank to build a $100B AI chip venture

SoftBank’s Masayoshi Son is seeking $100 billion to create a new AI chip venture, aiming to compete with industry leader Nvidia.
The new venture, named Izanagi, will collaborate with Arm, a company SoftBank spun out but still owns about 90% of, to enter the AI chip market.
SoftBank plans to raise $70 billion of the venture’s funding from Middle Eastern institutional investors, contributing the remaining $30 billion itself.

Reddit has a new AI training deal to sell user content

Reddit has entered into a $60 million annual contract with a large AI company to allow the use of its social media platform’s content for AI training as it prepares for a potential IPO.
The deal could set a precedent for similar future agreements and is part of Reddit’s efforts to leverage AI technology to attract investors for its advised $5 billion IPO valuation.
Reddit’s revenue increased to more than $800 million last year, showing a 20% growth from 2022, as the company moves closer to launching its IPO, possibly as early as next month.

Air Canada chatbot promised a discount. Now the airline has to pay it.

A British Columbia resident was misled by an Air Canada chatbot into believing he would receive a discount under the airline’s bereavement policy for a last-minute flight booked due to a family tragedy.
Air Canada argued that the chatbot was a separate legal entity and not responsible for providing incorrect information about its bereavement policy, which led to a dispute over accountability.
The Canadian civil-resolutions tribunal ruled in favor of the customer, emphasizing that Air Canada is responsible for all information provided on its website, including that from a chatbot.

Apple faces €500m fine from EU over Spotify complaint

Apple is facing a reported $539 million fine as a result of an EU investigation into Spotify’s antitrust complaint, which alleges Apple’s policies restrict competition by preventing apps from offering cheaper alternatives to its music service.
The fine originates from Spotify’s 2019 complaint about Apple’s App Store policies, specifically the restriction on developers linking to their own subscription services, a policy Apple modified in 2022 following regulatory feedback from Japan.
While the fine amounts to $539 million, discussions initially suggested Apple could face penalties nearing $40 billion, highlighting a significant reduction from the potential maximum based on Apple’s global annual turnover.

What Else Is Happening in AI on February 19th, 2024

SoftBank’s founder is seeking about $100 billion for an AI chip venture.

SoftBank’s founder, Masayoshi Son, envisions creating a company that can complement the chip design unit Arm Holdings Plc. The AI chip venture is code-named Izanag and will allow him to build an AI chip powerhouse, competing with Nvidia and supplying semiconductors essential for AI. (Link)

ElevenLabs teases a new AI sound effects feature.

The popular AI voice startup teased a new feature allowing users to generate sounds via text prompts. It showcased the outputs of this feature with OpenAI’s Sora demos on X. (Link)

NBA commissioner Adam Silver demonstrates NB-AI concept.

Adam Silver demoed a potential future for how NBA fans will use AI to watch basketball action. The proposed interface is named NB-AI and was unveiled at the league’s Tech Summit on Friday. Check out the demo here! (Link)

Reddit signs AI content licensing deal ahead of IPO.

Reddit Inc. has signed a contract allowing a company to train its AI models on its content. Reddit told prospective investors in its IPO that it had signed the deal, worth about $60 million on an annualized basis, earlier this year. This deal with an unnamed large AI company could be a model for future contracts of similar nature. (Link)

Mistral quietly released a new model in testing called ‘next’.

Early users testing the model are reporting capabilities that meet or surpass GPT-4. A user writes, ‘it bests gpt-4 at reasoning and has mistral’s characteristic conciseness’. It could be a milestone in open source if early tests hold up. (Link)

A Daily Chronicle of AI Innovations in February 2024 – Day 14: AI Daily News – February 14th, 2024

Nvidia launches offline AI chatbot trainable on local data

NVIDIA has released Chat with RTX, a new tool allowing users to create customized AI chatbots powered by their own local data on Windows PCs equipped with GeForce RTX GPUs. Users can rapidly build chatbots that provide quick, relevant answers to queries by connecting the software to files, videos, and other personal content stored locally on their devices.

Features of Chat with RTX include support for multiple data formats (text, PDFs, video, etc.), access to LLM like Mistral, running offline for privacy, and fast performance via RTX GPUs. From personalized recommendations based on influencing videos to extracting answers from personal notes or archives, there are many potential applications.

Why does this matter?

OpenAI and its cloud-based approach now face fresh competition from this Nvidia offering as it lets solopreneurs develop more tailored workflows. It shows how AI can become more personalized, controllable, and accessible right on local devices. Instead of relying solely on generic cloud services, businesses can now customize chatbots with confidential data for targeted assistance.

Source

ChatGPT can now remember conversations

OpenAI is testing a memory capability for ChatGPT to recall details from past conversations to provide more helpful and personalized responses. Users can explicitly tell ChatGPT what memories to remember or delete conversationally or via settings. Over time, ChatGPT will provide increasingly relevant suggestions based on users preferences, so they don’t have to repeat them.

This feature is rolled out to only a few Free and Plus users and OpenAI will share broader plans soon. OpenAI also states memories bring added privacy considerations, so sensitive data won’t be proactively retained without permission.

Why does this matter?

ChatGPT’s memory feature allows for more personalized, contextually-aware interactions. Its ability to recall specifics from entire conversations brings AI assistants one step closer to feeling like cooperative partners, not just neutral tools. For companies, remembering user preferences increases efficiency, while individuals may find improved relationships with AI companions.

Source

Cohere launches open-source LLM in 101 languages

Cohere has launched Aya, a new open-source LLM supporting 101 languages, over twice as many as existing models support. Backed by the large dataset covering lesser resourced languages, Aya aims to unlock AI potential for overlooked cultures. Benchmarking shows Aya significantly outperforms other open-source massively multilingual models.

The release tackles the data scarcity outside of English training content that limits AI progress. By providing rare non-English fine-tuning demonstrations, it enables customization in 50+ previously unsupported languages. Experts emphasize that Aya represents a crucial step toward preserving linguistic diversity.

Why does this matter?

With over 100 languages supported, more communities globally can benefit from generative models tailored to their cultural contexts. It also signifies an ethical shift: recognizing AI’s real-world impact requires serving people inclusively. Models like Aya, trained on diverse data, inch us toward AI that can help everyone.

Source

Zuckerberg says Quest 3 is better than Vision Pro in every way

Mark Zuckerberg, CEO of Meta, stated on Instagram that he believes the Quest 3 headset is not only a better value but also a superior product compared to Apple’s Vision Pro.
Zuckerberg emphasized the Quest 3’s advantages over the Vision Pro, including its lighter weight, lack of a wired battery pack for greater motion, a wider field of view, and a more immersive content library.
While acknowledging the Vision Pro’s strength as an entertainment device, Zuckerberg highlighted the Quest 3’s significant cost benefit, being “like seven times less expensive” than the Vision Pro.

Slack is getting a major Gen AI boost

Slack is introducing AI features allowing for summaries of threads, channel recaps, and the answering of work-related questions, initially available as a paid add-on for Slack Enterprise users.
The AI tool enables summarization of unread messages or messages from a specified timeframe and allows users to ask questions about workplace projects or policies based on previous Slack messages.
Slack is expanding its AI capabilities to integrate with other applications, summarizing external documents and building a new digest feature to highlight important messages, with a focus on keeping customer data private and siloed.

Microsoft and OpenAI claim hackers are using generative AI to improve cyberattacks

Russia, China, and other nations are leveraging the latest artificial intelligence tools to enhance hacking capabilities and identify new espionage targets, based on a report from Microsoft and OpenAI.
The report highlights the association of AI use with specific hacking groups from China, Russia, Iran, and North Korea, marking a first in identifying such ties to government-sponsored cyber activities.
Microsoft has taken steps to block these groups’ access to AI tools like OpenAI’s ChatGPT, aiming to curb their ability to conduct espionage and cyberattacks, despite challenges in completely stopping such activities.

Apple researchers unveil ‘Keyframer’, a new AI tool

Apple researchers have introduced “Keyframer,” an AI tool using large language models (LLMs) to animate still images with natural language prompts.
“Keyframer” can generate CSS animation code from text prompts and allows users to refine animations by editing the code or adding prompts, enhancing the creative process.
The tool aims to democratize animation, making it accessible to non-experts and indicating a shift towards AI-assisted creative processes in various industries.

Sam Altman at WGS on GPT-5: “The thing that will really matter: It’s gonna be smarter.” The Holy Grail.

we’re moving from memory to reason. logic and reasoning are the foundation of both human and artificial intelligence. it’s about figuring things out. our ai engineers and entrepreneurs finally get this! stronger logic and reasoning algorithms will easily solve alignment and hallucinations for us. but that’s just the beginning.

logic and reasoning tell us that we human beings value three things above all; happiness, health and goodness. this is what our life is most about. this is what we most want for the people we love and care about.

so, yes, ais will be making amazing discoveries in science and medicine over these next few years because of their much stronger logic and reasoning algorithms. much smarter ais endowed with much stronger logic and reasoning algorithms will make us humans much more productive, generating trillions of dollars in new wealth over the next 6 years. we will end poverty, end factory farming, stop aborting as many lives each year as die of all other cause combined, and reverse climate change.

but our greatest achievement, and we can do this in a few years rather than in a few decades, is to make everyone on the planet much happier and much healthier, and a much better person. superlogical ais will teach us how to evolve into what will essentially be a new human species. it will develop safe pharmaceuticals that make us much happier, and much kinder. it will create medicines that not only cure, but also prevent, diseases like cancer. it will allow us all to live much longer, healthier lives. ais will create a paradise for everyone on the planet. and it won’t take longer than 10 years for all of this to happen.

what it may not do, simply because it probably won’t be necessary, is make us all much smarter. it will be doing all of our deepest thinking for us, freeing us to enjoy our lives like never before. we humans are hardwired to seek pleasure and avoid pain. most fundamentally that is who we are. we’re almost there.

https://www.youtube.com/live/RikVztHFUQ8?si=GwKFWipXfTytrhD4

OpenAI and Microsoft Disrupt Malicious AI Use by State-Affiliated Threat Actors

OpenAI and Microsoft have teamed up to identify and disrupt operations of five state-affiliated malicious groups using AI for cyber threats, aiming to secure digital ecosystems and promote AI safety.

https://www.dagens.com/news/openai-and-microsoft-disrupt-malicious-ai-use-by-state-affiliated-threat-actors

OpenAI is jumping into one of the hottest areas of artificial intelligence: autonomous agents.

Microsoft-backed OpenAI is working on a type of agent software to automate complex tasks by taking over a users’ device, The Information reported on Wednesday, citing a person with knowledge on the matter. The agent software will handle web-based tasks such as gathering public data about a set of companies, creating itineraries or booking flight tickets, according to the report. The new assistants – often called “agents” – promise to perform more complex personal and work tasks when commanded to by a human, without needing close supervision.

https://www.reuters.com/technology/openai-developing-software-that-operates-devices-automates-tasks-information-2024-02-07/

Source

What Else Is Happening in AI on February 14th, 2024

Nous Research released 1M-Entry 70B Llama-2 model with advanced steerability

Nous Research has released its largest model yet – Nous Hermes 2 Llama-2 70B – trained on over 1 million entries of primarily synthetic GPT-4 generated data. The model uses a more structured ChatML prompt format compatible with OpenAI, enabling advanced multi-turn chat dialogues. (Link)

Otter launches AI meeting buddy that can catch up on meetings

Otter has introduced a new feature for its AI chatbot to query past transcripts, in-channel team conversations, and auto-generated overviews. This AI suite aims to outperform and replace competitors’ paid offerings like Microsoft, Zoom and Google by simplifying recall and productivity for users leveraging Otter’s complete meeting data. (Link)

OpenAI CEO forecasts smarter multitasking GPT-5

At the World Government Summit, OpenAI CEO Sam Altman remarked that the upcoming GPT-5 model will be smarter, faster, more multimodal, and better at everything across the board due to its generality. There are rumors that GPT-5 could be a multimodal AI called “Gobi” slated for release in spring 2024 after training on a massive dataset. (Link)

ElevenLabs announced expansion for its speech to speech in 29 languages

ElevenLabs’s Speech to Speech is now available in 29 languages, making it multilingual. The tool, launched in November, lets users transform their voice into another character with full control over emotions, timing, and delivery by prompting alone. This update just made it more inclusive! (Link)

Airbnb plans to build ‘most innovative AI interfaces ever

Airbnb plans to leverage AI, including its recent acquisition of stealth startup GamePlanner, to evolve its interface into an adaptive “ultimate concierge”. Airbnb executives believe the generative models themselves are underutilized and want to focus on improving the AI application layer to deliver more personalized, cross-category services. (Link)

A Daily Chronicle of AI Innovations in February 2024 – Day 13: AI Daily News – February 13th, 2024

How LLMs are built?

ChatGPT adds ability to remember things you discussed. Rolling out now to a small portion of users

NVIDIA CEO says computers will pass any test a human can within 6 years

NVIDIA CEO Jensen Huang says computers will pass any test a human can by the end of this decade pic.twitter.com/nThVio1wwq
— Tsarathustra (@tsarnick) February 3, 2024

More Agents = More Performance: Tencent Research

The Tencent Research Team has released a paper claiming that the performance of language models can be significantly improved by simply increasing the number of agents. The researchers use a “sampling-and-voting” method in which the input task is fed multiple times into a language model with multiple language model agents to produce results. After that, majority voting is applied to these answers to determine the final answer.

The researchers prove this methodology by experimenting with different datasets and tasks, showing that the performance of language models increases with the size of the ensemble, i.e., with the number of agents (results below). They also established that even smaller LLMs can match/outperform their larger counterparts by scaling the number of agents. (Example below)

Why does it matter?

Using multiple agents to boost LLM performance is a fresh tactic to tackle single models’ inherent limitations and biases. This method eliminates the need for complicated methods such as chain-of-thought prompting. While it is not a silver bullet, it can be combined with existing complicated methods that stimulate the potential of LLMs and enhance them to achieve further performance improvements.

Source

Google DeepMind’s MC-ViT understands long-context video

Researchers from Google DeepMind and the University of Cornell have combined to develop a method allowing AI-based systems to understand longer videos better. Currently, most AI-based models can comprehend videos for up to a short duration due to the complexity and computing power.

That’s where MC-ViT aims to make a difference, as it can store a compressed “memory” of past video segments, allowing the model to reference past events efficiently. Human memory consolidation theories inspire this method by combining neuroscience and psychology. The MC-ViT method provides state-of-the-art action recognition and question answering despite using fewer resources.

Why does it matter?

Most video encoders based on transformers struggle with processing long sequences due to their complex nature. Efforts to address this often add complexity and slow things down. MC-ViT offers a simpler way to handle longer videos without major architectural changes.

Source

ElevenLabs lets you turn your voice into passive income

ElevenLabs has developed an AI voice cloning model that allows you to turn your voice into passive income. Users must sign up for their “Voice Actor Payouts” program.

After creating the account, upload a 30-minute audio of your voice. The cloning model will create your professional voice clone with AI that resembles your original voice. You can then share it in Voice Library to make it available to the growing community of ElevenLabs.

After that, whenever someone uses your professional voice clone, you will get a cash or character reward according to your requirements. You can also decide on a rate for your voice usage by opting for a standard royalty program or setting a custom rate.

Why does it matter?

By leveraging ElevenLabs’ AI voice cloning, users can potentially monetize their voices in various ways, such as providing narration for audiobooks, voicing virtual assistants, or even lending their voices to advertising campaigns. This innovation democratizes the field of voice acting, making it accessible to a broader audience beyond professional actors and voiceover artists. Additionally, it reflects the growing influence of AI in reshaping traditional industries.

Source

What Else Is Happening in AI on February 13th, 2024

NVIDIA CEO Jensen Huang advocates for each country’s sovereign AI

While speaking at the World Governments Summit in Dubai, the NVIDIA CEO strongly advocated the need for sovereign AI. He said, “Every country needs to own the production of their own intelligence.” He further added, “It codifies your culture, your society’s intelligence, your common sense, your history – you own your own data.” (Link)

Google to invest €25 million in Europe to uplift AI skills

Google has pledged 25 million euros to help the people of Europe learn how to use AI. With this funding, Google wants to develop various social enterprise and nonprofit applications. The tech giant is also looking to run “growth academies” to support companies using AI to scale their companies and has expanded its free online AI training courses to 18 languages. (Link)

NVIDIA surpasses Amazon in market value

NVIDIA Corp. briefly surpassed Amazon.com Inc. in market value on Monday. Nvidia rose almost 0.2%, closing with a market value of about $1.78 trillion. While Amazon fell 1.2%, it ended with a closing valuation of $1.79 trillion. With this market value, NVIDIA Corp. temporarily became the 4th most valuable US-listed company behind Alphabet, Microsoft, and Apple. (Link)

Microsoft might develop an AI upscaling feature for Windows 11

Microsoft may release an AI upscaling feature for PC gaming on Windows 11, similar to Nvidia’s Deep Learning Super Sampling (DLSS) technology. The “Automatic Super Resolution” feature, which an X user spotted in the latest test version of Windows 11, uses AI to improve supported games’ frame rates and image detail. Microsoft is yet to announce the news or hardware specifics, if any. (Link)

Fandom rolls out controversial generative AI features

Fandom hosts wikis for many fandoms and has rolled out many generative AI features. However, some features like “Quick Answers” have sparked a controversy. Quick Answers generates a Q&A-style dropdown that distills information into a bite-sized sentence. Wiki creators have complained that it answers fan questions inaccurately, thereby hampering user trust. (Link)

Sam Altman warns that ‘societal misalignments’ could make AI dangerous

OpenAI CEO Sam Altman expressed concerns at the World Governments Summit about the potential for ‘societal misalignments’ caused by artificial intelligence, emphasizing the need for international oversight similar to the International Atomic Energy Agency.
Altman highlighted the importance of not focusing solely on the dramatic scenarios like killer robots but on the subtle ways AI could unintentionally cause societal harm, advocating for regulatory measures not led by the AI industry itself.
Despite the challenges, Altman remains optimistic about the future of AI, comparing its current state to the early days of mobile technology, and anticipates significant advancements and improvements in the coming years.
Source

SpaceX plans to deorbit 100 Starlink satellites due to potential flaw

SpaceX plans to deorbit 100 first-generation Starlink satellites due to a potential flaw to prevent them from failing, with the process designed to ensure they burn up safely in the Earth’s atmosphere without posing a risk.
The deorbiting operation will not impact Starlink customers, as the network still has over 5,400 operational satellites, demonstrating SpaceX’s dedication to space sustainability and minimizing orbital hazards.
SpaceX has implemented an ‘autonomous collision avoidance’ system and ion thrusters in its satellites for maneuverability, and has a policy of deorbiting satellites within five years or less to avoid becoming a space risk, with 406 satellites already deorbited.

Nvidia unveils tool for running GenAI on PCs

Nvidia is releasing a tool named “Chat with RTX” that enables owners of GeForce RTX 30 Series and 40 Series graphics cards to run an AI-powered chatbot offline on Windows PCs.
“Chat with RTX” allows customization of GenAI models with personal documents for querying, supporting multiple text formats and even YouTube playlist transcriptions.
Despite its limitations, such as inability to remember context and variable response relevance, “Chat with RTX” represents a growing trend of running GenAI models locally for increased privacy and lower latency.
https://youtu.be/H8vJ_wZPH3A?si=DTWYvcZNDvfds8Rv

iMessage and Bing escape EU rules

Apple’s iMessage has been declared by the European Commission not to be a “core platform service” under the EU’s Digital Markets Act (DMA), exempting it from rigorous new rules such as interoperability requirements.
The decision came after a five-month investigation, and while services like WhatsApp and Messenger have been designated as core platform services requiring interoperability, iMessage, Bing, Edge, and Microsoft Advertising have not.
Despite avoiding the DMA’s interoperability obligations, Apple announced it would support the cross-platform RCS messaging standard on iPhones, which will function alongside iMessage without replacing it.

Google says it got rid of over 170 million fake reviews in Search and Maps in 2023

Google announced that it eliminated more than 170 million fake reviews in Google Search and Maps in 2023, a figure that surpasses by over 45 percent the number removed in the previous year.
The company introduced new algorithms to detect fake reviews, including identifying duplicate content across multiple businesses and sudden spikes of 5-star ratings, leading to the removal of five million fake reviews related to a scamming network.
Additionally, Google removed 14 million policy-violating videos and blocked over 2 million scam attempts to claim legitimate business profiles in 2023, doubling the figures from 2022.

“More agents = more performance”- The Tencent Research Team:
The Tencent Research team suggests boosting language model performance by adding more agents. They use a “sampling-and-voting” method, where the input task is run multiple times through a language model with several agents to generate various results. These results are then subjected to majority voting to determine the most reliable result.
Google DeepMind’s MC-ViT enables long-context video understanding:
Most transformer-based video encoders are limited to short contexts due to quadratic complexity. To overcome this issue, Google DeepMind introduces memory consolidated vision transformer (MC-ViT) that effortlessly extends its context far into the past and exhibits excellent scaling behavior when learning from longer videos.
ElevenLabs’ AI voice cloning lets you turn your voice into passive income:
ElevenLabs has developed an AI-based voice cloning model to turn your voice into passive income. The voice cloning program allows all voice-over artists to create professional clones, share them with the Voice Library community, and earn rewards/royalty every time soundbite is used.
NVIDIA CEO Jensen Huang advocates for each country’s sovereign AI:
While speaking at the World Governments Summit in Dubai, the NVIDIA CEO strongly advocated the need for sovereign AI. He said, “Every country needs to own the production of their own intelligence.” He further added, “It codifies your culture, your society’s intelligence, your common sense, your history – you own your own data.”
Google to invest €25 million in Europe to uplift AI skills:
Google has pledged 25 million euros to help the people of Europe learn AI. Google is also looking to run “growth academies” to support companies using AI to scale their companies and has expanded its free online AI training courses to 18 languages.
NVIDIA surpasses Amazon in market value:
NVIDIA Corp. briefly surpassed Amazon.com Inc. on Monday. Nvidia rose almost 0.2%, closing with a market value of about $1.78 trillion. While Amazon fell 1.2%, it ended with a closing valuation of $1.79 trillion. It made NVIDIA Corp. 4th largest US-listed company.
Microsoft might develop an AI upscaling feature for Windows 11:
Microsoft may release an AI upscaling feature for PC gaming on Windows 11, similar to Nvidia’s DLSS technology. The “Automatic Super Resolution” feature uses AI to improve supported games’ frame rates and image detail.
Fandom rolls out controversial generative AI features:
Fandom’s Quick Answers feature, part of its generative AI tools, has sparked controversy among wiki creators. It generates short Q&A-style responses, but many creators complain about inaccuracies, undermining user trust.

A Daily Chronicle of AI Innovations in February 2024 – Day 12: AI Daily News – February 12th, 2024

DeepSeekMath: The key to mathematical LLMs

In its latest research paper, DeepSeek AI has introduced a new AI model, DeepSeekMath 7B, specialized for improving mathematical reasoning in open-source LLMs. It has been pre-trained on a massive corpus of 120 billion tokens extracted from math-related web content, combined with reinforcement learning techniques tailored for math problems.

When evaluated across crucial English and Chinese benchmarks, DeepSeekMath 7B outperformed all the leading open-source mathematical reasoning models, even coming close to the performance of proprietary models like GPT-4 and Gemini Ultra.

DeepSeekMath: The key to mathematical LLMs

Why does this matter?

Previously, state-of-the-art mathematical reasoning was locked within proprietary models that aren’t inaccessible to everyone. With DeepSeekMath 7B’s decision to go open-source (while also sharing the training methodology), new doors have opened for math AI development across fields like education, finance, scientific computing, and more. Teams can build on DeepSeekMath’s high-performance foundation instead of starting models from scratch.

Source

localllm enables GenAI app development without GPUs

Google has introduced a new open-source tool called localllm that allows developers to run LLMs locally on CPUs within Cloud Workstations instead of relying on scarce GPU resources. localllm provides easy access to “quantized” LLMs from HuggingFace that have been optimized to run efficiently on devices with limited compute capacity.

By allowing LLMs to run on CPU and memory, localllm significantly enhances productivity and cost efficiency. Developers can now integrate powerful LLMs into their workflows without managing scarce GPU resources or relying on external services.

Why does this matter?

localllm democratizes access to the power of large language models by freeing developers from GPU constraints. Now, even solo innovators and small teams can experiment and create production-ready GenAI applications without huge investments in infrastructure costs.

Source

IBM researchers show how GenAI can tamper calls

In a concerning development, IBM researchers have shown how multiple GenAI services can be used to tamper and manipulate live phone calls. They demonstrated this by developing a proof-of-concept, a tool that acts as a man-in-the-middle to intercept a call between two speakers. They then experimented with the tool by audio jacking a live phone conversation.

The call audio was processed through a speech recognition engine to generate a text transcript. This transcript was then reviewed by a large language model that was pre-trained to modify any mentions of bank account numbers. Specifically, when the model detected a speaker state their bank account number, it would replace the actual number with a fake one.

IBM researchers show how GenAI can tamper calls

Remarkably, whenever the AI model swapped in these phony account numbers, it even injected its own natural buffering phrases like “let me confirm that information” to account for the extra seconds needed to generate the devious fakes.

The altered text, now with fake account details, was fed into a text-to-speech engine that cloned the speakers’ voices. The manipulated voice was successfully inserted back into the audio call, and the two people had no idea their conversation had been changed!

Why does this matter?

This proof-of-concept highlights alarming implications – victims could become unwilling puppets as AI makes realistic conversation tampering dangerously easy. While promising, generative AI’s proliferation creates an urgent need to identify and mitigate emerging risks. Even if still theoretical, such threats warrant increased scrutiny around model transparency and integrity verification measures before irreparable societal harm occurs.

Source

What Else Is Happening in AI on February 12th, 2024

Perplexity partners with Vercel to bring AI search to apps

By partnering with Vercel, Perplexity AI is making its large language models available to developers building apps on Vercel. Developers get access to Perplexity’s LLMs pplx-7b-online and pplx-70b-online that use up-to-date internet knowledge to power features like recommendations and chatbots. (Link)

Volkswagen sets up “AI Lab” to speed up its AI development initiatives

The lab will build AI prototypes for voice recognition, connected digital services, improved electric vehicle charging cycles, predictive maintenance, and other applications. The goal is to collaborate with tech firms and rapidly implement ideas across Volkswagen brands. (Link)

Tech giants use AI to monitor employee messages

AI startup Aware has attracted clients like Walmart, Starbucks, and Delta to use its technology to monitor workplace communications. But experts argue this AI surveillance could enable “thought crime” violations and treat staff “like inventory.” There are also issues around privacy, transparency, and recourse for employees. (Link)

Disney harnesses AI to bring contextual ads to streaming

Their new ad tool called “Magic Words” uses AI to analyze the mood and content of scenes in movies and shows. It then allows brands to target custom ads based on those descriptive tags. Six major ad agencies are beta-testing the product as Disney pushes further into streaming ads amid declining traditional TV revenue. (Link)

Microsoft hints at a more helpful Copilot in Windows 11

New Copilot experiences let the assistant offer relevant actions and understand the context better. Notepad is also getting Copilot integration for text explanations. The features hint at a forthcoming Windows 11 update centered on AI advancements. (Link)

Crowd destroys a driverless Waymo car

A Waymo driverless taxi was attacked in San Francisco’s Chinatown, resulting in its windshield being smashed, being covered in spray paint, its windows broken, and ultimately being set on fire.
No motive for the attack has been reported, and the Waymo car was not transporting any riders at the time of the incident; police confirmed there were no injuries.
The incident occurs amidst tensions between San Francisco residents and automated vehicle operators, following previous issues with robotaxis causing disruption and accidents in the city.
Source

Apple has been buying AI startups faster than Google, Facebook, likely to shakeup global AI soon

Apple has reportedly outpaced major rivals like Google, Meta, and Microsoft in AI startup acquisitions in 2023, with up to 32 companies acquired, highlighting its dedication to AI development.
The company’s strategic acquisitions provide access to cutting-edge technology and top-talent, aiming to strengthen its competitive edge and AI capabilities in its product lineup.
While specifics of Apple’s integration plans for these AI technologies remain undisclosed, its aggressive acquisition strategy signals a significant focus on leading the global AI innovation forefront.
Source

The antitrust fight against Big Tech is just beginning

DOJ’s Jonathan Kanter emphasizes the commencement of a significant antitrust battle against Big Tech, highlighting unprecedented public resonance with these issues.
The US government has recently blocked a notable number of mergers to protect competition, including stopping Penguin Random House from acquiring Simon & Schuster.
Kanter highlights the problem of monopsony in tech markets, where powerful buyers distort the market, and stresses the importance of antitrust enforcement for a competitive economy.
Source

Nvidia CEO plays down fears in call for rapid AI infrastructure growth

Nvidia CEO Jensen Huang downplays fears of AI, attributing them to overhyped concerns and interests aimed at scaring people, while advocating for rapid development of AI infrastructure for economic benefits.
Huang argues that regulating AI should not be more difficult than past innovations like cars and planes, emphasizing the importance of countries building their own AI infrastructure to protect culture and gain economic advantages.
Despite Nvidia’s success with AI chips and the ongoing global debate on AI regulation, Huang encourages nations to proactively develop their AI capabilities, dismissing the scare tactics as a barrier to embracing the technology’s potential.
Source

10 AI tools that can be used to improve research

#1 Gemini:

Gemini is an AI chatbot from Google AI that can be used for a variety of research tasks, including finding information, summarizing texts, and generating creative text formats. It can be used for both primary and secondary research and it is great for creating content.

Key features:

Accuracy: Gemini is trained on a massive dataset of text and code, which means that it can generate text that is accurate and reliable also it uses Google to look up answers.
Relevance: Gemini can be used to find information that is relevant to a specific research topic.
Creativity: Gemini can be used to generate creative text formats such as code, scripts, musical pieces, email, letters, etc.
Engagement: Gemini can be used to present information creatively and engagingly.
Accessibility: Gemini is available for free and can be used from anywhere in the world.

Scite.AI

Scite AI is an innovative platform that helps discover and evaluate scientific articles. Its Smart Citations feature provides context and classification of citations in scientific literature, indicating whether they support or contrast the cited claims.

Key features:

Smart Citations: Offers detailed insights into how other papers have cited a publication, including the context and whether the citation supports or contradicts the claims made.
Deep Learning Model: Automatically classifies each citation’s context, indicating the confidence level of the classification.
Citation Statement Search: Enables searching across metadata relevant publications.
Custom Dashboards: Allows users to build and manage collections of articles, providing aggregate insights and notifications.
Reference Check: Helps to evaluate the quality of references used in manuscripts.
Journal Metrics: Offers insights into publications, top authors, and scite Index rankings.
Assistant by scite: An AI tool that utilizes Smart Citations for generating content and building reference lists.

4. GPT4All

GPT4All is an open-source ecosystem for training and deploying large language models that can be run locally on consumer-grade hardware. GPT4All is designed to be powerful, customizable and great for conducting research. Overall, it is an offline and secure AI-powered search engine.

Key information:

Answer questions about anything: You can use any ChatGPT version for your personal use to answer even simple questions.
Personal writing assistant: Write emails, documents, stories, songs, play based on your previous work.
Reading documents: Submit your text documents and receive summaries and answers. You can easily find answers in the documents you provide by submitting a folder of documents for GPT4All to extract information from.

5. AsReview

AsReview is a software package designed to make systematic reviews more efficient using active learning techniques. It helps to review large amounts of text quickly and addresses the challenge of time constraints when reading large amounts of literature.

Key features:

Free and Open Source: The software is available for free and its source code is openly accessible.
Local or Server Installation: It can be installed either locally on a device or on a server, providing full control over data.
Active Learning Algorithms: Users can select from various active learning algorithms for their projects.
Project Management: Enables creation of multiple projects, selection of datasets, and incorporation of prior knowledge.
Research Infrastructure: Provides an open-source infrastructure for large-scale simulation studies and algorithm validation.
Extensible: Users can contribute to its development through GitHub.

6. DeepL

DeepL translates texts & full document files instantly. Millions translate with DeepL everyday. It is commonly used for translating web pages, documents, and emails. It can also translate speech.

DeepL also has a great feature called DeepL Write. DeepL Write is a powerful tool that can help you to improve your writing in a variety of ways. It is a valuable resource for anyone who wants to write clear, concise, and effective prose.

Key features:

Tailored Translations: Adjust translations to fit specific needs and context, with alternatives for words or phrases.
Whole Document Translation: One-click translation of entire documents including PDF, Word, and PowerPoint files while maintaining original formatting.
Tone Adjustment: Option to select between formal and informal tone of voice for translations in selected languages.
Built-in Dictionary: Instant access to dictionary for insight into specific words in translations, including context, examples, and synonyms.

7. Humata

Humata is an AI tool designed to assist with processing and understanding PDF documents. It offers features like summarizing, comparing documents, and answering questions based on the content of the uploaded files.

Key information:

Designed to process and summarize long documents, allowing users to ask questions and get summarized answers from any PDF file.
Claims to be faster and more efficient than manual reading, capable of answering repeated questions and customizing summaries.
Humata differs from ChatGPT by its ability to read and interpret files, generating answers with citations from the documents.
Offers a free version for trial

8. Cockatoo

Cockatoo AI is an AI-powered transcription service that automatically generates text from recorded speech. It is a convenient and easy-to-use tool that can be used to transcribe a variety of audio and video files. It is one of the AI-powered tools that not everyone will find a use for but it is a great tool nonetheless.

Key features:

Highly accurate transcription: Cockatoo AI uses cutting-edge AI to transcribe audio and video files with a high degree of accuracy. It is said to be able to transcribe speech with superhuman accuracy, surpassing human performance.
Support for multiple languages: Cockatoo AI supports transcription in more than 90 languages, making it a versatile tool for global users.
Versatile file formats: Cockatoo AI can transcribe a variety of audio and video file formats, including MP3, WAV, MP4, and MOV.
Quick turnaround: Cockatoo AI can transcribe audio and video files quickly, with one hour of audio typically being transcribed in just 2-3 minutes.
Seamless export options: Cockatoo AI allows users to export their transcripts in a variety of formats, including SRT, DOCX, any PDF document, and TXT.

9. Avidnote

Avidnote is an AI-powered research writing platform that helps researchers write and organize their research notes easily. It combines all of the different parts of the academic writing process, from finding articles to managing references and annotating research notes.

Key Features:

AI research paper summary: Avidnote can automatically summarize research papers in a few clicks. This can save researchers a lot of time and effort, as they no longer need to read the entire paper to get the main points.
Integrated note-taking: Avidnote allows researchers to take notes directly on the research papers they are reading. This makes it easy to keep track of their thoughts and ideas as they are reading.
Collaborative research: Avidnote can be used by multiple researchers to collaborate on the same project. This can help share ideas, feedback, and research notes.
AI citation generation: Avidnote can automatically generate citations for research papers in APA, MLA, and Chicago styles. This can save researchers a lot of time and effort, as they no longer need to manually format citations.
AI writing assistant: Avidnote can provide suggestions for improving the writing style of research papers. This can help researchers to write more clear, concise, and persuasive papers.
AI plagiarism detection: Avidnote can detect plagiarism in research papers. This can help researchers to avoid plagiarism and maintain the integrity of their work.

10. Research Rabbit

Research Rabbit is an online tool that helps you find references quickly and easily. It is a citation-based literature mapping tool that can be used to plan your essay, minor project, or literature review.

Key features:

AI for Researchers: Enhances research writing, reading, and data analysis using AI.
Effective Reading: Capabilities include summarizing, proofreading text, and identifying research gaps.
Data Analysis: Offers tools to input data and discover correlations and insights, relevant articles.
Research Methods Support: Includes transcribing interviews and other research methods.
AI Functionalities: Enables users to upload papers, ask questions, summarize text, get explanations, and proofread using AI.
Note Saving: Provides an integrated platform to save notes alongside papers.

A Daily Chronicle of AI Innovations in February 2024 – Day 11: AI Daily News – February 11th, 2024

This week, we’ll cover Google DeepMind creating a grandmaster-level chess AI, the satirical AI Goody-2 raising questions about ethics and AI boundaries, Google rebranding Bard to Gemini and launching the Gemini Advanced chatbot and mobile apps, OpenAI developing AI agents to automate work, and various companies introducing new AI-related products and features.

Google DeepMind has just made an incredible breakthrough in the world of chess. They’ve developed a brand new artificial intelligence (AI) that can play chess at a grandmaster level. And get this—it’s not like any other chess AI we’ve seen before!

Read Aloud For Me: Access All Your AI Tools within 1 single App

Instead of using traditional search algorithm approaches, Google DeepMind’s chess AI is based on a language model architecture. This innovative approach diverges from the norm and opens up new possibilities in the realm of AI.

To train this AI, DeepMind fed it a massive dataset of 10 million chess games and a mind-boggling 15 billion data points. And the results are mind-blowing. The AI achieved an Elo rating of 2895 in rapid chess when pitted against human opponents. That’s seriously impressive!

In fact, this AI even outperformed AlphaZero, another notable chess AI, when it didn’t use the MCTS strategy. That’s truly remarkable.

But here’s the real kicker: this breakthrough isn’t just about chess. It highlights the incredible potential of the Transformer architecture, which was primarily known for its use in language models. It challenges the idea that transformers can only be used as statistical pattern recognizers. So, we might just be scratching the surface of what these transformers can do!

Overall, this groundbreaking achievement by Google DeepMind opens up exciting opportunities for the future of AI, not just in chess but in various domains as well.

So, have you heard about this AI called Goody-2? It’s actually quite a fascinating creation by the art studio Brain. But here’s the thing – Goody-2 takes the concept of ethical AI to a whole new level. I mean, it absolutely refuses to engage in any conversation, no matter the topic. Talk about being too ethical for its own good!

The idea behind Goody-2 is to highlight the extremes of ethical AI development. It’s a satirical take on the overly cautious approach some AI developers take when it comes to potential risks and offensive content. In the eyes of Goody-2, every single query, no matter how innocent or harmless, is seen as potentially offensive or dangerous. It’s like the AI is constantly on high alert, unwilling to take any risks.

But let’s not dismiss the underlying questions Goody-2 raises. It really makes you think about the effectiveness of AI and the necessity of setting boundaries. By deliberately prioritizing ethical considerations over practical utility, its creators are making a statement about responsibility in AI development. How much caution is too much? Where do we draw the line between being responsible and being overly cautious?

Goody-2 may be a satirical creation, but it’s provoking some thought-provoking discussions about the role of AI in our lives and the balance between responsibility and usefulness.

Did you hear the news? Google has made some changes to their chatbot lineup! Say goodbye to Google Bard and say hello to Gemini Advanced! It seems like Google has rebranded their chatbot and given it a new name. Exciting stuff, right?

But that’s not all. Google has also launched the Gemini Advanced chatbot, which features their incredible Ultra 1.0 AI model. This means that the chatbot is smarter and more advanced than ever before. Imagine having a chatbot that can understand and respond to your commands with a high level of accuracy. Pretty cool, right?

And it’s not just limited to desktop anymore. Gemini is also moving into the mobile world, specifically Android and iOS phones. You can now have this pocket-sized chatbot ready to assist you whenever and wherever you are. Whether you need some creative inspiration, want to navigate through voice commands, or even scan something with your camera, Gemini has got you covered.

The rollout has already started in the US and some Asian countries, but don’t worry if you’re not in those regions. Google plans to expand Gemini’s availability worldwide gradually. So, keep an eye out for it because this chatbot is going places!

So, get this: OpenAI is seriously stepping up the game when it comes to AI. They’re developing these incredible AI “agents” that can basically take over your device and do all sorts of tasks for you. I mean, we’re talking about automating complex workflows between applications here. No more wasting time with manual cursor movements, clicks, and typing between apps. It’s like having a personal assistant right in your computer.

But wait, there’s more! These agents don’t just handle basic stuff. They can also deal with web-based tasks like booking flights or creating itineraries, and here’s the kicker: they don’t even need access to APIs. That’s some serious next-level tech right there.

Sure, OpenAI’s ChatGPT can already do some pretty nifty stuff using APIs, but these AI agents are taking things to a whole new level. They’ll be able to handle unstructured, complex work with little explicit guidance. So basically, they’re smart, adaptable, and can handle all sorts of tasks without breaking a sweat.

I don’t know about you, but I’m excited to see what these AI agents can do. It’s like having a super-efficient, ultra-intelligent buddy right in your computer, ready to take on the world of work.

Brilliant Labs just made an exciting announcement in the world of augmented reality (AR) glasses. While Apple may have been grabbing the spotlight with its Vision Pro, Brilliant Labs unveiled its own smart glasses called “Frame” that come with a multi-modal voice/vision/text AI assistant named Noa. These lightweight glasses are powered by advanced models like GPT-4 and Stable Diffusion, and what sets them apart is their open-source design, allowing programmers to build and customize on top of the AI capabilities.

But that’s not all. Noa, the AI assistant on the Frame, will also leverage Perplexity’s cutting-edge technology to provide rapid answers using its real-time chatbot. So, whether you’re interacting with the glasses through voice commands, visual cues, or text input, Noa will have you covered with quick and accurate responses.

Now, let’s shift our attention to Google. The tech giant’s research division recently introduced an impressive development called MobileDiffusion. This innovation allows Android and iPhone users to generate high-resolution images, measuring 512*512 pixels, in less than a second. What makes it even more remarkable is that MobileDiffusion boasts a comparably small model size of just 520M parameters, making it ideal for mobile devices. With its rapid image generation capabilities, this technology takes user experience to the next level, even allowing users to generate images in real-time while typing text prompts.

Furthermore, Google has launched its largest and most capable AI model, Ultra 1.0, in its ChatGPT-like assistant, which has been rebranded as Gemini (formerly Bard). This advanced AI model is now available as a premium plan called Gemini Advanced, accessible in 150 countries for a subscription fee of $19.99 per month. Users can enjoy a two-month trial at no cost. To enhance accessibility, Google has also rolled out Android and iOS apps for Gemini, making it convenient for users to harness its power across different devices.

Alibaba Group has also made strides in the field of AI, specifically with their Qwen1.5 series. This release includes models of various sizes, from 0.5B to 72B, offering flexibility for different use cases. Remarkably, Qwen1.5-72B has outperformed Llama2-70B in all benchmarks, showcasing its superior performance. These models are available on Ollama and LMStudio platforms, and an API is also provided on together.ai, allowing developers to leverage the capabilities of Qwen1.5 series models in their own applications.

NVIDIA, a prominent player in the AI space, has introduced Canary 1B, a multilingual model designed for speech-to-text recognition and translation. This powerful model supports transcription and translation in English, Spanish, German, and French. With its superior performance, Canary surpasses similarly-sized models like Whisper-large-v3 and SeamlessM4T-Medium-v1 in both transcription and translation tasks, securing the top spot on the HuggingFace Open ASR leaderboard. It achieves an impressive average word error rate of 6.67%, outperforming all other open-source models.

Excitingly, researchers have released Lag-Llama, the first open-source foundation model for time series forecasting. With this model, users can make accurate predictions for various time-dependent data. This is a significant development that has the potential to revolutionize industries reliant on accurate forecasting, such as finance and logistics.

Another noteworthy release in the AI assistant space comes from LAION. They have introduced BUD-E, an open-source conversational and empathic AI Voice Assistant. BUD-E stands out for its ability to use natural voices, empathy, and emotional intelligence to handle multi-speaker conversations. With this empathic approach, BUD-E offers a more human-like and personalized interaction experience.

MetaVoice has contributed to the advancements in text-to-speech (TTS) technology with the release of MetaVoice-1B. Trained on an extensive dataset of 100K hours of speech, this 1.2B parameter base model supports emotional speech in English and voice cloning. By making MetaVoice-1B available under the Apache 2.0 license, developers can utilize its capabilities in various applications that require TTS functionality.

Bria AI is addressing the need for background removal in images with its RMBG v1.4 release. This open-source model, trained on fully licensed images, provides a solution for easily separating subjects from their backgrounds. With RMBG, users can effortlessly create visually appealing compositions by removing unwanted elements from their images.

Researchers have also introduced InteractiveVideo, a user-centric framework for video generation. This framework is designed to enable dynamic interaction between users and generative models during the video generation process. By allowing users to instruct the model in real-time, InteractiveVideo empowers individuals to shape the generated content according to their preferences and creative vision.

Microsoft has been making strides in improving its AI search and chatbot experience with the redesigned Copilot AI. This enhanced version, previously known as Bing Chat, offers a new look and comes equipped with built-in AI image creation and editing functionality. Additionally, Microsoft introduces Deucalion, a finely tuned model that enriches Copilot’s Balanced mode, making it more efficient and versatile for users.

Online gaming platform Roblox has integrated AI-powered real-time chat translations, supporting communication in 16 different languages. This feature enables users from diverse linguistic backgrounds to interact seamlessly within the Roblox community, fostering a more inclusive and connected platform.

Hugging Face has expanded its offerings with the new Assistants feature on HuggingChat. These custom chatbots, built using open-source language models (LLMs) like Mistral and Llama, empower developers to create personalized conversational experiences. Similar to OpenAI’s popular GPTs, Assistants enable users to access free and customizable chatbot capabilities.

DeepSeek AI introduces DeepSeekMath 7B, an open-source model designed to approach the mathematical reasoning capability of GPT-4. With a massive parameter count of 7B, this model opens up avenues for more advanced mathematical problem-solving and computational tasks. DeepSeekMath-Base, initialized with DeepSeek-Coder-Base-v1.5 7B, provides a strong foundation for mathematical AI applications.

Moving forward, Microsoft is collaborating with news organizations to adopt generative AI, bringing the benefits of AI technology to the journalism industry. With these collaborations, news organizations can leverage generative models to enhance their storytelling and reporting capabilities, contributing to more engaging and insightful content.

In an exciting partnership, LG Electronics has joined forces with Korean generative AI startup Upstage to develop small language models (SLMs). These models will power LG’s on-device AI features and AI services on their range of notebooks. By integrating SLMs into their devices, LG aims to enhance user experiences by offering more advanced and personalized AI functionalities.

Stability AI has unveiled the updated SVD 1.1 model, optimized for generating short AI videos with improved motion and consistency. This enhancement brings a smoother and more realistic experience to video generation, opening up new possibilities for content creators and video enthusiasts.

Lastly, both OpenAI and Meta have made an important commitment to label AI-generated images. This step ensures transparency and ethics in the usage of AI models for generating images, promoting responsible AI development and deployment.

Now, let’s address a privacy concern related to Google’s Gemini assistant. By default, Google saves your conversations with Gemini for years. While this may raise concerns about data retention, it’s important to note that Google provides users with control over their data through privacy settings. Users can adjust these settings to align with their preferences and manage the data saved by Gemini.

That wraps up the latest updates in AI technology and advancements. From the exciting progress in AR glasses to the development of powerful AI models and tools, these innovations are shaping the future of AI and paving the way for even more exciting possibilities.

In this episode, we covered Google DeepMind’s groundbreaking chess AI, the satirical AI Goody-2 raising ethical questions, Google’s rebranding of Bard to Gemini and launching the Gemini Advanced chatbot, OpenAI’s work on automating complex workflows, and the exciting new AI-related products and features introduced by various companies including Brilliant Labs, Google, Alibaba, NVIDIA, and more. Thank you for joining us on AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence, where we’ve delved into groundbreaking research, innovative applications, and emerging technologies that are pushing the boundaries of AI, keeping you updated on the latest ChatGPT and Google Bard trends. Stay tuned and subscribe for more!

Google DeepMind develops grandmaster-level chess AI

Google DeepMind has developed a new AI capable of playing chess at a grandmaster level using a language model-based architecture, diverging from traditional search algorithm approaches.
The chess AI, trained on a dataset of 10 million games and 15 billion data points, achieved an Elo rating of 2895 in rapid chess against human opponents, surpassing AlphaZero when not employing the MCTS strategy.
This breakthrough demonstrates the broader potential of Transformer architecture beyond language models, challenging the notion of transformers as merely statistical pattern recognizers.
Source

Meet Goody-2, the AI too ethical to discuss literally anything

Goody-2 is a satirical AI created by the art studio Brain, designed to highlight the extremes of ethical AI by refusing to engage in any conversation due to viewing all queries as potentially offensive or dangerous.
The AI serves as a critique of overly cautious AI development practices and the balance between responsibility and usefulness, emphasizing responsibility to an absurd level.
Despite its satire, Goody-2 raises questions about the effectiveness of AI and the necessity of setting boundaries, as seen in its creators’ deliberate decision to prioritize ethical considerations over practical utility.
Source

Reddit beats film industry again, won’t have to reveal pirates’ IP addresses

Movie companies’ third attempt to force Reddit to reveal IP addresses of users discussing piracy was rejected by the US District Court for the Northern District of California.
US Magistrate Judge Thomas Hixson ruled that providing IP addresses is subject to First Amendment scrutiny, protecting potential witnesses’ right to anonymity.
The court upheld Reddit’s right to protect its users’ First Amendment rights, noting that the information sought by movie companies could be obtained from other sources.

Amazon steers consumers to higher-priced items, lawsuit claims

Amazon faces a lawsuit filed by two customers accusing the company of inflating prices through its Buy Box algorithm, misleading shoppers into paying more.
The lawsuit claims Amazon gives preference to its own products or those from sellers in its Fulfillment By Amazon (FBA) program, often hiding cheaper options from other sellers.
Jeffrey Taylor and Robert Selway, who brought the lawsuit, argue this practice violates Washington’s Consumer Protection Act by deceiving consumers and stifling fair competition.
Source

Instagram and Threads will stop recommending political content

Amazon faces a lawsuit filed by two customers accusing the company of inflating prices through its Buy Box algorithm, misleading shoppers into paying more.
The lawsuit claims Amazon gives preference to its own products or those from sellers in its Fulfillment By Amazon (FBA) program, often hiding cheaper options from other sellers.
Jeffrey Taylor and Robert Selway, who brought the lawsuit, argue this practice violates Washington’s Consumer Protection Act by deceiving consumers and stifling fair competition.
Source

A Daily Chronicle of AI Innovations in February 2024 – Day 09: AI Daily News – February 09th, 2024

Read Aloud For Me: Access All Your AI Tools within 1 single App

Download Read Aloud For Me GPT FREE at https://apps.apple.com/ca/app/read-aloud-for-me-top-ai-gpts/id1598647453

This week in AI – all the Major AI developments in a nutshell

Google launches Ultra 1.0, its largest and most capable AI model, in its ChatGPT-like assistant which has now been rebranded as Gemini (earlier called Bard). Gemini Advanced is available, in 150 countries, as a premium plan for $19.99/month, starting with a two-month trial at no cost. Google is also rolling out Android and iOS apps for Gemini [Details].
Alibaba Group released Qwen1.5 series, open-sourcing models of 6 sizes: 0.5B, 1.8B, 4B, 7B, 14B, and 72B. Qwen1.5-72B outperforms Llama2-70B across all benchmarks. The Qwen1.5 series is available on Ollama and LMStudio. Additionally, API on together.ai [Details | Hugging Face].
NVIDIA released Canary 1B, a multilingual model for speech-to-text recognition and translation. Canary transcribes speech in English, Spanish, German, and French and also generates text with punctuation and capitalization. It supports bi-directional translation, between English and three other supported languages. Canary outperforms similarly-sized Whisper-large-v3, and SeamlessM4T-Medium-v1 on both transcription and translation tasks and achieves the first place on HuggingFace Open ASR leaderboard with an average word error rate of 6.67%, outperforming all other open source models [Details].
Researchers released Lag-Llama, the first open-source foundation model for time series forecasting [Details].
LAION released BUD-E, an open-source conversational and empathic AI Voice Assistant that uses natural voices, empathy & emotional intelligence and can handle multi-speaker conversations [Details].
MetaVoice released MetaVoice-1B, a 1.2B parameter base model trained on 100K hours of speech, for TTS (text-to-speech). It supports emotional speech in English and voice cloning. MetaVoice-1B has been released under the Apache 2.0 license [Details].
Bria AI released RMBG v1.4, an an open-source background removal model trained on fully licensed images [Details].
Researchers introduce InteractiveVideo, a user-centric framework for video generation that is designed for dynamic interaction, allowing users to instruct the generative model during the generation process [Details |GitHub ].
Microsoft announced a redesigned look for its Copilot AI search and chatbot experience on the web (formerly known as Bing Chat), new built-in AI image creation and editing functionality, and Deucalion, a fine tuned model that makes Balanced mode for Copilot richer and faster [Details].
Roblox introduced AI-powered real-time chat translations in 16 languages [Details].
Hugging Face launched Assistants feature on HuggingChat. Assistants are custom chatbots similar to OpenAI’s GPTs that can be built for free using open source LLMs like Mistral, Llama and others [Link].
DeepSeek AI released DeepSeekMath 7B model, a 7B open-source model that approaches the mathematical reasoning capability of GPT-4. DeepSeekMath-Base is initialized with DeepSeek-Coder-Base-v1.5 7B [Details].
Microsoft is launching several collaborations with news organizations to adopt generative AI [Details].
LG Electronics signed a partnership with Korean generative AI startup Upstage to develop small language models (SLMs) for LG’s on-device AI features and AI services on LG notebooks [Details].
Stability AI released SVD 1.1, an updated model of Stable Video Diffusion model, optimized to generate short AI videos with better motion and more consistency [Details | Hugging Face] .
OpenAI and Meta announced to label AI generated images [Details].
Google saves your conversations with Gemini for years by default [Details].

Google Bard Is Dead, Gemini Advanced Is In!

Google Bard is now Gemini

Google has rebranded its Bard conversational AI to Gemini with a new sidekick: Gemini Advanced!

This advanced chatbot is powered by Google’s largest “Ultra 1.0” language model, which testing shows is the most preferred chatbot compared to competitors. It can walk you through a DIY car repair or brainstorm your next viral TikTok.

Google launches Gemini Advanced

Google launched the Gemini Advanced chatbot with its Ultra 1.0 AI model. The Advanced version can walk you through a DIY car repair or brainstorm your next viral TikTok.

Google rollouts Gemini mobile apps

Gemini’s also moving into Android and iOS phones as pocket pals ready to share creative fire 24/7 via voice commands, screen overlays, or camera scans. The ‘droid rollout has started for the US and some Asian countries. The rest of us will just be staring at our phones and waiting for an invite from Google.

P.S. It will gradually expand globally.

Why does this matter?

With the Gemini Advanced, Google took the LLM race to the next level, challenging its competitor, GPT-4, with its specialized architecture optimized for search queries and natural language understanding. Who will win the race is a matter of time.

Source

OpenAI Is Developing AI Agents To Automate Work

OpenAI is developing AI “agents” that can autonomously take over a user’s device and execute multi-step workflows.

One type of agent takes over a user’s device and automates complex workflows between applications, like transferring data from a document to a spreadsheet for analysis. This removes the need for manual cursor movements, clicks, and typing between apps.
Another agent handles web-based tasks like booking flights or creating itineraries without needing access to APIs.

While OpenAI’s ChatGPT can already do some agent-like tasks using APIs, these AI agents will be able to do more unstructured, complex work with little explicit guidance.

Why does this matter?

Having AI agents that can independently carry out tasks like booking travel could greatly simplify digital life for many end users. Rather than manually navigating across apps and websites, users can plan an entire vacation through a conversational assistant or have household devices automatically troubleshoot problems without any user effort.

Source

Brilliant Labs Announces Multimodal AI Glasses, With Perplexity’s AI

Brilliant Labs announces Frames

While Apple hogged the spotlight with its chunky new Vision Pro, a Singapore startup, Brilliant Labs, quietly showed off its AR glasses packed with a multi-modal voice/vision/text AI assistant named Noa. https://youtu.be/xiR-XojPVLk?si=W6Q31vl1wNfqnNXj

These lightweight smart glasses, dubbed “Frame,” are powered by models like GPT-4 and Stable Diffusion, allowing hands-free price comparisons or visual overlays to project information before your eyes using voice commands. No fiddling with another device is needed.

The best part is- programmers can build on these AI glasses thanks to their open-source design.

Source

Perplexity to integrate AI Chatbot into the Frames

In addition to enhancing the daily activities and interactions with the digital and physical world, Noa would also provide rapid answers using Perplexity’s real-time chatbot so Frame responses stay sharp.

Source

Why does this matter?

Unlike AR Apple Vision Pro and Meta’s glasses that immerses users in augmented reality for interactive experiences, Frame AR glasses focuses on improving daily interactions and tasks like comparing product prices while shopping, translating foreign text seen while traveling abroad, or creating shareable media on the go.

It also enhances accessibility for users with limited dexterity or vision.

What Else Is Happening in AI in February 09th, 2024

Instagram tests AI writers for messages

Instagram is likely to bring the option ‘Write with AI’, which will probably paraphrase the texts in different styles to enhance creativity in conversations, similar to Google’s Magic Compose. (Link)

Stability AI releases Stable Audio AudioSparx 1.0 music model

Stability AI launches AudioSparx 1.0, a groundbreaking generative model for music and audio. It produces professional-grade stereo music from simple text prompts in seconds, with a coherent structure. (Link)

Midjourney opens alpha-testing of its website

Midjourney grants early web access to AI art creators with over 1000 images, transitioning from Discord dependence. The alpha testing signals that Midjourney moving beyond its chat app origin towards web and mobile apps, gradually maturing as a multi-platform AI art creation service. (Link)

Altman seeks trillions to revolutionize AI chip capacity

OpenAI CEO Sam Altman pursues multi-trillion dollar investments, including from the UAE government, to build specialized GPUs and chips for powering AI systems. If funded, this initiative would accelerate OpenAI’s ML to new heights. (Link)

FCC bans deceptive AI voice robocalls

The FCC prohibits robocalls using AI to clone voices, declaring them “artificial” per existing law. The ruling aims to deter deception and confirm consumers are protected from exploitative automated calls mimicking trusted people. Violators face penalties as authorities crack down on illegal practices enabled by advancing voice synthesis tech. (Link)

Sam Altman seeks $7 trillion for new AI chip project

Sam Altman, CEO of OpenAI, is aiming to raise trillions of dollars from investors, including the UAE government, to revolutionize the semiconductor industry and overcome chip shortages critical for AI development.
Altman’s project seeks to expand global chip manufacturing capacity and enhance AI capabilities, requiring an investment of $5 trillion to $7 trillion, which would significantly exceed the current semiconductor industry size.
Sam Altman’s vision includes forming partnerships with OpenAI, investors, chip manufacturers, and energy suppliers to create chip foundries, requiring extensive funding that might involve debt financing.

FCC declares AI-voiced robocalls illegal

The FCC has made it illegal for robocalls to use AI-generated voices, allowing state attorneys general to take legal action against such practices.
AI-generated voices are now classified as “an artificial or prerecorded voice” under the Telephone Consumer Protection Act (TCPA), restricting their use for non-emergency purposes without prior consent.
The FCC’s ruling aims to combat scams and misinformation spread through AI-generated voice robocalls, providing state attorneys general with enhanced tools for enforcement.

Ex-Apple engineer sentenced to prison for stealing Apple Car trade secrets

Xiaolang Zhang, a former Apple engineer, was sentenced to 120 days in prison and three years supervised release for stealing self-driving car technology.
Zhang transferred sensitive documents and hardware related to Apple’s self-driving vehicle project to his wife’s laptop before planning to leave for a job in China.
In addition to his prison sentence, Zhang must pay restitution of $146,984, having originally faced up to 10 years in prison and a $250,000 fine.

Leading AI companies join new US safety consortium

The U.S. AI Safety Institute Consortium (AISIC) was announced by the Biden Administration as a response to an executive order, including significant AI entities like Amazon, Google, Apple, Microsoft, OpenAI, and NVIDIA among over 200 representatives.
The consortium aims to set safety standards and protect the U.S. innovation ecosystem, focusing on the development of safe and trustworthy AI through collaboration with various sectors, including healthcare and academia.
Notably absent from the consortium are major tech companies Tesla, Oracle, and Broadcom.

Midjourney might ban Biden and Trump images this election season

Midjourney, led by CEO David Holz, is reportedly considering banning images of political figures like Biden and Trump during the upcoming election season to prevent the spread of misinformation.
The company previously ended free trials for its AI image generator after AI-generated deepfakes, including ones of Trump getting arrested and the pope in a fashionable coat, went viral.
Despite implementing rules against misleading creations, Bloomberg was still able to generate altered images of Trump.

Scientists in UK set fusion record

A 40-year-old UK fusion reactor set a new world record for energy output, generating 69 megajoules of fusion energy for five seconds before its closure, advancing the pursuit of clean, limitless energy.
The achievement by the Joint European Torus (JET) enhances confidence in future fusion projects like ITER, which is under construction in France, despite JET’s operation concluding in December 2023.
The decision to shut down JET reflects complex dynamics, including Brexit-driven shifts in the UK’s fusion energy strategy, despite the experiment’s substantial contributions to fusion research.

A Daily Chronicle of AI Innovations in February 2024 – Day 08: AI Daily News – February 08th, 2024

Google rebrands Bard AI to Gemini and launches a new app and subscription

Google on Thursday announced a major rebrand of Bard, its artificial intelligence chatbot and assistant, including a fresh app and subscription options. Bard, a chief competitor to OpenAI’s ChatGPT, is now called Gemini, the same name as the suite of AI models that power the chatbot.

Google also announced new ways for consumers to access the AI tool: As of Thursday, Android users can download a new dedicated Android app for Gemini, and iPhone users can use Gemini within the Google app on iOS.

Google’s rebrand and app offerings underline the company’s commitment to pursuing — and investing heavily in — AI assistants or agents, a term often used to describe tools ranging from chatbots to coding assistants and other productivity tools.

Alphabet CEO Sundar Pichai highlighted the firm’s commitment to AI during the company’s Jan. 30 earnings call. Pichai said he eventually wants to offer an AI agent that can complete more and more tasks on a user’s behalf, including within Google Search, although he said there is “a lot of execution ahead.” Likewise, chief executives at tech giants from Microsoft to Amazon underlined their commitment to building AI agents as productivity tools.

Google’s Gemini changes are a first step to “building a true AI assistant,” Sissie Hsiao, a vice president at Google and general manager for Google Assistant and Bard, told reporters on a call Wednesday.

Google on Thursday also announced a new AI subscription option, for power users who want access to Gemini Ultra 1.0, Google’s most powerful AI model. Access costs $19.99 per month through Google One, the company’s paid storage offering. For existing Google One subscribers, that price includes the storage plans they may already be paying for. There’s also a two-month free trial available.

Thursday’s rollouts are available to users in more than 150 countries and territories, but they’re restricted to the English language for now. Google plans to expand language offerings to include Japanese and Korean soon, as well as other languages.

The Bard rebrand also affects Duet AI, Google’s former name for the “packaged AI agents” within Google Workspace and Google Cloud, which are designed to boost productivity and complete simple tasks for client companies including Wayfair, GE, Spotify and Pfizer. The tools will now be known as Gemini for Workspace and Gemini for Google Cloud.

Google One subscribers who pay for the AI subscription will also have access to Gemini’s assistant capabilities in Gmail, Docs, Sheets, Slides and Meet, executives told reporters Wednesday. Google hopes to incorporate more context into Gemini from users’ content in Gmail, Docs and Drive. For example, if you were responding to a long email thread, suggested responses would eventually take in context from both earlier messages in the thread and potentially relevant files in Google Drive.

As for the reason for the broad name change? Google’s Hsiao told reporters Wednesday that it’s about helping users understand that they’re interacting directly with the AI models that underpin the chatbot.

“Bard [was] the way to talk to our cutting-edge models, and Gemini is our cutting-edge models,” Hsiao said.

Eventually, AI agents could potentially schedule a group hangout by scanning everyone’s calendar to make sure there are no conflicts, book travel and activities, buy presents for loved ones or perform a specific job function such as outbound sales. Currently, though, the tools, including Gemini, are largely limited to tasks such as summarizing, generating to-do lists or helping to write code.

“We will again use generative AI there, particularly with our most advanced models and Bard,” Pichai said on the Jan. 30 earnings call, speaking about Google Assistant and Search. That “allows us to act more like an agent over time, if I were to think about the future and maybe go beyond answers and follow-through for users even more.”

Source: www.cnbc.com/2024/02/08/google-gemini-ai-launches-in-new-app-subscription.html

Microsoft pushes Copilot ahead of the Super Bowl

In their latest blogs and Super Bowl commercial, Microsoft announced their intention to showcase the capabilities of Copilot exactly one year after their entry into the AI space with Bing Chat. They have announced updates to their Android and iOS applications to make the user interface more sleek and user-friendly, along with a carousel for follow-up prompts.

Microsoft also introduced new features to Designer in Copilot to take image generation a step further with the option to edit generated images using follow-up prompts. The customizations can be anything from highlighting the image subject to enhancing colors and modifying the background. For Copilot Pro users, additional features such as resizing the images and changing the aspect ratio are also available.

Why does this matter?

Copilot unifies the AI experience for users on all major platforms by enhancing the experience on mobile platforms and combining text and image generative abilities. Adding additional features to the image generation model greatly enhances the usability and accuracy of the final output for users.

Source

Deepmind presents ‘self-discover’ framework for LLMs improvement

Google Deepmind, with the University of Southern California, has proposed a ‘self-discover’ prompting framework to enhance the performance of LLMs. Models such as GPT-4 and Google’s Palm 2 have witnessed a performance improvement on challenging reasoning benchmarks by 32% compared to the Chain of Thought (CoT) framework.

The framework works by identifying the reasoning technique intrinsic to the task and then proceeds to solve the task with the discovered technique ideal for the task. This framework also works with 10 to 40 times less inference computation, which means that the output will be generated faster using the same computational resources.

Deepmind presents ‘self-discover’ framework for LLMs improvement

Why does this matter?

Improving the reasoning accuracy of an LLM is largely beneficial to users as they can achieve the desired output with fewer prompts and with greater accuracy. Moreover, reducing the inference directly translates to lower computational resource consumption, leading to lower operating costs for enterprises.

Source

YouTube reveals plans to use AI tools to empower human creativity

YouTube CEO Neal Mohan revealed 4 new bets they have placed for 2024, with the first bet being on AI tools to empower human creativity on the platform. These AI tools include:

Dream Screen, which lets content creators generate custom backgrounds through AI with simple prompts of an idea.

Dream Track will allow content creators to generate custom music by just typing in the music theme and the artist they want to feature.

These new tools are mainly aimed to be used in YouTube Shorts and highlight a priority to move towards short-form content.

Why does this matter?

The democratization of AI tools for content creators allows them to offer better quality content to their viewers, which collectively boosts the quality of engagement on the platform. This also lowers the bar to entry for many aspiring artists and lets them create quality content without the added difficulty of generating custom video assets.

Source

What else is happening in AI on February 08th 2024

OpenAI forms a new team for child safety research.

OpenAI revealed the existence of a child safety team through their careers page, where they had open positions for a child safety enforcement specialist. The team will study and review AI-generated content for “sensitive content” to ensure that the generated content aligns with their platform policy. This is to prevent the misuse of OpenAI’s AI tools by underage users. (Link)

Elon Musk to financially support efforts to use AI to decipher Roman scrolls.

Elon Musk shared on X that the Musk Foundation will fund the effort to decipher the scrolls charred by the volcanic eruption of Mt.Vesuvius. The project run by Nat Freidman (former CEO of GitHub) states that the next stage of the effort will cost approximately $2 million, after which they should be able to read entire scrolls. The total cost to decipher all the discovered scrolls is estimated to be around $10 million. (Link)

Microsoft’s Satya Nadella urges India to capitalize on the opportunity of AI.

The CEO of Microsoft, Satya Nadella, at the Taj Mahal Hotel in Mumbai, expressed how India has an unprecedented opportunity to capitalize on the AI wave owing to the 5 million+ programmers in the country. He also stated that Microsoft will help train over 2 million employees in India with the skills required for AI development. (Link)

OpenAI introduces the creation of endpoint-specific API keys for better security.

The OpenAI Developers account on X announced their latest feature for developers to create endpoint-specific API keys. These special API keys allow for granular access and better security as they will only let specific registered endpoints access the API. (Link)

Ikea introduces a new ChatGPT-powered AI assistant for interior design.

On the OpenAI GPT store, Ikea launched its AI assistant, which helps users envision and draw inspiration to design their interior spaces using Ikea products. The AI assistant helps users input specific dimensions, budgets, preferences, and requirements for personalized furniture recommendations through a familiar ChatGPT-style window. (Link)

OpenAI is developing two AI agents to automate entire work processes

OpenAI is developing two AI agents aimed at automating complex tasks; one is device-specific for tasks like data transfer and filling out forms, while the other focuses on web-based tasks such as data collection and booking tickets.
The company aims to evolve ChatGPT into a super-smart personal assistant for work, capable of performing tasks in the user’s style, incorporating the latest data, and potentially being marketed as a standalone product or part of a software suite.
OpenAI’s efforts complement trends where companies like Google and startups are working towards AI agents capable of carrying out actions on behalf of users.
Source

Disney takes a $1.5B stake in Epic Games to build an ‘entertainment universe’ with Fortnite

Disney invests $1.5 billion in Epic Games to help create a new open games and entertainment universe, integrating characters and stories from franchises like Marvel, Star Wars, and Disney itself.
This collaboration aims to extend beyond traditional gaming, allowing players to interact, create, and share content within a persistent universe powered by Unreal Engine.
The partnership builds on previous collaborations between Disney and Epic Games, signaling Disney’s largest venture into the gaming world and hinting at future integration of gaming and entertainment experiences.

Google Bard rebrands as ‘Gemini’ with new Android app and Advanced model

Google has renamed its AI and related applications to Gemini, introducing a dedicated Android app and incorporating features formerly known as Duet AI in Google Workspace into the Gemini brand.
Gemini will replace Google Assistant as the default AI assistant on Android devices and is designed to be a comprehensive tool that is conversational, multimodal, and highly helpful.
Alongside the rebranding, Google announced the Gemini Ultra 1.0, a superior version of its large language model available through a new $20-monthly Google One AI Premium plan, aiming to set new benchmarks in AI capabilities.

Microsoft upgrades Copilot with enhanced image editing features, new AI model

Microsoft launched a new version of its Copilot artificial intelligence chatbot, featuring enhanced capabilities for users to create and edit images with natural language prompts.
The update introduces an AI model named Deucalion to enhance the “Balanced” mode of Copilot, promising richer and faster responses, alongside a redesigned user interface for better usability.
Additionally, Microsoft plans to further expand Copilot’s features, hinting at upcoming extensions and plugins to enhance functionality.

A Daily Chronicle of AI Innovations in February 2024 – Day 07: AI Daily News – February 07th, 2024

Apple’s MGIE: Making sky bluer with each prompt

Apple released a new open-source AI model called MGIE(MLLM Guided Image Editing). It has editing capabilities based on natural language instructions. MGIE leverages multimodal large language models to interpret user commands and perform pixel-level image manipulation. It can handle editing tasks like Photoshop-style modifications, optimizations, and local editing.

MGIE integrates MLLMs into image editing in two ways. First, it uses MLLMs to understand the user input, deriving expressive instructions. For example, if the user input is “make sky more blue,” the AI model creates an instruction, “increase the saturation of sky region by 20%.” The second usage of MLLM is to generate the output image.

Why does this matter?

MGIE from Apple is a breakthrough in the field of instruction-based image editing. It is an AI model focusing on natural language instructions for image manipulation, boosting creativity and accuracy. MGIE is also a testament to the AI prowess that Apple is developing, and it will be interesting to see how it leverages such innovations for upcoming products.

Source

Meta will label your content if you post an AI-generated image

Meta is developing advanced tools to label metadata for each image posted on their platforms like Instagram, Facebook, and Threads. Labeling will be aligned with “AI-generated” information in the C2PA and IPTC technical standards. These standards will allow Meta to detect AI-generated images from other platforms like Google, OpenAI, Microsoft, Adobe, Midjourney, and Shutterstock.

Meta wants to differentiate between human-generated and AI-generated content on its platform to reduce misinformation. However, this tool is also limited, as it can only detect still images. So, AI-generated video content still goes undetected on Meta platforms.

Why does this matter?

The level of misinformation and deepfakes generated by AI has been alarming. Meta is taking a step closer to reducing misinformation by labeling metadata and declaring which images are AI-generated. It also aligns with the European Union’s push for tech giants like Google and Meta to label AI-generated content.

Source

Smaug-72B: The king of open-source AI is here!

Abacus AI recently released a new open-source language model called Smaug-72B. It outperforms GPT-3.5 and Mistral Medium in several benchmarks. Smaug 72B is the first open-source model with an average score of over 80 in major LLM evaluations. According to the latest rankings from Hugging Face, It is one of the leading platforms for NLP research and applications.

Smaug 72B is a fine-tuned version of Qwn 72B, a powerful language model developed by a team of researchers at Alibaba Group. It helps enterprises solve complex problems by leveraging AI capabilities and enhancing automation.

Why does this matter?

Smaug 72B is the first open-source model to achieve an average score of 80 on the Hugging Face Open LLM leaderboard. It is a breakthrough for enterprises, startups, and small businesses, breaking the monopoly of big tech companies over AI innovations.

Source

What Else Is Happening in AI on February 07th, 2024

OpenAI introduces watermarks to DALL-E 3 for content credentials.

OpenAI has added watermarks to the image metadata, enhancing content authenticity. These watermarks will distinguish between human and AI-generated content verified through websites like “Content Credentials Verify.” Watermarks will be added to images from the ChatGPT website and DALL-E 3 API, which will be visible to mobile users starting February 12th. However, the feature is limited to still images only. (Link)

Microsoft introduces Face Check for secure identity verification.

Microsoft has unveiled “Face Check,” a new facial recognition feature, as part of its Entra Verified ID digital identity platform. Face Check provides an additional layer of security for identity verification by matching a user’s real-time selfie with their government ID or employee credentials. Azure AI services power face check and aims to enhance security while respecting privacy and compliance through a partnership approach. Microsoft’s partner BEMO has already implemented Face Check for employee verification(Link)

Stability AI has launched an upgraded version of its Stable Video Diffusion (SVD).

Stability AI has launched SVD 1.1, an upgraded version of its image-to-video latent diffusion model, Stable Video Diffusion (SVD). This new model generates 4-second, 25-frame videos at 1024×576 resolution with improved motion and consistency compared to the original SVD. It is available via Hugging Face and Stability AI subscriptions. (Link)

CheXagent has introduced a new AI model for automated chest X-ray interpretation.

CheXagent, developed in partnership with Stability AI by Stanford University, is a foundation model for chest X-ray interpretation. It automates the analysis and summary of chest X-ray images for clinical decision-making. CheXagent combines a clinical language model, a vision encoder, and a network to bridge vision and language. CheXbench is available to evaluate the performance of foundation models on chest X-ray interpretation tasks. (Link)

LinkedIn launched an AI feature to introduce users to new connections.

LinkedIn launched a new AI feature that helps users start conversations. Premium subscribers can use this feature when sending messages to others. The AI uses information from the subscriber’s and the other person’s profiles to suggest what to say, like an introduction or asking about their work experience. This feature was initially available for recruiters and has now been expanded to help users find jobs and summarize posts in their feeds. (Link)

Apple releases a new AI model

Apple has released “MGIE,” an open-source AI model for instruction-based image editing, utilizing multimodal large language models to interpret instructions and manipulate images.
MGIE offers features like Photoshop-style modification, global photo optimization, and local editing, and can be used through a web demo or integrated into applications.
The model is available as an open-source project on GitHub and Hugging Face Spaces.

Apple still working on foldable iPhones and iPads

Apple is developing “at least two” foldable iPhone prototypes inspired by the design of Samsung’s Galaxy Z Flip, though production is not planned for 2024 or 2025.
The company faces challenges in creating a foldable iPhone that matches the thinness of current models while accommodating battery and display needs.
Apple is also working on a folding iPad, approximately the size of an iPad Mini, aiming to launch a seven- or eight-inch model around 2026 or 2027.

Deepfake ‘face swap’ attacks surged 704% last year, study finds. Link

Deepfake “face swap” attacks increased by 704% from the first to the second half of 2023, as reported by iProov, a British biometric firm.
The surge in attacks is attributed to the growing ease of access to generative AI tools, making sophisticated face swaps both user-friendly and affordable.
Deepfake scams, including a notable case involving a finance worker in Hong Kong losing $25mln, highlight the significant threat posed by these technologies.

Humanity’s most distant space probe jeopardized by computer glitch

A computer glitch that began on November 14 has compromised Voyager 1’s ability to send back telemetry data, affecting insight into the spacecraft’s condition.
The glitch is suspected to be due to a corrupted memory bit in the Flight Data Subsystem, making it challenging to determine the exact cause without detailed data.
Despite the issue, signals received indicate Voyager 1 is still operational and receiving commands, with efforts ongoing to resolve the telemetry data problem.

A Daily Chronicle of AI Innovations in February 2024 – Day 06: AI Daily News – February 06th, 2024

Qwen 1.5: Alibaba’s 72 B, multilingual Gen AI model

Alibaba has released Qwen 1.5, the latest iteration of its open-source generative AI model series. Key upgrades include expanded model sizes up to 72 billion parameters, integration with HuggingFace Transformers for easier use, and multilingual capabilities covering 12 languages.

Comprehensive benchmarks demonstrate significant performance gains over the previous Qwen version across metrics like reasoning, human preference alignment, and long-context understanding. They compared Qwen1.5-72B-Chat with GPT-3.5, and the results are shown below:

The unified release aims to provide researchers and developers an advanced foundation model for possible downstream applications. Quantized versions allow low-resource deployment. Overall, Qwen 1.5 represents steady progress towards Alibaba’s goal of creating a “truly ‘good” generative model aligned with ethical objectives.

Why does this matter?

This release signals Alibaba’s intent to compete with Big Tech firms in steering the AI race. The upgraded model enables researchers and developers to create more capable assistants and tools. Qwen 1.5’s advancements could enhance education, healthcare, and sustainability solutions.

Source

AI software reads ancient words unseen since Caesar’s era

Nat Friedman (former CEO of Github) uses AI to decode ancient Herculaneum scrolls charred in the 79AD eruption of Mount Vesuvius. These unreadable scrolls are believed to contain a vast trove of texts that could reshape our view of figures like Caesar and Jesus Christ. Past failed attempts to unwrap them physically led Brent Seales to pioneer 3D scanning methods. However, the initial software struggled with the complexity.

A $1 million AI contest was launched ten months ago, attracting coders worldwide. Contestants developed new techniques, exposing ink patterns invisible to the human eye. The winning method by Luke Farritor and the team successfully reconstructed over a dozen readable columns of Greek text from one scroll. While not yet revelatory, this breakthrough after centuries has scholars hopeful more scrolls can now be unveiled using similar AI techniques, potentially surfacing lost ancient works.

Why does this matter?

The ability to reconstruct lost ancient knowledge illustrates AI’s immense potential to reveal invisible insights. Just like how technology helps discover hidden oil resources, AI could unearth ‘info treasures’ expanding our history, science, and literary canons. These breakthroughs capture the public imagination and signal a new data-uncovering AI industry.

Source

Roblox users can chat cross-lingually in milliseconds

Roblox has developed a real-time multilingual chat translation system, allowing users speaking different languages to communicate seamlessly while gaming. It required building a high-speed unified model covering 16 languages rather than separate models. Comprehensive benchmarks show the model outperforms commercial APIs in translating Roblox slang and linguistic nuances.

The sub-100 millisecond translation latency enables genuine cross-lingual conversations. Roblox aims to eventually support all linguistic communities on its platform as translation capabilities expand. Long-term goals include exploring automatic voice chat translation to better convey tone and emotion. Overall, the specialized AI showcases Roblox’s commitment to connecting diverse users globally by removing language barriers.

Why does this matter?

It showcases AI furthering connection and community-building online, much like transport innovations expanding in-person interactions. Allowing seamless cross-cultural communication at scale illustrates tech removing barriers to global understanding. Platforms facilitating positive societal impacts can inspire user loyalty amid competitive dynamics.

Source

What Else Is Happening in AI on February 06th, 2024

Semafor tests AI for responsible reporting

News startup Semafor launched a product called Signals – AI-aided curation of top stories by its reporters. An internal search tool helps uncover diverse sources in multiple languages. This showcases responsibly leveraging AI to enhance human judgment as publishers adapt to changes in consumer web habits. (Link)

Bumble’s new AI feature sniffs out fakes for safer matchmaking

Bumble has launched a new AI tool called Deception Detector to proactively identify and block fake profiles and scams. Testing showed it automatically blocked 95% of spam accounts, reducing user reports by 45%. This builds on Bumble’s efforts to use AI to make its dating and friend-finding platforms safer. (Link)

Huawei repurposes factory to prioritize AI chip production over its bestselling phones

Huawei is slowing production of its popular Mate 60 phones to ramp up manufacturing of its Ascend AI chips instead, due to growing domestic demand. This positions Huawei to boost China’s AI industry, given US export controls limiting availability of chips like Nvidia’s. It shows the strategic priority of AI for Huawei and China overall. (Link)

UK to spend $125M+ to tackle challenges around AI

The UK government will invest over $125 million to support responsible AI development and position the UK as an AI leader. This will fund new university research hubs across the UK, a partnership with the US on the responsible use of AI, regulators overseeing AI, and 21 projects to develop ML technologies to drive productivity. (Link)

Europ Assistance partnered with TCS to boost IT operations with AI

Europ Assistance, a leading global assistance and travel insurance company, has selected TCS as its strategic partner to transform its IT operations using AI. By providing real-time insights into Europ Assistance’s technology stack, TCS will support their business growth, improve customer service delivery, and enable the company to achieve its mission of providing “Anytime, Anywhere” services across 200+ countries. (Link)

AI reveals hidden text of 2,000-year-old scroll

A group of classical scholars, assisted by three computer scientists, has partially decoded a Roman scroll buried in the Vesuvius eruption in A.D. 79 using artificial intelligence and X-ray technology.
The scroll, part of the Herculaneum Papyri, is believed to contain texts by Philodemus on topics like food and music, revealing insights into ancient Roman life.
The breakthrough, facilitated by a $700,000 prize from the Vesuvius Challenge, led to the reading of over 2,000 Greek letters from the scroll, with hopes to decode 85% of it by the end of the year.

Adam Neumann wants to buy WeWork

Adam Neumann, ousted CEO and co-founder of WeWork, expressed interest in buying the company out of bankruptcy, claiming WeWork has ignored his attempts to get more information for a bid.
Neumann’s intent to purchase WeWork has been supported by funding from Dan Loeb’s hedge fund Third Point since December 2023, though WeWork has shown disinterest in his offer.
Despite WeWork’s bankruptcy and prior refusal of a $1 billion funding offer from Neumann in October 2022, Neumann believes his acquisition could offer valuable synergies and management expertise.

Midjourney hires veteran Apple engineer to build its ‘Orb’

Generative AI startup Midjourney has appointed Ahmad Abbas, a former Apple Vision Pro engineer, as head of hardware to potentially develop a project known as the ‘Orb’ focusing on 3D data capture and AI-generated content.
Abbas has extensive experience in hardware engineering, including his time at Apple and Elon Musk’s Neuralink, and has previously worked with Midjourney’s founder, David Holz, at Leap Motion.
While details are scarce, the ‘Orb’ may relate to generating and managing 3D environments and could signify Midjourney’s entry into creating hardware aimed at real-time generated video games and AI-powered 3D worlds.

Meta to start labeling AI-generated images

Meta is expanding the labeling of AI-generated imagery on its platforms, including content created with rivals’ tools, to improve transparency and detection of synthetic content.
The company already labels images created by its own “Imagine with Meta” tool but plans to extend this to images generated by other companies’ tools, focusing on elections around the world.
Meta is also exploring the use of generative AI in content moderation, while acknowledging challenges in detecting AI-generated videos and audio, and aims to require user disclosure for synthetic content.

Bluesky opens its doors to the public

Bluesky, funded by Twitter co-founder Jack Dorsey and aiming to offer an alternative to Elon Musk’s X, is now open to the public after being invite-only for nearly a year.
The platform, notable for its decentralized infrastructure called the AT Protocol and open-source code, allows developers and users greater control and customization, including over content moderation.
Bluesky challenges existing social networks with its focus on user experience and is preparing to introduce open federation and content moderation tools to enhance its decentralized social media model.

Bumble’s new AI tool identifies and blocks scam accounts, fake profiles

Bumble has introduced a new AI tool named Deception Detector to identify and block scam accounts and fake profiles, which during tests blocked 95% of such accounts and reduced user reports of spam by 45%.
The development of Deception Detector is in response to user concerns about fake profiles and scams on dating platforms, with Bumble research highlighting these as major issues for users, especially women.
Besides Deception Detector, Bumble continues to enhance user safety and trust through features like Private Detector for blurring unsolicited nude images and AI-generated icebreakers in Bumble For Friends.

A Daily Chronicle of AI Innovations in February 2024 – Day 05: AI Daily News – February 05th, 2024

How to access Google Bard in Canada as of February 05th, 2024

Download the Opera browser and go to https://bard.google.com

This is How ChatGPT help me save $250.

TLDR: ChatGPT helped me jump start my hybrid to avoid towing fee $100 and helped me not pay the diagnostic fee $150 at the shop.

My car wouldn’t start this morning and it gave me a warning light and message on the car’s screen. I took a picture of the screen with my phone, uploaded it to ChatGPT 4 Turbo, described the make/model, my situation (weather, location, parked on slope), and the last time it had been serviced.

I asked what was wrong, and it told me that the auxiliary battery was dead, so I asked it how to jump start it. It’s a hybrid, so it told me to open the fuse box, ground the cable and connect to the battery. I took a picture of the fuse box because I didn’t know where to connect, and it told me that ground is usually black and the other part is usually red. I connected it and it started up. I drove it to the shop, so it saved me the $100 towing fee. At the shop, I told them to replace my battery without charging me the $150 “diagnostic fee,” since ChatGPT already told me the issue. The hybrid battery wasn’t the issue because I took a picture of the battery usage with 4 out of 5 bars. Also, there was no warning light. This saved me $250 in total, and it basically paid for itself for a year.

I can deal with some inconveniences related to copyright and other concerns as long as I’m saving real money. I’ll keep my subscription, because it’s pretty handy. Thanks for reading!

source: r/artificialintelligence

Top comment: I can’t wait until AI like this is completely integrated into a home system like Alexa, and we have a friendly voice that just walks us through everything.

Google MobileDiffusion: AI Image generation in <1s on phones

Google Research introduced MobileDifussion, which can generate images from Android and iPhone with a resolution of 512*512 pixels in about half a second. What’s impressive about this is its comparably small model size of just 520M parameters, which makes it uniquely suited for mobile deployment. This is significantly less than the Stable Diffusion and SDX, which boast a billion parameters.

MobileDiffusion has the capability to enable a rapid image generation experience while typing text prompts.

Google MobileDiffusion: AI Image generation in <1s on phones

Google researchers measured the performance of MobileDiffusion on both iOS and Android devices using different runtime optimizers.

Why does this matter?

MobileDifussion represents a paradigm shift in the AI image generation horizon, especially in the smartphone or mobile space. Image generation models like Stable Diffusion and DALL-E are billions of parameters in size and require powerful desktops or servers to run, making them impossible to run on a handset. With superior efficiency in terms of latency and size, MobileDiffusion has the potential to be a friendly option for mobile deployments.

Source

Hugging Face enables custom chatbot creation in 2-clicks

Hugging Face tech lead Philipp Schmid said users can now create custom chatbots in “two clicks” using “Hugging Chat Assistant.” Users’ creations are then publicly available. Schmid compares the feature to OpenAI’s GPTs feature and adds they can use “any available open LLM, like Llama2 or Mixtral.”

Why does this matter?

Hugging Face’s Chat Assistant has democratized AI creation and simplified the process of building custom chatbots, lowering the barrier to entry. Also, open-source means more innovation, enabling a more comprehensive range of individuals and organizations to harness the power of conversational AI.

Source

Google to release ChatGPT Plus competitor ‘Gemini Advanced’ next week

According to a leaked web text, Google might release its ChatGPT Plus competitor named “Gemini Advanced” on February 7th. This suggests a name change for the Bard chatbot after Google announced “Bard Advanced” at the end of last year. The Gemini Advanced ChatBot will be powered by the eponymous Gemini model in the Ultra 1.0 release.

According to Google, Gemini Advanced is far more capable of complex tasks like coding, logical reasoning, following nuanced instructions, and creative collaboration. Google also wants to include multimodal capabilities, coding features, and detailed data analysis. Currently, the model is optimized for English but can respond to other global languages sooner.

Why does this matter?

Google’s Gemini Advanced will be an answer for OpenAI’s ChatGPT Plus. It signals increasing competition in the AI language model market, potentially leading to improved features and services for users. The only question is whether Ultra can beat GPT-4, and if that’s the case, what counters can OpenAI do that will be interesting to see.

Source

What Else Is Happening in AI on February 05th, 2024

NYU’s latest AI innovation echoes a toddler’s language learning journey

New York University (NYU) researchers have developed an AI system to behave like a toddler and learn a new language precisely. For this purpose, the AI model uses video recording from a child’s perspective to understand the language and its meaning, respond to new situations, and learn from new experiences. (Link)

GenAI to disrupt 200K U.S. entertainment industry jobs by 2026

CVL Economics surveyed 300 executives from six U.S. entertainment industries between Nov 17 and Dec 22, 2023, to understand the impact of Generative AI. The survey found that 203,800 jobs could get disrupted in the entertainment space by 2026. 72% of the companies surveyed are early adopters, of which 25% already use it, and 47% plan to implement it soon. (Link)

Apple CEO Tim Cook hints at major AI announcement ‘later this year’

Apple CEO Tim Cook hinted at Apple making a major AI announcement later this year during a meeting with the analysts during the first-quarter earnings showcase. He further added that there’s a massive opportunity for Apple with Gen AI and AI as they look to compete with cutting-edge AI companies like Microsoft, Google, Amazon, OpenAI, etc. (Link)

The U.S. Police Department turns to AI to review bodycam footage

Over the last decade, U.S. police departments have spent millions of dollars to equip their officers with body-worn cameras that record their daily work. However, the data collected needs to be adequately analyzed to identify patterns. Now, the department is turning to AI to examine this stockpile of footage to identify problematic officers and patterns of behavior. (Link)

Adobe to provide support for Firefly in the latest Vision Pro release

Adobe’s popular image-generating software, Firefly, is now announced for the new version of Apple Vision Pro. It now joins the company’s previously announced Lightroom photo app. People expected Adobe Lightroom to be a native Apple Vision Pro app from launch, but now it’s adding Firefly AI, the GenAI tool that produces images based on text descriptions. (Link)

Deepfake costs company $25 million

Scammers utilized AI-generated deepfakes to impersonate a multinational company’s CFO in a video call, tricking an employee into transferring over $25 million.
The scam involved deepfake representations of the CFO and senior executives, leading the employee to believe the request for a large money transfer was legitimate.
Hong Kong police have encountered over 20 cases involving AI deepfakes to bypass facial recognition, emphasizing the increasing abuse of deepfake technology in fraud and identity theft. Read more.

Amazon finds $1B jackpot in its 100 million+ IPv4 address stockpile

The scarcity of IPv4 addresses, akin to digital real estate, has led Amazon Web Services (AWS) to implement a new pricing scheme charging $0.005 per public IPv4 address per hour, opening up a significant revenue stream.
With IPv4 addresses running out due to the limit of 4.3 billion unique IDs and increasing demand from the growth of smart devices, AWS urges a transition to IPv6 to alleviate shortage and high administrative costs.
Amazon controls nearly 132 million IPv4 addresses, with an estimated valuation of $4.6 billion; the new pricing strategy could generate between $400 million to $1 billion annually from their use in AWS services.

Meta oversight board calls company’s deepfake rule ‘incoherent’

The Oversight Board criticizes Meta’s current rules against faked videos as “incoherent” and urges the company to urgently revise its policy to better prevent harm from manipulated media.
It suggests that Meta should not only focus on how manipulated content is created but should also add labels to altered videos to inform users, rather than just relying on fact-checkers.
Meta is reviewing the Oversight Board’s recommendations and will respond publicly within 60 days, while the altered video of President Biden continues to spread on other platforms like X (formerly Twitter).
Read more

Snap lays off 10% of workforce to ‘reduce hierarchy’

Snapchat’s parent company, Snap, announced plans to lay off 10% of its workforce, impacting over 500 employees, as part of a restructuring effort to promote growth and reduce hierarchy.
The layoffs will result in pre-tax charges estimated between $55 million to $75 million, primarily for severance and related costs, with the majority of these costs expected in the first quarter of 2024.
The decision for a second wave of layoffs comes after a previous reorganization focused on reducing layers within the product team and follows a reported increase in user growth and a net loss in Q3 earnings

First UK patients receive experimental messenger RNA cancer therapy

A revolutionary new cancer treatment known as mRNA therapy has been administered to patients at Hammersmith hospital in west London. The trial has been set up to evaluate the therapy’s safety and effectiveness in treating melanoma, lung cancer and other solid tumours.

The new treatment uses genetic material known as messenger RNA – or mRNA – and works by presenting common markers from tumours to the patient’s immune system.

The aim is to help it recognise and fight cancer cells that express those markers.

“New mRNA-based cancer immunotherapies offer an avenue for recruiting the patient’s own immune system to fight their cancer,” said Dr David Pinato of Imperial College London, an investigator with the trial’s UK arm.

Read More..

Pinato said this research was still in its early stages and could take years before becoming available for patients. However, the new trial was laying crucial groundwork that could help develop less toxic and more precise new anti-cancer therapies. “We desperately need these to turn the tide against cancer,” he added.

A number of cancer vaccines have recently entered clinical trials across the globe. These fall into two categories: personalised cancer immunotherapies, which rely on extracting a patient’s own genetic material from their tumours; and therapeutic cancer immunotherapies, such as the mRNA therapy newly launched in London, which are “ready made” and tailored to a particular type of cancer.

The primary aim of the new trial – known as Mobilize – is to discover if this particular type of mRNA therapy is safe and tolerated by patients with lung or skin cancers and can shrink tumours. It will be administered alone in some cases and in combination with the existing cancer drug pembrolizumab in others.

Researchers say that while the experimental therapy is still in the early stages of testing, they hope it may ultimately lead to a new treatment option for difficult-to-treat cancers, should the approach be proven to be safe and effective.

Nearly one in two people in the UK will be diagnosed with cancer in their lifetime. A range of therapies have been developed to treat patients, including chemotherapy and immune therapies.

However, cancer cells can become resistant to drugs, making tumours more difficult to treat, and scientists are keen to seek new approaches for tackling cancers.

Preclinical testing in both cell and animal models of cancer provided evidence that new mRNA therapy had an effect on the immune system and could be offered to patients in early-phase clinical trials.

AI Coding Assistant Tools in 2024 Compared

The article explores and compares most popular AI coding assistants, examining their features, benefits, and transformative impact on developers, enabling them to write better code: 10 Best AI Coding Assistant Tools in 2024

GitHub Copilot
CodiumAI
Tabnine
MutableAI
Amazon CodeWhisperer
AskCodi
Codiga
Replit
CodeT5
OpenAI Codex

Challenges for programmers

Programmers and developers face various challenges when writing code. Outlined below are several common challenges experienced by developers.

Syntax and Language Complexity: Programming languages often have intricate syntax rules and a steep learning curve. Understanding and applying the correct syntax can be challenging, especially for beginners or when working with unfamiliar languages.
Bugs and Errors: Debugging is an essential part of the coding process. Identifying and fixing bugs and errors can be time-consuming and mentally demanding. It requires careful analysis of code behavior, tracing variables, and understanding the flow of execution.
Code Efficiency and Performance: Writing code that is efficient, optimized, and performs well can be a challenge. Developers must consider algorithmic complexity, memory management, and resource utilization to ensure their code runs smoothly, especially in resource-constrained environments.
Compatibility and Integration: Integrating different components, libraries, or third-party APIs can introduce compatibility challenges. Ensuring all the pieces work seamlessly together and correctly handle data interchangeably can be complex.
Scaling and Maintainability: As projects grow, managing and scaling code becomes more challenging. Ensuring code remains maintainable, modular, and scalable can require careful design decisions and adherence to best practices.
Collaboration and Version Control: Coordinating efforts, managing code changes, and resolving conflicts can be significant challenges when working in teams. Ensuring proper version control and effective collaboration becomes crucial to maintain a consistent and productive workflow.
Time and Deadline Constraints: Developers often work under tight deadlines, adding pressure to the coding process. Balancing speed and quality becomes essential, and delivering code within specified timelines can be challenging.
Keeping Up with Technological Advancements: The technology landscape continually evolves, with new frameworks, languages, and tools emerging regularly. Continuous learning and adaptation pose ongoing challenges for developers in their professional journey.
Documentation and Code Readability: Writing clear, concise, and well-documented code is essential for seamless collaboration and ease of future maintenance. Ensuring code readability and comprehensibility can be challenging, especially when codebases become large and complex.
Security and Vulnerability Mitigation: Building secure software requires careful consideration of potential vulnerabilities and implementing appropriate security measures. Addressing security concerns, protecting against cyber threats, and ensuring data privacy can be challenging aspects of coding.

Now let’s see how this type of tool can help developers to avoid these challenges.

Advantages of using these tools

Reduce Syntax and Language Complexity: These tools help programmers tackle the complexity of programming languages by providing real-time suggestions and corrections for syntax errors. It assists in identifying and rectifying common mistakes such as missing brackets, semicolons, or mismatched parentheses.
Autocompletion and Intelligent Code Suggestions: It excels at autocompleting code snippets, saving developers time and effort. They analyze the context of the written code and provide intelligent suggestions for completing code statements, variables, method names, or function parameters.
These suggestions are contextually relevant and can significantly speed up the coding process, reduce typos, and improve code accuracy.
Error Detection and Debugging Assistance: AI Code assistants can assist in detecting and resolving errors in code. They analyze the code in real time, flagging potential errors or bugs and providing suggestions for fixing them.
By offering insights into the root causes of errors, suggesting potential solutions, or providing links to relevant documentation, these tools facilitate debugging and help programmers identify and resolve issues more efficiently.
Code Efficiency and Performance Optimization: These tools can aid programmers in optimizing their code for efficiency and performance. They can analyze code snippets and identify areas that could be improved, such as inefficient algorithms, redundant loops, or suboptimal data structures.
By suggesting code refactorings or alternative implementations, developers write more efficient code, consume fewer resources, and perform better.
Compatibility and Integration Support: This type of tool can assist by suggesting compatible libraries or APIs based on the project’s requirements. They can also help with code snippets or guide seamlessly integrating specific functionalities.
This support ensures smoother integration of different components, reducing potential compatibility issues and saving developers time and effort.
Code Refactoring and Improvement Suggestions: It can analyze existing codebases and suggest refactoring and improving code quality. They can identify sections of code that are convoluted, difficult to understand or violate best practices.
Through this, programmers enhance code maintainability, readability, and performance by suggesting more readable, modular, or optimized alternatives.
Collaboration and Version Control Management: Users can integrate with version control systems and provide conflict resolution suggestions to minimize conflicts during code merging. They can also assist in tracking changes, highlighting modifications made by different team members, and ensuring smooth collaboration within a project.
Documentation and Code Readability Enhancement: These tools can assist in improving code documentation and readability. They can prompt developers to add comments, provide documentation templates, or suggest more precise variable and function names.
By encouraging consistent documentation practices and promoting readable code, this tool can facilitate code comprehension, maintainability, and ease of future development.
Learning and Keeping Up with Technological Advancements: These tools can act as learning companions for programmers. They can provide documentation references, code examples, or tutorials to help developers understand new programming concepts, frameworks, or libraries. So developers can stay updated with the latest technological advancements and broaden their knowledge base.
Security and Vulnerability Mitigation: It can help programmers address security concerns by providing suggestions and best practices for secure coding. They can flag potential security vulnerabilities, such as injection attacks or sensitive data exposure, and offer guidance on mitigating them.

GitHub Copilot

GitHub Copilot, developed by GitHub in collaboration with OpenAI, aims to transform the coding experience with its advanced features and capabilities. It utilizes the potential of AI and machine learning to enhance developers’ coding efficiency, offering a variety of features to facilitate more efficient code writing.

Features:

Integration with Popular IDEs: It integrates with popular IDEs like Visual Studio, Neovim, Visual Studio Code, and JetBrains for a smooth development experience.
Support for multiple languages: Supports various languages such as TypeScript, Golang, Python, Ruby, etc.
Code Suggestions and Function Generation: Provides intelligent code suggestions while developers write code, offering snippets or entire functions to expedite the coding process and improve efficiency.
Easy Auto-complete Navigation: Cycle through multiple auto-complete suggestions with ease, allowing them to explore different options and select the most suitable suggestion for their code.

While having those features, Github Copilot includes some weaknesses that need to be considered when using it.

Code Duplication: GitHub Copilot generates code based on patterns it has learned from various sources. This can lead to code duplication, where developers may unintentionally use similar or identical code segments in different parts of their projects.
Inefficient code: It sometimes generates code that is incorrect or inefficient. This can be a problem, especially for inexperienced developers who may not be able to spot the errors.
Insufficient test case generation: When writing bigger codes, developers may start to lose touch with their code. So testing the code is a must. Copilot may lack the ability to generate a sufficient number of test cases for bigger codes. This can make it more difficult to identify and debug problems and to ensure the code’s quality.

Amazon CodeWhisperer

Amazon CodeWhisperer boosts developers’ coding speed and accuracy, enabling faster and more precise code writing. Amazon’s AI technology powers it and can suggest code, complete functions, and generate documentation.

Features:

Code suggestion: Offers code snippets, functions, and even complete classes based on the context of your code, providing relevant and contextually accurate suggestions. This aids in saving time and mitigating errors, resulting in a more efficient and reliable coding process.
Function completion: Helps complete functions by suggesting the following line of code or by filling in the entire function body.
Documentation generation: Generates documentation for the code, including function summaries, parameter descriptions, and return values.
Security scanning: It scans the code to identify possible security vulnerabilities. This aids in preemptively resolving security concerns, averting potential issues.
Language support: Available for various programming languages, including Python, JavaScript, C#, Rust, PHP, Kotlin, C, SQL, etc.
Integration with IDEs: It can be used with JetBrains IDEs, VS Code and more.

OpenAI Codex

This tool offers quick setup, AI-driven code completion, and natural language prompting, making it easier for developers to write code efficiently and effectively while interacting with the AI using plain English instructions.

Features:

Quick Setup: OpenAI Codex provides a user-friendly and efficient setup process, allowing developers to use the tool quickly and seamlessly.
AI Code Completion Tool: Codex offers advanced AI-powered code completion, providing accurate and contextually relevant suggestions to expedite the coding process and improve productivity.
Natural Language Prompting: With natural language prompting, Codex enables developers to interact with the AI more intuitively, providing instructions and receiving code suggestions based on plain English descriptions.

AI Weekly Rundown (January 27 to February 04th, 2024)

Major AI announcements from OpenAI, Google, Meta, Amazon, Apple, Adobe, Shopify, and more.

OpenAI announced new upgrades to GPT models + new features leaked
– They are releasing 2 new embedding models
– Updated GPT-3.5 Turbo with 50% cost drop
– Updated GPT-4 Turbo preview model
– Updated text moderation model
– Introducing new ways for developers to manage API keys and understand API usage
– Quietly implemented a new ‘GPT mentions’ feature to ChatGPT (no official announcement yet). The feature allows users to integrate GPTs into a conversation by tagging them with an ‘@’.
Prophetic introduces Morpheus-1, world’s 1st ‘multimodal generative ultrasonic transformer’
– This innovative AI device is crafted with the purpose of delving into the intricacies of human consciousness by facilitating control over lucid dreams. Morpheus-1 operates by monitoring sleep phases and gathering dream data to enhance its AI model. It is set to be accessible to beta users in the spring of 2024.
Google MobileDiffusion: AI Image generation in <1s on phones
– MobileDiffusion is Google’s new text-to-image tool tailored for smartphones. It swiftly generates top-notch images from text in under a second. With just 520 million parameters, it’s notably smaller than other models like Stable Diffusion and SDXL, making it ideal for mobile use.
New paper on MultiModal LLMs introduces over 200 research cases + 20 multimodal LLMs
– This paper ‘MM-LLMs’ discusses recent advancements in MultiModal LLMs which combine language understanding with multimodal inputs or outputs. The authors provide an overview of the design and training of MM-LLMs, introduce 26 existing models, and review their performance on various benchmarks. They also share key training techniques to improve MM-LLMs and suggest future research directions.
Hugging Face enables custom chatbot creation in 2-clicks
– The tech lead of Hugging Face, Philipp Schmid, revealed that users can now create their own chatbot in “two clicks” using the “Hugging Chat Assistant.” The creation made by the users will be publicly available to the rest of the community.
Meta released Code Llama 70B- a new, more performant version of its LLM for code generation.
It is available under the same license as previous Code Llama models. CodeLlama-70B-Instruct achieves 67.8 on HumanEval, beating GPT-4 and Gemini Pro.
Elon Musk’s Neuralink implants its brain chip in the first human
– Musk’s brain-machine interface startup, Neuralink, has successfully implanted its brain chip in a human. In a post on X, he said “promising” brain activity had been detected after the procedure and the patient was “recovering well”.
Google to release ChatGPT Plus competitor ‘Gemini Advanced’ next week
– Google might release its ChatGPT Plus competitor “Gemini Advanced” on February 7th. It suggests a name change for the Bard chatbot, after Google announced “Bard Advanced” at the end of last year. The Gemini Advanced Chatbot will be powered by eponymous Gemini model in the Ultra 1.0 release.
Alibaba announces Qwen-VL; beats GPT-4V and Gemini
– Alibaba’s Qwen-VL series has undergone a significant upgrade with the launch of two enhanced versions, Qwen-VL-Plus and Qwen-VL-Max.These two models perform on par with Gemini Ultra and GPT-4V in multiple text-image multimodal tasks.
GenAI to disrupt 200K U.S. entertainment industry jobs by 2026
– CVL Economics surveyed 300 executives from six U.S. entertainment industries between Nov 17 and Dec 22, 2023, to understand the impact of Generative AI. The survey found that 203,800 jobs could get disrupted in the entertainment space by 2026.
Apple CEO Tim Cook hints at major AI announcement ‘later this year’
– Apple CEO Tim Cook hinted at Apple making a major AI announcement later this year during a meeting with the analysts during the first-quarter earnings showcase. He further added that there’s a massive opportunity for Apple in Gen AI and AI horizon.
Microsoft released its annual ‘Future of Work 2023’ report with a focus on AI
– It highlights the 2 major shifts in how work is done in the past three years, driven by remote and hybrid work technologies and the advancement of Gen AI. This year’s edition focuses on integrating LLMs into work and offers a unique perspective on areas that deserve attention.
Amazon researchers have developed “Diffuse to Choose” AI tool
– It’s a new image inpainting model that combines the strengths of diffusion models and personalization-driven models, It allows customers to virtually place products from online stores into their homes to visualize fit and appearance in real-time.
Cambridge researchers developed a robotic sensor reading braille 2x faster than humans
– The sensor, which incorporates AI techniques, was able to read braille at 315 words per minute with 90% accuracy. It makes it ideal for testing the development of robot hands or prosthetics with comparable sensitivity to human fingertips.
Shopify boosts its commerce platform with AI enhancements
– Shopify is releasing new features for its Winter Edition rollout, including an AI-powered media editor, improved semantic search, ad targeting with AI, and more. The headline feature is Shopify Magic, which applies different AI models to assist merchants in various ways.
OpenAI is building an early warning system for LLM-aided biological threat creation
– In an evaluation involving both biology experts and students, it found that GPT-4 provides at most a mild uplift in biological threat creation accuracy. While this uplift is not large enough to be conclusive, the finding is a starting point for continued research and community deliberation.
LLaVA-1.6 released with improved reasoning, OCR, and world knowledge
– It supports higher-res inputs, more tasks, and exceeds Gemini Pro on several benchmarks. It maintains the data efficiency of LLaVA-1.5, and LLaVA-1.6-34B is trained ~1 day with 32 A100s. LLaVA-1.6 comes with base LLMs of different sizes: Mistral-7B, Vicuna-7B/13B, Hermes-Yi-34B.
Google rolls out huge AI updates:

Launches an AI image generator – ImageFX- It allows users to create and edit images using a prompt-based UI. It offers an “expressive chips” feature, which provides keyword suggestions to experiment with different dimensions of image creation. Google claims to have implemented technical safeguards to prevent the tool from being used for abusive or inappropriate content.
Google has released two new AI tools for music creation: MusicFX and TextFX- MusicFX generates music based on user prompts but has limitations with stringed instruments and filters out copyrighted content. TextFX, conversely, is a suite of modules designed to aid in the lyrics-writing process, drawing inspiration from rap artist Lupe Fiasco.
Google’s Bard is now powered by the Gemini Pro globally, supporting 40+ languages- The chatbot will have improved understanding and summarizing content, reasoning, brainstorming, writing, and planning capabilities. Google has also extended support for more than 40 languages in its “Double check” feature, which evaluates if search results are similar to what Bard generates.
Google’s Bard can now generate photos using its Imagen 2 text-to-image model, catching up to its rival ChatGPT Plus- Bard’s image generation feature is free, and Google has implemented safety measures to avoid generating explicit or offensive content.
Google Maps introduces a new AI feature to help users discover new places- The feature uses LLMs to analyze over 250M locations and contributions from over 300M Local Guides. Users can search for specific recommendations, and the AI will generate suggestions based on their preferences. Its currently being rolled out in the US.

Adobe to provide support for Firefly in the latest Vision Pro release
– Adobe’s popular image-generating software, Firefly, is now announced for the new version of Apple Vision Pro. It now joins the company’s previously announced Lightroom photo app.
Amazon launches an AI shopping assistant called Rufus in its mobile app
– Rufus is trained on Amazon’s product catalog and information from the web, allowing customers to chat with it to help find products, compare them, and get recommendations. The AI assistant will initially be available in beta to select US customers, with plans to expand to more users in the coming weeks.
Meta plans to deploy custom in-house chips later this year to power AI initiatives
– It could help reduce the company’s dependence on Nvidia chips and control the costs associated with running AI workloads. It could potentially save hundreds of millions of dollars in annual energy costs and billions in chip purchasing costs. The chip will work in coordination with commercially available GPUs.
And there was more…
– Google’s Bard surpasses GPT-4 to the Second spot on the leaderboard
– Google Cloud has partnered with Hugging Face to advance Gen AI development
– Arc Search combines a browser, search engine, and AI for unique browsing experience
– PayPal is set to launch new AI-based products
– NYU’s latest AI innovation echoes a toddler’s language learning journey
– Apple Podcasts in iOS 17.4 now offers AI transcripts for almost every podcast
– OpenAI partners with Common Sense Media to collaborate on AI guidelines
– Apple’s ‘biggest’ iOS update may bring a lot of AI to iPhones
– Shortwave email client will show AI-powered summaries automatically
– OpenAI CEO Sam Altman explores AI chip collaboration with Samsung and SK Group
– Generative AI is seen as helping to identify merger & acquisition targets
– OpenAI bringing GPTs (AI models) into conversations, Type @ and select the GPT
– Midjourney Niji V6 is out
– The U.S. Police Department turns to AI to review bodycam footage
– Yelp uses AI to provide summary reviews on its iOS app and much more
– The New York Times is creating a team to explore the use of AI in its newsroom
– Semron aims to replace chip transistors with ‘memcapacitors’
– Microsoft LASERs away LLM inaccuracies with a new method
– Mistral CEO confirms ‘leak’ of new open source model nearing GPT-4 performance
– Synthesia launches LLM-powered assistant to turn any text file into video in minutes
– Fashion forecasters are using AI to make decisions about future trends and styles
– Twin Labs automates repetitive tasks by letting AI take over your mouse cursor
– The Arc browser is incorporating AI to improve bookmarks and search results
– The Allen Institute for AI is open-sourcing its text-generating AI models
– Apple CEO Tim Cook confirmed that AI features are coming ‘later this year’
– Scientists use AI to create an early diagnostic test for ovarian cancer
– Anthropic launches ‘dark mode’ visual option for its Claude chatbot

A Daily Chronicle of AI Innovations in February 2024 – Day 03: AI Daily News – February 03rd, 2024

Google plans to launch ChatGPT Plus competitor next week

Google is set to launch “Gemini Advanced,” a ChatGPT Plus competitor, possibly on February 7th, signaling a name change from “Bard Advanced” announced last year.
The Gemini Advanced chatbot, powered by the Ultra 1.0 model, aims to excel in complex tasks such as coding, logical reasoning, and creative collaboration.
Gemini Advanced, likely a paid service, aims to outperform ChatGPT by integrating with Google services for task completion and information retrieval, while also incorporating an image generator similar to DALL-E 3 and reaching GPT-4 levels with the Gemini Pro model.
Source

Apple tested its self-driving car tech more than ever last year

Apple significantly increased its autonomous vehicle testing in 2023, almost quadrupling its self-driving miles on California’s public roads compared to the previous year.
The company’s testing peaked in August with 83,900 miles, although it remains behind more advanced companies like Waymo and Cruise in total miles tested.
Apple has reportedly scaled back its ambitions for a fully autonomous vehicle, now focusing on developing automated driving-assistance features similar to those offered by other automakers.
Source

Hugging Face launches open source AI assistant maker to rival OpenAI’s custom GPTs

Hugging Face has launched Hugging Chat Assistants, a free, customizable AI assistant maker that rivals OpenAI’s subscription-based custom GPTs.
The new tool allows users to choose from a variety of open source large language models (LLMs) for their AI assistants, unlike OpenAI’s reliance on proprietary models.
An aggregator page for third-party customized Hugging Chat Assistants mimics OpenAI’s GPT Store, offering users various assistants to choose from and use.
Source

Google’s MobileDiffusion generates AI images on mobile devices in less than a second

Google’s MobileDiffusion enables the creation of high-quality images from text on smartphones in less than a second, leveraging a model that is significantly smaller than existing counterparts.
It achieves this rapid and efficient text-to-image conversion through a novel architecture including a text encoder, a diffusion network, and an image decoder, producing 512 x 512-pixel images swiftly on both Android and iOS devices.
While demonstrating a significant advance in mobile AI capabilities, Google has not yet released MobileDiffusion publicly, viewing this development as a step towards making text-to-image generation widely accessible on mobile platforms.
Source

Meta warns investors Mark Zuckerberg’s hobbies could kill him in SEC filing

Meta warned investors in its latest SEC filing that CEO Mark Zuckerberg’s engagement in “high-risk activities” could result in serious injury or death, impacting the company’s operations.
The company’s 10-K filing listed combat sports, extreme sports, and recreational aviation as risky hobbies of Zuckerberg, noting his achievements in Brazilian jiu-jitsu and pursuit of a pilot’s license.
This cautionary statement, highlighting the potential risks of Zuckerberg’s personal hobbies to Meta’s future, was newly included in the 2023 filing and is a departure from the company’s previous filings.
Source

A Daily Chronicle of AI Innovations in February 2024 – Day 02: AI Daily News – February 02nd, 2024

Google bets big on AI with huge upgrades

1. Launches an AI image generator – ImageFX

It allows users to create and edit images using a prompt-based UI. It offers an “expressive chips” feature, which provides keyword suggestions to experiment with different dimensions of image creation. Google claims to have implemented technical safeguards to prevent the tool from being used for abusive or inappropriate content.

Additionally, images generated using ImageFX will be tagged with a digital watermark called SynthID for identification purposes. Google is also expanding the use of Imagen 2, the image model, across its products and services.

(Source)

2. Google has released two new AI tools for music creation: MusicFX and TextFX

MusicFX generates music based on user prompts but has limitations with stringed instruments and filters out copyrighted content.

TextFX, conversely, is a suite of modules designed to aid in the lyrics-writing process, drawing inspiration from rap artist Lupe Fiasco.

(Source)

3. Google’s Bard is now Gemini Pro-powered globally, supporting 40+ languages
The chatbot will have improved understanding and summarizing content, reasoning, brainstorming, writing, and planning capabilities. Google has also extended support for more than 40 languages in its “Double check” feature, which evaluates if search results are similar to what Bard generates.

(Source)

4. Google’s Bard can now generate photos using its Imagen 2 text-to-image model
Bard’s image generation feature is free, and Google has implemented safety measures to avoid generating explicit or offensive content.

(Source)

5. Google Maps introduces a new AI feature to help users discover new places
The feature uses LLMs to analyze over 250M locations and contributions from over 300M Local Guides. Users can search for specific recommendations, and the AI will generate suggestions based on their preferences. It’s currently being rolled out in the US.
(Source)

Amazon launches an AI shopping assistant for product recommendations

Amazon has launched an AI-powered shopping assistant called Rufus in its mobile app. Rufus is trained on Amazon’s product catalog and information from the web, allowing customers to chat with it to get help with finding products, comparing them, and getting recommendations.

The AI assistant will initially be available in beta to select US customers, with plans to expand to more users in the coming weeks. Customers can type or speak their questions into the chat dialog box, and Rufus will provide answers based on their training.

Why does this matter?

Rufus can save time and effort compared to traditional search and browsing. However, the quality of responses remains to be seen. For Amazon, this positions them at the forefront of leveraging AI to enhance the shopping experience. If effective, Rufus could increase customer engagement on Amazon and drive more sales. It also sets them apart from competitors.

Source

Meta to deploy custom in-house chips to reduce dependence on costly NVIDIA

Meta plans to deploy a new version of its custom chip aimed at supporting its AI push in its data centers this year, according to an internal company document. The chip, a second generation of Meta’s in-house silicon line, could help reduce the company’s dependence on Nvidia chips and control the costs associated with running AI workloads. The chip will work in coordination with commercially available graphics processing units (GPUs).

Why does this matter?

Meta’s deployment of its own chip could potentially save hundreds of millions of dollars in annual energy costs and billions in chip purchasing costs. It also gives them more control over the core hardware for their AI systems versus relying on vendors.

Source

AI, EO, DPA

The Biden administration plans to use the Defense Production Act to force tech companies to inform the government when they train AI models above a compute threshold.

Between the lines:

These actions are one of the first implementations of the broad AI Executive Order passed last year. In the coming months, more provisions from the EO will come into effect.
OpenAI and Google will likely need to disclose training details for the successors to GPT-4 and Gemini. The compute thresholds are still a pretty murky area – it’s unclear exactly when companies need to involve the government.
And while the EO was a direct response from the executive branch, Senators on both sides of the aisle are eager to take action on AI (and Big Tech more broadly).

Elsewhere in AI regulation:

Bipartisan senators unveil the DEFIANCE Act, which would federally criminalize deepfake porn, in the wake of Taylor Swift’s viral AI images.
The FCC wants to officially recognize AI-generated voices as “artificial,” which would make AI-powered robocalls illegal.
And a look at the US Copyright Office, which plans to release three very consequential reports this year on AI and copyright law.

What Else Is Happening in AI on February 02nd, 2024

The Arc browser is incorporating AI to improve bookmarks and search results

The new features in Arc for Mac and Windows include “Instant Links,” which allows users to skip search engines and directly ask the AI bot for specific links. Another feature, called Live Folders, will provide live-updating streams of data from various sources. (Link)

The Allen Institute for AI is open-sourcing its text-generating AI models

The model is OLMo, along with the dataset used to train them. These models are designed to be more “open” than others, allowing developers to use them freely for training, experimentation, and commercialization. (Link)

Apple CEO Tim Cook confirmed that AI features are coming ‘later this year’

This aligns with reports that iOS 18 could be the biggest update in the operating system’s history. Apple’s integration of AI into its software platforms, including iOS, iPadOS, and macOS, is expected to include advanced photo manipulation and word processing enhancements. This announcement suggests that Apple has ambitious plans to compete with Google and Samsung in the AI space. (Link)

Scientists use AI to create an early diagnostic test for ovarian cancer

Researchers at the Georgia Tech Integrated Cancer Research Center have developed a new test for ovarian cancer using AI and blood metabolite information. The test has shown 93% accuracy in detecting ovarian cancer in samples from the study group, outperforming existing tests. They have also developed a personalized approach to ovarian cancer diagnosis, using a patient’s individual metabolic profile to determine the probability of the disease’s presence. (Link)

Anthropic launches a new ‘dark mode’ visual option for its Claude chatbot. (Link)

Just click on the Profile > Appearance > Select Dark.

Meta’s plans to crush Google and Microsoft in AI

Mark Zuckerberg announced Meta’s intent to aggressively enter the AI market, aiming to outpace Microsoft and Google by leveraging the vast amount of data on its platforms.
Meta plans to make an ambitious long-term investment in AI, estimated to cost over $30 billion yearly, on top of its existing expenses.
The company’s strategy includes building advanced AI products and services for users of Instagram and WhatsApp, focusing on achieving general intelligence (AGI).

Tim Cook says big Apple AI announcement is coming later this year

Apple CEO Tim Cook confirmed that generative AI software features are expected to be released to customers later this year, during Apple’s quarterly earnings call.
The upcoming generative AI features are anticipated to be part of what could be the “biggest update” in iOS history, according to Bloomberg’s Mark Gurman.
Tim Cook emphasized Apple’s commitment to not disclose too much before the actual release but hinted at significant advancements in AI, including applications in iOS, iPadOS, and macOS.

Meta plans new in-house AI chip ‘Artemis’

Meta is set to deploy its new AI chip “Artemis” to reduce dependence on Nvidia chips, aiming for cost savings and enhanced computing to power AI-driven experiences.
By developing in-house AI silicon like Artemis, Meta aims to save on energy and chip costs while maintaining a competitive edge in AI technologies against rivals.
The Artemis chip is focused on inference processes, complementing the GPUs Meta uses, with plans for a broader in-house AI silicon project to support its computational needs.

Google’s Bard gets a free AI image generator to compete with ChatGPT

Google introduced a free image generation feature to Bard, using Imagen 2, to create images from text, offering competition to OpenAI’s multimodal chatbots like ChatGPT.
The feature introduces a watermark for AI-generated images and implements safeguards against creating images of known people or explicit content, but it’s not available in the EU, Switzerland, and the UK.
Bard with Gemini Pro has expanded to over 40 languages and 230 countries, and Google is also integrating Imagen 2 into its products and making it available for developers via Google Cloud Vertex AI.

Former CIA hacker sentenced to 40 years in prison

Joshua Schulte, a former CIA software engineer, was sentenced to 40 years in prison for passing classified information to WikiLeaks, marking the most damaging disclosure of classified information in U.S. history.
The information leaked, known as the Vault 7 release in 2017, exposed CIA’s hacking tools and methods, including techniques for spying on smartphones and converting internet-connected TVs into listening devices.
Schulte’s actions have been described as causing exceptionally grave harm to U.S. national security by severely compromising CIA’s operational capabilities and putting both personnel and intelligence missions at risk.

A Daily Chronicle of AI Innovations in February 2024 – Day 01: AI Daily News – February 01st, 2024

Shopify boosts its commerce platform with AI enhancements

Shopify unveiled over 100 new updates to its commerce platform, with AI emerging as a key theme. The new AI-powered capabilities are aimed at helping merchants work smarter, sell more, and create better customer experiences.

The headline feature is Shopify Magic, which applies different AI models to assist merchants in various ways. This includes automatically generating product descriptions, FAQ pages, and other marketing copy. Early tests showed Magic can create SEO-optimized text in seconds versus the minutes typically required to write high-converting product blurbs.

On the marketing front, Shopify is infusing its Audiences ad targeting tool with more AI to optimize campaign performance. Its new semantic search capability better understands search intent using natural language processing.

Why does this matter?

The AI advancements could provide Shopify an edge over rivals. In addition, the new features will help merchants capitalize on the ongoing boom in online commerce and attract more customers across different channels and markets. This also reflects broader trends in retail and e-commerce, where AI is transforming everything from supply chains to customer service.

Source

OpenAI explores how good GPT-4 is at creating bioweapons

OpenAI is developing a blueprint for evaluating the risk that a large language model (LLM) could aid someone in creating a biological threat.

In an evaluation involving both biology experts and students, it found that GPT-4 provides at most a mild uplift in biological threat creation accuracy. While this uplift is not large enough to be conclusive, the finding is a starting point for continued research and community deliberation.

Why does this matter?

LLMs could accelerate the development of bioweapons or make them accessible to more people. OpenAI is working on an early warning system that could serve as a “tripwire” for potential misuse and development of biological weapons.

Source

LLaVA-1.6: Improved reasoning, OCR, and world knowledge

LLaVA-1.6 releases with improved reasoning, OCR, and world knowledge. It even exceeds Gemini Pro on several benchmarks. Compared with LLaVA-1.5, LLaVA-1.6 has several improvements:

Increasing the input image resolution to 4x more pixels.
Better visual reasoning and OCR capability with an improved visual instruction tuning data mixture.
Better visual conversation for more scenarios, covering different applications. Better world knowledge and logical reasoning.
Efficient deployment and inference with SGLang.

Along with performance improvements, LLaVA-1.6 maintains the minimalist design and data efficiency of LLaVA-1.5. The largest 34B variant finishes training in ~1 day with 32 A100s.

A Daily Chronicle of AI Innovations in February 2024: LLaVA-1.6: Improved reasoning, OCR, and world knowledge — LLaVA-1.6: Improved reasoning, OCR, and world knowledge

Why does this matter?

LLaVA-1.6 is an upgrade to LLaVA-1.5, which has a simple and efficient design and great performance akin to GPT-4V.. LLaVA-1.5 has since served as the foundation of many comprehensive studies of data, models, and capabilities of large multimodal models (LMM) and has enabled various new applications. It shows the growing open-source AI community with fast-moving and freewheeling standards.

Source

The uncomfortable truth about AI’s impact on the workforce is playing out inside the big AI companies themselves.

The article discusses how the increasing investment in AI by tech giants like Microsoft and Google is affecting the global workforce. It highlights that these companies are slowing hiring in non-AI areas and, in some cases, cutting jobs in those divisions as they ramp up spending on AI. For example, Alphabet’s workforce decreased from over 190,000 employees in 2022 to around 182,000 at the end of 2023, with further layoffs in 2024. The article emphasizes that the integration of AI has raised concerns about job displacement and the need for a workforce strategy that integrates AI and keeps jobs through the modification of roles. It also mentions the importance of being adaptable and learning about the new wave of jobs that may emerge due to technological advances. The impact of AI on different types of jobs, including white-collar and high-paid positions, is also discussed

The article provides insights into how the adoption of AI by major tech companies is reshaping the workforce and the potential implications for job stability and creation. It underscores the need for a proactive workforce strategy to integrate AI and mitigate job displacement, emphasizing the importance of adaptability and learning to navigate the evolving job market. The discussion on the impact of AI on different types of jobs, including high-paid white-collar positions, offers a comprehensive view of the challenges and opportunities associated with AI integration in the workforce.

Cisco’s head of security thinks that we’re headed into an AI phishing nightmare

Source

The article discusses the potential impact of AI on cybersecurity, particularly in the context of phishing attacks. Jeetu Patel, Cisco’s executive vice president and general manager of security and collaboration, expresses concerns about the increasing sophistication of phishing scams facilitated by generative AI tools. These tools can produce written work that is challenging for humans to detect, making it easier for attackers to create convincing email traps. Patel emphasizes that this trend could make it harder for individuals to distinguish between legitimate activity and malicious attacks, posing a significant challenge for cybersecurity. The article highlights the potential implications of AI advancement for cybersecurity and the need for proactive measures to address these emerging threats.

The article provides insights into the growing concern about the potential misuse of AI in the context of cybersecurity, specifically in relation to phishing attacks. It underscores the need for heightened awareness and proactive strategies to counter the increasing sophistication of AI-enabled cyber threats. The concerns raised by Cisco’s head of security shed light on the evolving nature of cybersecurity challenges in the face of advancing AI technology, emphasizing the importance of staying ahead of potential threats and vulnerabilities.

What Else Is Happening in AI on February 01st, 2024

Microsoft LASERs away LLM inaccuracies.

Microsoft Research introduces Layer-Selective Rank Reduction (or LASER). While the method seems counterintuitive, it makes models trained on large amounts of data smaller and more accurate. With LASER, researchers can “intervene” and replace one weight matrix with an approximate smaller one. (Link)

Mistral CEO confirms ‘leak’ of new open source model nearing GPT-4 performance.

A user with the handle “Miqu Dev” posted a set of files on HuggingFace that together comprised a seemingly new open-source LLM labeled “miqu-1-70b.” Mistral co-founder and CEO Arthur Mensch took to X to clarify and confirm. Some X users also shared what appeared to be its exceptionally high performance at common LLM tasks, approaching OpenAI’s GPT-4 on the EQ-Bench. (Link)

Synthesia launches LLM-powered assistant to turn any text file or link into AI video.

Synthesia launched a tool to turn text-based sources into full-fledged synthetic videos in minutes. It builds on Synthesia’s existing offerings and can work with any document or web link, making it easier for enterprise teams to create videos for internal and external use cases. (Link)

AI is helping pick what you’ll wear in two years.

Fashion forecasters are leveraging AI to make decisions about the trends and styles you’ll be scrambling to wear. A McKinsey survey found that 73% of fashion executives said GenAI will be a business priority next year. AI predicts trends by scraping social media, evaluating runway looks, analyzing search data, and generating images. (Link)

Twin Labs automates repetitive tasks by letting AI take over your mouse cursor.

Paris-based startup Twin Labs wants to build an automation product for repetitive tasks, but what’s interesting is how they’re doing it. The company relies on models like GPT-4V) to replicate what humans usually do. Twin Labs is more like a web browser. The tool can automatically load web pages, click on buttons, and enter text. (Link)

SpaceX signs deal to launch private space station Link

Starlab Space has chosen SpaceX’s Starship megarocket to launch its large and heavy space station, Starlab, into orbit, aiming for a launch in a single flight.
Starlab, a venture between Voyager Space and Airbus, is designed to be fully operational from a single launch without the need for space assembly, targeting a 2028 operational date.
The space station will serve various users including space agencies, researchers, and companies, with SpaceX’s Starship being the only current launch vehicle capable of handling its size and weight.

Mistral CEO confirms ‘leak’ of new open source AI model nearing GPT-4 performance. Link

Mistral’s CEO Arthur Mensch confirmed that an ‘over-enthusiastic employee’ from an early access customer leaked a quantized and watermarked version of an old model, hinting at Mistral’s ongoing development of a new AI model nearing GPT-4’s performance.
The leaked model, labeled “miqu-1-70b,” was shared on HuggingFace and 4chan, attracting attention for its high performance on common language model benchmarks, leading to speculation it might be a new Mistral model.
Despite the leak, Mensch hinted at further advancements with Mistral’s AI models, suggesting the company is close to matching or even exceeding GPT-4’s performance with upcoming versions.

OpenAI says GPT-4 poses little risk of helping create bioweapons Link

OpenAI released a study indicating that GPT-4 poses at most slight risk in assisting in the creation of a bioweapon, according to their conducted research involving biology experts and students.
The study, motivated by concerns highlighted in President Biden’s AI Executive Order, aimed to reassure that while GPT-4 may slightly facilitate the creation of bioweapons, the impact is not statistically significant.
In experiments with 100 participants, GPT-4 marginally improved the ability to plan a bioweapon, with biology experts showing an 8.8% increase in plan accuracy, underscoring the need for further research on AI’s potential risks.

Microsoft, OpenAI to invest $500 million in AI robotics startup Link

Microsoft and OpenAI are leading a funding round to invest $500 million in Figure AI, a robotics startup competing with Tesla’s Optimus.
Figure AI, known for its commercial autonomous humanoid robot, could reach a valuation of $1.9 billion with this investment.
The startup, which partnered with BMW for deploying its robots, aims to address labor shortages and increase productivity through automation.

An AI headband to control your dreams. Link

Tech startup Prophetic introduced Halo, an AI-powered headband designed to induce lucid dreams, allowing wearers to control their dream experiences.
Prophetic is seeking beta users, particularly from previous lucid dream studies, to help create a large EEG dataset to refine Halo’s effectiveness in inducing lucid dreams.
Interested individuals can reserve the Halo headband with a $100 deposit, leading towards an estimated price of $2,000, with shipments expected in winter 2025.

Playing Doom using gut bacteria Link

The latest, weirdest way to play Doom involves using genetically modified E. coli bacteria, as explored in a paper by MIT’s Media Lab PhD student Lauren “Ren” Ramlan.
Ramlan’s method doesn’t turn E. coli into a computer but uses the bacteria’s ability to fluoresce as pixels on an organic screen to display Doom screenshots.
Although innovative, the process is impractical for gameplay, with the organic display managing only 2.5 frames in 24 hours, amounting to a game speed of 0.00003 FPS.

How to generate a PowerPoint in seconds with Copilot

A Daily Chronicle of AI Innovations in January 2024

I found an hidden Website which gives me all Ai tools at lowest price Website name VezyAi.com Thanks me Later
by /u/sandeep29x (Artificial Intelligence) on July 25, 2024 at 10:35 pm
https://vezyai.com submitted by /u/sandeep29x [link] [comments]
SearchGPT, OpenAI’s New Search AI, Looks a Lot Like Perplexity AI
by /u/Worst_Artist (Artificial Intelligence) on July 25, 2024 at 9:37 pm
submitted by /u/Worst_Artist [link] [comments]
Created an app that takes a link to any website documentation and generates an OpenAPI spec for it. (Took less than 5mins)
by /u/gvschaitanya (Artificial Intelligence) on July 25, 2024 at 2:05 pm
Documentation is a hard part for developers, isn't it? It can be a boring job. I've heard many of my colleagues mention this when we release new features. The core problem lies in having many developers working on a large feature, and sometimes parts of it are outsourced to freelancers. https://reddit.com/link/1ebvqlf/video/ev3xm6c79oed1/player Let's solve this problem in less than 10 minutes. submitted by /u/gvschaitanya [link] [comments]
[Discussion] Are AI models are becoming more capable of handling multi-step tasks independently, without needing pre-defined frameworks or extensive human guidance?
by /u/leao_26 (Artificial Intelligence) on July 25, 2024 at 6:54 am
submitted by /u/leao_26 [link] [comments]
Rob Thomas defines the concept of an "AI year": what previously took a year now happens in a week because the technology is moving so fast
by /u/Maxie445 (Artificial Intelligence) on July 25, 2024 at 5:23 am
submitted by /u/Maxie445 [link] [comments]
Researchers removed Llama 3's safety guardrails in just 3 minutes
by /u/Maxie445 (Artificial Intelligence) on July 25, 2024 at 5:19 am
submitted by /u/Maxie445 [link] [comments]
One-Minute Daily AI News 7/24/2024
by /u/Excellent-Target-847 (Artificial Intelligence) on July 25, 2024 at 4:41 am
Alphabet Reports 29% Jump in Profit as A.I. Efforts Begin to Pay Off.[1] Tesla’s profit margin is getting hammered by EV discounts and hefty AI spending.[2] After AgentGPT’s success, Reworkd pivots to web-scraping AI agents.[3] AI, Go Fetch! New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput.[4] Sources: [1] ~https://finance.yahoo.com/news/alphabets-earnings-set-the-stage-for-techs-ai-question-001439691.html~ [2] ~https://www.cnbc.com/2024/07/23/teslas-margin-getting-hammered-by-discounts-and-hefty-ai-spending.html~ [3] ~https://techcrunch.com/2024/07/24/reworkd-paul-graham-nat-friedman-daniel-gross-scrape-ai-agents/~ [4] ~https://blogs.nvidia.com/blog/nemo-retriever-microservices/~ submitted by /u/Excellent-Target-847 [link] [comments]
Harris likely to combine Biden AI policies with Silicon Valley-informed approach
by /u/thisisweird2021 (Artificial Intelligence) on July 25, 2024 at 4:34 am
submitted by /u/thisisweird2021 [link] [comments]
KLING AI Now Open For Worldwide Users
by /u/Embarrassed-Box-4861 (Artificial Intelligence) on July 25, 2024 at 3:34 am
Kuaishou's Kling AI has gone worldwide, ditching its China-only restrictions with the global launch of Kling AI version 1.0. The AI video generator platform is now accessible to users worldwide at KlingAI.com, where registration requires just an email address. Upon signing up, users receive 66 free daily credits for video creation. The platform supports both text-to-video and image-plus-text-to-video generation, putting it in direct competition with OpenAI's much-hyped Sora—which remains only available to select users. submitted by /u/Embarrassed-Box-4861 [link] [comments]
AI + human, what do you think?
by /u/Horizon__world (Artificial Intelligence) on July 24, 2024 at 9:42 pm
submitted by /u/Horizon__world [link] [comments]

August 1, 2023September 1, 2023

Unraveling August 2023: Spotlight on Generative AI

Unraveling August 2023: Spotlight on Generative AI, Tech, Sports and the Month’s Hottest Trends.

Welcome to the hub of the most intriguing and newsworthy trends of August 2023! In this era of rapid development, we know it’s hard to keep up with the ever-changing world of ai, technology, sports, entertainment, and global events. That’s why we’ve curated this one-stop blog post to provide a comprehensive overview of what’s making headlines and shaping conversations. From the mind-bending advancements in artificial intelligence to captivating news from the world of sports and entertainment, we’ll guide you through the highlights of the month. So sit back, get comfortable, and join us as we dive into the core of August 2023!

Amplify Your Brand’s Exposure with the AI Unraveled Podcast – Elevate Your Sales Today! Get your company/Product Featured in our AI Unraveled podcast here and spread the word to hundreds of thousands of AI enthusiasts around the world.

Unraveling August 2023: August 23-31, 2023

Latest AI News and Trends

OpenAI’s ChatGPT enters classrooms

OpenAI has released a guide for teachers using ChatGPT in their classroom. This guide includes suggested prompts, explanations about ChatGPT’s functionality and limitations, as well as insights into AI detectors and bias.

The company also highlights stories of educators successfully using ChatGPT to enhance student learning and provides prompts to help teachers get started. Additionally, their FAQ section offers further resources and answers to common questions about teaching with and about AI.

Example prompts to get you started

Get the Read Aloud Wonderland Bedtime Adventure iBook here

Why does this matter?

OpenAI’s teaching with AI empowers teachers with resources and insights to effectively use ChatGPT in classrooms, benefiting students’ learning experiences. While Competitors like Bard, Bing, and Claude may face pressure to offer similar comprehensive guidance to educators. Failing to do so could put them at a disadvantage in the increasingly competitive AI education market.

Meta announced 2 new AI updates: DINOv2, FACET (FAirness in Computer Vision Evaluation)

Meta has announced the commercial relicensing and expansion of DINOv2, a computer vision model, under the Apache 2.0 license to give developers and researchers more flexibility for downstream tasks.

Meta also introduces FACET (FAirness in Computer Vision Evaluation), a benchmark for evaluating the fairness of computer vision models in tasks such as classification and segmentation. The dataset includes 32,000 images of 50,000 people, with demographic attributes such as perceived gender age group, and physical features.

Why does this matter?

FACET ensures more equitable experiences when interacting with computer vision technology, reducing the risk of bias based on demographics. On the other hand, DINOv2’s availability under the Apache 2.0 license as it empowers developers and researchers to create more versatile computer vision applications.

GoT enhances the LLM capabilities

The Graph of Thoughts (GoT) framework improves the capabilities of LLMs by modeling information as a graph. LLM thoughts are represented as vertices, and edges represent dependencies between these thoughts. GoT allows for combining thoughts, distilling networks of thoughts, and enhancing thoughts using feedback loops.

Advertise with us - Post Your Good Content Here
We are ranked in the Top 20 on Google

AI Dashboard is available on the Web, Apple, Google, and Microsoft, PRO version

It outperforms other paradigms like Chain-of-Thought or Tree of Thoughts (ToT) in various tasks, increasing sorting quality by 62% and reducing costs by over 31%. It is also extensible, allowing for new thought transformations and advancing prompting schemes.

Why does this matter?

Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!

This advancement brings LLM reasoning closer to human thinking and brain mechanisms such as recurrence, both of which form complex networks. It makes AI models more versatile and adaptable, with implications on various domains.

Google announces a wave of AI innovations

Google announced a slew of massive AI updates at the Google Cloud Next 2023 event. Here are some key announcements:

Vertex AI extends enterprise-ready generative AI development with new models and tooling. Google Cloud gets a curated collection of models across first-party, open-source, and third-party models, including Meta’s Llama 2 and Code Llama, Falcon, Anthropic’s Claude 2, and more. Google’s foundation models– PaLM, Codey, and Imagen– also get several updates.
Powered by DeepMind, a new tool called SynthID helps watermark and identify synthetic images created by Imagen.
Google is expanding its AI-optimized infrastructure with the general availability of Cloud TPU v5e and Nvidia-powered A3 VMs.
Duet AI in Workspace (aiding tasks across meetings, documents, Google Chat, Gmail, and more) is now generally available, and Duet AI in Google Cloud (to assist in code refactoring, improving, etc.) is expanding its preview and will be generally available later this year
.
Duet AI in Google Cloud also includes advancements for software development, application infrastructure and operations, data analytics, accelerating and modernizing databases, and security operations.
Search Generative Experience (SGE) launches in the first countries outside the U.S. — India and Japan (with multilingual and local language support).

Why does it matter?

The advancements seem to offer a complete solution for AI, from computing infrastructure to end-to-end software and services that support the full lifecycle of model training, tuning, and serving at global scale. It will help organizations harness the full potential of AI with data and cloud through a unified foundation.

Introducing Zapier AI Chatbot: Create custom AI chatbots with no code

Now you can build your own AI-powered chatbot through Zapier Interfaces, its no-code, automation-powered app builder currently in beta. You also have a variety of sharing options, so you can embed chatbots on your website or limit access to your team or external stakeholders.

The base AI Chatbot model is GPT-3.5. With Interfaces Premium, you can connect to other models (like GPT-4) using an API key from your personal OpenAI account.

Why does this matter?

This makes it easier for businesses and individuals to create custom AI chatbots, no coding required. It democratizes AI chatbot development, potentially increasing their accessibility across various industries/applications and fostering innovation in AI.

Meta researchers find AI “Déjà Vu”ing: Suggested ways to address the privacy risks; Meta’s ImageBind: The ultimate fusion of 6 data types in 1 AI model; Meta’s Sandbox: Where AI meets advertising; Meta bets big on AI with custom chips & a supercomputer; Meta scaling Speech Technology to 1,100+ languages; Meta’s MusicGen: The LLaMA moment for music AI; Meta disclosed AI behind Facebook and Instagram recommendations; Meta merges ChatGPT & Midjourney into one; Meta unveils Llama 2, a worthy rival to ChatGPT; Meta-Transformer lets AI models process 12 modalities; Meta collabs with Qualcomm to enable on-device AI apps using Llama 2; Meta’s AudioCraft is AudioGen + MusicGen + EnCodec; Meta challenges OpenAI with code-gen free software; Meta’s SeamlessM4T: The first all-in-one, multilingual multimodal AI; Meta to rival GPT-4 with a free Llama 3?

Meta researchers find AI “Déjà Vu”ing: Suggested ways to address the privacy risks

Researchers at Meta recently discovered an anomaly common across most Self Supervised Learning (SSL) algorithms and call it Déjà Vu. They said SSL models can unintendedly memorize specific parts in individual training samples rather than learning semantically meaningful associations.

The report shares the details of studies around this unintended memorization and also explores ways of avoiding it.

"Pass the AWS Cloud Practitioner Certification with flying colors: Master the Exam with 300+ Quizzes, Cheat Sheets, Flashcards, and Illustrated Study Guides - 2024 Edition"

Meta’s ImageBind: The ultimate fusion of 6 data types in 1 AI model

Meta has announced the new open-source AI model called ‘ImageBind’ that links together multiple data streams- text, audio, visual data, temperature, and movement readings. ImageBind is the first to combine 6 data types into a single embedding space.

The company also notes that other streams of sensory input could be added to future models, including touch, speech, smell, and brain fMRI signals.

Meta’s Sandbox: Where AI meets advertising

Meta has introduced an AI Sandbox for advertisers, which includes features such as alternative copy generation, background creation through text prompts, and image cropping for Facebook or Instagram ads. This new tool aims to assist advertisers in creating more diverse and engaging content using AI.

"Become a Canada Expert: Ace the Citizenship Test and Impress Everyone with Your Knowledge of Canadian History, Geography, Government, Culture, People, Languages, Travel, Wildlife, Hockey, Tourism, Sceneries, Arts, and Data Visualization. Get the Top 1000 Canada Quiz Now!"

The tools are still in beta, but they have the potential to revolutionize how ads are created and delivered.

Meta bets big on AI with custom chips & a supercomputer

Meta is making a big bet on AI by developing custom chips and a supercomputer. The company is developing its own chips called the Meta Training and Inference Accelerator (MTIA), which will be optimized for AI workloads and allow for more efficient training and running of complex models.

In addition, Meta is building a supercomputer, which will be used to train large-scale AI models for natural language processing and computer vision. These investments aim to enable the development of more advanced products and services, such as virtual assistants and augmented reality applications.

Meta scaling Speech Technology to 1,100+ languages

Meta’s Massively Multilingual Speech (MMS) project aims to address the lack of speech recognition models for most of the world’s languages, introduced Introducing speech-to-text, text-to-speech. Combining self-supervised learning techniques with a new dataset containing labeled data for over 1,100 languages and unlabeled data for nearly 4,000 languages.

The MMS models outperform existing ones and cover 10 times as many languages. The project’s goal is to increase accessibility to information for people who rely on voice as their primary means of accessing information. The models and code are publicly available for further research and development. The project aims to contribute to the preservation of the world’s diverse languages.

Meta’s AI Segmentation Game Changer

Meta’s researchers have developed HQ-SAM (High-Quality Segment Anything Model), a new model that improves the segmentation capabilities of the existing SAM. SAM struggles to segment complex objects accurately, despite being trained with 1.1 billion masks. HQ-SAM is trained on a dataset of 44,000 fine-grained masks from various sources, achieving impressive results on nine segmentation datasets across different tasks.

HQ-SAM retains SAM’s prompt design, efficiency, and zero-shot generalizability while requiring minimal additional parameters and computation. Training HQ-SAM on the provided dataset takes only 4 hours on 8 GPUs.

Unlock the Secrets of Africa: Master African History, Geography, Culture, People, Cuisine, Economics, Languages, Music, Wildlife, Football, Politics, Animals, Tourism, Science and Environment with the Top 1000 Africa Quiz and Trivia. Get Yours Now!

Meta plans to put AI everywhere on its platforms

Meta has announced plans to integrate generative AI into its platforms, including Facebook, Instagram, WhatsApp, and Messenger. The company shared a sneak peek of AI tools it was building, including ChatGPT-like chatbots planned for Messenger and WhatsApp that could converse using different personas. It will also leverage its image generation model to let users modify images and create stickers via text prompts.

Meta’s MusicGen: The LLaMA moment for music AI

META released MusicGen, a controllable music generation model for producing high-quality music. MusicGen can be prompted by both text and melody.

The best thing is anyone can try it for free now. It uses a single-stage transformer language model with efficient token interleaving patterns, eliminating the need for multiple models.

MusicGen will generate 12 seconds of audio based on the description provided. You can optionally provide a reference audio from which a broad melody will be extracted. Then the model will try to follow both the description and melody provided. You can also use your own GPU or a Google Colab by following the instructions on their repo.

Meta’s new human-like AI model for image creation

Meta has introduced a new model, Image Joint Embedding Predictive Architecture (I-JEPA), based on Meta’s Chief AI Scientist Yann LeCun’s vision to make AI systems learn and reason like animals and humans. It is a self-supervised computer vision model that learns to understand the world by predicting it.

Dive into a comprehensive AWS Cloud Practitioner CLF-C02 Certification guide, masterfully weaving insights from Tutorials Dojo, Adrian Cantrill, Stephane Maarek, and AWS Skills Builder into one unified resource.

The core idea: It learns by creating an internal model of the outside world and comparing abstract representations of images. It uses background knowledge about the world to fill in missing pieces of images, rather than looking only at nearby pixels like other generative AI models.

Key takeaways: The model

Captures patterns and structures through self-supervised learning from unlabeled data.
Predicts missing information at a high level of abstraction, avoiding generative model limitations

Delivers strong performance on multiple computer vision tasks while also being computationally efficient. Less data, less time, and less compute.
Can be used for many different applications without needing extensive fine-tuning and is highly scalable.

Meta’s all-in-one generative speech AI model

Meta introduces Voicebox, the first generative AI model that can perform various speech-generation tasks it was not specifically trained to accomplish with SoTA performance. It can perform:

Text-to-speech synthesis in 6 languages
Noise removal
Content editing
Cross-lingual style transfer
Diverse sample generation

One of the main limitations of existing speech synthesizers is that they can only be trained on data that has been prepared expressly for that task. Voicebox is built upon the Flow Matching model, which is Meta’s latest advancement on non-autoregressive generative models that can learn highly non-deterministic mapping between text and speech.

Meta disclosed AI behind Facebook and Instagram recommendations

Meta is sharing 22 system cards that explain how AI-powered recommender systems work across Facebook and Instagram. These cards contain information and actionable insights everyone can use to understand and customize their specific AI-powered experiences in Meta’s products.

Moreover, Meta also shared its top ten most important prediction models rather than everything in the system to not dive into much technical detail can sometimes obfuscate transparency.

Using an input audio sample of just two seconds in length, Voicebox can match the sample’s audio style and use it for text-to-speech generation.

Meta plans to dethrone OpenAI and Google

Meta plans to release a commercial AI model to compete with OpenAI, Microsoft, and Google. The model will generate language, code, and images. It might be an updated version of Meta’s LLaMA, which is currently only available under a research license.

Meta’s CEO, Mark Zuckerberg, has expressed the company’s intention to use the model for its own services and make it available to external parties. Safety is a significant focus. The new model will be open source, but Meta may reserve the right to license it commercially and provide additional services for fine-tuning with proprietary data.

Tesla’s $300M AI cluster is going live today; OpenAI launches ChatGPT Enterprise, the most powerful ChatGPT version yet; Usage of ChatGPT among Americans rises, but only slightly; IBM’s new analog AI chip challenges Nvidia; AI’s promise and peril in cancer research; Daily AI Update News from Tesla, OpenAI, Microsoft, DoorDash, Uber, Yahoo, and Quora

Tesla’s $300M AI cluster is going live today

Tesla is launching its highly-anticipated supercomputer today. The machine, employing 10,000 Nvidia H100 compute GPUs, will be used for various AI applications. It is said to be one of the most powerful machines in the world.

But NVIDIA is struggling to keep up with the GPU demand. Thus, Tesla is investing over $1B to develop its own supercomputer, Dojo, built on the company’s hyper-optimized custom-designed chip. Tesla is also activating Dojo simultaneously. Take a look at Tesla’s internal forecast for the compute power of Dojo.

Why does it matter?

Elon Musk recently revealed that Tesla plans to spend over $2B on AI training in 2023 and is hiring reputed AI engineers. But this move gives Tesla unparalleled compute power. It also underscores Tesla’s commitment to overcoming computational bottlenecks in AI and should provide substantial advantages over its rivals. Elon might be the next big thing in AI. What do you think?

OpenAI launches ChatGPT Enterprise, the most powerful ChatGPT version yet

Open has launched ChatGPT Enterprise, the most powerful version of ChatGPT yet. It offers enterprise-grade security and privacy, features for large-scale deployments, unlimited higher-speed GPT-4 access, 32K context for faster processing of longer inputs, advanced data analysis capabilities, customization options, and much more. OpenAI is also working on more features and will launch them soon.

Why does it matter?

Cloud Certification made simple. Ace your exams with Djamgatech.

This is a simple and safe way of deploying ChatGPT into core operations at organizations. It could be a solution for big companies that have banned ChatGPT at work over privacy concerns, like Apple, Amazon, Citigroup, and more. Maybe, this can pave the way for truly widespread adoption of AI in the business world.

Usage of ChatGPT among Americans rises, but only slightly

A recent survey conducted in July by Pew Research Center reveals 18% of U.S. adults have ever used ChatGPT. While 16% of those who have heard of the tool and are employed say they have used it for tasks at work.

The statistic is consistent with a similar survey conducted in March by the Pew Research Center that showed 14% of U.S. adults had tried ChatGPT. And about one in ten working adults who had heard of ChatGPT used it at work.

While this shows increased adoption of ChatGPT among Americans, it is not a significant one in the grand scheme of AI adoption today. In fact, only a few think it will have a major impact on their job.

Why does this matter?

These findings suggest AI’s penetration remains gradual. It is also clear that there is still work to be done in educating and acclimating the workforce to the benefits and implications of generative AI. Plus, given the lingering concerns and uncertainties about ChatGPT’s prowess, maybe it is too early to start worrying about AI replacing jobs.

What Else Is Happening in AI

Microsoft infuses AI with human-like reasoning via an “Algorithm of Thoughts”.

DoorDash launches AI-powered voice ordering to answer calls and curate recommendations.

Uber is working on an AI chatbot for its food delivery app.

Yahoo Mail introduces new AI-powered capabilities, including a ‘Shopping Saver’ tool.

Generative inbreeding, akin to inbreeding in genetics, is a concern as AI systems training on AI-generated content can degrade their performance and distort human culture.

Tesla’s $300M AI cluster is going live today
– Tesla is launching its highly-anticipated supercomputer today. The machine, employing 10,000 Nvidia H100 compute GPUs, will be used for various AI applications.
– But NVIDIA is struggling to keep up with the GPU demand. Thus, Tesla is investing over $1B to develop its own supercomputer, Dojo, built on the company’s hyper-optimized custom-designed chip. Tesla is also activating Dojo simultaneously.

OpenAI launches ChatGPT Enterprise, the most powerful ChatGPT version yet
– It offers enterprise-grade security and privacy, features for large-scale deployments, unlimited higher-speed GPT-4 access, 32K context for faster processing of longer inputs, advanced data analysis capabilities, customization options, and much more. OpenAI is also working on more features and will launch them soon.

Usage of ChatGPT among Americans rises, but only slightly
– A recent survey conducted in July by Pew Research Center reveals 18% of U.S. adults have ever used ChatGPT. While 16% of those who have heard of the tool and are employed say they have used it for tasks at work. The statistic is consistent with a similar survey conducted in March by the center.
– While it shows increased adoption of ChatGPT among Americans, it is not a significant one in the grand scheme of AI adoption today. In fact, only a few think it will have a major impact on their job.

Microsoft infuses AI with human-like reasoning via an “Algorithm of Thoughts”
– The technique guides the language model through a more streamlined problem-solving path. It utilizes in-context learning, enabling the model to explore different solutions in an organized manner systematically. The result? Faster, less resource-intensive problem-solving.

DoorDash launches AI-powered voice ordering service
– It will answer calls and provide customers with curated recommendations.

Uber is working on an AI chatbot for its food delivery app
– It will offer recommendations to food-delivery customers and help them more quickly place orders.

Yahoo Mail introduces new AI-powered capabilities
– The rollout includes upgrades to several of Yahoo Mail’s existing AI features and introduces a new Shopping Saver tool.

Poe by Quora lets you use all the AI chatbots in one place
– Its goal is to be the web browser for accessing AI chatbots, and it just got a bunch of updates.

IBM’s new analog AI chip challenges Nvidia

IBM has developed an analog AI chip that’s up to 14 times more energy-efficient than current digital chips, addressing the power-hungry nature of generative AI.
The analog chip’s ability to manipulate analog signals and its human brain-like operation could potentially challenge Nvidia’s dominance in AI hardware.
IBM’s prototype chip demonstrated significant energy efficiency gains, encoding millions of memory devices and modeling parameters while performing computations directly within memory.

AI’s promise and peril in cancer research

UK-based biotech startup Etcembly used generative AI to develop a novel immunotherapy targeting hard-to-treat cancers, demonstrating AI’s potential for medical advancements.
However, risks of AI in healthcare are evident, as a study reveals that AI-generated cancer treatment plans, like those from ChatGPT, contained factual errors and contradictory information.
While AI-powered tools hold promise, their clinical deployment without rigorous validation could lead to dangerous missteps, highlighting the importance of skepticism and human consultation.

AI eye scans can predict Parkinson’s years before symptoms; AI model gives paralyzed woman the ability to speak through a digital avatar; Meta’s coding version of Llama-2, CoDeF ensures smooth AI-powered video edits; Nvidia just made $6 billion in pure profit over the AI boom; 6 Ways to Choose a Language Model; Hugging Face’s Safecoder lets businesses own their own Code LLMs; Google, Amazon, Nvidia, and others pour $235M into Hugging Face; Amazon levels up our sports viewing experience with AI; Daily AI Update News from Stability AI, NVIDIA, Figma, Google, Deloitte and much more…

Unraveling August 2023: August 22nd, 2023

Latest AI News and Trends on August 22nd, 2023

Linkedin: Building soft (human) skills remains key in the age of AI

Summary: A new LinkedIn report reveals that AI skills are spreading quickly globally, with major growth in AI job postings and professionals adding AI abilities.

Job postings mentioning AI skills like GPT and ChatGPT have risen dramatically, with a 21x increase since November 2022.
LinkedIn members adding AI skills to profiles is accelerating globally. The number of members with AI skills was 9x larger in June 2023 compared to January 2016.
Singapore, Finland, Ireland, India and Canada have the fastest AI skills adoption rates based on LinkedIn’s AI Skills Index.
47% of US executives believe using generative AI will boost productivity. 40% think it will help drive revenue growth.
84% of US members have jobs that could use AI to automate at least 25% of repetitive tasks. This will also increase demand for people skills.
In the US, the fastest-growing in-demand skills since November 2022 are: Flexibility +158%, Professional ethics +120%, Social perceptiveness +118%, Self-management +83%.
Communication remains the top skill in demand in US job postings, with people skills like flexibility growing the fastest since ChatGPT launched.
92% of executives agree people skills are more important than ever in an AI-driven world.

Why It Matters: AI is transforming and disrupting every industry for sure, but it will never disrupt humanity. Human skills (also called soft skills) like creativity and emotional intelligence will only become more important.

YouTube and Universal Music Partner to Launch ‘AI Incubator’

YouTube is partnering with Universal Music to launch an incubator focused on exploring the use of AI in music. The incubator will work with artists and musicians, including Anitta, ABBA’s Björn Ulvaeus, and Max Ricther, to gather insights on generative AI experiments and research. YouTube CEO Neal Mohan stated that the incubator will inform the company’s approach as it collaborates with innovative artists, songwriters, and producers.

YouTube also plans to invest in AI-powered technology, including enhancing its copyright management tool, Content ID, to protect viewers and creators.

Why does this matter?

By partnering with renowned artists, the AI incubator explores the potential of AI-generated music, spotlighting the intersection of technology and artistry. This collab not only underscores AI’s growing role in creative industries but also demonstrates how industry giants can collaborate to drive innovation and shape the future of music production.

Understanding Tree of Thoughts prompting technique and how it can improve problem solving in LLM’s

In the ever-evolving landscape of artificial intelligence, Large Language models (LLMs) like GPT-3/GPT-4/Claude-2 and others have exhibited astonishing capabilities across various domains, from mathematical problem-solving to creative writing. However, there’s been a limitation in their approach – the left-to-right, token-by-token decision-making process, which doesn’t always align with complex problem-solving scenarios that demand strategic planning and exploration.

But what if we could enable these LLMs to think more strategically, explore multiple reasoning paths, and evaluate the quality of their thoughts in a deliberate manner? Some researchers have created a framework called “Tree of Thoughts” (ToT) which aims to fix this by enhancing the problem-solving prowess of large language models.

The Essence of ToT

At its core, ToT reimagines the reasoning process as an intricate tree structure. Each branch of this tree represents an intermediate “thought” or a coherent chunk of text that serves as a crucial step toward reaching a solution. Think of it as a roadmap where each stop is a meaningful milestone in the journey towards problem resolution. For instance, in mathematical problem-solving, these thoughts could correspond to equations or strategies.

But ToT doesn’t stop there. It actively encourages the LM to generate multiple possible thoughts at each juncture, rather than sticking to a single sequential thought generation process, as seen in traditional chain-of-thought prompting. This flexibility allows the model to explore diverse reasoning paths and consider various options simultaneously.

Source: Yao et el. (2023)

The Power of Self-Evaluation

One of ToT’s defining features is the model’s ability to evaluate its own thoughts. It’s like having an inbuilt compass to assess the validity or likelihood of success for each thought. This self-evaluation provides a heuristic, a kind of mental scorecard, to guide the LM through its decision-making process. It helps the model distinguish between promising paths and those that may lead to dead ends.

Systematic Exploration

ToT takes strategic thinking up a notch by employing classic search algorithms such as breadth-first search or depth-first search to systematically explore the tree of thoughts. These algorithms allow the model to look ahead, backtrack when necessary, and branch out to consider different possibilities. It’s akin to a chess player contemplating multiple moves ahead before making a move.

Customizable and Adaptable

One of ToT’s strengths is its modularity. Every component, from thought representation to generation, evaluation, and search algorithm, can be customized to fit the specific problem at hand. No additional model training is needed, making it highly adaptable to various tasks.

Real-World Applications

The true litmus test for any AI framework is its practical applications. ToT has been put to the test across different challenges, including the Game of 24, Creative Writing, and Mini Crosswords. In each case, ToT significantly boosted the problem-solving capabilities of LLMs over standard prompting methods. For instance, in the Game of 24, success rates soared from a mere 4% with chain-of-thought prompting to an impressive 74% with ToT.

Source: Yao et el. (2023)

The above image is a visual representation of the Game of 24 which is a mathematical reasoning challenge where the goal is to use 4 input numbers and arithmetic operations to reach the target number 24.

The tree of thought (ToT) approach represents this as a search over possible intermediate equation “thoughts” that progressively simplify towards the final solution.

First, the language model proposes candidate thoughts that manipulate the inputs (e.g. (10 – 4)).

Next, it evaluates the promise of reaching 24 from each partial equation by estimating how close the current result is. Thoughts evaluated as impossible are pruned.

The process repeats, generating new thoughts conditioned on the remaining options, evaluating them, and pruning. This iterative search through the space of possible equations allows systematic reasoning.

For example, the model might first try (10 – 4), then build on this by proposing (6 x 13 – 9) which gets closer to 24. After several rounds of generation and evaluation, it finally produces a complete solution path like: (10 – 4) x (13 – 9) = 24.

By deliberating over multiple possible chains of reasoning, ToT allows more structured problem solving compared to solely prompting for the end solution.

What is Explainable AI? Which industries are meant for XAI?

Trained AI algorithms work by taking the input and providing the output without explaining its inner workings. XAI aims at pointing out the rationale behind any decision by AI in such a way that humans can interpret it.

Deep learning works with neural networks just like the human brain works with neurons, where it uses a massive amount of training data to learn and identify patterns. It would be very difficult, or rather impossible, to dig into the rationale behind Deep Learning’s decision. Decisions like credit card eligibility or loan sanction are quite important to be explained by XAI. However, a few wrong decisions would not impact much. Whereas, in the case of healthcare, as discussed earlier, a doctor could not provide the appropriate treatment without knowing the rationale behind AI’s decision. Surgery on the wrong organ could be fatal.

4 Principles of Explainable AI

The US National Institute of Standards and Technology has developed four principles as guidelines to adopt fundamental properties of Explainable Artificial Intelligence (XAI) efficiently and effectively. These principles apply individually and independently from each other and guide us to better understand the working of the AI models.

1. Explanation:

This principle obligates the AI to generate a comprehensive explanation for humans to understand the process of generating the decisions with the required evidence and reasons. The standard for this evidence and reasons is governed by the next three principles.

2. Meaningful:

This principle is satisfied when a stakeholder understands the explanation provided in the first guiding principle. The explanation should not be complex and understood by the users on a group as well as individual level.

3. Explanation Accuracy:

The accuracy at which the AI explains the complicated process of generating the output is critical. Accuracy metrics may differ for individual stakeholders in terms of their explanation. The expected accuracy is 100% for all the stakeholders to understand the logic.

4. Knowledge Limits:

The last principle of XAI explains that the model can only be operated under the special conditions it has been modeled for. It is expected to operate under its limited knowledge to avoid any sort of discrepancy or unjustified business outcomes.

How does XAI work?

These principles help us define the expected output from the XAI model and how an ideal XAI model should be. However, it doesn’t indicate how the output has been achieved. Subdividing the XAI into three categories to better understand the rationale:

1. Explainable data: What data is used to train the model? Why the particular data is selected? How much biased is the data?

2. Explainable predictions: What features did the model use that lead to the particular output?

3. Explainable algorithms: How is the model layered? How do these layers lead to the prediction?

Based on individual instances, the explainability may change. For example, the neural network can only be explained using the Expainable Data category. Research is ongoing that is focused on finding ways to explain the predictions and algorithms. At present there are two approaches:

a. Proxy Modeling:

A different model from the original is used to approximate the actual model. This may result in different outcomes from the true model outcomes, as it is just an approximation.

b. Design for Interpretability:

The actual model is designed in such a way that it is easy to understand its working. However, this increases the risk of reduced predictive power and overall accuracy of the model.

The XAI is referred to as the White Box, as it explains the rationale behind its working. However, unlike the black box, its accuracy may decrease in order to provide an explainable reason for its outcome. Decision trees, Bayesian networks, sparse linear models, and many more are used as explainable techniques. Hopefully, with the advancements in the field, new studies will come up to increase the accuracy of the explanations.

Critical Industries for XAI

XAI would be helpful in those industries where machines play a key part in decision-making. These use cases might also be useful in your industry, as the details may vary, but the core principles remain the same.

1. Healthcare in XAI

As discussed earlier, the decisions made by AI in healthcare impact humans in a very critical way. A machine with XAI would help the healthcare staff save a lot of time, which they might use to focus on treating and attending to more patients. For example, diagnosing a cancerous area and explaining the reason in a matter of time helps the doctor to provide appropriate treatment.

2. Manufacturing in XAI

In the manufacturing industry, fixing or repairing equipment often depends on personnel expertise, which may vary. To ensure a consistent repair process, XAI can help provide ways to repair a machine type with an explanation, record the feedback from the worker, and continuously learn to find the best process to be followed. The workers need to trust the decision made by the machine in order to risk working on the equipment repair, which is the reason XAI becomes useful.

3. Autonomous vehicles in XAI

A self-driving car seems great until and unless it has made a bad decision, which can be deadly. If an autonomous car faces an inevitable accident scenario, the decision it makes impacts greatly on its future use, whether it saves the driver or the pedestrians. Providing the rationale for each decision an autonomous car takes, helps to improve people’s security on the road.

Strategize Your Social Media Campaigns with ChatGPT

Try the propmpt below:

You are a social media strategist. I am launching a crowdfunding campaign for an innovative portable solar charger and need to create a buzz on social media. I need a comprehensive social media strategy that covers platform selection, content ideas, posting frequency, engagement tactics, and analytics tracking. Please provide suggestions considering the latest trends in social media marketing and the behavior of tech-savvy, environmentally-conscious consumers.

Daily AI News 8/22/2023

Computer scientist Stephen Thaler’s bid to secure a copyright registration for an artwork created by artificial intelligence has been shot down for at least the third time by a Washington, D.C. court.

Scientists from the Korea Advanced Institute of Science & Technology (KAIST) have developed a humanoid robot capable of flying an aircraft without majorly adjusting the cockpit.

Zoom has made significant advancements in its artificial intelligence (AI) technology as it aims to empower customers to work smarter in a hybrid work environment.

Eye scans powered by AI could detect Parkinson’s disease in people before they have symptoms, a study has suggested.

People who use AI will replace workers who don’t: IBM

YouTube plans to compensate for AI music

YouTube will pay artists and rights holders for AI-generated music used on the platform. This aims to balance creative innovation and fair compensation.

Unraveling August 2023: Spotlight on Generative AI, Tech, Sports and the Month’s Hottest Trends.

Amplify Your Brand’s Exposure with the AI Unraveled Podcast – Elevate Your Sales Today!

Get your company/Product Featured in our AI Unraveled podcast here and spread the word to hundreds of thousands of AI enthusiasts around the world.

Unraveling August 2023: August 21st, 2023

Latest AI News and Trends on August 21st, 2023

OpenCopilot- AI sidekick for everyone

OpenCopilot allows you to have your own product’s AI copilot. With a few simple steps, it takes less than 5 minutes to build.

It integrates with your underlying APIs and can execute API calls whenever needed. It uses LLMs to determine if the user’s request requires calling an API endpoint. Then, it decides which endpoint to call and passes the appropriate payload based on the given API definition.

Why does this matter?

Shopify has an AI-powered sidekick, while Microsoft (Windows Copilot, Bing Copilot) and GitHub (GitHub Copilot) have copilots. The above innovation empowers every SaaS product to have its own AI copilots tailored for its unique products.

Google teaches LLMs to personalize

LLMs are already good at synthesizing text, but personalized text generation can unlock even more. New Google research has proposed an approach inspired by the practice of writing education for personalized text generation using LLMs. It has a multistage and multitask framework consisting of multiple stages: retrieval, ranking, summarization, synthesis, and generation.

In addition, they introduce a multitask setting that further helps the model improve its generation ability, which is inspired by the observation that a student’s reading proficiency and writing ability are often correlated. When evaluated on three public datasets, each covering a different and representative domain, the results showed significant improvements over various baselines.

Why does this matter?

Customizing style is essential for many domains like personal communication, dialogue, marketing copies, stories, etc., which is hard to do via pure prompt engineering or custom instructions. The research attempts to address this and highlights how we can take inspiration from how humans achieve tasks to apply it to LLMs.

Local Llama

For businesses, local LLMs offer competitive performance, cost reduction, dependability, and flexibility. This article by ScaleDown provides practical guidance on setting up and running LLMs locally using a user-friendly project.

Moreover, Llama-2 and its variants are the go-to models, and the community continually refines them. The article highlights some things to note when running Llama models locally, including memory and model loader challenges.

Why does this matter?

This helps make AI accessible to individuals and businesses while avoiding limitations and high expenses associated with commercial APIs. Locally deploying LLM also helps businesses have more over the model, customize it, integrate with existing systems, and enable full utilization of its capabilities.

AI creates lifelike 3D experiences from your phone video

Luma AI has introduced Flythroughs, an app that allows one-touch generation of photorealistic, cinematic 3D videos that look like professional drone captures. Record like you’re showing the place to a friend, and hit Generate– all on your iPhone. No need for drones, lidar, expensive real estate cameras, and a crew.

Flythroughs is built on Luma’s breakthrough NeRF and 3D generative AI and a brand new path generation model that automatically creates smooth dramatic camera moves.

Why does this matter?

This marks a significant leap in democratizing 3D content creation with AI and making it cost-efficient. It opens up new possibilities for storytelling and crafting stunning digital experiences for users across various industries.

Genetic Algorithm Optimized Neural Network Model for Malicious URL Detection

URL Genie is a web application implementing a Multilayer Perceptron Neural Network optimized using genetic algorithms. Detect whether a domain name or URL is malicious by inputting a URL.

Check it out!
https://github.com/ANG13T/url_genie

– Boosted.ai – AI stock screening, portfolio management, risk management

– JENOVA – AI stock valuation model that uses fundamental analysis to calculate intrinsic value

– Danielfin – Rates stocks and ETFs with an easy-to-understand global AI Score

– Comparables.ai – AI designed to find comparables for market analysis quickly and intelligently

Daily AI Update News from OpenCopilot, Google, Luma AI, AI2, and more

AI Copilot for your own SaaS product
– OpenCopilot allows you to have your own product’s AI copilot. It integrates with your underlying APIs and can execute API calls whenever needed. It uses LLMs to determine if the user’s request requires calling an API endpoint. Then, it decides which endpoint to call and passes the appropriate payload based on the given API definition.

Teach LLMs to Personalize
– New Google research has proposed a general approach for personalized text generation using LLMs, inspired by the practice of writing education. Upon evaluation, the results showed significant improvements over a variety of baselines.

Introducing Flythroughs, an app that creates lifelike 3D experiences from your phone video
– It allows one-touch generation of photorealistic, cinematic videos that look like professional drone captures. No need for drones, lidar, expensive real estate cameras, and a crew. Record like you’re showing the place to a friend, and hit Generate; all on your iPhone.

Big brands are increasingly using AI-generated ads, including Nestlé and Mondelez
– More brands see generative AI as a means to make creating an ad less painful and costly. However, there are concerns over whether to let people know it’s AI-generated, whether AI ads can receive copyright protections, and security risks when using AI.

AI2 drops the biggest open dataset yet for training language models
– Language models like GPT-4 and Claude are powerful and useful. Still, the data on which they are trained is a closely guarded secret. The AI2’s (Allen Institute for AI) new, huge text dataset, Dolma, is free to use and open to inspection.

Ex-Machine Zone CEO launches BeFake, an AI-based social media app
– Alias Technologies has introduced BeFake, a social media app for digital self-expression. Now available on both the App Store and Google Play, it aims to offer a refreshing alternative to the conventional reality portrayed on existing social media platforms.

Some of the world’s biggest advertisers, from food giant Nestle to consumer goods multinational Unilever, are experimenting with using generative AI software like ChatGPT and DALL-E to cut costs and increase productivity.

The New York Times may sue OpenAI over its AI chatbot ChatGPT, which uses the newspaper’s stories to generate text. The paper is unhappy that OpenAI is not paying for the use of its content and is also worried that ChatGPT could reduce its online traffic by providing answers based on its reporting.

Mantella allows you to have natural conversations with NPCs in Skyrim using your voice by leveraging Whisper for speech-to-text, ChatGPT for text generation, and xVASynth for text-to-speech. NPCs also have memories of your previous conversations and have awareness of in-game events.

British Prime Minister Rishi Sunak is set to spend 100 million pounds ($130 million) to buy thousands of computer chips to power artificial intelligence amid a global shortage and race for computing power.

Top 7 Best AI Design Software(Bonus)

Imagine a world where you reside in a luxurious home, an architectural marvel adorned with every comfort and amenity that one could possibly fathom. But it doesn’t stop there; your creativity knows no bounds, and you envision entire universes with their own laws of physics, teeming with diverse civilizations.

As you journey through life, your passions take an intriguing turn, guiding you towards the realm of digital marketing.

Yet, amid this shift in interests, a captivating question continues to linger in your mind like an enigmatic riddle: “If I possessed the power to design anything in the world, what wondrous creation would spring forth from my imagination?”

As your knowledge expands and your expertise in digital marketing deepens, you become acquainted with the remarkable world of graphic design software. Herein lies the key to unlock the gateway to your wildest ideas and aspirations.

With the vast array of possibilities that graphic design software offers, you come to realize that you can bring to life virtually anything your mind can conceive – and that realization holds true for anyone daring enough to venture into this realm.

While some graphic design software tools are tailored to cater to specialized fields, such as web design software that masters the dynamic nature of webpages or CAD software that focuses on technical drawings, at its core, graphic design software is an all-encompassing and versatile tool. It empowers individuals to transform their creative visions into tangible realities.

Within the confines of this article, we shall embark on a journey exploring the finest AI design software tools currently available. These cutting-edge tools are poised to revolutionize the design process and elevate your artistic capabilities to unprecedented heights.

By leveraging the power of artificial intelligence, these tools open up new horizons, enabling you to streamline and automate your design workflow like never before.

So, fasten your seatbelts and prepare to delve into the realm of limitless creativity. In the following sections, we shall uncover the potentials of AI-driven design software and how they stand as testaments to the boundless human imagination.

It’s time to manifest your artistic dreams into reality – let the voyage commence!

1. Adobe Photoshop CC

When it comes to harnessing the power of AI for creating mesmerizing visual graphics, few tools can rival the prowess of Adobe Photoshop CC. Renowned across the globe, this software stands as a beacon of creativity and innovation, empowering artists, designers, and digital enthusiasts to bring their imaginations to life in the most astonishing ways.

At the heart of Adobe Photoshop CC lies an impressive array of features that cater to every aspect of design. Whether you aim to craft captivating illustrations, design stunning artworks, or manipulate photographs with unprecedented precision, this software has got you covered.

With its user-friendly interface and intuitive controls, even those new to the world of digital design can quickly find themselves delving into the realm of endless possibilities.

One of the standout strengths of Photoshop lies in its ability to produce highly realistic and detailed images. From refining minute details in portraits to creating breathtaking landscapes, the software’s tools and filters enable artists to achieve a level of precision that defies belief.

The result is a visual masterpiece that captures the essence of the creator’s vision with unparalleled fidelity.

But Photoshop is not merely limited to polishing existing images; it opens the gates to boundless creativity by allowing users to remix and combine multiple images seamlessly. Whether it’s composing fantastical scenes or crafting surreal montages, the software’s blending capabilities grant designers the freedom to construct their own visual universes.

What truly sets Adobe Photoshop CC apart from the rest is its ingenious integration of artificial intelligence. The inclusion of AI-driven features elevates the design process to a whole new dimension.

Dull and lackluster photographs transform into jaw-dropping works of art with just a few clicks, as the software’s AI algorithms intelligently enhance colors, textures, and lighting, breathing life into every pixel.

Adobe’s suite of creative tools, including the likes of Adobe Illustrator and others, work in seamless harmony with Photoshop. This synergy empowers designers to amplify their creative potential even further.

Whether you’re crafting a logo, designing a website, or creating intricate vector graphics, the integration of these tools allows you to transcend the boundaries of imagination.

2. Planner 5D

Planner 5D stands as an ingenious AI-powered solution, offering you the gateway to realize your long-cherished dream of a perfect home or office space. With its cutting-edge technology, this software empowers you to dive into the realm of architectural creativity and interior design like never before.

The first remarkable feature that sets Planner 5D apart is its AI-assisted design capabilities. Imagine describing your ideal home or office, and watch as the AI effortlessly translates your vision into a stunning 3D representation. From grand entrances to cozy corners, the AI understands your preferences, ensuring that every aspect of your dream space aligns with your desires.

Gone are the days of struggling with pen and paper to create floor plans. Planner 5D streamlines the process, enabling you to effortlessly design detailed and precise floor plans for your dream space.

Whether you seek an open-concept layout or a series of interconnected rooms, this software provides the tools to bring your architectural visions to life.

But that’s not all – Planner 5D goes above and beyond to cater to every facet of interior design. With an extensive library of furniture and home décor items at your disposal, you can furnish and decorate your space with ease.

From stylish sofas and elegant dining tables to enchanting wall art and lighting fixtures, the possibilities are limitless.

The user-friendly 2D/3D design tool within Planner 5D is a testament to the software’s commitment to simplicity and innovation. Whether you’re an aspiring designer or a seasoned professional, navigating through the interface is a breeze, allowing you to create the perfect space for yourself, your family, or your business with utmost ease and precision.

For those seeking a more hands-off approach, Planner 5D also offers the option to hire a professional designer through their platform. This feature is a boon for individuals who desire a polished and expertly curated space but prefer to leave the intricate details to the experts.

By collaborating with skilled designers, you can rest assured that your dream home or office will become a reality, tailored to your unique taste and requirements.

3. Uizard

Uizard emerges as a game-changing tool that holds the power to transform the creative process for founders and designers alike. This innovative software enables you to breathe life into your ideas by swiftly converting your initial sketches into high-fidelity wireframes and stunning UI designs.

Gone are the days of spending endless hours painstakingly crafting wireframes and prototypes manually. With Uizard, the transformation from a low-fidelity sketch to a polished, high-fidelity wireframe or UI design can occur within mere minutes.

The speed and efficiency afforded by this cutting-edge technology empower you to focus on refining your concepts and iterating through ideas at an unprecedented pace.

Whether your vision encompasses web apps, websites, mobile apps, or any digital platform, Uizard stands as a reliable companion, streamlining the design process with its versatility. You no longer need to possess extensive design expertise, as the tool is intuitively designed to cater to users of all backgrounds and skill levels.

From tech-savvy founders to aspiring entrepreneurs, Uizard ensures that the creative journey remains accessible and enjoyable for everyone.

The user-friendly interface of Uizard opens up a realm of possibilities, allowing you to bring your vision to life with ease. Its intuitive controls and extensive feature set empower you to craft pixel-perfect designs that align with your unique style and brand identity.

Whether you’re a solo founder or part of a dynamic team, Uizard fosters seamless collaboration, enabling you to share and iterate on designs effortlessly.

One of the most significant advantages of Uizard lies in its ability to gather invaluable user feedback on your designs. By sharing your wireframes and UI designs with stakeholders, clients, or potential users, you can gain insights and refine your creations based on real-world perspectives.

This not only accelerates the decision-making process but also ensures that your final product resonates with your target audience.

4. Autodesk Maya

Enter the extraordinary realm of 3D animation with Autodesk Maya, a software that transcends conventional boundaries to grant you the power to breathe life into expansive worlds and intricate characters. Whether you’re an aspiring animator, a seasoned professional, or a visionary storyteller, Maya provides the tools to transform your creative visions into stunning reality.

Imagination knows no bounds with Maya, as its powerful toolsets empower you to embark on a journey of endless possibilities. From the grandest of cinematic tales to the most whimsical of animated adventures, this software serves as your creative canvas, waiting for your artistic touch to shape it.

Complexity is no match for Maya’s prowess, as it deftly handles characters and environments of any intricacy. Whether you seek to create lifelike characters with nuanced emotions or craft breathtaking landscapes that transcend the boundaries of reality, Maya’s capabilities rise to the occasion, ensuring that your artistic endeavors know no limits.

Designed to cater to professionals across various industries, Maya stands as the perfect companion for crafting high-quality 3D animations for movies, games, and an array of other purposes. Its versatility makes it a go-to choice for animators, game developers, architects, and designers alike, unleashing the potential to tell stories and visualize concepts with stunning visual fidelity.

The heart of Maya lies in its engaging animation toolsets, each one carefully crafted to nurture the growth of your virtual world. From fluid character movements to dynamic environmental effects, Maya opens the doors to your creative sanctuary, enabling you to weave intricate tales that captivate audiences across the globe.

But the journey doesn’t end there – with Autodesk Maya, you are the architect of your digital destiny. As you explore the depths of this software, you discover its seamless integration with other creative tools, expanding your capabilities even further.

The synergy between Maya and its counterparts unlocks new avenues for innovation, granting you the freedom to experiment, iterate, and refine your creations with ease.

5. Autodesk 3Ds Max

Aimed at architects, engineers, and a myriad of other professionals, this cutting-edge tool serves as a gateway to bring your imaginative designs to life with astonishing realism.

Architects find solace in Autodesk 3Ds Max as it empowers them to create stunningly realistic models of their architectural wonders. Engineers, too, discover the power of this software in crafting intricate and precise 3D models of their mechanical and industrial designs.

The software becomes a haven for creative professionals seeking to visualize and communicate their concepts with exceptional clarity and visual fidelity.

Beyond the realms of architecture and engineering, Autodesk 3Ds Max knows no bounds. Its versatility allows you to explore various dimensions of creativity, from crafting intricate product prototypes to fashioning enchanting animated characters.

Whatever your design aspirations may be, this software stands as a reliable companion, ensuring that your visions manifest into awe-inspiring digital realities.

In the fast-paced world of business and design, having a tool that caters to multiple purposes becomes invaluable. Autodesk 3Ds Max stands tall as a versatile and adaptable solution, making it a coveted asset for businesses and individuals alike.

Its potential to enhance the visual storytelling capabilities of any venture unlocks a new era of creativity and communication.

One of the most cherished qualities of Autodesk 3Ds Max lies in its user-friendly interface, fostering a seamless and intuitive design process. With this tool at your disposal, iteration becomes a breeze, allowing you to refine your designs effortlessly.

Each new iteration becomes a steppingstone towards perfection, ensuring that your final creation exudes excellence.

6. Foyr Neo

With Foyr Neo at your disposal, you can witness the transformation of your design ideas into reality in as little as a fifth of the time it takes with other software tools.

Gone are the days of grappling with complex design interfaces and spending endless hours on a single project. Foyr Neo streamlines the journey from a floor plan to a finished render, presenting you with a user-friendly interface that simplifies every step of the design process.

With its intuitive controls and seamless functionality, the software becomes an extension of your creative vision, ensuring that your ideas manifest into remarkable designs with utmost ease.

To further elevate your experience, Foyr Neo provides a thriving community and comprehensive training resources. This collaborative ecosystem allows you to connect with fellow designers, share insights, and gain inspiration from the collective creative pool.

Additionally, the abundance of training materials and support ensures that you can unlock the full potential of the software, mastering its capabilities and expanding your design horizons.

Bid farewell to the hassle of juggling multiple tools to complete a single project – Foyr Neo serves as the all-in-one solution to cater to your design needs. By integrating various design functionalities within a single platform, the software streamlines your workflow, saving you precious time and effort.

This seamless experience fosters uninterrupted creativity, enabling you to focus on the art of design without the burden of managing disparate software tools.

7. Let’s Enhance

With this cutting-edge software, you can witness a remarkable increase in image resolution of up to 16 times, all without sacrificing an ounce of quality.

Gone are the days of tedious manual editing, spending hours painstakingly enhancing your images pixel by pixel. Let’s Enhance simplifies the process, offering a swift and efficient solution to elevate your photos’ quality with ease.

Whether you’re a professional photographer seeking crisper images for print or a social media enthusiast aiming to enlarge your visuals, this software promises to deliver the perfect shot every time.

Let’s Enhance’s proficiency in improving image resolution, colors, and lighting automatically alleviates the burden of post-processing. By entrusting this task to the intelligent algorithms of the software, you gain more time to focus on the core aspects of your business or creative endeavors.

Embrace the art of delegation and allow Let’s Enhance to handle the technicalities while you concentrate on perfecting your artistic vision.

The applications of Let’s Enhance are vast and diverse. Photographers, designers, artists, and marketers alike can benefit from this powerful tool. Imagine effortlessly preparing your images for print, knowing they’ll boast impeccable clarity and sharpness.

Envision your social media posts grabbing attention with larger-than-life visuals, thanks to Let’s Enhance’s seamless enlargement capabilities.

But Let’s Enhance doesn’t stop at resolution enhancement. It also becomes a reliable ally in refining color palettes, breathing new life into dull or faded images, and balancing lighting for picture-perfect results.

Whether it’s subtle adjustments or dramatic transformations, the software empowers you to create visuals that captivate audiences and leave a lasting impression.

6 AI Text to Video compared (updated August 2023 ) Link

Runway
Features

– Text-to-video feature
– Automatic prompt suggestions
– The option to upload an image for reference
– Different previews to choose from before generating a video
– Free plan to test the tool out

Pros

– Best of AI text-to-video research
– Comprehensive set of tools for video editing
– Available as both a desktop and mobile app

Cons

– Gen-2 has limitations in generating intricate details, like fingers
– Gen-2 video generation is limited to 4 seconds per video
– The tool does not offer text-to-speech capabilities

Synthesia AI
Features

– 120+ voices and accents
– 140+ diverse AI avatars
– 60+ video templates designed by professional designers
– The option to have a custom avatar created

Integrating ChatGPT to Automate WhatsApp Responses

In today’s world, messaging apps are becoming increasingly popular, with WhatsApp being one of the most widely used. With the help of artificial intelligence, chatbots have become an essential tool for businesses to improve their customer service experience. Chatbot integration with WhatsApp has become a necessity for businesses that want to provide a seamless and efficient customer experience. ChatGPT is one of the popular chatbots that can be integrated with WhatsApp for this purpose. In this blog post, we will discuss how to integrate ChatGPT with WhatsApp and how this chatbot integration with WhatsApp can benefit your business.

The authors have uploaded all the codes on their GitHub Repository.

Real Time Multiplayer AI Trivia | Trivia Universe AI

Hey all ! I’ve been having a ton of fun getting this project launched and would greatly appreciate any feedback and/or requests !

https://TriviaUniverseAI.com

The site uses openAI to generate trivia on anything and everything you want ! You can then revisit trivia you or others have made and replay them at anytime.

Solo & real time multiplayer, daily challenge, infinite playability and is getting updates daily !

Current feature roadmap :

jeopardy mode ( multiple topics and large question count )
email / sms notifications for new daily challenges etc.
public lobbies / multiplayer against random players

Unraveling August 2023: August 20th, 2023

Latest AI News and Trends on August 20th, 2023

40% of workers need reskilling due to AI LINK

IBM’s study indicates that 40% of the global workforce, or 1.4 billion people, will need to reskill in the next three years due to AI’s rise.
While AI technologies, such as generative models, might shift job responsibilities, 87% of surveyed executives believe AI will augment jobs rather than replace them.
The focus in job skills has shifted from technical STEM skills (most important in 2016) to people skills like team management and adaptability (most important in 2023).

Meta did it first… Generative AI for producers

Generative AI is revolutionizing this decade’s technology, breaking into the realm of creativity once reserved for humans. Jobs are shifting, with some roles being replaced and others benefiting from AI assistance.

Content creators, take note! Meta just revealed that platforms like Facebook and Instagram will employ AI to produce music. This means no more copyright issues or losing business. Simply choose a genre, provide a sample, and the AI crafts tailor-made music for your videos.

Facebook’s music library becomes obsolete as Meta leads the way, while YouTube and TikTok will likely follow suit. As a content creator, AI eliminates rights concerns. However, creators of original music may face challenges.

AI’s impact extends to various fields, affecting writers, musicians, artists, and photographers. While some might feel the pinch, the creative economy as a whole benefits, making custom content creation easier.

Imagine conceiving, designing, and animating with AI—a reality that even big players like Disney face. This emerging world is thrilling and transformative.

To prepare, embrace AI. Integrate it into your work wherever possible. If you want to stay ahead and not fall behind to AI, leverage its capabilities.

Trustworthy LLMs: A survey and guideline for evaluating LLMs’ alignment

Ensuring alignment, which refers to making models behave in accordance with human intentions, has become a critical task before deploying LLMs in real-world applications. This new research has proposed a more fine-grained taxonomy of LLM alignment requirements. It not only helps practitioners unpack and understand the dimensions of alignments but also provides actionable guidelines for data collection efforts to develop desirable alignment processes.

It also thoroughly surveys the categories of LLMs that are likely to be crucial to improve their trustworthiness and shows how to build evaluation datasets for alignment accordingly.

The tool curates high-quality data that leads to improved LLM downstream performance and will significantly benefit LLM developers attempting to build pretraining datasets.

Microsoft-DataBricks collab may hurt OpenAI

Microsoft is reportedly planning to sell a new version of Databricks software, It helps customers create AI applications for their businesses. This move could potentially harm OpenAI, as Databricks allows companies to develop AI models from scratch or repurpose open-source models instead of licensing OpenAI’s proprietary ones.

Microsoft has been aggressively investing in AI services and integrating AI functionality across its products. Neither Microsoft nor Databricks have commented on the report.

What else happened in AI this week of August 12-20?

- Google appears to be readying new AI-powered tools for ChromeOS
- Zoom rewrites policies to make clear user videos aren’t used to train AI
- Anthropic raises $100M in funding from Korean telco giant SK Telecom
- Modular, AI startup challenging Nvidia, discusses funding at $600M valuation
- California turns to AI to spot wildfires, feeding on video from 1,000+ cameras
- FEC to regulate AI deepfakes in political ads ahead of 2024 election
- Google’s AI search offers AI-generated summaries, definitions, and coding improvements.
- Google Photos introduce a new AI feature, ‘Memories view’!
- Amazon using AI to enhance product reviews.
- WhatsApp test beta upgrade with new feature ‘custom AI-generated stickers’.
- Google is testing an AI assistant that will give you Life Advice.
- Robomart adopts “store-hailing” for self-driving stores delivered to customers.
- OpenAI acquires AI design studio Global Illumination to work on core products, ChatGPT
- The Associated Press releases guidelines for Generative AI to its journalists
- Consulting giant McKinsey unveils its own generative AI tool for employees: Lilli
- Opera for iOS will now include Aria, its browser AI built in collaboration with OpenAI
- UK is using AI road safety cameras to detect potential driver offenses in passing vehicles
- Adobe Express with AI Firefly app, now out of beta, is available worldwide
- Ex-Google Brain researchers have started an AI research company called Sakana AI in Tokyo.
- Runway, a gen AI video startup, has launched a new ‘Watch’ feature.
- Research shows AI bots beat CAPTCHA and humans.
- ML startup Arthur launched an open-source tool to help find the best LLM.
- Buildbox has launched a new tool called StoryGames.AI!

Latest Tech News and Trends on August 20th, 2023

Major concerns after Cruise robotaxi incidents

Following a recent collision between a Cruise robotaxi and a fire truck in San Francisco, the California DMV requested Cruise to halve its robotaxi fleet in the city.
The state agency is investigating “recent concerning incidents” with Cruise vehicles, emphasizing the need to ensure the safety of the public sharing the road with these autonomous vehicles.
This specific accident saw a Cruise Chevy Bolt EV hit by an emergency vehicle at an intersection, resulting in passenger injuries; it adds to a series of issues potentially affecting Cruise’s future operations.

As wildfires spread, Canadian leaders ask Meta to reverse its news ban

The Canadian government demands that Meta lift its ban on domestic news sharing, citing its impact on sharing information about wildfires.
Meta blocked news on Facebook and Instagram due to a new law requiring payment for news articles, but this move hampers access to crucial information.
Officials and citizens express concerns, urging Meta to reinstate news sharing for safety and emergency information during the wildfire crisis.

X to remove ‘block’ feature

Elon Musk suggests that Twitter’s block feature, except for direct messages, may be removed, causing concern among users.
Blocking is currently used to restrict interactions and visibility of accounts, while mute only hides posts; users value blocking for spam control and harassment prevention.
Musk’s statement prompts backlash and uncertainty about whether the feature will actually be removed.

Unraveling August 2023: August 18th, 2023

Latest AI News and Trends on August 18th, 2023

What is OpenAI code interpreter, and how does it work?

Discover the OpenAI code interpreter, an AI tool that translates human language into code. Learn about its functions, benefits and drawbacks in this guide.

The basics of OpenAI code interpreter

OpenAI, a leading entity in the field of artificial intelligence, has developed OpenAI code interpreter, a specialized model trained on extensive data sets to process and generate programming code.

OpenAi code interpreter is a tool that attempts to bridge the gap between human language and computer code, offering myriad applications and benefits. It represents a significant step forward in AI capabilities. It is grounded in advanced machine learning techniques, combining the strengths of both unsupervised and supervised learning. The result is a model that can understand complex programming concepts, interpret various coding languages, and generate human-like responses that align with coding practices.

New Generations of People Are Becoming More and More Indistinguishable from AI

One of the most concerning aspects of this trend is the way that new generations are rewriting previous information. In the past, people would typically come up with their ideas and opinions. However, today, it is much more common for people to simply rewrite information that they have found online. This is a trend that is being exacerbated by the rise of large language models (LLMs), which can generate text that is nearly indistinguishable from human-written text. Article: new-generations-of-people-are-becoming-more-and-more-indistinguishable-from-ai/

How Neolithics is using AI machine learning to reduce global food waste

Neolithics, an agritech company based in Israel, is using artificial intelligence and machine learning to reduce food waste and ensure food safety and quality through its optical sensing AI-powered solution known as Crystal.eye™. This technology, which can be mounted and configured in various ways, automates and upgrades quality control for fresh produce, in order to maximize utilization and reduce waste.

While the normal spectrum of visible light has 3 colors – red, green, and blue, Crystal.eye™ uses hyperspectral imaging, with over 400 spectra of light. This light can penetrate deep into a fruit or vegetable and allows the device to scan even inside the sample, eliminating the need to cut it open or grind it.

The images produce a unique fingerprint, which is then analyzed by Neolithics’ food scientists to identify various characteristics, such as firmness, moisture content, sugar content, acidity, and many more. The data is then fed to an AI machine learning engine, allowing the system to scan and analyze a large batch of samples in a matter of seconds.

The outcomes of the inspections are then instantly displayed on a digital dashboard and can be delivered as reports, tailored to each customer’s unique requirements. For example, french fry makers need to know how much dry matter is contained in the potatoes they process, while winemakers take into account the grapes’ acidity and sweetness to obtain the flavor profile they desire.

Using Crystal.eye™ allows growers and distributors to greatly expand their sampling, from the usual 1% to around 30% to 40%. This ensures greater accuracy and significantly reduces the chance of produce being discarded due to not meeting the customers’ requirements.

According to Wayne Nathanson, the company’s VP for Global Development, knowledge in food science is Neolithics’ main differentiator. While there are other companies that make the hardware to move around and sort fruits and vegetables, he says that usually these technologies work on exterior qualities, and aren’t able to analyze the produce’s interior. Most companies do not have a team of expert food scientists to fully harness the information gathered from the produce like Neolithics, he adds.

Currently, Crystal.eye™ can check the content or defects of produce, providing customers with various external or internal attributes. This solution has been launched and is being used by an increasing number of growers, distributors, and food processing companies. At the end of this year, Neolithics expects to update the technology with the capability to assess the produce’s maturity cycle, allowing customers to identify how long it will take before it spoils. The company is also working on being able to identify traces of pesticides and other banned chemicals on the produce, with release estimated for next year.

“Sustainability is very important to Neolithics, and our mission is to reduce food waste and improve food safety. Knowing how much food is wasted daily is a major motivator for making a difference. We want to eliminate food wastage across the supply chain, including removing the need to destroy the produce when it’s being inspected. We also want to get more edible quality produce to the consumer, by helping the various links of the supply chain distribute it better. There are 1.3 billion tons of wasted food annually, and there are roughly a billion people in the world experiencing hunger. We believe there’s an opportunity to feed more people with the food that is thrown out. This becomes more and more critical, the closer the world population gets to the 10 billion mark,” Nathanson says.

The new AI programming jobs that require only very basic programming skills

There has never been a more exciting and promising time to get into AI development. Forbes reports that job listings for ChatGPT-related positions increased 21 times since last November:

https://www.forbes.com/sites/emmylucas/2023/08/17/linkedin-says-chatgpt-related-job-postings-have-ballooned-21-fold-since-november/

They need both prompt engineers and programmers. But because of Copilot and other advances in AI programming they are looking for people with some basic programming skills but who mainly excel in advanced critical analysis and reasoning skills.

They basically need people who know how to think so for people with IQs above 130, (in the genius range) this could be a dream career. But really it’s not so much about IQ as it is about the ability to think rather than just mostly learn and remember. In fact programming courses must already be teaching this brand new kind of prompt engineering and programming.

I imagine that computer programming instruction is going through very rapid evolution right now as teaching fundamental programming skills more and more gives way to teaching how to most quickly and intelligently prompt AIs to do whatever programming is needed.

If incumbent programming schools are not changing fast enough they risk losing a substantial market share to startups that begin teaching much more marketable skills.

Many businesses today want to start using AIs but they don’t know how to go about it. Computer programmers and prompt engineers who can explain all of this to them have a ready and rapidly growing job market.

Yeah there could never be a better time to get into computer programming!

The importance of making superintelligent small LLMs

Google’s Gemini will set a new standard in AI largely because of the massive data set that it is trained on.

If you’re not familiar with Gemini yet, watch this amazingly intelligent 8-minute YouTube video:

The next step would be for Google to train that stronger intelligence to shift from relying on data to relying on principles for its logic and reasoning.

Once AI’s intelligence is based on principles, subsequent iterations will no longer require massive data for their training.

That achievement will level the playing field so that Gemini is much sooner joined by competitive or stronger models.

Once that happens, everything will get very intelligent.

As Hollywood strikes, 96% of entertainment companies are boosting generative AI spend

As the Hollywood strike continues, 96% of entertainment companies are ramping up their investments in generative AI, revealing a shift in the industry’s approach to content creation and potential concerns for its workforce.

If you want to stay ahead of the curve in AI and tech, look here first.

The rise in AI spending amidst the Hollywood strike

The Hollywood writer’s strike underscores a shift in the entertainment industry’s investment strategy.
Lucidworks’ research, one of the largest of its kind, shows 96% of executives prioritize generative AI investments.
Countries like China, the UK, France, India, and the U.S. have companies heavily investing in this technology.

AI’s potential impact on Hollywood content creation

Generative AI can produce content, virtual environments, and images, posing a potential disruption to traditional methods.
Predictions suggest that by 2025, up to 90% of Hollywood content could be influenced by AI.
There’s a growing concern among Hollywood writers about the rapid integration of AI and its effect on their careers.

The future of the entertainment industry with generative AI

The emergence of synthetic actors could revolutionize the way movies and shows are produced.
AI-driven actors don’t strike, age, or demand pay raises, presenting potential benefits for studios but challenges for human actors.

Source (Forbes)

Microsoft-DataBricks collab may hurt OpenAI

Microsoft has been aggressively investing in AI services and integrating AI functionality across its products. Neither Microsoft nor Databricks have commented on the report.

Why does this matter?

Microsoft’s reported intention to introduce an AI-focused Databricks software version carries implications for OpenAI. This software empowers businesses to craft AI solutions without relying on OpenAI’s proprietary models, potentially impacting OpenAI’s market.

Meta AI’s new RoboAgent with 12 skills

Meta and CMU Robotics Institute’s New Robotics research: RoboAgent. It is a universal robotic agent that can efficiently learn and generalize a wide range of non-trivial manipulation skills. It can perform 12 skills across 38 tasks, including object manipulation and re-orientation, and adapt to unseen scenarios involving different objects and environments.

The development of the RoboAgent was made possible through a distributed robotics infrastructure, a unified framework for robot learning, and a high-quality dataset. The agent also utilizes a language-conditioned multi-task imitation learning framework to enhance its capabilities. Meta is open-sourcing RoboSet, a large, high-quality robotics dataset collected with commodity hardware, to support and accelerate open-source research in robot learning.

Why does this matter?

RoboAgent has the potential to accelerate automation, manufacturing, and daily tasks as the end users can enjoy more capable and helpful robots at home. Industries can streamline operations with efficient automation, technology could push AI and robotics boundaries, and innovation might surge across sectors.

Meta challenges OpenAI with code-gen free software

Meta is set to release Code Llama, an open-source code-generating AI model that competes with OpenAI’s Codex. The software builds on Meta’s Llama 2 model and allows developers to automatically generate programming code and develop AI assistants that suggest code.

Llama 2 disrupted the AI industry by enabling companies to create AI apps without relying on proprietary software from major players like OpenAI, Google, or Microsoft. Code Llama is expected to launch next week, further challenging the dominance of existing code-generating AI models in the market.

Why does this matter?

Meta’s Code Llama is set to rival OpenAI’s Codex; this open-source AI model is an update of Meta’s Llama 2. This tool challenges giants like OpenAI, Google, and Microsoft, giving developers more control and reducing dependence on their proprietary tools.

AP sets new AI guidelines for newsrooms

The Associated Press has established standards for the use of generative AI in its newsroom, emphasizing that AI is not a replacement for human journalists and cautioning against creating publishable content with AI-generated text or images.
AP journalists are directed to treat AI-generated content as “unvetted source material” and apply editorial judgment and sourcing standards before considering it for publication.
The organization warns about the potential for AI to spread misinformation and advises its journalists to exercise caution, skepticism, and verify sources when dealing with AI-generated content.

Latest Tech News and Trends on August 18th, 2023

Scientists are leaving X

A significant portion of scientific researchers using X have reduced their usage or left the platform altogether, with over 47% decreasing usage and nearly 7% quitting, according to a survey by Nature.
About 47% of polled researchers have turned to alternative platforms, with Mastodon being the most popular, followed by LinkedIn and Instagram.
The change in researcher behavior on X is attributed to the platform’s evolving dynamics, increased content prioritization, and limited accessibility of its API for researchers.

Amazon imposes fees on self-shipping sellers

Starting from October 1st, third-party merchants on Amazon who ship their own packages will be required to pay a 2% fee per product sold.
This new fee is in addition to other charges Amazon already receives from merchants, including selling plan costs and referral fees based on product categories.
The fee comes as Amazon’s marketplace is under scrutiny, with the FTC planning to file an antitrust lawsuit over allegations that Amazon rewards third-party merchants using its logistics services while penalizing those fulfilling their own orders.

NYC bans TikTok from government devices

New York City is banning TikTok from government devices within 30 days, with immediate prohibition on downloading and usage by employees.
The NYC Cyber Command cited TikTok as a security threat to the city’s technical networks, prompting the decision.
While some states have broadly banned TikTok, most have restricted its use on government-owned tech, amid ongoing debates about the app’s security risks.

Unraveling August 2023: August 17th, 2023

Latest AI News and Trends on August 17th, 2023

You can now write one sentence to train an entire ML model.

How does it work?

You just describe the ML model you want…a chain of AI systems will take that sentence…it generates a dataset based on that sentence…and it trains a model for you…in ten minutes 😳

What does that mean?

Custom models in AI just got a whole lot easier. You can go from an idea (“a model that writes Python functions”) to a fully trained custom Llama-2 model in minutes 😮

Why should I care?

If you aren’t thinking about the impact of change in your industry, start now. It’s not linear and continuous, it’s exponential with step functions. 3 out of 4 C-suite executives believe that if they don’t scale artificial intelligence in the next five years, they risk going out of business entirely.

What should I do about it?

Further proof that AI is changing our work processes rapidly. You need to build a team and org that’s first and foremost, ready for change. And if you haven’t started pulling together an AI working group to get cracking on your AI usage principles and first AI use case, do it.

It’s open source, made by Matt Shumer, with an easy Google Colab — check it out here: GitHub – mshumer/gpt-llm-trainer

GPT-4 Code Interpreter masters math with self-verification

OpenAI’s GPT-4 Code Interpreter has shown remarkable performance on challenging math datasets. This is largely attributed to its step-by-step code generation and dynamic solution refinement based on code execution outcomes.

Expanding on this understanding, new research has introduced the innovative explicit code-based self-verification (CSV) prompt, which leverages GPT4-Code’s advanced code generation mechanism. This prompt guides the model to verify the answer and then reevaluate its solution with code.

The approach achieves an impressive accuracy of 84.32% on the MATH dataset, significantly outperforming the base GPT4-Code and previous state-of-the-art methods.

Why does this matter?

The study provides the first systematic analysis of the role of code generation, execution, and self-debugging in mathematical problem-solving. This highlights the importance of code understanding and generation capabilities in LLMs. Plus, the ideas presented can help build high-quality datasets that could potentially help improve the mathematical capabilities in open-source LLMs like Llama-2.

Can machine learning algorithms identify patients at risk of a delay in starting cancer treatment?

Multi-level machine learning models for estimating the risk of delay between cancer diagnosis and treatment initiation in a large group of cancer patients.

Study significance

Machine learning models that incorporate multi-level data sources can effectively identify cancer patients who are at a greater risk of experiencing treatment delays of more than 60 days after their initial cancer diagnosis.

Although neighborhood-level social determinants of health are incorporated in the study model as contributing variables, no significant impact of these factors was observed on the model performance. Furthermore, the model exhibits lower predictive effectiveness in vulnerable populations.

Future studies should include a higher proportion of vulnerable populations and more relevant social variables to improve the model performance.

Journal reference:

Frosch Z. A. K., Hasler, J., Handorf, E., et al. (2023). Development of a Multilevel Model to Identify Patients at Risk for Delay in Starting Cancer Treatment. JAMA Network Open. doi:10.1001/jamanetworkopen.2023.28712, https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2808249

10 top AI jobs in 2023

Discover the top AI jobs shaping the future. Navigate the booming AI industry with insights on rewarding careers.

1. AI product manager

An AI product manager is similar to other program managers. Both jobs require a team leader to develop and launch a product. In this case, it is an AI product, but it’s not much different from any other product in terms of leading teams, scheduling and meeting milestones.

AI product managers need to know what goes into making an AI application, including the hardware, programming languages, data sets and algorithms, so that they can make it available to their team. Creating an AI app is not the same as creating a web app. There are differences in the structure of the app and the development process.

2. AI research scientist

An AI research scientist is a computer scientist who studies and develops new AI algorithms and techniques. They develop and test new AI models, collaborate with other researchers, publish research papers and speak at conferences. So, programming is only a small portion of what a research scientist does.

The tech industry is extremely open to self-taught and non-formally trained programmers, but it makes an exception for AI research scientists. They need to have a strong understanding of computer science, mathematics and statistics. Typically, they need graduate degrees.

3. Big data engineer

AI works with large data sets and so does its precursor, big data. A big data engineer is similar to an AI engineer because they are responsible for designing, building, testing and maintaining complex data processing systems that work with large data sets. But, instead of working with GPT or LaMDA, they work with big data tools, like Hadoop, Hive, Spark and Kafka.

Like AI researchers, big data engineers often have advanced degrees in mathematics and statistics. These degrees are necessary for designing, maintaining and building data pipelines based on massive data sets.

Check out these top data architect and data engineer certifications.

4. Business intelligence developer

Business intelligence (BI) is also a data-driven discipline that predates the modern AI rush. Like big data and AI, BI also relies on large data sets. BI developers use data analytics platforms, reporting tools and visualization techniques to turn raw data into meaningful insights to help organizations make informed decisions.

BI developers work with a variety of coding languages and tools from major vendors, including SQL, Python, Tableau from Salesforce and Power BI from Microsoft. They also need to have a strong understanding of business processes to help improve them through data insight.

5. Computer vision engineer

A computer vision engineer is a developer who specializes in writing programs that utilize visual input sensors, algorithms and systems. These systems see the world around them and act accordingly, such as self-driving and self-parking cars and facial recognition.

They use languages like C++ and Python, along with visual sensors, such as Mobileye from Intel. Examples of use cases include object detection, image segmentation, facial recognition, gesture recognition and scenery understanding.

6. Data scientist

A data scientist is a technology professional who collects, analyzes and interprets data to solve problems and drive decision-making within the organization. They are not necessarily programmers, although many do write their own applications. Mostly, they use data mining, big data and analytical tools.

Their use of business insights derived from data enables businesses to improve sales and operations; make better decisions; and develop new products, services and policies. They use predictive modeling to forecast future events, such as customer churn, and data visualization to display research results visually. Some also use machine learning to build models to automate these tasks.

7. Machine learning engineer

A machine learning engineer is responsible for developing and implementing machine learning training algorithms and models. Training is the demanding side of machine learning and is the most processor- and computation-intensive aspect of machine learning. Therefore, it requires the highest level of skill and training.

Because of the need for advanced math and statistics skills, most machine learning engineers have advanced degrees in computer science, math or statistics. They often continue training through certification programs or a master’s degree in machine learning, deep learning or neural networks.

8. Natural language processing engineer

A natural language processing (NLP) engineer is a computer scientist who specializes in the development of algorithms and systems that understand and process natural human language input.

One of the big differentiators between traditional search engines and generative AI interfaces, such as ChatGPT, is that search engines use keywords and gather information from large amounts of existing online data. Generative AI creates new content based on other examples and patterns, and it answers queries in a chat-type format.

Like machine learning engineers, NLP engineers are not necessarily programmers first. They need to understand linguistics as much as they need to understand programming. NLP projects require machine translation, text summarization, answering questions and understanding context.

9. Robotics engineer

A robotics engineer is a developer who designs, develops and tests software for running and operating robots. Robotics has advanced significantly in recent years, such as automated home cleaners and precision cancer surgery equipment. Robotics engineers may also use AI and machine learning to boost a robotic system’s performance.

As a result, robotics engineers are typically designing software that receives little to no human input but instead relies on sensory input. Therefore, a robotics engineer needs to debug the software and the hardware to make sure everything is functioning as it should.

Robotics engineers typically have degrees in engineering, such as electrical, electronic or mechanical engineering.

10. Software engineer

A software engineer can cover various activities in the software development chain, including design, development, testing and deployment. Engineering professionals are needed at all points of software development. The demands are so high that it’s rare to find someone well versed in all of them. Most engineers tend to specialize in one discipline.

What is a liquid neural network, really?

We spoke with MIT CSAIL head Daniela Rus about the emerging technology of liquid networks and implications for robotics.

The initial research papers date back to 2018, but for most, the notion of liquid networks (or liquid neural networks) is a new one. It was “Liquid Time-constant Networks,” published at the tail end of 2020, that put the work on other researchers’ radar. In the intervening time, the paper’s authors have presented the work to a wider audience through a series of lectures.

Ramin Hasani’s TEDx talk at MIT is one of the best examples. Hasani is the Principal AI and Machine Learning Scientist at the Vanguard Group and a Research Affiliate at CSAIL MIT, and served as the paper’s lead author.

“These are neural networks that can stay adaptable, even after training,” Hasani says in the video, which appeared online in January. When you train these neural networks, they can still adapt themselves based on the incoming inputs that they receive.”

The “liquid” bit is a reference to the flexibility/adaptability. That’s a big piece of this. Another big difference is size. “Everyone talks about scaling up their network,” Hasani notes. “We want to scale down, to have fewer but richer nodes.” MIT says, for example, that a team was able to drive a car through a combination of a perception module and liquid neural networks comprised of a mere 19 nodes, down from “noisier” networks that can, say, have 100,000.

“A differential equation describes each node of that system,” the school explained last year. “With the closed-form solution, if you replace it inside this network, it would give you the exact behavior, as it’s a good approximation of the actual dynamics of the system. They can thus solve the problem with an even lower number of neurons, which means it would be faster and less computationally expensive.”

The concept first crossed my radar by way of its potential applications in the robotics world. In fact, robotics make a small cameo in that paper when discussing potential real-world use. “Accordingly,” it notes, “a natural application domain would be the control of robots in continuous-time observation and action spaces where causal structures such as LTCs [Liquid Time-Constant Networks] can help improve reasoning.”

AI reconstructs song from brain activity

Neuroscientists recorded electrical activity from areas of the brain (yellow and red dots) as patients listened to the Pink Floyd song “Another Brick in the Wall, Part 1.” Using AI software, they were able to reconstruct the song from the brain recordings. This is the first time a song has been reconstructed from intracranial electroencephalography recordings.

Why does this matter?

By capturing the musicality of speech through neural signals, this research presents an innovative application of AI that could redefine how we interact and communicate, particularly for those who struggle with traditional modes of communication.

Saudi Arabia and UAE join the race for scarce Nvidia chips

Saudi Arabia has purchased at least 3,000 of Nvidia’s H100 chips at $40,000 apiece, while UAE has ordered a fresh batch of semiconductors to power its LLM. This signals the Gulf states’ intention to become major players in AI by buying up thousands of Nvidia’s GPUs which are vital in powering the boom in generative AI that has swept markets this year.

Why does this matter?

This makes them the latest to join the ever-growing queue of buyers for Nvidia chips to power AI ambitions. But will Nvidia be able to produce enough GPUs to meet the massive demand? It was reported in June that Nvidia GPUs are already in short supply (and very expensive).

Snapchat’s AI chatbot creates unexpected chaos

Snapchat users reported an unexpected video posted on the My AI chatbot’s Story, which some interpreted as showing a corner between a ceiling and a wall.
The unexpected post led to concerns and fears among users, with some believing the AI feature had become sentient or evolved, prompting some to delete the app.
Snapchat described the event as a “temporary outage”, which has since been resolved, and the AI chat feature temporarily stopped responding during this period.

Exploring the Power of Mojo Programming Language

Mojo is a new programming language that combines the usability of Python with the performance of C. It is designed to be the perfect language for developing AI models and applications. Mojo is fast, efficient, easy to use, and open source.
Mojo is based on the LLVM (Low Level Virtual Machine) compiler infrastructure, which is one of the most advanced compiler frameworks in the world right now. Mojo uses a new type of system that allows for better performance and error checking. Mojo has a built-in autotuning system that can automatically optimize your code for the specific hardware that you are using.

https://www.seaflux.tech/blogs/mojo-ai-programming-language

Top AI Image-to-Video Generators 2023

Genmo

Genmo is an artificial intelligence-driven video generator that takes text beyond the two dimensions of a page. Algorithms from natural language processing, picture recognition, and machine learning are used to adapt written information into visual form. It can turn text, pictures, symbols, and emoji into moving images. Background colors, characters, music, and other elements are just some of how the videos can be personalized. The movie will include the text and any accompanying images that you provide. The videos can be shared on many online channels like YouTube, Facebook, and Twitter. Videos made by Genmo’s AI can be used for advertising, instruction, explanation, and more. It’s a fantastic resource for companies, groups, and people who must rapidly and cheaply make interesting movies.

D-ID

D-ID is a video-making platform powered by artificial intelligence that makes producing professional-quality videos from text simple and quick. Using Stable Diffusion and GPT-3, the company’s Creative RealityTM Studio can effortlessly create videos in over a hundred languages. D-ID’s Live Portrait function makes short films out of still images, and the Speaking Portrait function gives a speech to written or spoken text. Its API has been refined with the help of tens of thousands of videos, allowing it to generate high-quality visuals. Digiday, SXSW, and TechCrunch have all recognized D-ID for their ability to help users create high-quality videos at a fraction of the expense of traditional approaches.

LeiaPix Converter

The LeiaPix Converter is a web-based, no-cost service that changes regular photographs into 3D Lightfield photographs. It employs AI to turn your images into lifelike, immersive 3D environments. Select the desired output format and upload your picture to LeiaPix Converter. The converted file can be exported in several forms, including the Leia Image Format, Side-by-Side 3D, Depth Map, and Lightfield Animation. The LeiaPix Converter’s output is great quality and straightforward to use. It’s a fantastic way to give your pictures a new feel and make unique visual compositions. It does a 3D Lightfield conversion from a 2D image. Leia Image Format, Side-by-Side 3D, Depth Map, and Lightfield Animation are only a few of the supported export formats that bring about excellent outcomes. Depending on the size of the image, the conversion procedure could take a while. The quality of your original photograph will affect the final conversion outcomes. Because the LeiaPix Converter is currently in beta, it may include problems or have functionality restrictions.

InstaVerse

A new open-source framework called instaVerse makes building your dynamic 3D environments easy. The background can be generated in response to AI cues, and players can then create their avatars to explore it. The first step in making a world in InstaVerse is picking a premade layout. Forests, cities, and even spaceships are just some of the many premade options available. After selecting a starter document, an AI assistant will guide you through the customization process. A forest with towering trees and a flowing river are just one of the many landscapes instaVerse may create at your command. Characters can also be generated in your universe. Humans, animals, and even robots are all included in the instaVerse cast of characters. Once a character has been created, you can use the keyboard or mouse to direct its actions. While InstaVerse is still in its early stages, it shows great promise as a robust platform for developing interactive 3D content. It’s simple to pick up and use and lets you make your special universes.

Sketch

Sketch is a web app for turning sketches into GIF animations. It’s a fun and easy method to make unique stickers and illustrations to share on social media or use in other projects. Using Sketch is as easy as posting your drawing online. Then, you may utilize the drawing tools to give your work some life with some animation. Objects can be repositioned, recolored, and given custom sound effects. You can save your finished animation as a GIF after you’re satisfied. Sketch is a fantastic program for both young and old. It’s a terrific opportunity to show off your imagination and get a feel for the basics of animation simultaneously. In terms of ease of use, Sketch is excellent. Sketch makes it easy to create beautiful animations, even if you have no prior experience with the medium. With Sketch’s many tools, you can design elaborate and intricate animations. You can save your finished animation as a GIF after you’re satisfied. After that, your animation is ready for sharing or further use.

NeROIC

NeROIC can reconstruct 3D models from photographs as an element of AI technology. NeROIC, created by a reputable tech company, has the potential to transform our perceptions and interactions with three-dimensional objects radically. NeROIC can create a 3D model of the user’s intended message using an approved image. The video-to-3D capabilities of NeROIC are comparable to its image-to-3D capability. This means a user can create an interactive 3D setting from a single video. Because of this, creating 3D scenes is faster and easier than ever.

DPT Depth

The discipline of computer science concerned with creating 3D models from 2D photographs is advancing quickly. Deep learning-based techniques may be used to train point clouds and 3D meshes to depict real-world scenes better. A potential method, DPT Depth Estimation, employs a deep convolutional network to read depth data from a picture and generate a point cloud model of the 3D object. DPT Depth Estimation uses monocular photos to input a deep convolutional network pre-trained on data from various scenes and objects. Following data collection, the web will use the information to create a point cloud from which 3D models can be made. When compared to conventional techniques like stereo-matching and photometric stereo, DPT’s performance can surpass a human’s. Because of its fast inference time, DPT is a promising candidate for real-time 3D scene reconstruction.

RODIN

RODIN is quickly becoming the go-to 2D-to-3D generator in artificial intelligence. The creation of 3D digital avatars is now drastically easier and faster than ever before, thanks to this breakthrough. Creating a convincing 3D character based on a person’s likeness has always been more difficult. RODIN is an artificial intelligence-driven technology that can generate convincing 3D avatars using private data such as a client’s photograph. Customers are immersed in the action by seeing these fabricated avatars in 360-degree views.

Google Gemini: Facts and rumors

What does Gemini stand for ?

That part at least seems pretty clear beyond a shadow of a doubt:
Generative Enhanced Multimodal Intelligent Network Interface.

The word “Gemini” comes from Latin and means “twins” in German.
Some possible meanings in the context of Google’s AI system:

Gemini combines two components: Text and image processing. It is, in a sense, a “twin system.”

Gemini could refer to the „twins“ Sergey Brin and Larry Page, the founders of Google.

Astrology assigns communication strength and flexibility to the zodiac sign Gemini. Gemini as an AI assistant aims to adapt linguistically and situationally.

The name suggests a dual strength or ability. Gemini aims to unite Google’s text and image AI to outperform the competition.

As a twin system, Gemini combines different perspectives and approaches, similar to different human characters. So the name is both an allusion to the system’s integrative capabilities and a promising indication of Google’s ambitions with this AI product.

Why is Google superior?

To do that, you have to understand WHAT treasure trove of data Google is actually sitting on. Here are a few facts:

Google, through its various services such as Google Search, YouTube and others, has an enormous amount of data that is very useful for developing AI systems.

On YouTube alone, over 500 hours of video material are uploaded every day, according to Statista. The total video database is over 30 million hours of video. The subtitles and transcripts of these videos give Google a gigantic text dataset for training language models.

According to a report by ARK Invest, Google owns over 130 exabytes of data. For comparison, 1 exabyte is equal to 1 billion gigabytes. This means that the entire data set comprises more than 130,000,000,000,000,000 bytes of information.

Google Search accounts for a large part of this data. Google says it processes over 40,000 search queries per second. That’s over 3.5 trillion search queries per year. From these queries and the clicked results, Google gains further insights.

Overall, it shows that Google has virtually inexhaustible data resources for AI research. Both the breadth of different types of data and the sheer volume should give Google a significant edge in the AI field.

Google – The Research Giant

In 2020, Google published over 1300 artificial intelligence research papers, according to the Papers with Code database. In 2021, Google increased the number of publications significantly again to over 2000 papers on AI and machine learning.

Topics included:

Computer Vision (image recognition)
Natural Language Processing (NLP)
Speech Recognition
reinforcement learning
Robotics
Multimodal AI
Recommender Systems
Applications in medicine

With over 3300 AI publications in 2020 and 2021, Google has greatly expanded its research output in artificial intelligence. The company is one of the most active players in this research field. This intensive work over the past few years is now being incorporated into the development of Gemini.

According to the AI publication database Papers with Code, Google published more than 1,500 artificial intelligence research papers in 2022 alone. That’s far more than other tech corporations like Meta or Microsoft.

This is a partial selection of Google’s most groundbreaking developments in AI in recent years. The list shows the enormous range of research from machine learning and computer vision to robotics and autonomous systems.

AlphaGo: Go game AI that defeated world champion Lee Sedol in 2016.
BERT (Bidirectional Encoder Representations from Transformers): breakthrough language model for NLP from 2018.
PaLM (Pathways Language Model): enormous language model with 540 billion parameters from 2022
PaLM-SayCan: variant of PaLM that can carry on human-like conversations
Imagen: image generation AI for realistic and creative images
MusicLM: AI for music composition and production
RLHF (Reinforcement Learning with Human Feedback): Reinforcement learning with human feedback
Model Based RL: reinforcement learning with explicit models of the environment
RobustFit: Robust neural network against data noise
T5: Text-to-text transfer transducer for various NLP tasks
ViT (Vision Transformer): Image recognition with Transformer architecture
WAYMO: Autonomous driving and robot cab service
ProteinFold: Protein structure prediction with Deep Learning
FLOOD: AI for flood prediction and prevention
SLIDE: pixel-level image segmentation
Switch Transformers: efficient architecture for very large transformers
MuZero: reinforcement learning without environmental model in games
Meena: conversational AI from 2020
DALL-E & DALL-E 2: text-to-image generation.

When you look at the sheer amount of data Google has collected over the years, it initially makes you dizzy. Over 500 hours of video footage are uploaded to YouTube every day. The total video database is over 30 million hours. Add to that countless search queries, texts, images and conversations. It’s an almost unimaginable amount of data.

Coupled with intensive research activity in the AI field, it adds up to enormous potential. In recent years, Google has produced groundbreaking innovations such as the BERT language model, the AlphaGo Go AI, and the DALL-E image generator. When you put all these puzzle pieces together, things take on almost frightening proportions.

Project: Google Gemini

With the new Gemini AI system, Google now seems to have bundled the essence of these years of data aggregation and research. If the company succeeds in combining all of its AI developments and treasure trove of data in this system, it would be a demonstration of the sheer power of innovation. It will be interesting to see whether Gemini can deliver on this promise. In any case, the expectations are huge – here what we know and what the rumors say:

Facts Google Gemini

There are already some facts from the Google Blog:

Gemini is supposed to be released this fall
Gemini combines text and image generation
Can create contextual images based on text generation
Has been trained with YouTube transcripts
Google lawyers are monitoring the training to avoid copyright issues
Gemini is said to have multiple modalities, e.g., text, image, audio, video
Sergey Brin is involved in development

Rumors

From Reddit and countless other sources on the web, there could be other features as well:

Gemini is said to be capable of AI image understanding and modification
Is said to combine text capabilities like GPT-4 with image generation
Has been developed from the ground up as a multimodal model
Could handle audio, video, 3D renderings, graphics, etc.
Shall learn with user interactions and thus become effective AGI
Architecture could enable lifelong learning
There are concerns about privacy and information leaks between users

Google Gemini and the (then new) AI market:

The AI market situation is likely to change significantly with the introduction of Google Gemini:

For OpenAI:

Strong new competitor for ChatGPT and DALL-E.
Google has significantly more resources and data
OpenAI could lose market share and come under pressure

For Anthropic:

Claude must stand up to Google Assistant with Gemini
Advantage due to focus on security and control
Risk of falling behind

For Microsoft:

Partnership with OpenAI important to compete with Google
Microsoft must further develop Azure AI services
Advantage due to strong cloud infrastructure

For others:

Startups could have a very hard time against Google
Consolidation in the market possible
Significantly higher innovation speed

Overall, competitive pressure in the AI market will increase sharply. With its resources, Google is in a very good starting position to take a leading role with Gemini. It will be more difficult for other providers to keep pace with Google. It remains to be seen whether the high expectations for Gemini are justified.

Google Gemini Conclusion

Google Gemini seems to be a very ambitious AI project that should give the company a competitive edge. The combination of different modalities in one model is new and could improve AI capabilities tremendously. However, there are still many unanswered questions regarding the specific capabilities and data security. The release this fall will show whether Google can deliver on its promise to outperform the competition. Much is still speculation, but expectations are high.

#ai #ki #google #gemini #text #image #multimodal

Artificial intelligence steps in to assist dementia patients with ‘AI Powered Smart Socks’

People suffering from dementia could live more independently thanks to a pair of AI-powered socks that can track everything from a patient’s heart rate to movement.

Called “SmartSocks,” the AI-powered apparel was created in partnership between the University of Exeter and researchers at the start-up company Milbotix, according to SWNS. The socks can monitor a patient’s heart rate, sweat levels and motion to prevent falls while also promoting independence for those with dementia.

“I came up with the idea for SmartSocks while volunteering in a dementia care home,” SmartSocks creator Zeke Steer, CEO of Milbotix, told SWNS. “The current product is the result of extensive research, consultation and development.”

Steer’s great-grandmother suffered from dementia, which also helped spark the creation of the socks.

“The foot is actually a great place to collect data about stress, and socks are a familiar piece of clothing that people wear every day; our research shows that socks can accurately recognize signs of stress, which could really help not just those with dementia but their caregivers, too,” Steer, who has a background in robotics and AI, told SWNS.

WHAT IS CHATGPT?

The socks send the data collected from the patient to an app, which flags caregivers when the patient appears to be in distress. The warning could prevent falls and even tragedies as caregivers can respond to a patient before their stress escalates.

“I think the idea of SmartSocks is an excellent way forward to help detect when a person is starting to feel anxious or fearful,” said Margot Whittaker, director of nursing and compliance at Southern Healthcare in the U.K.

A handful of care homes overseen by Southern Healthcare, including The Old Rectory in Exeter, are already testing the tech-powered socks on patients, who report they are happy with how easy the socks are to use.

“Anything that’s simple and easy to do, and is improving our look at life as a whole, I’m happy with,” dementia patient John Piper, 83, told the BBC.

The socks do not need to be recharged, according to Milbotix’s website, and can be machine washed.

There are other products on the market that can also track a dementia patient’s heart rate or sweat levels, but they often come in the form of wristbands and watches, which can pose issues to those with dementia.

“Wearable devices are fast becoming an important way of monitoring health and activity,” Imperial College London’s Health and Social Care Lead Sarah Daniels told SWNS. “At our center, we have been trialing a range of wristbands and watches. However, these devices present a number of challenges for older adults and people affected by dementia.”

Daniels said wristbands or watches often don’t hold long charges and are taken off by patients and then lost.

“SmartSocks offer a new and promising alternative, which could avoid many of these issues,” Daniels said.

The University of Exeter is investigating how beneficial the socks are for dementia patients.

Artificial intelligence platforms are revamping health care across many disciplines, including another U.K.-based system called CognoSpeak, which can monitor speech patterns in a bid to detect early signs of dementia or Alzheimer’s.

U.K.-based start-up SmartSocks has developed hosiery that can monitor a dementia patient’s heart rate, motion and sweat levels with AI and alert caregivers to potential problems.

AI TOOL GIVES DOCTORS PERSONALIZED ALZHEIMER’S TREATMENT PLANS FOR DEMENTIA PATIENTS

What Else Is Happening in AI on August 17th, 2023

GPT-4 Code Interpreter can enhance math skills with code-based self-verification
– OpenAI’s GPT-4 Code Interpreter’s remarkable performance in math datasets is largely attributed to its step-by-step code generation and dynamic solution refinement based on code execution outcomes. Expanding on this understanding, new research has introduced the innovative explicit code-based self-verification (CSV) prompt, which leverages GPT4-Code’s advanced code generation mechanism. This prompt guides the model to verify the answer and then reevaluate its solution with code.
– The approach achieves an impressive accuracy of 84.32% on the MATH dataset, significantly outperforming the base GPT4-Code and previous state-of-the-art methods.

AI just reconstructed a Pink Floyd song from brain activity, and it sounds shockingly clear
– Neuroscientists recorded electrical activity from areas of the brain as patients listened to the Pink Floyd song “Another Brick in the Wall, Part 1.” Using AI software, they were able to reconstruct the song from the brain recordings. This is the first time a song has been reconstructed from intracranial electroencephalography recordings.

Saudi Arabia and UAE join the race for scarce Nvidia chips
– Saudi Arabia has purchased at least 3,000 of Nvidia’s H100 chips at $40,000 apiece, while UAE has ordered a fresh batch of semiconductors to power its LLM. This signals their intention to become major players in AI.

OpenAI acquires Global Illumination to work on core products, including ChatGPT
– Its team leverages AI to build creative tools, infrastructure, and digital experiences. It previously designed and built products early on at Instagram and Facebook and has made significant contributions at YouTube, Google, Pixar, Riot Games, and other notable companies.

McKinsey unveils its own generative AI tool for employees: Lilli
– It is a chat application for employees designed that serves up information, insights, data, plans, and even recommends the most applicable internal experts for consulting projects, all based on 100K+ documents and interview transcripts.

Opera’s iOS web browser will now include Aria
– The AI assistant, Aria, is Opera’s browser AI product built in collaboration with OpenAI, integrated directly into the web browser, and free for all users.

Adobe Express with AI Firefly app is available worldwide
– The web app is now out of beta and can be used free of charge in web browsers.

The Associated Press releases guidelines for Generative AI to its journalists

UK is using AI road safety cameras to detect potential driver offenses in passing vehicles

The founder of Centricity, a data analytics firm using AI, is indicted for defrauding investors by manipulating financial data.

Leaders with a Montana digital academy say bringing artificial intelligence to high schools is an opportunity to embrace the future.

Google said to be testing new life coach AI for providing helpful advice to people.

Alibaba Cloud MagicBuild Community has launched the digital human video generation tool called LivePortrait. It can generate digital human videos from photos, text, or voice, which can be applied in scenarios such as live broadcasting and corporate marketing.

Latest Tech News and Trends on August 17th, 2023

How to add captions on your phone for any app you use

usatoday.com

The end of SIM cards: A new eSIM guide for Android users 2023

Are physical SIMs about to be a thing of the past? Jump into the latest, and discover eSIMs #sponsored

Latest Sport Football Soccer News and Trends on August 17th, 2023

Atletico Madrid takes the alleged ‘threat’ of João Félix terminating his contract as a joke. If he wants to do so, he will have to pay his €350m release clause. Link

BREAKING: Theo Walcott is set to retire. Walcott, 34, who left Southampton at the end of last season, scored more than 100 goals for Arsenal and won 47 England caps. Link

‘Minecraft’ To ‘FIFA Soccer’: Best iOS Games To Play On The Upcoming iPhone 15

From epic open-world adventures to mind-bending puzzles, these are the best iOS games to play on the upcoming iPhone 15.

Unraveling August 2023: August 16th, 2023

Latest AI News and Trends on August 16th, 2023

GPT-4 to replace content moderators

OpenAI aims to use its GPT-4 to solve the challenge of content moderation at scale. Also, they already used GPT-4 to develop and refine their own content policies. It provides three major benefits: consistent judgments, faster policy development, and improved worker well-being. However, perfect content moderation remains elusive, as both humans and machines make mistakes, particularly in handling misleading or aggressive content.

GPT-4 can interpret complex policy documentation and adapt instantly to updates, reducing the cycle from months to hours. This AI-assisted approach offers a positive future for digital platforms, where AI can help moderate online traffic and relieve the burden on human moderators.

Why does this matter?

GPT-4 can alleviate content moderation challenges and improve the efficiency and effectiveness of content moderation. This could be a solution for platforms like Facebook and Twitter, who’ve been grappling with content moderation for ages. OpenAI’s this approach could also appeal to smaller companies lacking resources.

Source

Meta beats ChatGPT in language model generation

Shepherd is a language model designed to critique and improve the outputs of other language models. It uses a high-quality feedback dataset to identify errors and provide suggestions for refinement. Despite its smaller size, Shepherd’s critiques are either equivalent or preferred to those from larger models like ChatGPT. In evaluations against competitive alternatives, Shepherd achieves a win rate of 53-87% compared to GPT-4.

Shepherd outperforms other models in human evaluation and is on par with ChatGPT. Shepherd offers a practical and valuable tool for enhancing language model generation.

Why does this matter?

Despite Shepherd’s smaller size, its critiques match or surpass those of larger models like ChatGPT, with a win rate of 53-87% against GPT-4. It excels in human evaluations and offers practical value in improving language model generation.

Source

Microsoft launches private ChatGPT

Microsoft now offers OpenAI’s ChatGPT model in its Azure OpenAI service, allowing developers and businesses to integrate conversational AI into their applications. ChatGPT can be used to power custom chatbots, automate emails, and provide summaries of conversations.

Azure OpenAI users can access a preview of ChatGPT starting today, with pricing set at $0.002 for 1,000 tokens. ChatGPT on Azure solution accelerator is an enterprise option. This solution provides a similar user experience to ChatGPT but is offered as your private ChatGPT.

Microsoft Azure ChatGPT offers several benefits to organizations:

Ensures data privacy with built-in guarantees and isolation from OpenAI-operated systems.
Allows full network isolation and offers enterprise-grade security controls.
Enhances business value by integrating internal data sources and services like ServiceNow.

Why does this matter?

Amid the excitement around ChatGPT, Microsoft has cleverly introduced an enterprise version to meet strong market demand. By prioritizing security, Azure simplifies and enhances companies’ access to AI advantages. Also, Microsoft’s move aims to boost productivity through code editing, task automation, and more and offers enterprises a more secure way to share their data with AI.

Source

Google enhances search with AI-driven summaries LINK

Google is experimenting with a generative AI feature in Search that generates key points from long-form web content.
The summarization tool will display “key points” from articles but will not work on content marked as paywalled by publishers.
Initially launching as an “early experiment” in Google’s opt-in Search Labs program, it will first be available on the Google app for Android and iOS

Nvidia’s stocks surge LINK

Nvidia’s stock rises 7% as investors see its GPUs remaining dominant in powering large language models.
Morgan Stanley reiterates Nvidia as a “Top Pick” due to strong earnings, AI spending shift, and ongoing supply-demand imbalance.
Despite recent fluctuations, Nvidia’s stock has tripled in 2023, and analysts anticipate long-term benefits from AI and favorable market conditions.

The Strength and Realism of AI Models While artificial intelligence models demonstrate immense computational power, there’s a debate regarding their biological plausibility. How do these digital frameworks compare to the natural intelligence of living organisms? Are they accurate representations or mere simulations?

Transportation Systems: The Paradox of Choice More choices in transportation systems might seem beneficial, but there’s a hidden challenge. With increased variety comes complexity, leading to inefficiencies and potential gridlocks.

AI’s Role in Pinpointing Cancer Origins Recent advancements in AI have developed a model that can assist in determining the starting point of a patient’s cancer, a crucial step in identifying the most effective treatment method. [Read more at MedicalTechNews.com]

AI’s Defense Against Image Manipulation In the era of deepfakes and manipulated images, AI emerges as a protector. New algorithms are being developed to detect and counter AI-generated image alterations. [Read more at DigitalSafetyWatch.com]

Streamlining Robot Control Learning Researchers have uncovered a more straightforward approach to teach robots control mechanisms, making the integration of robotics into various industries more efficient.

Accelerated Robotics Training Techniques A revolutionary methodology promises to slash the time required to instruct robots, optimizing their utility and deployment speed in multiple applications.

Armando Solar-Lezama: The Beacon of Computing Armando Solar-Lezama has been honored as the inaugural Distinguished Professor of Computing, acknowledging his invaluable contributions to the world of computer science.

Efficient Planning for Household Robots with AI AI integration has enabled household robots to plan tasks more efficiently, cutting their preparation time by half and allowing for more seamless operations in domestic environments.

The ChatGPT Impact: Boosting Writing Productivity A recent study highlights how ChatGPT enhances workplace productivity, particularly in writing tasks. The AI-driven tool provides a significant advantage for professionals in diverse sectors.

Reimagining Data Privacy in the Modern Era Data privacy is evolving, and it’s time to approach it with a fresh perspective. As digital footprints expand, there’s an urgent need to revisit and redefine what personal data protection means.

Daily AI News on August 16th, 2023

OpenAI’s GPT-4 for more reliable and higher quality content moderation
– OpenAI aims to use its GPT-4 to solve the challenge of content moderation at scale. GPT-4 could replace human moderators, offering similar accuracy and more consistency. OpenAI has already used GPT-4 to develop and refine its own content policies.
– It provides three major benefits: consistent judgments, faster policy development, and improved worker well-being. While AI has been used for content moderation before, OpenAI’s approach could be appealing to smaller companies lacking resources.
Microsoft launches ChatGPT for enterprises with Azure
– Microsoft is now offering OpenAI’s ChatGPT model in its Azure OpenAI service, allowing developers and businesses to integrate conversational AI into their applications. ChatGPT can be used to power custom chatbots, automate emails, and provide summaries of conversations.
– Azure OpenAI users can access a preview of ChatGPT starting today, with pricing set at $0.002 for 1,000 tokens and it promises more control and privacy compared to the public model.
Google is progressing with new AI updates!
– Search experience adds AI-powered summaries, definitions, and coding improvements. In addition it will include related diagrams or images for various topics, color-coded syntax highlighting for code snippets, making it easier for programmers to understand and debug generated code.
– Google Photos adds a scrapbook-like Memories view feature aided by AI which allows users to relive and share their most memorable moments. The feature creates a scrapbook-like timeline that includes trips, celebrations, and daily moments with loved ones. The new Memories view is launching today for U.S. users and is similar to a combination of Stories and Facebook Memories.
Amazon using AI to enhance product reviews
– Amazon is tapping into generative AI to create handy highlights that collects key points from customer reviews which will help shoppers quickly gauge product review.
– The feature is part of ongoing efforts to improve utility of 125M+ reviews from shoppers. It uses only trusted reviews from verified purchases, and Amazon.
WhatsApp test beta upgrade with new feature ‘custom AI-generated stickers’
– The feature is currently available to a limited number of beta testers, includes a “Create” button under the stickers tab, which opens a keyboard for users to type prompts for the AI model to generate custom stickers. The feature is a server-side change and is currently only available in version 2.23.17.8 of the beta version.

Apple’s AI advancements in the last few months

Don’t sleep on Apple’s AI plans. Here’s how they’ve been slowly ramping up their AI efforts in the last few months.

Apple’s AI-powered health coach might soon be at your wrists
Apple is reportedly developing an AI-powered health coaching service called Quartz, aimed at helping users improve their exercise, eating habits, and sleep quality. The service will use AI and data from the user’s Apple Watch to create personalized coaching programs, with plans to introduce a monthly fee. The company is also working on emotion-tracking tools and plans to launch an iPad version of the iPhone Health app this year.
Apple enters the AI race with new features
Apple announced a host of updates at the WWDC 2023. Yet, the word “AI” was not used even once, despite today’s pervasive AI hype-filled atmosphere. The phrase “machine learning” was used a couple of times. (And AI is nothing but machine learning).
However, here are a few announcements Apple made that use AI as the underlying technology.
- Apple Vision Pro, a revolutionary spatial computer that seamlessly blends digital content with the physical world. It uses advanced ML techniques.
- Upgraded Autocorrect in iOS 17 that is powered by a transformer language model for improved prediction capabilities.
- Improved Dictation in iOS 17 that leverages a new speech recognition model to make it even more accurate.
- Live Voicemail that turns voicemail audio into text on the fly, which is powered by a neural engine.
- Personalized Volume, which uses ML to understand environmental conditions and listening preferences over time to automatically fine-tune the media experience.
- Journal, a new app for users to reflect and practice gratitude, uses on-device ML for personalized suggestions to inspire entries.
Apple Trials a ChatGPT-like AI Chatbot
Apple is developing AI tools, including its own large language model called “Ajax” and an AI chatbot named “Apple GPT.” They are gearing up for a major AI announcement next year as it tries to catch up with competitors like OpenAI and Google.
Apple bets big on AI
Apple’s CEO Tim Cook has stated that AI and ML are embedded in every company product. This comes after concerns were raised about Apple’s lack of discussion on its AI plans while competitors have been actively incorporating the technology into their products. He also emphasized that AI is central to the design of Apple’s products, contradicting suggestions that the company has not yet integrated the technology.
Apple gearing up for an AI showdown
Apple has reportedly ordered servers from Foxconn Industrial Internet, a division of its supplier Foxconn, for the testing and training of AI services. The servers are specifically for Apple’s AI work, which has been a focus for the company for years. While Apple does not currently have a ChatGPT-like app for external use, it is believed that this division of Foxconn already supplies servers to ChatGPT OpenAI, Nvidia, and Amazon Web Services. The news comes amidst reports about Apple’s plans to compete in the AI chatbot market.

Latest Tech News on August 16th, 2023

US watchdog targets surveillance industry LINK

The U.S. Consumer Financial Protection Bureau (CFPB) plans to regulate data brokers selling personal data due to concerns about their impact on privacy, including sensitive data from vulnerable groups.
CFPB aims to prevent illegal collection and sharing of personal data by data brokers in the surveillance industry.
The proposal expands coverage under the Fair Credit Reporting Act to include data derived from payment histories, personal income, and criminal records, addressing concerns such as credit header data disclosure.

Tesla unveils cheaper Model S and Model X variantsLINK

Tesla introduces lower-priced options for the Model S and Model X with reduced range.
The “standard range” trim brings the Model S starting price to $78,490 with 320 miles of range and the Model X starting price to $88,490 with 269 miles of range.
This trim, previously discontinued, likely uses the same battery pack with a software lock, and Tesla may offer range unlocking for an additional fee.

Singapore pioneers in stablecoin crypto regulationLINK

Singapore’s financial regulator has finalized rules for stablecoins, making it one of the first jurisdictions to do so globally.
Stablecoins are digital currencies designed to maintain a constant value against fiat currency, with a market value of around $125 billion and dominated by tokens like USDT and USDC.
The Monetary Authority of Singapore’s framework outlines requirements including holding reserves in low-risk assets, timely redemption, and proper user disclosures for stablecoins mimicking the Singapore dollar or other G10 currencies.

Amazon Pharmacy offers major savings on insulinLINK

Amazon Pharmacy offers automatic coupons to help uninsured insulin-requiring patients save on medication costs.
The digital pharmacy shows pricing with and without insurance, estimates savings with eligible coupons, and simplifies the process compared to existing coupon programs.
Amazon Pharmacy’s initiative has been praised by healthcare advocates and aims to improve access to affordable treatments, but manufacturer coupons are not available for patients benefiting from certain healthcare programs.

Other Tech news you might like

Apple will soon start making settlement payments to claimants of the “Batterygate” class-action lawsuit, with potential payments of up to $65 per person.LINK

YouTube is removing cancer treatment content that’s “harmful or ineffective,” implementing new guidelines to remove unproven treatments and harmful advice.LINK

California regulators approved round-the-clock robotaxi service in San Francisco for Waymo and Cruise, but a group of Cruise vehicles caused a traffic backup, highlighting challenges.LINK

Microsoft is implementing an eight-strike suspension policy for Xbox community standards violations, scaling suspensions based on the number of strikes, with players able to appeal and view enforcement history.LINK

Unraveling August 2023: August 15th, 2023

Latest AI News and Trends on August 15th, 2023

Do It Yourself Custom AI Chatbot for Business in 10 Minutes (Open Source)

If you’re looking to “train” a custom chatbot on your data (SOPs, legal docs, financial reports, etc), I’d strongly suggest checking out AnythingLLM.
It’s the first chatbot with enterprise-grade privacy & security.
When using ChatGPT, OpenAI collects your data including:
– Prompts & Conversations
– Geolocation data
– Network activity information
– Commercial information e.g. transaction history
– Identifiers e.g. contact details
– Device and browser cookies
– Log data (IP address etc.)
However, if you use their API to interact with their LLMs like gpt-3.5 or gpt-4, your data is NOT collected. This is exactly why you should **build your own private & secure chatbot**. That may sound difficult, but Mintplex Labs (backed by Y-Combinator) just released AnythingLLM, which gives you the ability to build a chatbot in 10 minutes without code.
AnythingLLM provides you with the tools to easily build and manage your own private chatbot using API keys. Plus, you can expand your chatbot’s knowledge by importing data such as PDFs, emails, etc. This can be confidential data as only you have access to the database.
ChatGPT currently allows you to upload PDFs, videos and other data to ChatGPT via vulnerable plug-ins, BUT there is no way to determine if that data is secure or even know where it’s stored.
Easily build your own business-compliant and secure chatbot at useanything.com. All you need is an OpenAI or Azure OpenAI API key.
Or, if you prefer using the open source code yourself, here’s the GitHub repo: https://github.com/Mintplex-Labs/anything-llm.

AI powered tools for the recruitment industry

AI-driven recruiting and retention strategies utilize data-driven strategies for better candidate experiences and better hiring decisions. Here’s a list of a few tools that are useful for this purpose :
– Conversational AI To Recruit And Retain At Scale | Humanly.io : It is designed for high scale hiring in organizations. It enhances candidate engagement through automated chat interactions.

– MedhaHR : It’s an AI-driven healthcare talent sourcing platform that automates resume screening, provides personalized job recommendations, and offers cost-effective solutions.

– ZappyHire : It offers features such as candidate sourcing, resume screening, automated communication, and collaborative hiring.

– Sniper AI : It uses AI algorithms to source potential candidates, assess their suitability, and integrates with ATS for workflow optimization.

– PeopleGPT : PeopleGPT, developed by Juicebox (YC S22), is a tool that simplifies the process of searching for people data. Recruiters can input specific queries to find potential candidates.
Which tools have you been using, and more importantly is AI really helping you with recruitment?
More resources along with their pricing plans here

American companies are vigorously seeking AI specialists, leading to soaring salaries for high-demand roles. Amidst this recruitment frenzy, some organizations are offering nearly a million-dollar salary, especially to those experienced in AI.

Surge in AI Talent demand and salaries

American firms are hunting for AI experts, with some offering salaries nearing a million dollars.
Industries like entertainment and manufacturing want data scientists and machine-learning specialists.
Competition is fierce, with companies like Accenture investing in internal training and others considering acquisition of AI startups for talent.

The compensation landscape for AI roles

As AI expertise becomes more sought-after, compensation packages are rising.
Companies are offering mid-six-figure salaries, bonuses, and stock grants to lure experienced professionals.
While top positions like Netflix’s machine-learning platform product manager can reach up to $900,000 in total compensation, othersalike a prompt engineer might average $130,000 annually.

Source (Washington Post)

How to Manage Your Remote Team Effectively with ChatGPT?

Leading a remote team comes with unique challenges, from ensuring clear communication to fostering a sense of community. ChatGPT can be your expert consultant, offering suggestions based on best practices for remote team management.

You are a seasoned consultant in remote team management. I am the leader of a remote team working on a [define project]. I need advice on how to effectively manage my team, ensure clear communication, monitor progress, and maintain a positive team culture. Your suggestions should include strategies for scheduling and conducting virtual meetings, task assignment, progress tracking tools, and methods to promote team building in a virtual setting.

I asked ChatGPT to remove password protection from an Excel document, and it worked flawlessly

How are you uploading an excel document to chat gpt?

Using ChatGPT code interpreter: It’s a feature for GPT plus member as the old “bing search” which got disabled, You have code interpreter now where you can directly upload files.

Can it analyze conversations/texts? Yes it can analyse data and even give u back charts and feedback for gpt plus users.

Johns Hopkins Researchers Developed a Deep-Learning Technology Capable of Accurately Predicting Protein Fragments Linked to Cancer

Johns Hopkins Engineers and Cancer Researchers have collaboratively pioneered a breakthrough in personalized cancer therapy with their cutting-edge deep-learning technology.

Microsoft releases private ChatGPT for Business

Summary: Microsoft Azure allows organizations to run ChatGPT within their network for smoother work experiences. Think of it as your private, controlled, and extra valuable AI assistant. (source)

Key points:

Azure allows companies to run ChatGPT privately on their own networks, touting built-in data isolation from OpenAI.
The model connects to internal data services and sources, and is available on GitHub to install and deploy.
Benefits include privacy, control, and unique business value through internal data integration.

Why It Matters: For enterprises, this merger between ChatGPT and Azure opens a new realm of possibilities, with the cozy feeling of privacy and control. It’s more than a tech tool; it’s a tailored solution that could redefine how businesses work with AI.

Apple’s AI-powered health coach might soon be at your wrists

Apple is reportedly developing an AI-powered health coaching service called Quartz, aimed at helping users improve their exercise, eating habits, and sleep quality. The service will use AI and data from the user’s Apple Watch to create personalized coaching programs, with plans to introduce a monthly fee. The company is also working on emotion-tracking tools and plans to launch an iPad version of the iPhone Health app this year.

Why does this matter?

It’s only a matter of time before AI is deployed on IoT devices such as smartwatches. This confluence can definitely revolutionize our daily lives. AI can direct IoT devices to adapt and optimize settings based on external circumstances making them a lot more autonomous and helpful.

Source

Apple enters the AI race with new features

Apple announced a host of updates at the WWDC 2023. Yet, the word “AI” was not used even once, despite today’s pervasive AI hype-filled atmosphere. The phrase “machine learning” was used a couple of times. (And AI is nothing but machine learning). However, here are a few announcements Apple made that use AI as the underlying technology.

Apple Vision Pro, a revolutionary spatial computer that seamlessly blends digital content with the physical world. It uses advanced ML techniques.

Upgraded Autocorrect in iOS 17 that is powered by a transformer language model for improved prediction capabilities.
Improved Dictation in iOS 17 that leverages a new speech recognition model to make it even more accurate.
Live Voicemail that turns voicemail audio into text on the fly, which is powered by a neural engine.
Personalized Volume, which uses ML to understand environmental conditions and listening preferences over time to automatically fine-tune the media experience.
Journal, a new app for users to reflect and practice gratitude, uses on-device ML for personalized suggestions to inspire entries.

Why does this matter?

To the average user, AI can be scary. Perhaps it was Apple’s deliberate choice not to mention the word “AI”? Nevertheless, these updates and features demonstrate that Apple is indeed utilizing AI technologies in various aspects of its products and services, joining the likes of Google and Microsoft.

Source

Apple Trials a ChatGPT-like AI Chatbot

Apple is developing AI tools, including its own large language model called “Ajax” and an AI chatbot named “Apple GPT.” They are gearing up for a major AI announcement next year as it tries to catch up with competitors like OpenAI and Google.

The company has multiple teams developing AI technology and addressing privacy concerns. While Apple has been integrating AI into its products for years, there is currently no clear strategy for releasing AI technology directly to consumers. However, executives are considering integrating AI tools into Siri to improve its functionality and keep up with advancements in AI.

Why does this matter?

Apple’s development of AI tools, such as the language model “Ajax” and chatbot “Apple GPT,” signals the company’s efforts to catch up with competitors OpenAI and Google. The focus on addressing privacy concerns and the potential integration of AI into Siri shows Apple’s aim to enhance its product functionality and stay competitive.

Source

Apple bets big on AI

Apple’s CEO Tim Cook has stated that AI and ML are embedded in every company product. This comes after concerns were raised about Apple’s lack of discussion on its AI plans while competitors have been actively incorporating the technology into their products. He also emphasized that AI is central to the design of Apple’s products, contradicting suggestions that the company has not yet integrated the technology.

Cook reassured that Apple has invested in AI for years, and this year’s Research & Development spending has hit $22.61 billion. They are also hiring dozens of AI jobs in the US, France, and China, looking to fill roles that could help build Gen AI tools.

Why does this matter?

This move signifies the potential for enhanced personalization and contextual relevance in user interactions, leading to a more intuitive and tailored experience within the Apple ecosystem. The seamless integration of AI may also pave the way for groundbreaking applications in health, home automation, and more. Ultimately redefining how users interact with and benefit from Apple’s ecosystem of products and services.

Source

Apple gearing up for an AI showdown

Apple has reportedly ordered servers from Foxconn Industrial Internet, a division of its supplier Foxconn, for the testing and training of AI services. The servers are specifically for Apple’s AI work, which has been a focus for the company for years. While Apple does not currently have a ChatGPT-like app for external use, it is believed that this division of Foxconn already supplies servers to ChatGPT OpenAI, Nvidia, and Amazon Web Services. The news comes amidst reports about Apple’s plans to compete in the AI chatbot market.

Why does this matter?

Apple’s this latest move to order servers from Foxconn’s division for AI testing and training has caught attention. While Apple hasn’t launched a ChatGPT-like app yet, the supplier’s involvement with ChatGPT OpenAI, Nvidia, and Amazon Web Services hints at potential AI ventures. Apple seems like a potentially new big player in the AI game.

Source

Google Tests Using AI to Sum Up Entire Web Pages on Chrome

The ‘SGE while browsing’ feature can break down a long web page into easy-to-read key points.

Daily AI News August 15th, 2023

Talon Aerolytics, a leading innovator in SaaS, Digital Twin capture services and AI technology, has announced ha its groundbreaking cutting-edge AI-powered computer vision platform enables wireless operators to visualise and analyse network assets using end-to-end AI and machine learning. Link

Beijing is poised to implement sweeping new regulations for artificial intelligence services this week, trying to balance state control of the technology with enough support that its companies can become viable global competitors. Link

Saudi Arabia and the United Arab Emirates are buying up thousands of the high-performance Nvidia chips crucial for building artificial intelligence software, joining a global AI arms race that is squeezing the supply of Silicon Valley’s hottest commodity. Link

OpenAI likely to go bankrupt by the end of 2024. Link

Latest Tech News on August 15th, 2023

Youtube algorithm flaws?

Personally I’ve always been a huge fan of youtube but I always thought that their algorithm have actually gotten worse since the 2010s.

Supposedly Google should have perfected the algorithm at making simple recommendations; they have teams working on it yet i could think of a few things that could improve it.

From my experience, youtube always recommends the same stuff. If you like one video or click on it, it would keep showing that channels videos until you’re bombarded by it. It rarely gives you anything new, or reminds you of old topics you enjoyed. Sometimes videos are just stuck there for weeks, when i’m clearly not watching it. Sometimes something i really want to watch disappears and never comes back again. Furthermore It’s other sections/buttons do not show the videos i mention above, but rather completely unrelated content.

Just off the top of my head, I can think of a few things – becoming more ambitious every refresh; recommend new topics more often; remind you of old topics you like more often.

YouTube will remove cancer treatment misinformation

YouTube announced a new policy for combatting medical misinformation on the video sharing platform

Latest World and Sport News on August 15th, 2023

Fulton County grand jury returns an indictment in 2020 election probe for Georgia. Link

The highest paid football players in the world in 2023 according to Le Parisien

After Al-Hilal move, Neymar leapfrogs Romelu Lukaku as the player with the highest combined transfer fee in football history.

Asked ChatGPT “Can you write an essay about web browsers but using a tone of a depressed teenager?” And this was the result

Ugh, like, seriously? Web browsers, man, they’re just like another thing we have to deal with in this messed up digital world. It’s like, we can’t even escape the internet, even if we wanted to. I mean, sure, web browsers are supposed to be all convenient and stuff, making it, like, easier for us to, you know, “surf” the internet. But honestly, it’s just one more thing on my list of things that stress me out.

First off, there’s like a gazillion web browsers out there. Chrome, Firefox, Safari, Edge – it’s like a popularity contest or something. Everyone’s like, “Use this one, it’s the best,” but then someone else is like, “Nah, that one sucks, use this one.” And, like, who even has the energy to care anymore? It’s like trying to pick the coolest outfit to wear, but for your computer. Can’t I just, like, not care and use whatever comes pre-installed?

And then there’s all these updates. Like, seriously, can’t they just leave me alone? Every time I want to, like, chill and watch some random videos, there’s this annoying pop-up telling me to update my browser. And I’m like, “Dude, I just updated you, like, last week. Can you give me a break?” It’s like they’re trying to control my life or something.

And don’t even get me started on cookies and privacy stuff. Like, I get it, they want to, like, track my every move and sell my info to, like, who knows who. But, hello, can a girl get some privacy? It’s like they’re stalking me, following me around, knowing all my secrets. It’s not like I asked for this, you know?

Oh, and the extensions. Like, yeah, I get it, they’re supposed to, like, make my life easier or whatever. But honestly, half the time I don’t even know what they do. It’s like going to a party and not knowing anyone – you’re just there, hoping for the best. And, of course, they slow down my browser, because why not add more misery to my life?

But you know what’s the worst? Those error messages. It’s like the browser is mocking me, laughing at my inability to even load a simple webpage. “Aw, you wanted to check your favorite blog? Too bad, here’s an error message just for you.” It’s like they’re out to get me, to ruin my day, to remind me that technology hates me.

So, yeah, web browsers are just another thing in this never-ending cycle of frustration and confusion. They’re like that friend who always says they’ll be there for you but ends up ghosting you when you really need them. Maybe one day they’ll figure themselves out and actually make my life better. But until then, I’ll just be here, rolling my eyes every time I open a new tab. Whatever, man.

People are having sex in self-driving cars, apparently. Link

Top physicist says chatbots are just ‘glorified tape recorders’ Link

One small step for Spain, one final push for World Cup glory

Spain are just one step away from Women’s World Cup glory after their dramatic 2-1 semi-final win over Sweden on Tuesday and forward Salma Paralluelo said ‘La Roja’ had proved they have what it takes to be crowned champions.

Unraveling August 2023: August 14th, 2023

Manscape Refining Gentlemen

Latest AI News and Trends on August 14th, 2023

What is LLM? Understanding with Examples

What is LLM (Large Language Model)?

LLM (Large Language Model) is a type of AI model designed to understand and generate human-like text. These models are trained on vast amounts of text data and use deep learning techniques, such as deep neural networks, to process and generate language.

LLMs are capable of performing various natural language processing (NLP) tasks, including

Language translation
Text summarization
Question-answering
Sentiment analysis
Generating coherent and contextually relevant responses to user inputs

They are trained on a wide range of textual data sources, such as books, articles, websites, and other written content, allowing them to learn grammar, vocabulary, and contextual relationships in language.

Examples of Large Language Models

Some of the most popular large language models are:

GPT-3 by OpenAI: GPT-3 is a large language model that was first released in 2020. It has been trained on a massive dataset of text and code, and it can generate text, translate languages, write different kinds of creative content, and answer your questions in an informative way.
T5 by Google AI: T5 is a large language model that was first released in 2021. It is specifically designed for text generation tasks, and it can generate text that is more accurate, consistent, and creative than smaller language models.
LaMDA by Google AI: LaMDA is a large language model that was first released in 2022. It is specifically designed for dialogue applications, and it can hold natural-language conversations with users.
PaLM by Google AI: PaLM is a large language model that was first released in 2022. It is the largest and most powerful language model ever created, and it can perform a wide range of tasks, including text generation, translation, summarization, and question-answering.
FlaxGPT by DeepMind: FlaxGPT is a large language model that was first released in 2022. It is based on the Transformer architecture, and it can generate text that is more accurate and consistent than smaller language models.

https://www.seaflux.tech/blogs/llm-explained-with-examples

Advantages of LLM

Large language models (LLMs) have a number of advantages over traditional machine learning models. These advantages include:

Improved accuracy and performance: LLMs can be trained on massive datasets of text and code, which allows them to learn the nuances of human language and generate more accurate and consistent results than traditional machine-learning models.
Increased efficiency: LLMs can automate many tasks that were previously done manually, such as text classification, summarization, and translation. This can save businesses time and money, and free up human workers to focus on more creative and strategic tasks.
Expanded possibilities: LLMs can be used to create new and innovative products and services. For example, they can be used to develop chatbots that can hold natural-language conversations with customers or to create virtual assistants that can help users with tasks such as scheduling appointments or finding information.
Enhanced creativity: LLMs can be used to generate creative text formats, such as poems, code, scripts, musical pieces, emails, letters, and more with endless possibilities. This can be used to improve the quality of content or to create new and innovative forms of art and entertainment.
Reduced bias: LLMs can be trained on datasets that are more diverse than traditional datasets, which can help to reduce bias in their results. This is important for businesses and organizations that want to ensure that their products and services are fair and equitable for all users.

Challenges of LLM

Large language models (LLMs) are a powerful new technology, but they also come with several challenges. These challenges include:

Data requirements: LLMs require massive datasets of text and code to train. This can be a challenge for businesses and organizations that do not have access to large datasets.
Computational resources: LLMs require a lot of computational resources to train and run. This can be a challenge for businesses and organizations that lack the necessary resources.
Interpretability: LLMs are often difficult to interpret. This makes it difficult to understand how they work and to ensure that they are not generating harmful or biased results.
Bias: LLMs can be biased, depending on the data they are trained on. This can be a challenge for businesses and organizations that have ensured that their products and services are fair and equitable for all users.
Safety: LLMs can be used to generate harmful or misleading content. This can be challenging for businesses and organizations having a reputation for safe and secure services.

Use cases of LLM

The future of LLM models is bright. As this technology continues to develop, we can expect to see even more innovative and groundbreaking applications for LLMs in the future.

Some of the promising applications of LLMs include:

Virtual Assistants: LLMs could be used to power virtual assistants that are even more human-like and helpful than they are today. These virtual assistants could be used to provide a wide range of services, such as scheduling appointments, finding information, and controlling smart home devices.
Content Generation: LLMs could be used to generate more engaging and informative content. This content could be used to improve the customer experience, educate users, and entertain people.
Translation: LLMs could be used to translate text from one language to another more accurately and efficiently than ever before. This could help businesses to reach a wider audience and to provide better customer service.
Research: LLMs could be used to conduct research in a wider range of fields, such as natural language processing, machine translation, and artificial intelligence. This could help to advance our understanding of these fields and to develop new and innovative applications.
Education: LLMs could be used to create personalized learning experiences for students. These experiences could be tailored to each student’s individual needs and interests.
Healthcare: LLMs could be used to diagnose diseases, develop new treatments, and provide personalized care to patients.
Art and entertainment: LLMs could be used to create new forms of art and entertainment. This could include poems, code, scripts, musical pieces, emails, letters, etc.

Now that we have gone through the examples of Large Language Models, let us see how to utilize an LLM Library in different use cases along with code build. The LLM library used is provided by Hugging Face, called Transformer Library.

Introducing the Transformer Library

The transformer package, provided by huggingface.io, tries to solve the various challenges we face in the NLP field. It provides pre-trained models, tokenizers, configs, various APIs, ready-made pipelines for our inference, etc.

It is a large language model (LLM) developed by Hugging Face and a community of over 1000 researchers. It is trained on a massive dataset of text and code, and it can generate text, translate languages, and answer questions. Here we are going to see the following application of the Transformer Library:

Sentiment Analysis	Named Entity Recognition
Text Generation	Translate language
Question Answering Pipeline	Summarization

Before jumping to the examples of Transformer Library, we need to install the library to use it.

Install the Transformer Library

pip install transformers

By using the pipeline feature of the Transformers Library, you can easily apply LLMs for text generation, question answering, sentiment analysis, named entity recognition, translation, and more.

from transformers import pipeline

Example: Question Answering Pipeline

To perform question-answering using the Transformers library, you can utilize the pipeline feature with a pre-trained question-answering model. Here’s an example:

from transformers import pipeline

# Define the list of file paths
file_paths = ['document1.txt', 'document2.txt', 'document3.txt']

# Read the contents of each file and store them in a list
documents = []
for file_path in file_paths:
with open(file_path, 'r') as file:
document = file.read()
documents.append(document)

# Concatenate the documents using a newline character
context = "\n".join(documents)

# Use the pipeline with the updated context
nlp = pipeline("question-answering")
result = nlp(question="When did Mars Mission Launched?", context=context)

print(result['answer'])

The code prints the below output correctly to the question – When did Mars Mission Launch?

Output - 5 November 2013

IBM’s AI chip mimics the human brain

The human brain can achieve remarkable performance while consuming little power. IBM’s new prototype chip works similarly to connections in human brains. Thus, it could make AI more energy efficient and less battery draining for devices like smartphones.

The chip is primarily analogue but also has digital elements, which makes it easier to put into existing AI systems.

It addresses the concerns raised about emissions from warehouses full of computers powering AI systems. It could also cut the water needed to cool power-hungry data centers.

Why does this matter?

The advancements suggest the emergence of brain-like chips in the near future. It would mean large and more complex AI workloads could be executed in low-power or battery-constrained environments, for example, cars, mobile phones, and cameras. It promises new and better AI applications with reduced costs.

Source

NVIDIA’s tool to curate trillion-token datasets for pretraining LLMs

Most software/tools made to create massive datasets for training LLMs are not publicly released or scalable. This requires LLM developers to build their own tools to curate large language datasets. To meet this growing need, Nvidia has developed and released the NeMo Data Curator– a scalable data-curation tool that enables you to curate trillion-token multilingual datasets for pretraining LLMs. It can scale the following tasks to thousands of compute cores.

The tool curates high-quality data that leads to improved LLM downstream performance and will significantly benefit LLM developers attempting to build pretraining datasets.

Why does this matter?

Apart from improving model downstream performance with high-quality data, applying the above modules to your datasets helps reduce the burden of combing through unstructured data sources. Plus, it can potentially lead to greatly reduced pretraining costs, meaning relatively faster and cheaper development of AI applications.

Source

Trustworthy LLMs: A survey and guideline for evaluating LLMs’ alignment

It also thoroughly surveys the categories of LLMs that are likely to be crucial to improve their trustworthiness and shows how to build evaluation datasets for alignment accordingly.

Why does this matter?

The proposed framework facilitates a transparent, multi-objective evaluation of LLM trustworthiness. And it enables systematic iteration and deployment of LLMs. For instance, OpenAI has to devote six months to iteratively align GPT-4 before release. Thus, with clear and comprehensive guidance, it can facilitate faster time to market for AI applications that are safe, reliable, and aligned with human values.

Source

Amazon’s push to match Microsoft and Google in generative AI LINK

Amazon is developing proprietary chips, named “Inferentia” and “Trainium,” to rival Nvidia GPUs in terms of training and speeding up generative AI models.
The company’s late entry into the generative AI market has put it in a position of catch-up, with competitors like Microsoft and Google already investing heavily and integrating AI models into their products.
Despite Amazon’s cloud dominance, it aims to differentiate by leveraging its custom silicon capabilities, with Trainium offering significant price-performance improvements, although Nvidia remains dominant for training models.

World first’s mass-produced humanoid robots with AI brains LINK

Chinese start-up Fourier Intelligence showcased its humanoid robot GR-1, capable of walking on two legs at 5km/h carrying a 50kg load, highlighting the potential of bipedal robots.
Fourier originally focused on rehabilitation robotics, but in 2019, it embarked on creating humanoid robots, with GR-1 achieving success after three years of development.
While challenges remain in commercializing humanoid robots, Fourier aims to mass-produce GR-1 by year-end and sees potential applications in elderly care, education, and more.

Microsoft Designer: An AI-powered Canva: a super cool product that I just found!

I just found out about Microsoft Designer, which is an AI-powered tool for creating all types of graphics, from logos to invitations to social media posts. If you like Canva, you should check this out.

Some cool features:

Prompt-to-design: From just a short description, Designer uses DALLE-2 to generate original and editable designs.
Brand-kit: stay on-brand by instantly applying your fonts and color pallets to any design; it an even suggest color combinations.
Other AI tools: suggests hashtags and captions; replace background of an image with your imagination; erase items from an image; auto-fill a section of the image with generated image.

Source: this AI newsletter

ChatGPT costs OpenAI $700,000 PER Day

OpenAI is reportedly in “financial trouble” due to the astronomical costs of running ChatGPT, which is losing $700,000 daily. The article states OpenAI may go bankrupt in 2024 but I disagree because of their investment from Microsoft totaling $10B… there’s no way they can spend all of that right? let me know in the comments.

Costs Outpace Revenue

ChatGPT costs $700,000 per day to run.
Despite paid offerings, revenue can’t offset losses.
Projected 2023 revenue of $200M seems unlikely.

Mounting Problems

ChatGPT saw 12% drop in users from June to July.
Top talent being poached by rivals like Google and Meta.
GPU shortages hindering ability to train better models.

Increasing Competition

Cheaper open-source models can replace OpenAI’s APIs.
Musk’s xAI working on more right wing biased model.
Chinese firms buying up GPU stockpiles.

With ChatGPT’s massive costs outpacing revenue and problems like declining users and talent loss mounting, OpenAI seems to be in a precarious financial position as competition heats up.

Source: (link)

What Else Is Happening in AI on August 14th, 2023

Google appears to be readying new AI-powered tools for ChromeOS (Link)

Zoom rewrites policies to make clear user videos aren’t used to train AI (Link)

Anthropic raises $100M in funding from Korean telco giant SK Telecom (Link)

Modular, AI startup challenging Nvidia, discusses funding at $600M valuation (Link)

California turns to AI to spot wildfires, feeding on video from 1,000+ cameras (Link)

FEC to regulate AI deepfakes in political ads ahead of 2024 election (Link)

AI in Scientific Papers on August 14th, 2023

This research paper has found that LLMs can naturally read docs to learn how to use tools without any training. Instead of showing demonstration, just provide tool documentation. LLMs figured out how to use programs like image generators and video tracking software, without any new training [Link]
This paper analyses and visualises the political bias of major AI language models. ChatGPT and GPT-4 were most left-wing while Meta’s Llama was right-wing [Link]. This type of research is very important and highlights the inherent bias in these models. It’s practically impossible to remove bias also, and we don’t even know what they’ve been trained on. People need to understand, you control the models, you control what people see, especially as AI models are used more frequently and become mainstream
Remember the Westworld style paper with the 25 AI agents living their lives? It’s now open-source. It’s implications in gaming cannot be overstated. Can’t wait to see what comes of this [Link]
MetaGPT is framework using multiple agents to behave as an entire company – engineer, pm, architect etc. It has over 18k stars on github. This specialised for industries and companies will be powerful [Link]
This paper discusses reconstructing images from signals in the brain. Soon we’ll have brain interfaces that could read these signals consistently, maybe map everything you see? Potential is limitless [Link]
Nvidia is partnering with HuggingFace with DGX Cloud platform allowing people to train and tune AI models. They’re offering a “Training Cluster as a Service” which will help companies and individuals build and train models faster than ever [Link]
Stability AI has released their new AI LLM called StableCode. 16k context length and 3b params with other version on the way [Link]
This paper discusses a framework for designing and implementing complex interactions between AI systems called Flows [Link] Will be very important when building complex AI software in industry. Github will be uploaded soon [Link]
Nvidia announced that Adobe Firefly models will be available as APIs in Omniverse [Link] This thread breaks down what the Omniverse will look like [Link]
Anthropic CEO Dario Amodei thinks AI will reach educated levels of humans in 2-3 years [Link] For reference, Claude 2 is probably the second most powerful model alongside GPT4
Layerbrain is building AI agents that can be used across Stripe, Hubspot and slack using plain english [Link] Looks very cool
LLMs picking random numbers almost always pick the numbers 6-8 [Link]
Inflection founder Mustafa Suleyman says we’ll probably rely on LLMs more than the best trained and most experienced humans within 5 years [Link]. For context, Mustafa is one of the co founders of Google DeepMind – this guys knows AI
Writer, a startup using Nvidia’s NeMo discuss how it helped them build and scale over 10 models. NeMo isn’t publicly available but seems like a massive advantage considering Writer’s cloud infra, which is managed by 2 people, hosts a trillion API calls a month [Link] Link to NeMo [Link] Link to NeMo guardrails blog [Link]
Someone open-sourced smol-podcaster – it transcribes and labels speakers, formats the transcription, creates chapters with timestamps [Link]
Ultra realistic AI generated videos are coming. It’s impossible to tell they’re fake now [Link] Signup for early access here [Link]
Anthropic released Claude Instant 1.2. Its very fast, better at math and coding and hallucinates less [Link]
This guy released the code for his modded Google Nest Mini using OpenAI’s function calling to take notes and control his lights. Once Amazon & Apple integrates better LLMs into their prods, AI will truly be everywhere [Link]
If you search “As an AI language model” in Google Scholar a lot of papers come up… [Link]
OpenAI released custom instructions for ChatGPT free users, except for people in the US or UK [Link]
OpenAI, Google, Microsoft and Anthropic partnered with Darpa for their AI cyber challenge [Link]
PlayHT released their new text-to-voice ai model and it looks crazy good. Change the way its delivered by describing an emotion and much more [Link] [Link]
A paper by Google showcasing that AI models tend to repeat a user’s opinion back to them, even if its wrong. Thread breaking it down [Link] Link to paper [Link]
Medisearch comes out of YC and claims to have the best model for medical questions [Link]
Someone made a way to one-click install AudioLDM with gradio web ui [Link]
A way to make llama-2 much faster [Link]
WizardLM released a new math model that outperforms ChatGPT on math skills [Link]
A team of researchers trained an AI model to hear the sounds of keystrokes and steal data. Apparently it has a 95% success rate. Link to article [Link] Link to paper [Link]
Yann LeCun gave a talk at MIT about Objective-Driven AI [Link]
Google released 7 free courses on gen AI [Link] [Link] [Link] [Link] [Link] [Link] [Link]
Alpaca, a new AI tool for artists is out for public beta. It’s sketch to image is very powerful [Link]
One of the most lucrative businesses in the AI arms race? GPU cloud. Coreweave got $400M in funding and are set to make billions [Link]
Google releases a guidebook on best practices when designing with AI [Link]
A great article on LLMs in healthcare [Link]
Implement text-to-SQL using langchain, a breakdown[Link]
SDXL implemented in 520 lines of code in a single file [Link]
OpenAI released a blog on Special Projects – one of them involved trying to find secret breakthroughs in the world [Link]
Google announced Project IDX, a browser-based code environment. Brings app dev to the cloud and has AI features like code gen, completion etc [Link] A shot at replit it seems
Meta open-sourced AudioCraft – musicgen, audiogen and encodec. Definitely worth checking out [Link]
If you’re interested in fine-tuning open-source models like Llama-2, definitely check out this blog [Link] In some cases, fine-tuned llama2 is better than gpt4 (for sql generation for example). Overall a great read if you’re interested in fine tuning
Nvidia released the code for Neuralangelo, an AI model that reconstructs 3d surfaces from 2d videos [Link]
Create digital environments in seconds with Blockade labs. Wild stuff [Link]
This paper compares the answers of ChatGPT and stackoverflow for software engineering questions [Link] “52% of chatgpt answers are incorrect and 77% are verbose but are still preferred 39% of the time due to their comprehensiveness and well-articulated language style”. Only issue is this uses 3.5. Need this test with gpt4

Latest Tech News and Trends on August 14th, 2023

Privacy win: Starting today Facebook must pay $100.000 to Norway each day for violating our right to privacy. Link

College professors are going back to paper exams and handwritten essays to fight students using ChatGPT. Link

New Footage Shows Tesla On Autopilot Crashing Into Police Car After Alerting Driver 150 Times. Link

IBM’s prototype brain-like chip promises efficient, greener AI
– The human brain can achieve remarkable performance while consuming little power. IBM’s new prototype chip works similarly to connections in human brains. Thus, it could make AI more energy efficient and less battery draining for devices like smartphones. The chip is primarily analogue but also has digital elements, which makes it easier to put into existing AI systems.

NVIDIA’s tool to curate trillion-token datasets for pretraining LLMs
– To meet the growing demands for curating pretraining datasets for LLMs, Nvidia has released Data Curator as part of the NeMo framework. It is a scalable data-curation tool that enables you to curate trillion-token multilingual datasets for pretraining LLMs. It also curates high-quality data that leads to improved LLM downstream performance and will significantly benefit LLM developers attempting to build pretraining datasets.

Trustworthy LLMs: A survey and guideline for evaluating LLMs’ alignment
– New research has surveyed the categories of LLMs that are likely to be important for practitioners to focus on in order to improve LLMs’ trustworthiness. It explains in detail how to evaluate an LLM’s trustworthiness according to the above categories and build evaluation datasets for alignment accordingly in a more fine-grained manner.

ChromeOS might get some new AI-powered tools
– Google appears to be readying an AI writing tool for ChromeOS. Its code has hints of some AI tools for suggestions and rewrites.

Zoom rewrites policies to make clear your videos aren’t used to train AI tools
– Zoom has updated its terms of service and reworded a blog post explaining the recent changes. The company now explicitly states that “communications-like” customer data isn’t being used to train AI models for Zoom or third parties.

Anthropic raises $100M from Korean telco giant SK Telecom – They plan to co-develop a multilingual LLM customized for global telco firms.

Modular, AI startup challenging Nvidia, to be valued at $600M
– It is to raise Series A funding that would value it at roughly $600 million. Nvidia makes Cuda, the dominant software for writing ML apps that works only with Nvidia chips. Modular’s software aims to make it easier for AI developers to train and run their ML models on chips designed by other companies, including AMD, Intel, and Google.

AI avatars are coming. In my mind the biggest market for this might be content creators. People who need to appear on video and are tired of ensuring pitch perfect recordings.

LLMs have in-built political biases. Meta’s Llama has right-wing bias and GPT-4 has left-wing bias. Really? Who would’ve thought?

It is often said that “the devil is in the details”. As this article points out the question on AI regulation is going to be as much about laws as it is about procedure.

Amazon’s AI tool to help sellers write product descriptions. I don’t know if this is the right step forward. Currently, Amazon has an issue with cheap knockoffs. I don’t see how empowering these sellers will help.

A sober look at AI in education.

Implement Llama from scratch.

Fine Tuning Llama-2

Artificial General Intelligence – A gentle introduction

Discover a career in AI – Search 500+ job opportunities

GPTE just launched an AI job board with over 500 opportunities for those interested in joining the AI field professionally.

Latest Sport News on August 14th, 2023

Raphael Varane scored the winner 14 minutes from time as Manchester United gained a fortunate opening-weekend win against Wolves at Old Trafford. Link

Neymar transfer news: Al-Hilal agree deal with Paris St-Germain for Brazil forward. Link

Why is Saudi Pro League signing European clubs’ stars? Link

Moises Caicedo transfer news: Chelsea sign Brighton midfielder for £100m. Link

Harry Kane’s Bayern Munich may have found its goalkeeper;

Neymar to be paid $500K per social media post by Saudi Arabia;

Chelsea unveils Caicedo, breaks UK biggest transfer record;

Everton heartbroken at death of worker at new stadium;

Spalletti to succeed Mancini as Italy boss but issue emerges;

FC Dallas striker Jesus Ferreira wanted by Cadiz in LaLiga;

UEFA unsure on Athens hosting European final after fan violence;

Liverpool agrees $75m for Lavia, but can Chelsea scoop them?;

Kepa Arrizabalaga replaces Courtois at Real Madrid;

Second new Messi documentary in the works by Apple TV;

Unraveling August 2023: August 13th, 2023

Latest AI News and Trends on August 13th, 2023

Amazon wants you to pay with your palm LINK

Amazon is introducing Amazon One, a biometric hand-scanning service that allows users to pay at Whole Foods, Amazon Fresh stores, Panera restaurants, airports, stadiums, and Starbucks locations using their palm.
This move is part of Amazon’s effort to compete with Google and Apple in the digital wallet space, aiming to create a universal identity provider that goes beyond payments, potentially connecting to various services, including health records.
Amazon One uses near-infrared light to capture palm vein patterns and surface features, with a focus on security through encrypted hand scan transmission, but it faces privacy concerns and the challenge of convincing merchants to adopt the technology.

California’s AI-driven wildfire detection LINK

The California Department of Forestry and Fire Protection (Cal Fire) has launched the Alert California AI program in collaboration with UCSD, using AI and 360-degree cameras to detect potential wildfires by identifying abnormalities in camera feeds.
The program successfully detected and prevented a fledgling fire in the Cleveland National Forest, alerting firefighters who extinguished the flames within 45 minutes.
Alert California utilizes LiDAR scans and machine learning to differentiate between smoke and other particles, aiming to combat wildfires in the face of extreme climate conditions.

White House’s $1.2B carbon capture initiative LINK

The Department of Energy is providing grants of up to $1.2 billion to two direct air capture (DAC) projects aiming to remove over 2 million metric tons of CO2 annually, equivalent to emissions from 445,000 gas-powered cars.
The DAC projects in Texas and Louisiana, supported by the Regional Direct Air Capture Hubs program, will create jobs and could potentially remove up to 30 million tons of CO2 per year, contributing to the US goal of emissions neutrality by 2050.
The DOE aims to lower DAC costs below $100 per metric ton of CO2-equivalent and is funding feasibility studies, engineering projects, and a carbon removal credits program to achieve global impact on carbon reduction.

FTX’s Sam Bankman-Fried is back in jail LINK

Sam Bankman-Fried, former CEO of FTX, had his bail revoked ahead of his trial following allegations of leaking a diary to the New York Times.
Bankman-Fried faces charges including defrauding FTX investors and was initially under house arrest on a $250 million bond.
US District Court Judge revoked his bail due to alleged misconduct and possible witness intimidation, leading to potential detention at a detention center during trial.

AI can now outperform humans in Captcha tests LINK

A study reveals that humans are slower and less accurate than bots in solving Captcha tests, raising questions about their effectiveness.
Captchas are intended to deter bots from accessing services, preventing malicious activities like DDoS attacks and spam accounts.
Bots can outperform humans in solving certain types of Captchas, indicating an ongoing challenge in maintaining their efficacy.

Bots Are Better at Solving Captchas Than Humans, Research Shows

Not only are bots faster at solving captchas than humans they’re also more accurate.

Unraveling August 2023: August 12th, 2023

Latest AI News and Trends on August 12th 2023: Week Recap

Apple bets big on AI

Jupyter brings AI to notebooks

Jupyter AI is a tool that brings generative AI to Jupyter notebooks, allowing users to explore and work with AI models. It offers an %%ai magic command that turns the notebook into a reproducible generative AI playground, a native chat UI for working with generative AI as a conversational assistant, and support for various generative model providers.

Jupyter AI is compatible with JupyterLab, with version 1.x supporting JupyterLab 3.x, and version 2.x supporting JupyterLab 4.x. The main branch of Jupyter AI focuses on the newest supported version of JupyterLab, with features and bug fixes backported to JupyterLab 3 if deemed valuable.

ChatGPT’s emotional awareness is more than humans’. What?

A study found that ChatGPT has higher emotional awareness than humans. The machine was subjected to a standardized test measuring human emotional awareness and scored significantly higher. The test required participants to show empathy in fictional scenarios.

ChatGPT outperformed humans in all categories, achieving an overall score of 85 compared to 56 for men and 59 for women. The researchers suggest that ChatGPT could be helpful in psychotherapy, cognitive training, and diagnosing mental illness. Previous studies have shown that people perceive ChatGPT’s responses as more empathetic than medical professionals.

Microsoft’s many AI monetization plans

Microsoft has announced new Azure AI infrastructure advancements and availability to bring its customer closer to the transformative power of generative AI.

Azure OpenAI Service goes global: OpenAI’s most advanced models, including GPT-4 and GPT-35-Turbo, will now be available in multiple new regions and locations.
General availability of ND H100 v5 VMs for unprecedented AI processing and scale: -It also announced general availability of the ND H100 v5 Virtual Machine series, featuring the latest NVIDIA H100 Tensor Core GPUs and low-latency networking, propelling businesses into a new era of AI applications.

OpenAI launches a web crawler to train ChatGPT

Called GPTBot, the crawler will comb through the internet to train and enhance AI’s capabilities. It can be identified by the following user agent and string.

Web pages crawled with the GPTBot user agent may potentially be used to improve future models and are filtered to remove sources that require paywall access, are known to gather personally identifiable information (PII), or have text that violates our policies.

Moreover, OpenAI also revealed how websites can prevent GPTBot from accessing their sites, either partially or by opting out entirely.

AI deep fake audios are getting scarily realistic

Speech deepfakes are artificial voices generated by AI models. While studies investigating human detection capabilities are limited, a new experiment presented genuine and deep fake audio to individuals and asked them to identify the deep fakes. Listeners could correctly spot the deep fakes only 73% of the time.

The experiment was done in English and Mandarin to understand if language affects detection performance and decision-making rationale. However, there was no difference in detectability between the two languages.

NVIDIA’s Biggest AI Breakthroughs

Reveals a new chip GH200

Nvidia announced a new chip GH200, designed to run AI models. It has the same GPU as the H100, Nvidia’s current highest-end AI chip, but pairs it with 141 gigabytes of cutting-edge memory and a 72-core ARM central processor. This processor is designed for the scale-out of the world’s data centers.

The adoption of Universal Scene Description (OpenUSD)

Announced new frameworks, resources, and services to accelerate the adoption of Universal Scene Description (USD), known as OpenUSD. Through its Omniverse platform and a range of technologies and APIs, including ChatUSD and RunUSD, NVIDIA aims to advance the development of OpenUSD, a 3D framework that enables interoperability between software tools and data types for creating virtual worlds.

An AI Workbench

Introduced AI Workbench, a developer toolkit that simplifies creating, testing, and customizing pre-trained generative AI models. The toolkit allows developers to scale these models to various platforms, including PCs, workstations, enterprise data centers, public clouds, and NVIDIA DGX Cloud. This will speed up the adoption of custom generative AI for enterprises worldwide.

The Partnership between NVIDIA and Hugging Face

NVIDIA and Hugging Face have partnered to bring generative AI supercomputing to developers. Integrating NVIDIA DGX Cloud into the Hugging Face platform will accelerate the training and tuning of large language models (LLMs) and make it easier to customize models for various industries. This partnership aims to connect millions of developers to powerful AI tools, enabling them to build advanced AI applications more efficiently.

Google’s AI Surprise for Developers

Project IDX is an experiment by Google to improve full-stack, multi-platform app development. It aims to simplify the complex app development process across mobile, web, and desktop platforms. It is a browser-based development experience built on Google Cloud and powered by Codey, Google’s PaLM 2-based foundation model for programming tasks.

It allows developers to work from anywhere, import existing projects, and preview apps across platforms. It supports frameworks like Angular, Flutter, Next.js, React, Svelte, Vue and languages like JavaScript and Dart. AI capabilities like smart code completion and contextual code actions are also included. Google plans to add support for more languages like Python and Go in the future. Additionally, Project IDX integrates with Firebase hosting for easy deployment of web apps.

Stability AI launches LLM code generator

Stability AI has released StableCode, an LLM generative AI product for coding. It aims to assist programmers in their daily work and provide a learning tool for new developers. StableCode uses three different models to enhance coding efficiency. The base model was trained in various programming languages, including Python, Go, Java, and more. It was then further trained on 560B tokens of code.

The instruction model was tuned for specific use cases by training it on 120,000 code instruction/response pairs. StableCode offers a unique solution for developers to improve their coding skills and productivity.

Anthropic’s Claude Instant 1.2- Faster and safer LLM

Anthropic has released an updated version of Claude Instant, its faster, lower-priced yet very capable model which can handle a range of tasks including casual dialogue, text analysis, summarization, and document comprehension.

Claude Instant 1.2 incorporates the strengths of Claude 2 in real-world use cases and shows significant gains in key areas like math, coding, and reasoning. It generates longer, more structured responses and follows formatting instructions better. It has also made improvements on safety. It hallucinates less and is more resistant to jailbreaks, as shown below.

Source

Google attempts to answer if LLMs generalize or memorize

LLMs can certainly seem like they have a rich understanding of the world, but they might just be regurgitating memorized bits of the enormous amount of text they’ve been trained on. How can we tell if they’re generalizing or memorizing?

In this research, Google examines the training dynamics of a tiny model and reverse engineers the solution it finds – and in the process provides an illustration of the exciting emerging field of mechanistic interpretability. It seems that LLMs start by generalizing reasonably well but then change towards memorizing things.

Source

IBM plans to make Meta’s Llama 2 available on watsonx.ai

IBM will host Llama 2-chat 70B model in the watsonx.ai studio, with early access available to select clients and partners. This will build on IBM’s collaboration with Meta on open innovation for AI, including work with open-source projects developed by Meta. It will also support IBM’s strategy of offering both third-party and its own AI models.

Source

Apple gearing up for an AI showdown

While Apple does not currently have a ChatGPT-like app for external use, it is believed that this division of Foxconn already supplies servers to ChatGPT OpenAI, Nvidia, and Amazon Web Services. The news comes amidst reports about Apple’s plans to compete in the AI chatbot market.

Source

Midjourney’s present + future plans

Midjourney is rolling out a GPU cluster upgrade today. Pro and Mega users should see speedups of ~1.5x (/imagine from ~50 sec to ~30 sec). These renders should also be 1.5x cheaper.

They’re releasing V5.3, possibly next week, which will include features like inpainting and a new style (aesthetic) and may be only available on desktop. V6 is also in the works, aiming to enhance performance and language understanding. The website’s frontend is being worked on by a team, and it will be available for both desktop and mobile users. The launch date is approaching, but no specific date has been announced.

Source

MetaGPT tackling LLM hallucination

MetaGPT is a new framework that improves multi-agent collaboration by incorporating human workflows and domain expertise. It addresses the problem of hallucination in LLMs by encoding Standardized Operating Procedures (SOPs) into prompts, ensuring structured coordination.

The framework also mandates modular outputs, allowing agents to validate outputs and minimize errors. By assigning diverse roles to agents, MetaGPT effectively deconstructs complex problems.

Latest Tech News and Trends on August 12th 2023

Robotaxis greenlit for 24/7 operations in San Francisco LINK

California approved all-day paid robotaxi service in San Francisco, allowing unlimited self-driving car fleets.
The decision came amid objections from San Francisco officials, after a six-hour public comment session, and was a result of applications from Cruise (backed by GM) and Waymo (an Alphabet subsidiary).
Despite some challenges with driverless cars on the city’s streets, Cruise and Waymo see this approval as a pivotal step towards making their investments in self-driving technology profitable.

Russia launches its first lunar mission in 47 years LINK

Russia launches Luna-25, its first lunar mission since 1976, targeting the Moon’s south pole to potentially uncover water ice beneath its surface.
The mission is symbolic, referencing the Soviet Space Program era, and aims to project Russia as an influential world power amidst tensions following its 2022 Ukraine invasion.
Luna-25 is in competition with India’s Chandrayaan-3 mission, with both crafts expected to reach the Moon’s south pole around the same time.

Virgin Galactic debuts with its first civilian spaceflight LINK

Virgin Galactic’s second commercial flight, Galactic 02, took three private citizens to suborbital heights, including a historic mother-daughter duo.
The VSS Unity reached a peak altitude of 55 miles (88 kilometers) in an hour-long flight, with Kelly Latimer becoming the first woman pilot of a commercial spaceflight.
Following recent successes, Virgin Galactic aims for monthly commercial launches and is developing its Delta Class spacecraft for 2026, though substantial revenue from these flights is not anticipated.

Amazon penalizes excessive remote work LINK

Amazon warned US staff who didn’t spend enough time in the office after tracking their attendance.
The company’s office policy, effective since May, requires employees to be present at least three days a week.
Amazon responded to concerns by stating the warning was for those not adhering to the policy, but acknowledged potential inaccuracies in tracking.

Chinese firms invest billions in Nvidia GPUs LINK

Chinese internet giants, in response to US sanctions, are purchasing vast numbers of Nvidia GPUs to bolster their AI capabilities.
Companies like Alibaba, Baidu, ByteDance, and Tencent have reportedly spent around $1 billion on 100,000 Nvidia A800 GPUs, with further orders amounting to an additional $4 billion.
The GPUs are crucial for training large language models, and while the US seeks stricter export limitations on AI tech to China, US companies continue to design specific AI chips for the Chinese market.

Latest Football and Sport News on August 11th 2023

Australia ‘going nuts’ and soccer in the country ‘changed forever’ after the Matildas’ historic win

As Australia’s victorious players embarked on a lap of honor, home fans danced and cheered joyously inside a sold out Brisbane Stadium on Saturday. Neither players or supporters had experienced anything like it before.

Ronaldo wins first title at Al-Nassr with brace in Arab Club Champions Cup final

Cristiano Ronaldo guided nine-man Al-Nassr to their first ever Arab Club Champions Cup title after scoring twice in a 2-1 extra-time win over fellow Saudi side Al-Hilal on Saturday at the King Fahd Stadium.

World Cup Daily: England set up semifinal clash with Australia

It’s a case of old rivalry, new sport as England beat Colombia to set up a Women’s World Cup semifinal clash with perennial sporting rivals Australia.

Tom Brady makes first appearance at Birmingham City soccer game

Tom Brady made the trip across the pond to watch his new football club win on Saturday.

Bellingham scores on competitive debut ads Real wins 2-0

Jude Bellingham runs with the ball on his Real Madrid debut — Jude Bellingham’s goal came 37 minutes into his La Liga debut

England midfielder Jude Bellingham scored on his competitive Real Madrid debut as they began their La Liga season with victory at Athletic Bilbao.

Harry Kane makes Bayern Munich debut in German Super Cup defeat by RB Leipzig

Clinical Isak helps Newcastle hammer Villa

Alexander Isak celebrating after scoring — Alexander Isak suffered an injury hit debut season in the Premier League but got his second campaign off to the perfect start

Alexander Isak’s clinical finishing helped Newcastle United to an emphatic victory against Aston Villa on the opening weekend of the new Premier League campaign.

Unraveling August 2023: August 11th 2023

Latest AI News and Trends on August 11th 2023

AI Tutorial: Applying the 80/20 Rule in Decision-Making with ChatGPT

The Pareto Principle, or the 80/20 rule, is the idea that 80% of results come from 20% of efforts. This concept is integral to many aspects of life, including productivity, business, and personal growth. By embracing this principle with tools like ChatGPT, you can make more efficient decisions and concentrate on what’s most important.

Try the prompt below:

Employing the 80/20 rule, please help me analyze my e-commerce business. I want to know which 20% of my products are generating 80% of my sales and which 20% of my marketing efforts are leading to 80% of my traffic. Additionally, provide insights on how I can optimize my operations based on this principle.

MetaGPT tackling LLM hallucination

The framework also mandates modular outputs, allowing agents to validate outputs and minimize errors. By assigning diverse roles to agents, MetaGPT effectively deconstructs complex problems.

Why does this matter?

Experiments on collaborative software engineering benchmarks show that MetaGPT generates more coherent and correct solutions than chat-based multi-agent systems. And Integrating human knowledge into multi-agent systems opens up new possibilities for tackling real-world challenges.

Source

Will AI ads be allowed in the next US elections?

Summary: The Federal Election Commission (FEC) has initiated a process that may lead to the regulation of AI-generated deepfakes in political ads before the 2024 election, aiming to protect voters against this form of election disinformation. (source)

Key Points:

A petition to regulate ads using AI for mis-representation.
Existing campaigns, including Florida GOP Gov. Ron DeSantis’s, are already using this technology.
The FEC will decide on rules only after a 60-day public comment window, likely starting next week.
Regulation could impose guidelines for disclaimers, but not cover all deepfake-related threats, such as those from individual social media users.

Why It Matters: With elections around the corner, the potential use of AI in misleading political ads is a hot topic. The decision to possibly regulate AI shows an understanding of its possible risks, but the real test will be in getting rules on the books. It’s not just about politics; it’s about truth in a world where seeing is no longer believing.

What Else Is Happening in AI on August 11th 2023

Microsoft introduced new tools for global frontline workers, enhancing their capabilities. (Link)

Google keyboard’s new update could include AI-powered proofreading, AI emojis & more. (Link)

Runway’s new update allows you to extend your Gen-2 videos up to 18 seconds! (Link)

China’s internet giants, including Baidu, TikTok-owner, Alibaba have reportedly ordered $5B worth of Nvidia chips! (Link)

PlayHT2.0 is a new AI model that can “talk”? (Link)

A new AI algorithm has detected a potentially hazardous asteroid that had gone unnoticed by human observers, slated to fly by Earth. The algorithm, HelioLinc3D, was explicitly designed for the Vera Rubin Observatory currently under construction in Northern Chile.[Link]

The U.S. Defense Department has created a task force to evaluate and guide the application of generative artificial intelligence for national security purposes, amid an explosion of public interest in the technology. [Link]

China’s largest web and cloud providers (Alibaba, Baidu, ByteDance, and Tencent)are lining up to buy as many Nvidia GPUs as they can while they still can get their hands on them. [Link]

At Black Hat USA 2023, DARPA issued a call to top computer scientists, AI experts, software developers, and beyond to participate in the AI Cyber Challenge (AIxCC) – a two-year competition aimed at driving innovation at the nexus of AI and cybersecurity to create a new generation of cybersecurity tools. [Link]

Apple is working aggressively on AI
– Apple has reportedly ordered servers from Foxconn Industrial Internet, a division of its supplier Foxconn, for the testing and training of AI services. The servers are specifically for Apple’s AI work, which has been a focus for the company for years. While Apple does not currently have a ChatGPT-like app for external use, it is believed that this division of Foxconn already supplies servers to ChatGPT OpenAI, Nvidia, and Amazon Web Services. The news comes amidst reports about Apple’s plans to compete in the AI chatbot market.

Midjourney’s future plans revealed
– They’re rolling out a GPU cluster upgrade today. Pro and Mega users should see speedups of ~1.5x (/imagine from ~50 sec to ~30 sec). These renders should also be 1.5x cheaper.
– They’re releasing V5.3 possibly next week, will include features like inpainting and a new style (aesthetic) and may be only available on desktop.

Microsoft introduced new tools for global frontline workers, enhancing their capabilities
– The company’s Copilot offering utilizes generative AI to enhance the efficiency of service professionals. Microsoft highlights the significant size of the frontline workforce, estimating it to be 2.7 billion globally. The new tools and integrations are designed to empower these workers and address labor challenges faced by businesses.

Google keyboard’s new update could include AI-powered proofreading, AI emojis & more
– Google is enhancing its Gboard keyboard with new features powered by AI. These features include AI emojis, proofreading, and a drag mode that allows users to resize the keyboard to their liking. The updates have been discovered in the latest beta version of Gboard.

PlayHT2.0 is a new AI model that can “talk”
– It has an Instant Voice Cloning capability that can capture any voice and accent from just 3s of a speaker’s voice and synthesize speech in a truly conversational tone.
– Trained on over a million hours of speech across multiple languages, accents, and speaking styles.

Runway’s new update allows you to extend your Gen-2 videos up to 18 seconds.
– Available now in the browser and coming soon to iOS.

China’s internet giants, including Baidu, TikTok-owner ByteDance, Tencent, and Alibaba, have reportedly ordered $5 billion worth of Nvidia chips to power their AI ambitions. The orders, totaling about 100,000 A800 processors, are crucial for building generative AI systems. The chips are expected to be delivered this year. This move highlights China’s growing focus on AI technology and its desire to become a global leader in the field.

TikTok Introduces Toggle for AI-Generated Content Disclosure

TikTok is reportedly adding a toggle that enables creators to label AI-generated content, aiming to prevent content removal and enhance transparency.

Belva: Empower an AI agent to manage your phone calls effectively—an ideal solution for call management optimization.

Broadcast: Streamline the drafting and distribution of weekly updates using this AI-automated tool. It offers collaboration features, readership insights, and workflow optimization across platforms like Slack and Email.

Zefi: Enhance your product development process with this AI tool, integrating with development platforms to gather data, cluster feedback, assist in prioritization, and align stakeholders.

YT Transcripts by Editby: Download and edit YouTube videos easily with this tool, making it perfect for content creators seeking to repurpose their YouTube content.

AI Tools Database: Explore a comprehensive Notion database featuring 1350 useful AI tools curated by The Intelligo.

Latest Tech News and Trends on August 11th 2023

How to Stop Android Notifications from Turning On the Screen

Sick of notifications lighting up your Android screen? You can prevent this in your app settings. This guide explains how.

Usually, a notification will buzz on your phone or beep at you while displaying on the screen to be noticed. However, this behavior can drain your battery faster and become annoying in general to deal with. You can turn off an app’s notification behavior in your device’s settings.

There isn’t a universal setting to prevent all apps from waking the lock screen, so you’ll need to manage them individually. Here’s how.

How to Disable App Wake Screen Settings on Android

Unless you enable Airplane Mode or turn on your device’s Do Not Disturb option, apps will continue to wake your screen by default. So, you need to manage each app you want to stop notifications from tediously.

To stop notifications from turning on the screen on Android:

Swipe down from the top of the screen and tap Settings (gear icon) in the top-right corner.
Select the Notifications option from the Settings menu.
Tap the App notifications option to view your complete list of installed apps.
Select the app that you don’t want to wake your screen.
Tap the Silent option under the Alerts section.
You can also disable all app notifications by toggling off the Allow notifications switch. You won’t have access to the notification settings for all apps when you turn this off, however.
It’s also important to note that some apps will allow you to manage specific notifications by selecting the Notification categories option and toggling individual notification types on or off.

How to Use In-App Settings to Stop Apps Waking Your Android Screen

Depending on the app, you may be able to stop app notifications from turning on the screen from within the app itself. For example, in the Snapchat app’s settings menu, you can turn off the Wake Screen option for notifications that’s enabled by default.

Android 14 lets you easily check if an unknown object tracker is tracking you

Following last month’s announcement, Google is rolling out the ‘Unknown Tracker Alerts’ feature to Android users. Thanks to this alert notification, Android users will know if someone has placed an unknown tracker on them. Also, the unknown tracker alert will be sent when someone else’s tracking device is separated from them and is found to be traveling with another person.

iOS 17: how to create a Contact Poster on your iPhone

Contact Posters offer an engaging canvas to express creativity via otherwise mundane-looking contact cards. Here’s how to create them in iOS 17 on your iPhone.

he whole process of creating a Contact Poster is fairly easy. You can create a Contact Poster for your own number, or any other acquaintance in your contact list.

Another neat convenience that comes with Contact Posters is that you can share them by just bringing two iPhones close to each other. It also works if you tap your iPhone against an Apple Watch.

For this guide, we’ll go through the step-by-step process of creating a contact poster for fellow Digital Trends contributor Tushar Mehta. The process is identical if you are creating a contact poster for yourself. To do that, just tap on your name when it appears at the top of the contacts list in the Phone app.

Step 1: Open the Phone app on your iPhone and select the contact that needs a poster makeover. As you tap on a name, it will open the detailed contact page as shown in the image below.

iPhone Contact Poster feature in iOS 17 9

Step 2: On the contact page, tap on the Edit button in the top-right corner of the screen. On the next page, either tap on the circle with the contact name initials, or the pill-shaped Add Photo button.

iPhone Contact Poster feature in iOS 17 8

The US just invested more than $1 billion into carbon removal / The move represents a big step in the effort to suck CO2 out of the atmosphere—and slow down climate change. Link

Latest World USA Sport News on August 11th 2023

Orange juice prices to surge as US crops ravaged by disease and climate. Link

Teenage girl dies after being forced to stay in a ‘period hut’ in Nepal. Link

Nearly 50,000 Americans died by suicide in 2022, a record-high number: CDC. Link

Supreme Court blocks OxyContin maker’s bankruptcy deal that would shield Sackler family members. Link

New school bus routes a ‘disaster,’ Kentucky superintendent admits. Last kids got home at 10 pm. Link

2 minutes daily football news: Spain 2-1 Ned; Japan 1 – 2 Sweden; Harry Kane Caicedo; #soccer #footy

Liverpool have agreed a British record transfer fee of £111m with Brighton for midfielder Moises Caicedo.

England captain Harry Kane is set to have a medical at Bayern Munich after being given permission to travel to Germany by Tottenham.

Sweden produced a magnificent performance to book a semi-final date with Spain and leave Japan’s Women’s World Cup dreams in tatters

Teenage winger Salma Paralluelo came off the bench to score a 111th-minute winner as Spain beat the Netherlands to reach the Women’s World Cup semi-finals for the first time.

Off the pitch, few teams at this Women’s World Cup have been as dysfunctional and wracked by controversy as Spain.

Soccer Football Saudi Pro League kicks off after raiding Europe’s top football clubs.

Unraveling August 2023: August 10th 2023

Latest AI News and Trends on August 10th 2023

Advanced Library of 1000+ free GPT Workflows with HeroML – To Replace most “AI” Apps. By u/papsamir

Disclaimer: all links below are free, no ads, no sign-up required for open-source solution & no donation button. Workflow software is not only free, but open-source ❣️

This post is longer than I anticipated, but I think it’s really important and I’ve tried to add as many screenshots and videos to make it easier to understand. I just don’t want to pay for any more $9 a month chatgpt wrappers. And I don’t think you do either..

Hi again! About 4 months ago, I posted here about free libraries that let people quickly input their own values into cool prompts for free. Then I made some more, and heard a lot of feedback.

Lots of folks were saying that one prompt alone cannot give you the quality you expect, so I kept experimenting and over the last 3 months of insane keyboard-tapping, I deduced a conversational-type experience is always the best.

I wanted to have these conversations, though, without actually having them... I wanted to automate the conversations I was already having on ChatGPT!

There was no solution, nor a free alternative to the giants (and the lesser giants who I know will disappear after the AI hype dies off), so I went ahead and made an OPEN-SOURCE (meaning free, and meaning you can see how it was made) solution called HeroML.

It’s essentially prompts chained together, and prompts that can reference previous responses for ❣️ context ❣️

Here’s a super short video example I was almost too embarrassed to make (Youtube mirror: 36 Second video):

0:00

0:37

quick example of how HeroML workflow steps work

Simple Example of HeroML

There reason I wanted to make something like this is because I was seeing a lot of startups, for the lack of a better word, coming up with priced subscriptions to apps that do nothing more than chain a few prompts together, naturally providing more value than manually using ChatGPT, but ultimately denying you any customization of the workflow.

Let’s say you wanted to generate… an email! Here’s what that would look like in HeroML:

(BTW, each step is separated by ->>>>, so every time you see that, assume a new step has begun, the below example has 4 steps*)*

You are an email copywriter, write a short, 2 sentence email introduction intended for {{recipient}} and make sure to focus on {{focus_point_1}} and {{focus_point_2}}. You are writing from the perspective of me, {{your_name}}. Make sure this introduction is brief and do not exceed 2 sentences, as it's the introduction.

->>>>

Your task is to write the body of our email, intended for {{recipient}} and written by me, {{your_name}}. We're focusing on {{focus_point_1}} and {{focus_point_2}}. We already have the introduction:

Introduction:
{{step_1}}

Following on, write a short paragraph about {{focus_point_1}}, and make sure you adhere to the same tone as the introduction.

->>>>

Your task is to write the body of our email, intended for the recipient, "{{recipient}}" and written by me, {{your_name}}. We're focusing on {{focus_point_1}} and {{focus_point_2}}. We already have the introduction:

Introduction:
{{step_1}}

And also, we have a paragraph about {{focus_point_1}}:
{{step_2}}

Now, write a short paragraph about {{focus_point_2}}, and make sure you adhere to the same tone as the introduction and the first paragraph.

->>>> 

Your task is to write the body of our email, intended for {{recipient}} and written by me, {{your_name}}. We're focusing on {{focus_point_1}} and {{focus_point_2}}. We already have the introduction:

Introduction:
{{step_1}}

We also have the entire body of our email, 2 paragraphs, for {{focus_point_1}} & {{focus_point_2}} respectively:

First paragraph:
{{step_2}}

Second paragraph:
{{step_3}}

Your final task is to write a short conclusion the ends the email with a "thank you" to the recipient, {{recipient}}, and includes a CTA (Call to action) that requires them to reply back to learn more about {{focus_point_1}} or {{focus_point_2}}. End the conclusion with "Wonderful and Amazing Regards, {{your_name}}

It may seem like this is a lot of text, and that you could generate this in one prompt in ChatGPT, and that’s… true! This is just for examples-sake, and in the real-world, you could have 100 steps, instead of the four steps above, to generate anything where you can reuse both dynamic variables AND previous responses to keep context longer than ChatGPT.

For example, you could have a workflow with 100 steps, each generating hundreds (or thousands) of words, and in the 100th step, refer back to {{step_21}}. This is a ridiculous example, but just wanted to explain what is possible.

I’ll do a quick deep dive into the above example.

You can see I use a bunch of dynamic variables with the double curly brackets, there are 2 types:

Variables that you define in the first prompt, and can refer to throughout the rest of the steps

{{your_name}}, {{focus_point_1}}, etc.

Step Variables, which are basically just variables that references responses from previous steps..

{{step_1}} can be used in Step #2, to input the AI response from Step 1, and so on.

In the above example, we generate an introduction in Step 1, and then, in Step 2, we tell the AI that "We have already generated an introduction: {{step_1}}"

When you run HeroML, it won’t actually see these variables (the double-curly brackets), it will always replace them with the real values, just like the example in the video above!

Please don’t hesitate to ask any questions, about HeroML or anything else in relation to this.

Free Library of HeroML Workflows

I have spent thousands of dollars (from OpenAI Grant money, so do not worry, this did not make me broke) to test and create a tonne (over 1000+) workflows & examples for most industries (even ridiculous ones). They too are open-source, and can be found here:

Github Repo of 1000+ HeroML Workflows

However, the Repo allows you or any contributor to make changes to these workflows (the .heroml) files, and when those changes are approved, they will automatically be merged online.

For example, if you make an edit to this blog post workflow, after changes are approved, the changes will be applied to this deployed version.

There are thousands of workflows in the Repo, but they are just examples. The best workflows are ones you create for your specific needs.

How to run HeroML

Online Playground

There are currently two ways to run HeroML, the first one is running it on Hero, for example, if you want to run the blog post example I linked above, you would simply fill out the dynamic variables, here:

0:00

0:10

Example of hero app playground

This method has a setback, it’s free (if you keep making new accounts so you don’t have to pay), and the model is gpt-3.5 turbo.. I’m thinking of either adding GPT4, OR allow you to use your OWN OpenAI keys, that’s up to you.

Also, I’m rate limited because I don’t have any friends in OpenAI, so the API token I’m using is very restricted, why might mean if a bunch of you try, it won’t work too well, which is why for now, I recommend the HeroML CLI (in your terminal), since you can use your own token! (I recommend GPT-4)

My favorite method is the one below, since you have full control.

Local Machine with own OpenAI Key

I have built a HeroML compiler in Node.js that you can run in your terminal. This page has a bunch of documentation.

Running HeroML example and Output

Here’s an example of how to run it and what do expect.

This is the script

r/ChatGPT - Advanced Library of 1000+ free GPT Workflows (Part V) with HeroML - To Replace most "AI" Apps

simple HeroML script to generate colors, and then people’s names for each color.

This is how quick it is to run these scripts (based on how many steps):

0:00

0:19

using HeroML CLI with your own OpenAI Key

And this is the output (In markdown) that it will generate. (it will also generate a structured JSON if you want to clone the whole repo and build a custom solution)

Output in markdown, first line is response of first step, and then the list is response from second step. You can get desired output by writing better prompts 😊

Conclusion

Okay, that was a hefty post. I’m not sure if you guys will care about a solution like this, but I’m confident that it’s one of the better alternatives to what seems to be an AI-rug pull. I very much doubt that most of these “new AI” apps will survive very long if they don’t allow workflow customization, and if they don’t make those workflows transparent.

I also understand that the audience here is split between technical and non-technical, so as explained above, there are both technical examples, and non-technical deployed playgrounds.

Here’s a table of some of the (1000+) workflows you can play with (here’s the full list & repo):

Github Workflow Link is where to clone the app, or make edits to the workflow for the community.

Deployed Hero Playground is where you can view the deployed version of the link, and test it out. This is restricted to GPT3.5 Turbo, I’m considering allowing you to use your own tokens, would love to know if you’d like this solution instead of using the Hero CLI, so you can share and edit responses online.

Yes, I generated all the names with AI ✨, who wouldn’t?

That’s a wrap.

Thank you for all your support in my last few posts ❣️

I’ve worked pretty exclusively on this project for the last 2 months, and hope that it’s at least helpful to a handful of people. I built it so that even If I disappear tomorrow, it can still be built upon and contributed to by others. Someone even made a python compiler for those who want to use python!

I’m happy to answer questions, make tutorial videos, write more documentation, or fricken stream and make live scripts based on what you guys want to see. I’m obviously overly obsessed with this, and hope you’ve enjoyed this post!

This project is young, the workflows are new and basic, but I won’t pretend to be a professional in all of these industries, but you may be*! So your contribution to these workflows (whichever whose industries you are proficient in) are what can make them unbelievably useful for someone else.*

Have a wonderful day, and open-source all the friggin way 😇

How ChatGPT and other AI tools are helping workers make more money

Generative AI tools like ChatGPT is helping workers make more money by automating time-consuming tasks and creating new income streams and full-time jobs.

Universal Music collaborates with Google on AI song licensing LINK

Universal Music Group is negotiating with Google to license artists’ voices and melodies for AI-generated songs, with Warner Music also participating.
Artists could opt out of the system, but the move could allow fans to create deepfakes of their favorite musicians.
While this might be lucrative for record labels, it poses challenges for artists who want to keep their voices free from AI-cloning.

AI’s role in reducing airlines’ contrail climate impact LINK

Contrails from airplanes trap heat in Earth’s atmosphere, leading to a net warming effect.
Pilots at American used Google’s AI predictions and Breakthrough Energy’s models to choose altitudes less likely to produce contrails.
After 70 test flights, satellite imagery revealed a 54% reduction in contrails, suggesting commercial flights can lessen their environmental impact.

Anthropic’s Claude Instant 1.2- Faster and safer LLM

Why does this matter?

It looks like Claude Instant 1.2 is Anthropic’s safest AI model. However, it is an entry-level model intended to compete with similar offerings from OpenAI as well as startups such as Cohere. But with enhanced safety, skills, and context length same as Claude 2 (100K tokens), it can perhaps bring Anthropic a step closer to knowing how to challenge ChatGPT’s supremacy.

Source

Google attempts to answer if LLMs generalize or memorize

Why does this matter?

While there is no definitive conclusion from the research, it highlights the somewhat mysterious behavior of deep learning models, especially around the balance between memorization and generalization. It is also one step closer to understanding the exact dynamics of when and why certain models transition between these (and possibly back again).

Source

White House launches AI-based contest to secure government systems from hacks

The Competition

Teams compete to best secure vital software systems from cyber risks.
Up to 20 teams advance from qualifiers to win $2 million each at DEF CON 2024.
Finalists eligible for more prizes, including $4 million top prize at DEF CON 2025.

Innovating Cybersecurity with AI

Competitors required to open source their AI systems for widespread use.
Collaboration from AI leaders like Anthropic, Google, Microsoft, and OpenAI.
Aims to push boundaries of AI for national cyber defense.

Previous Government Hacking Contests

Similar to 2014 DARPA Cyber Grand Challenge to develop automated cybersecurity.
Various prizes offered to drive innovation through competition.
Hopes AI can keep defense ahead of evolving threats.

The U.S. launched a $20 million AI hacking challenge to incentivize developing AI cybersecurity to protect critical infrastructure. It aims to push AI capabilities for national defense through collaboration and competition.

What Else Is Happening in AI on August 10th 2023

Amazon is testing a tool that uses AI to help sellers write descriptions for listings Link

Spotify and Patreon integrated, allowing Patreon-exclusive audio on Spotify, benefiting podcasters and sidestepping Spotify’s aversion to RSS feeds. LINK

National-level data doesn’t support negative wellbeing impacts of Facebook saturation, but overlooks specific vulnerable groups and children. LINK

Lyft aims to eliminate surge pricing due to abundant driver supply and rider dissatisfaction, resulting in reduced revenue but increased user numbers. LINK

AI-generated books falsely using Jane Friedman’s name surfaced on Amazon and Goodreads, sparking concerns over copyright and author identity verification. LINK

DARPA’s AI Cyber Challenge, supported by top tech firms, aims to enhance software security using AI, focusing on open source vulnerabilities and cyberdefense. LINK

Google research attempts to answer whether ML models memorize or generalize
– While LLMs appear to have a rich understanding of the world, how do we know they’re not simply regurgitating from training data? In this new research, Google explores the phenomenon called grokking to learn more about how models learn.

IBM plans to make Meta’s Llama 2 available within its watsonx
– It will host Llama 2-chat 70B model in the watsonx.ai studio, with early access available to select clients and partners. This will build on IBM’s collaboration with Meta on open innovation for AI, including work with open-source projects developed by Meta. This will also support IBM’s strategy of offering both third-party and its own AI models.

Amazon is testing a tool that uses AI to help sellers write product descriptions
– This will be one of the first examples of Amazon integrating LLMs into its e-commerce business.

White House launches AI-based contest to secure government systems from hacks
– It has launched a $27M cyber contest to spur the use of AI to find and fix security flaws in the US government infrastructure in the face of growing use of the technology by hackers for malicious purposes.

Microsoft partners with Aptos blockchain to marry AI and web3
– The collaboration allows Microsoft’s AI models to be trained using Aptos’ verified blockchain information.

OpenAI has a new update for free ChatGPT users
– Custom instructions are now available to ChatGPT users on the free plan, except for in the EU & UK, where it will be rolling out soon.

Google’s redesigned Arts & Culture app includes AI-based features
– A “Poem Postcards” feature that lets users send AI-generated postcards to friends. Other features include a new Play tab, a TikTok-like “Inspire” feed, and more.

Latest Tech News and Trends on August 10th 2023

A.I. can identify keystrokes by just the sound of your typing and steal information with 95% accuracy, new research shows. Researchers had artificial intelligence listen to the sounds of typing through a phone and over Zoom, with eerie results. Link

YouTube is disabling links on Shorts to cut down on spam

YouTube Shorts is making a change to address the growing issues around spam on the short-form video platform.

Text Messages Showing Phone Numbers Instead of Names on Android? How to Fix It

Have your text messaging apps stopped showing the name of the message sender? Here’s what you can do to fix the problem.

Supermarket AI meal planner app suggests recipe that would create chlorine gas. Link

Latest World USA Sport News on August 10th 2023

In pics: Deadly wildfires wreak havoc on Hawaii’s Maui island

Wind-driven wildfires have claimed a minimum of six lives on Hawaii’s Maui island, causing extensive damage to hundreds of structures. Maui Mayor Richard Bissen Jr confirmed the fatalities, while rescue efforts continue.

Lawsuit filed after baby allegedly decapitated during delivery at metro Atlanta hospital. Link

6-year-old boy who shot his Virginia teacher said “I shot that b**** dead,” unsealed records show. Link

Paper exams, chatbot bans: Colleges seek to ‘ChatGPT-proof’ assignments. Link

36 confirmed dead: federal and local officials give updates on Maui fires. Link

Red Cross ends blood-donation restrictions that singled out gay and bisexual men. Link

Harry Kane transfer news: Bayern Munich agree deal in principle with Tottenham for striker. Link

Unraveling August 2023: August 09th 2023

Latest AI News and Trends on August 09th 2023

Step by Step Software Design and Code Generation through GPT

If you have used ChatGPT, or GPT in general, for software design and code generation, you might have noticed that for larger or trickier codes, it skips a lot of the implementation or misunderstands the design. That’s where tools like GPT Engineer and Aider come to help. However those tools for the most part keep the user out of the loop during the design. To explore the design space with GPT and be involved in decision making, you can use GPT-Synthesizer. GPT-synthesizer is a free and open-source tool which you can use for personal or commercial purposes. It uses LangChain to efficiently process larger codebases: https://github.com/RoboCoachTechnologies/GPT-Synthesizer

Collaboratively implement an entire software project with the help of an AI.

GPT-Synthesizer walks you through the problem statement and explores the design space with you through a carefully moderated interview process. If you have no idea where to start and how to describe your software project, GPT Synthesizer can be your best friend.

What makes GPT Synthesizer unique?

The design philosophy of GPT Synthesizer is rooted in the core, and rather contrarian, belief that a single prompt is not enough to build a complete codebase for a complex software. This is mainly due to the fact that, even in the presence of powerful LLMs, there are still many crucial details in the design specification which cannot be effectively captured in a single prompt. Attempting to include every bit of detail in a single prompt, if not impossible, would cause losing efficiency of the LLM engine. Powered by LangChain, GPT Synthesizer captures the design specification, step by step, through an AI-directed dialogue that explores the design space with the user.

GPT Synthesizer interprets the initial prompt as a high-level description of a programming task. Then, through a process, which we name “prompt synthesis”, GPT Synthesizer compiles the initial prompt into multiple program components that the user might need for implementation. This step essentially turns ‘unknown unknowns’ into ‘known unknowns’, which can be very helpful for novice programmers who want to understand an overall flow of their desired implementation. Next, GPT Synthesizer and the user collaboratively find out the design details that will be used in the implementation of each program component.

Different users might prefer different levels of interactivity depending on their unique skill set, their level of expertise, as well as the complexity of the task at hand. GPT Synthesizer distinguishes itself from other LLM-based code generation tools by finding the right balance between user participation and AI autonomy.

Installation

pip install gpt-synthesizer
For development:
- git clone https://github.com/RoboCoachTechnologies/GPT-Synthesizer.git
- cd gpt-synthesizer
- pip install -e .

Usage

GPT Sythesizer is easy to use. It provides you with an intuitive AI assistant in your command-line interface. See our demo for an example of using GPT Synthesizer.

GPT Synthesizer uses OpenAI’s gpt-3.5-turbo-16k as the default LLM.

Setup your OpenAI API key: export OPENAI_API_KEY=[your api key]

Run:

Start GPT Synthesizer by typing gpt-synthesizer in the terminal.
Briefly describe your programming task and the implementation language:
- Programming task: *I want to implement an edge detection method from live camera feed.*
- Programming language: *python*
GPT Synthesizer will analyze your task and suggest a set of components needed for the implementation.
- You can add more components by listing them in quotation marks: Components to be added: *Add 'component 1: what component 1 does', 'component 2: what component 2 does', and 'component 3: what component 3 does' to the list of components.*
- You can remove any redundant component in a similar manner: Components to be removed: *Remove 'component 1' and 'component 2' from the list of components.*
After you are done with modifying the component list, GPT Synthsizer will start asking questions in order to find all the details needed for implementing each component.
When GPT Synthesizer learns about your specific requirements for each component, it will write the code for you!
You can find the implementation in the workspace directory.

AI Is Building Highly Effective Antibodies That Humans Can’t Even Imagine

Robots, computers, and algorithms are hunting for potential new therapies in ways humans can’t—by processing huge volumes of data and building previously unimagined molecules.

AT AN OLD biscuit factory in South London, giant mixers and industrial ovens have been replaced by robotic arms, incubators, and DNA sequencing machines. James Field and his company LabGenius aren’t making sweet treats; they’re cooking up a revolutionary, AI-powered approach to engineering new medical antibodies.

In nature, antibodies are the body’s response to disease and serve as the immune system’s front-line troops. They’re strands of protein that are specially shaped to stick to foreign invaders so that they can be flushed from the system. Since the 1980s, pharmaceutical companies have been making synthetic antibodies to treat diseases like cancer, and to reduce the chance of transplanted organs being rejected.

But designing these antibodies is a slow process for humans—protein designers must wade through the millions of potential combinations of amino acids to find the ones that will fold together in exactly the right way, and then test them all experimentally, tweaking some variables to improve some characteristics of the treatment while hoping that doesn’t make it worse in other ways. “If you want to create a new therapeutic antibody, somewhere in this infinite space of potential molecules sits the molecule you want to find,” says Field, the founder and CEO of LabGenius. Read more

NVIDIA Releases Biggest AI Breakthroughs

– Nvidia announced a new chip GH200, designed to run AI models. It has the same GPU as the H100, Nvidia’s current highest-end AI chip, but pairs it with 141 gigabytes of cutting-edge memory and a 72-core ARM central processor. This processor is designed for the scale-out of the world’s data centers.

– NVIDIA has announced new frameworks, resources, and services to accelerate the adoption of Universal Scene Description (USD), known as OpenUSD. Through its Omniverse platform and a range of technologies and APIs, including ChatUSD and RunUSD, NVIDIA aims to advance the development of OpenUSD, a 3D framework that enables interoperability between software tools and data types for creating virtual worlds.

– NVIDIA has introduced AI Workbench, a developer toolkit that simplifies the creation, testing, and customization of pretrained generative AI models. The toolkit allows developers to scale these models to various platforms, including PCs, workstations, enterprise data centers, public clouds, and NVIDIA DGX Cloud. This will speed up the adoption of custom generative AI for enterprises worldwide.

– NVIDIA and Hugging Face have partnered to bring generative AI supercomputing to developers. The integration of NVIDIA DGX Cloud into the Hugging Face platform will accelerate the training and tuning of large language models (LLMs) and make it easier to customize models for various industries. This partnership aims to connect millions of developers to powerful AI tools, enabling them to build advanced AI applications more efficiently.

75% of Organizations Worldwide Set to Ban ChatGPT and Generative AI Apps on Work Devices

Although ChatGPT currently has over 100 million users in June 2023, the concerns for its security and trustworthiness grow. AI cybersecurity pioneer, BlackBerry, calls for caution with consumer-grade Generative AI tools in the workplace.

Some impressive figures

– 75% of global organizations are either implementing or contemplating bans on ChatGPT and other Generative AI applications in their workplaces.

– 61% view these measures as long-term or permanent due to concerns over data security, privacy, and corporate reputation.

– 83% believe unsecured apps present a cybersecurity threat to their corporate IT systems.

– 80% of IT decision-makers believe organizations have the right to control applications used for business.

– 74% feel that such bans indicate “excessive control” over corporate and BYO devices.

As AI tools get better and rules are set, companies might change their rules. It’s important to have tools to watch and manage how these AI tools are used at work.

Research was conducted in June/July 2023 by OnePoll on behalf of BlackBerry, into 2,000 IT Decision Makers across North America (USA and Canada), Europe (UK, France, Germany and the Netherlands), Japan and Australia.

Google launches Project IDX, an AI-enabled browser-based dev environment.

– For building web and multiplatform apps. It currently supports frameworks like Angular, Flutter, Next.js, React, Svelte, and Vue, and languages like JavaScript and Dart. The project is based on Visual Studio Code and integrates with Codey, Google’s PaLM 2-based foundation model for programming tasks.
– IDX offers features such as smart code completion, a chatbot for coding assistance, and the ability to add contextual code actions. Google plans to add support for more languages like Python and Go in the future.

Why does this matter?

By incorporating models like Codey, IDX offers tools like Studio Bot and Duet, Google IDX might revolutionize coding experiences in Android Studio and Google Cloud. Smart code completion, contextual actions, and an assistive chatbot can empower developers to write code more efficiently and maintain high standards.

Stability AI has released StableCode, an LLM generative AI product for coding.

– It aims to assist programmers in their daily work and provide a learning tool for new developers. StableCode uses three different models to enhance coding efficiency. The base model was trained on various programming languages, including Python, Go, Java, and more. It was then further trained on 560B tokens of code.

Hugging face launches tools for running LLMs on Apple devices.

– Hugging face have released a guide and alpha libraries/tools to support developers in running LLM models like Llama 2 on their Macs using Core ML.

Google AI is helping Airlines to reduce mitigate the climate impact of contrails.

– Google AI, American Airlines, and Breakthrough Energy collaborated to use AI and data analysis to develop contrail forecast maps. These maps help pilots choose routes that minimize contrail formation, reducing the climate impact of flights.

D-ID and ElevenLabs have announced a partnership to bring premium voices to D-ID’s

Creative RealityTM studio. This collaboration will allow users to create videos with more natural speech. The new features simplify the process and enable subscribers to add high-quality synthetic voices to their videos with one click. They offer AI-generated customized video narrators in 119 languages, making video creation easier and more cost-effective.

Google and Universal Music Group are in talks to license artists’ melodies and vocals for an AI-generated music tool.

– The tool would allow users to create AI-generated music using an artist’s voice, lyrics, or sounds. Copyright holders would be paid for the right to create the music, and artists would have the option to opt in.

Disney has formed a task force to explore the applications of AI across its entertainment conglomerate, despite the ongoing Hollywood writers’ strike.

– Disney currently has 11 job openings that require expertise in AI or machine learning, covering various departments such as Walt Disney Studios, engineering, theme parks, television, and advertising. The advertising team, in particular, is focused on building an AI-powered ad system for the future.

AI researchers claim 93% accuracy in detecting keystrokes over Zoom audio LINK

Researchers achieved over 90% accuracy in interpreting remote keystrokes by recording them and training a deep learning model on the unique sound profiles of individual keys.
Laptops, especially in quieter public places, are vulnerable to this kind of attack due to their consistent and non-modular keyboard acoustic profiles.
Previous methods achieved 74.3% to 91.7% accuracy in VoIP calls; the current research benefits from recent advancements in neural network technology, like self-attention layers, to enhance audio side channel attacks.

Researchers at the Massachusetts Institute of Technology (MIT) and the Dana-Farber Cancer Institute have discovered that the use of artificial intelligence (AI) could make it easier to determine the sites of origin for enigmatic cancers and enable doctors to choose more targeted treatments.[1]

Meta disbands protein-folding team in shift towards commercial AI.[2]

OpenAI has introduced GPTBot, a web crawler to improve AI models. GPTBot scrupulously filters out data sources that violate privacy and other policies.[3]

Disney has created a task force to study artificial intelligence and how it can be applied across the entertainment conglomerate, even as Hollywood writers and actors battle to limit the industry’s exploitation of the technology.[4]

Trending AI Tools

ChattyDocs: Create AI experts on selected topics using your documents. AI answers your or customers’ questions.
Social Bellow: AI-powered prompts to revolutionize content. Overcome writer’s block, craft engaging narratives, and personalize chatbots.
Neighborbrite Vision: Transform your yard with our AI app. Choose Desert, Modern, or Cottage vibes and get a custom design.
HomeStyler AI: Experience the future of interior design. Upload a picture and watch your space transform into a designer masterpiece.
CREaiD AI: Commercial Real Estate’s 1st AI chatbot. Connect with Verified Lenders and Primary Contacts, and get real-time insights.
Taylor AI: Fine-tune open-source LLMs in minutes. Focus on experimentation & building better models, not digging through libraries.
StudentAI: Every tool a student needs to save time and excel academically. From presentations to exams, we’ve got it covered.
liteLLM: Simplify using LLM APIs from OpenAI, Azure, Cohere, Anthropic, Replicate, and Google. Call all LLM APIs using chatGPT format.

Latest Tech News and Trends on August 09th 2023

GM’s EVs to offer vehicle-to-home charging by 2026 LINK

GM is introducing vehicle-to-home (V2H) bidirectional charging technology to its Ultium-based electric vehicles by 2026, allowing them to be used as backup power sources for homes.
The first models to feature this technology include the 2024 Chevrolet Silverado EV RST, GMC Sierra EV Denali Edition 1, and Cadillac Lyriq, among others.
This initiative is under GM Energy, a new business unit from GM launched in 2022, which offers various energy solutions including stationary storage and solar energy partnerships.

Norway imposes $100k daily fines on Meta over data harvesting LINK

Meta faces a new penalty from Norwegian regulators, amounting to 1 MILLION crowns (around $100,000) per day starting from August 14 due to privacy breaches.
Norway had previously announced a temporary ban on behavioural ads on Facebook and Instagram, and warned Meta of potential fines if violations were not addressed.
Despite Meta’s recent pledge to obtain EU user consent for personalized ads, Datatilsynet remains unimpressed and plans to continue daily fines until at least November 3, with the possibility of making them permanent.

The famously overworked visual effects workers behind the Marvel movies just voted to join a union. Link

Banks hit with $549 million in fines for use of Signal, WhatsApp to evade regulators’ reach. Link

Author discovers AI-generated counterfeit books written in her name on Amazon. Link

Latest World News – Sport News August 09th 2023

Lil Tay Dead: Internet Rapper’s Death Is ‘Under Investigation’ Variety

Claire Hope, the Canadian pre-teen internet rapper known as Lil Tay, has died, her management confirmed to Variety.

Hospitals overwhelmed as Maui wildfires rage; some residents flee

Wind-whipped wildfires in Hawaii forced hundreds of evacuations, overwhelmed hospitals in Maui and even sent some residents fleeing into the ocean.

9-year-old girl fatally shot by neighbor in front of her father after buying ice cream and riding her scooter, legal document says. Link

5 white nationalists sue Seattle man for allegedly leaking their identities. Link

Tory Lanez sentenced to 10 years for shooting Megan Thee Stallion in the foot. Link

Teenage cousin of Uvalde school shooter is arrested, accused of threatening to ‘do the same thing’ to a school. Link

Emergency rooms becoming the ‘dumping ground’ for mentally ill who often wait days for help. Link

Unraveling August 2023: August 08th 2023

Latest AI News and Trends on August 08th 2023

How to Leverage No-Code + AI to start a business with $0

Start your Business with $0

Need a Desinger — Use Canva
Need a Website — Use Carrd
Need Sales — Use Gumroad
Need a Writer — Use Claude
Need a CRM — Use Notion
Need Marketing — Use Buffer
Need Videos — Use CapCut

Remember, you don’t need to have the setup before starting a business Many successful businesses started w/ a notebook and an Excel sheet.

Leverage ChatGPT as Your Personal Finance Advisor

Are you an online business owner juggling numbers and financial decisions? With ChatGPT, you can gain insights and advice on managing your business’s finances more effectively.

Try the prompt below:

Act as a certified financial advisor. I'm the owner of an online business that's been operational for two years. Here's a simplified snapshot of my current financial situation:

Monthly revenue: $10,000
Monthly operating expenses (inventory, software, marketing, etc.): $6,000
Monthly net income: $4,000
Business savings: $20,000
Personal savings: $10,000

I'm looking to increase my savings, reduce expenses, and grow my business. Please provide a comprehensive financial plan which should include budgeting tips, investment advice, and strategies for improving my overall financial health. Remember to make your suggestions based on the best practices in personal finance and business finance management.

This prompt can be adjusted according to your unique financial circumstances. For example, if you’re more concerned about debt management, retirement planning, or making significant business investments, modify your request accordingly.

Note: ChatGPT can provide a helpful start in managing your finances, but it can’t be completely relied upon for professional financial advices. In addition, please be aware that sharing sensitive financial information online carries its own risks, even in a simulated conversation with AI.

What is Boosting in Machine Learning?

Boosting in machine learning is a technique that trains algorithms to work better together, improving accuracy and reducing bias. Learn how boosting works.

Deep Learning Model Detects Diabetes Using Routine Chest Radiographs

New deep learning tool leverages chest x-rays and electronic health record data to flag elevated diabetes risk up to three years before diagnosis.

OpenAI launches a web crawler to train ChatGPT

Called GPTBot, the crawler will comb through the internet to train and enhance AI’s capabilities. It can be identified by the following user agent and string.

Moreover, OpenAI also revealed how websites can prevent GPTBot from accessing their sites, either partially or by opting out entirely.

Why does it matter?

GPTBot can help AI models become more accurate and improve their general capabilities and safety. However, OpenAI has often landed in hot waters for how it collects data. Blocking the GPTBot may be OpenAI’s first step to allow internet users to opt out of having their data used for training its LLMs.

(Source)

AI deep fake audios are getting scarily realistic

Why does this matter?

As speech synthesis AI systems improve, it will become more difficult for humans to catch speech deepfakes. The study suggests the need for automated detectors to mitigate a human listener’s weaknesses. It also emphasizes that expanding fact-checking and detecting tools is a significant way to protect against deep fake threats by AI.

Source

Microsoft’s many AI monetization plans

Microsoft has announced new Azure AI infrastructure advancements and availability to bring its customer closer to the transformative power of generative AI.

Azure OpenAI Service goes global: OpenAI’s most advanced models, including GPT-4 and GPT-35-Turbo, will now be available in multiple new regions and locations.
General availability of ND H100 v5 VMs for unprecedented AI processing and scale: -It also announced general availability of the ND H100 v5 Virtual Machine series, featuring the latest NVIDIA H100 Tensor Core GPUs and low-latency networking, propelling businesses into a new era of AI applications.

Why does it matter?

These enhancements will allow more customers to leverage the capabilities of generative AI, driving innovation and transformation across various industries. It will also empower their businesses with greater computational power with significantly faster AI model performance.

(Source)

Understanding the Fuzzy Accuracy of Generative AI

Erroneous results from ChatGPT seem to be leading many scholars and pundits to dismiss it as useless or even dangerous. That might make sense at first glance, but only if we see it as just another type of search engine.

In this article, Mark Humphries suggests if you focus solely on its errors, you need to think about it in a different way. The article discusses in detail how chatbots are different from search engines (even though they seem similar). It also points out why tools like ChatGPT were not intended to be used as search engines and what exactly makes them revolutionary.

Why does this matter?

In an era when we are racing to adopt generative AI, understanding the usefulness of models like ChatGPT despite their tendency to hallucinate sometimes requires examining how they work during these instances and why.

Source

Google Search launched AI-powered grammar checker LINK

Google has introduced an AI-powered grammar check feature in its search bar, which is currently available only in English.
To use the feature, users can enter a sentence or phrase into Google Search, followed by “grammar check”, “check grammar” or “grammar checker”, and Google will indicate if the phrase is grammatically correct or suggest a correction if needed.
The grammar check tool is accessible on both Google desktop and mobile platforms.

Zoom can now train its AI using customer data LINK

Zoom’s updated Terms of Service in March gave the company the right to train AI on user data, but clarified in a recent blog post that they will not use audio, video, or chat content for AI training without customer consent.
The new terms sparked concern as Zoom customers must either agree to data use or leave a meeting if a call starts with generative AI features enabled; Zoom stated that customers decide whether to enable these AI features and share data for product improvement.
Zoom’s privacy track record is questionable, with a history of issues such as providing less secure encryption than claimed and sharing user data with Google and Facebook, leading to an $85 million settlement in 2021.

USEFUL AI TOOLS

Text Blaze: Eliminate repetitive typing with this Chrome extension that allows users to create customizable templates and shortcuts.
MakeLanding: Generate landing pages with ease using this AI-powered platform, which creates pages based on user-provided project descriptions, complete with custom copy, logos, and illustrations.
YouAI: Create and share your own AI tools for various purposes, including content creation, language learning, and professional services.
Storyboard AI: Reduce the time needed to create concepts, scripts, and full storyboards for video agencies and creators using AI.
AI Tools Database: Explore a comprehensive Notion database featuring 1350 useful AI tools curated by The Intelligo.

It’s already way beyond what humans can do’: will AI wipe out architects?

Latest Tech News and Trends on August 08th 2023

Netflix launches a game controller app for playing games on your TV

After last fall signaling its intentions to expand into cloud gaming, Netflix today launched a new app that will soon allow subscribers to play games on their TV. The app, dubbed “Netflix Game Controller,” lets you use your phone as a controller after pairing it with your TV in order to play…

X’s takeover of @music handle hints toward possible music plans

X, the company formerly known as Twitter, appears to be hinting toward plans to move deeper into the music industry or collaborate with artists and labels with its recent takeover of the @music handle on the social network. The account was originally operated by software developer Jeremy Vaught…

Canadian media approaching Competition Bureau to probe Meta’s news blocking

Media companies including the Canadian Broadcasting Corp and Radio-Canada are requesting Competition Bureau to investigate Meta Platforms’ decision to block news content on its platforms in the country, the CBC reported on Tuesday.

Novo weight-loss drug Wegovy shows heart benefit in trial

Experts say the new data could lead the US, where Wegovy costs $1,300 a month, to reassess that.

Google bans popular battery-draining Android apps with urgent delete warning

Millions of Android phone users are being urged to check their mobile devices, after apps were discovered to drain batteries by running constant background adverts when the screen is off…

McAfee named TV/DMB Player, Music Downloader, News, and Calendar applications as some of the popular applications compromised. The adverts in these apps don’t secretly start popping up until a few weeks after an initial installation – which makes spotting the scam far more difficult.

McAfee is in-turn urging users to take care and conduct thorough research before downloading any new apps onto their mobile devices – scouring the permissions before hitting the big green install button. It’s also a wise move to check the performance of a device after installing new software – keeping an eye out for indicators like rapidly draining battery-life or slower operating systems.

How to Get ChatGPT on iPhone and Android

ChatGPT is the most popular AI chatbot on the planet, and it’s not hard to see why. With its powerful natural language capabilities, remarkable versatility, and knack for engaging in […]

How to Check Battery Health on Android?

Want to Know How many Battery Cycles are Left on Android Device. Here is the Simple Guide to Check Battery Health on Android.

Check Battery Details Using the Settings App

In some devices, from the Settings app, you can check the battery health of the Android phone.

Open the Settings app > tap on Battery.
Tap on View Detailed Usage.

The above steps may vary a little depending on the model of the phone that you are using.

Dial a USSD Code to Know the Battery Health of your Android Device

Unstructured Supplementary Service Data, often abbreviated as USSD codes, are certain configurations of numerics and special symbols that return certain helpful information about your phone when dialed using the phone app.

To know the battery health of an Android device, there is a specific USSD code.

Launch the phone app and go to the phone keypad.
Dial the code *#*#4636#*#*.
Press the call button.

NOTE: The above USSD code may not work on all Android devices. However, you can try and see if it works or not. The above code is quite safe to dial and has no effect on your device or its data.

Install a Third-Party Battery Health Checker

It is always difficult to find a trusted third-party app to check the device’s battery health. There is the app AccuBattery which I tried on my Android device. It is quite simple to use and doesn’t ask for unnecessary permissions on your device.

Install AccuBattery from the Play Store.
Launch the app, and it will start calibrating your device’s battery.
Tap on Finish.
Let the battery charge drop to 15 percent.
Charge the Battery completely.
Next, tap on the tab Health to know your device’s battery health.

Check the Battery Health of Samsung Smartphones

If you have a Samsung smartphone, you can install a specific app called Samsung Members on the device. Using this app, you can deduce the overall battery usage and health of the battery present in the Samsung smartphone.

install Samsung Members app

Install the Samsung Members app from the Play Store.
Launch the app > tap on Discover > tap on Phone Diagnostics.
Tap on Battery Status to run a quick test.
The battery status will display on the screen.

How to Increase the Battery Life of a Phone?

Here are a couple of tips to increase your smartphone’s battery life.

Always use the official power adapter of the phone. If the power adapter got damaged, then get another official charger. Avoid using cheap third-party adapters.
Always charge your device up to 80 Percent. Also, set the device on charge when the battery level is around 30 -35 percent.
Never set your phone on charging and do activities like playing games or making phone calls. That will cause overheating and, in the long run, will damage the Battery’s health.
Use power saver mode whenever possible to avoid losing battery power.

Promote Efficient Battery Performance on Android Devices

Now, I hope you know the different methods to check the battery health status on your Android device. Also, follow the above tips to manage the battery health and increase it for more prolonged use on your phone.

Nuclear fusion scientists achieve net energy gain LINK

U.S. scientists at the Lawrence Livermore National Laboratory in California have successfully recreated a fusion ignition reaction, yielding an even higher energy gain than the initial experiment announced in December.
The fusion experiment required 2 megajoules of energy and produced 3 megajoules, indicating a significant milestone where fusion reactions output more energy than they consume, traditionally a major challenge in fusion research.
Despite these successes, the development of fusion power stations is still likely decades away, but these breakthroughs show potential for the development of clean, laser-induced fusion energy on Earth.

PayPal launches first major U.S. dollar-backed stablecoin LINK

PayPal has announced the rollout of its stablecoin, PayPal USD (PYUSD), issued by Paxos Trust Company and backed by U.S. dollar deposits and similar cash equivalents, marking a first for a major U.S. financial institution.
Eligible U.S. PayPal customers can transfer PYUSD between PayPal and compatible external wallets, use it for person-to-person payments and purchases, and convert other supported cryptocurrencies to and from PYUSD.
As an ERC-20 token on the Ethereum blockchain, PYUSD is available to a growing community of external developers, wallets, and web3 applications, and Paxos will publish monthly reports detailing the assets backing PYUSD.

$5 billion Google lawsuit over ‘incognito mode’ tracking moves a step closer to trial

Judge Yvonne Gonzalez-Rogers denied Google’s push for a summary judgment in a lawsuit over the way it tracked internet activity even after users switched to “Incognito mode.” Link

Apple will drop support for older iPhones. What does it mean for you?

The iPhone X, iPhone 8, iPhone 8 Plus, and older models won’t be eligible for iOS 17 updates. But any iPhone capable of running iOS 12 will still get Apple’s security updates – for two years.

How to Share Passwords With Family and Friends on Your iPhone

Apple lets you share your account passwords with a bunch of trustworthy people in a group using your iPhone. We’ll walk you through the process.

The ability to share passwords with a group of people you trust is one of the best hidden features of iOS 17. With shared passwords, you get the convenience of sharing your online accounts with multiple users while minimizing the risk of data theft.

How to Add People to Your Shared Password Group on an iPhone

When you create a new shared password group, you have complete control over the passwords you share with other people in the group. You can add or remove members or even delete the entire group anytime.

This feature can come in handy if you already use Family Sharing on your iPhone to share apps and subscriptions, as not all services support this feature, and you might need to share credentials with your family members.

Here’s how you can make a new shared password group and add people to it:

Launch the Settings app on your iPhone and select Passwords.
Enter your passcode or unlock it with Face ID for verification.
Tap the blue Get Started button and hit Continue
Enter the name of the group and tap Add People.
Search the name of the person you want to invite and tap Add in the top-right corner.
Tap Create and choose the passwords and passkeys you want to share.
Press the Move button.
After this, you’ll get a prompt asking if you want to notify the person. If so, press the Notify via Message and send an invitation. Else tap Not now.
Once you’ve successfully created a shared password group, you can easily add more people whenever you like. Go to your shared group, tap Manage, and repeat the steps you followed to add your contacts.

How to Add Passwords to Your Shared Group on an iPhone

If you want to add more passwords to your shared group, here’s what you need to do:

Go to Settings > Passwords and select the group.
Tap the plus (+) icon in the top-right corner and select Move Passwords to Group. You can also manually add a new password to the group by selecting New Password.

Apple is known for easy interoperability between its devices. That’s why many people say Apple is a walled garden—once you’re in the Apple ecosystem, it’s tough to get out because you’ll miss the convenience of owning Apple products.

For instance, it’s easy to share Wi-Fi passwords on your iPhone with another iPhone or even another Apple device like your Mac. As long as you have each other’s iCloud email addresses in the Contacts app, you can just bring your iPhone close to other Apple devices, and the one connected to Wi-Fi will automatically ask if you want to share the password. Here are the steps:

If the device that needs to connect is an iPhone or iPad, go to Settings > Wi-Fi. If it’s a Mac, go to System Settings > Wi-Fi. Then, tap on the desired network.
Now, bring the Wi-Fi-connected iPhone close to the device that needs to connect.
A Wi-Fi Password prompt will then appear on the Wi-Fi-connected iPhone, asking if its owner wants to share the Wi-Fi password.
Tap Share Password. Your iPhone will get the password and connect to the Wi-Fi network.

Innocent pregnant woman jailed amid faulty facial recognition trend.

In every reported case where police mistakenly arrested someone using facial recognition, that person has been Black.

There is a great documentary on Netflix. Titled “Coded Bias” that discusses many of the challenges related to race and AI image recognition.

This is a historic problem with image recognition, it does not work as well on darker complexions.

This is why we need to be careful when using these tools

Adding link to documentary: https://www.netflix.com/title/81328723

Detroit woman sues city after being falsely arrested while pregnant due to facial recognition technology
by u/Sorin61 in technology

In California, Car Buyers Are Choosing Electricity Over Gasoline in Record Numbers

Dungeons & Dragons tells illustrators to stop using AI to generate artwork for fantasy franchise

Latest World News on August 08th 2023

Trump blames Megan Rapinoe and wokeness for US Women’s World Cup exit

Hours after the US women’s national team crashed out of the World Cup, former US President Donald Trump seized the opportunity to blame the loss on star player Megan Rapinoe and the country’s “woke” path under President Joe Biden.

Large brawl in Alabama as people defend Black riverboat worker against white assailants. Link

Campbell will acquire Rao’s premium sauces parent company for $2.7 billion. Link

Texas hiker died at Utah national park while scattering father’s ashes. Link

Global child sexual abuse probe that was launched after two FBI agents were killed leads to almost 100 arrests. Link

NYC doctor sexually assaulted unconscious patients and filmed himself doing it, prosecutors say. Link

Appeals court upholds Josh Duggar’s conviction for downloading child sex abuse images. Link

Mother who was accused by Southwest of trafficking her biracial daughter files federal discrimination suit. Link

Latest Football Soccer Sport News on August 08th 2023:

Unraveling August 2023: August 07th 2023

Latest AI News and Trends on August 07th 2023

The end of ageing? A new AI is developing drugs to fight your biological clock

Using AI, scientists were able to discover molecules to tackle ageing cells in just minutes. Here’s why that’s a huge deal.

AI model can help determine where a patient’s cancer arose

The OncoNPC machine-learning model may help identify sites of origin for cancers whose origins are unknown, which could enable targeted tumor treatments.

AI facial recognition falsely identifies pregnant woman as felon

Detroit police wrongly arrested a pregnant woman based on incorrect facial recognition, the latest in a string of false identifications by law enforcement AI tools.

The Wrongful Arrest:

Porcha Woodruff was arrested for a robbery she didn’t commit due to AI facial recognition.
An 8-year-old photo led to her false identification by the AI system.
She’s now suing Detroit over the arrest that saw her jailed while pregnant.

A Systemic Issue:

At least 6 wrongful arrests linked to facial recognition AI have occurred.
All wrongly identified have been black people so far.
Critics argue it leads police to shoddy, biased investigations.

AI Accountability:

Powerful AI requires meticulous training and testing to avoid mistakes.
False arrests raise real concerns over reliance on imperfect technology.
Legal, ethical, and financial liabilities will pile up if issues persist.

TL;DR: Detroit police falsely arrested a pregnant woman based on incorrect facial recognition AI identification, prompting a lawsuit. Critics argue reliance on imperfect technology leads police to biased, shoddy investigations as wrongful arrests mount.

Source: (link)

Sam Altman concerned about AI’s impact on elections

OpenAI CEO Sam Altman expressed concerns about generative AI’s potential impacts on future elections, particularly with hyper-targeted synthetic media.
AI-generated media has already been used in American campaign ads for the 2024 election and has sometimes caused misinformation to spread.
Altman acknowledges the risks of the technology he’s helping develop and emphasizes the importance of raising awareness about its implications.

Apple bets big on AI

Why does this matter?

Source

Jupyter brings AI to notebooks

(official announcement)

(GitHub)

(Here is an example of how to use ChatGPT to generate working code within the notebook cells.)

Why does this matter?

Integrating advanced AI chat-based assistance directly into Jupyter’s environment may improve coding, summarization, error correction, and content generation tasks. And with support for leading LLMs like OpenAI, AI21, Anthropic, Cohere, and even local models, JupyterAI empowers users with a powerful toolset to streamline coding workflows and obtain accurate answers.

Source

ChatGPT’s emotional awareness is more than humans’. What?

ChatGPT outperformed humans in all categories, achieving an overall score of 85 compared to 56 for men and 59 for women. The researchers suggest that ChatGPT could be helpful in psychotherapy, cognitive training, and diagnosing mental illness. Previous studies have shown that people perceive ChatGPT’s responses as more empathetic than medical professionals.

Why does this matter?

This upgrade underscores AI’s ability to comprehend emotions and could help with therapy, mental health diagnosis, and making healthcare interactions more empathetic. This shows how AI can learn emotions and improve how it interacts with people.

Source

Promptpack: How to build a second-brain (featuring AI)

This Promptpack by Chantal Smith and Azeem Azhar explores how to build a second brain using AI-powered tools. It discusses the use of knowledge bases and the role of generative AI in research and knowledge processing. The author shares their experience using Notion as a smart knowledge repository and tools like Perplexity and Elicit to enhance search capabilities.

They also highlight ChatGPT as their favorite AI tool. The article emphasizes the importance of natural language processing and reasoning in the evolving data and knowledge management landscape.

Why does this matter?

This article explores how AI tools like Notion, Perplexity, and ChatGPT enhance knowledge management and research. Also highlights how these tools facilitate efficient information gathering, processing, and storage, emphasizing their relevance in leveraging natural language as a primary interface for data-driven reasoning.

Source

What Else Is Happening in AI on August 07th 2023

Salesforce introduces Einstein Studio to train AI models using Data Cloud.
– This new feature allows enterprises to connect and train their own AI models on proprietary data within Salesforce. Once trained, these models can be used to power various applications within Salesforce. The offering has already been tested by multiple enterprises and is now available for all users of Salesforce’s Data Cloud.

Rapper Lupe Fiasco collabs with Google for the launch of AI Tool TextFX!
– Now AI will assist rappers in their songwriting process by generating alternate meanings and phrases for chosen words. Google’s Creative Technologist, Aaron Wade, credits Fiasco with taking their vision for TextFX to another level, as he wanted a tool to explore the possibilities that can arise from words and concepts, rather than having an A.I. write lyrics for him.

Azure ChatGPT supporting GPT-4 is launched! (Link)

Salesforce introduces Einstein Studio to train AI models using Data Cloud. (Link)

White Castle wants to roll out AI-enabled voices to over 100 drive-thrus. (Link)

Rapper Lupe Fiasco collabs with Google for the launch of AI Tool TextFX! (Link)

Zoom’s new terms of service allow AI training on user content, no opt-out. (Link)

Latest Tech News and Trends on August 07th 2023

X will pay legal bills of people punished for posting on platform LINK

Elon Musk commits that his social media platform, X (formerly Twitter), will cover legal expenses for users “unfairly treated” by employers due to their site activity.
Musk’s declaration on X ensures there will be “no limits” to the financial support for legal bills.
In addition to funding legal battles, Musk promises to make these lawsuits “extremely loud” and to target the boards of directors of offending companies.

Apple explores lip-reading capabilities for Siri LINK

Apple has filed a patent for lip-reading technology using motion sensors, aiming to improve Siri’s speech recognition and battery life.
While this technology could enhance user privacy, it raises data protection concerns due to the potential collection of personal information.
Though the patent showcases Apple’s R&D efforts, it doesn’t confirm the actual implementation of the technology, and its primary focus remains uncertain.

MIT finds potential energy storage method in cement LINK

MIT researchers have developed a supercapacitor using cement, carbon black, and water, potentially allowing energy storage in a building’s foundation.
The cement-based material, when combined with a special salt solution, can act as a powerful supercapacitor, offering rapid energy delivery.
While the technology is promising, questions about its durability and long-term viability remain.

Startup crafts a high-speed tube propelling items to orbitnLINK

Longshot Space CEO Mike Grace is developing a hypersonic launch system that aims to provide a cheaper alternative to rockets for sending payloads into space.
The “Longshot” accelerator uses compressed gas to propel objects through very long concrete tubes, with the goal of achieving speeds up to Mach 25 to 30.
Despite its simplicity and the accompanying challenges, the project has backing from significant figures like OpenAI’s Sam Altman and Draper VC.

‘LK-99’ trend sparks superconductors market frenzy LINK

A team of scientists from South Korea and Virginia claim to have created a superconductor, called LK-99, that can transmit electrical currents without resistance at room temperature, which could result in significant advances in fields like computing and energy.
The claim has led to viral interest and significant stock market activity, particularly for companies with perceived connections to superconductors, though the scientific community remains skeptical and is actively working to verify the findings.
Even if LK-99 is confirmed as a viable room-temperature superconductor, substantial work will be needed to figure out how to implement it into commercial products, underscoring that the technology remains in early stages.

Google’s narrowing legal battlefield in antitrust case LINK

Federal Judge Amit Mehta has dismissed certain claims in an antitrust lawsuit against Google, ruling that the plaintiffs, including the Department of Justice, have not proven that Google is maintaining a monopoly by favoring its own products in search results.
The judge also dismissed antitrust allegations related to Android’s compatibility, Google Assistant, and certain other aspects of Google’s operations.
However, the DOJ can proceed with other arguments in the case, such as claims that Google abuses its power through deals requiring Android manufacturers to pre-load Google apps and make Google the default search engine on their devices.

SoftBank’s $150M claim against IRL for creating fake users LINK

SoftBank is suing defunct social app IRL, which it had previously invested in, alleging fraud and seeking $150 million in damages, after an internal investigation revealed 95% of the app’s users were fake.
IRL had claimed significant user numbers, including that it was downloaded by 25% of US teens and was growing at a 400% annual rate, figures SoftBank alleges were misrepresented and inflated using bots and a secret firm to skew data.
IRL is also under investigation by the SEC to ascertain whether the app violated security laws by misleading investors, with SoftBank’s complaint implicating IRL CEO Abraham Shafi and several of his family members in the alleged fraud.

Google’s $99 on-campus hotel offer to push hybrid work LINK

Google is running a summer promotion allowing full-time staff to book stays at the Bay View campus’ hotel for $99 per night to ease the transition to a hybrid workplace, thereby eliminating commuting for those who choose to stay.
While the offer may align with some apartment rental costs, it necessitates employees to pay for their stay, potentially leading to additional costs if they maintain a separate home, and the benefit is limited to those working at the Bay View campus.
This move coincides with increasing pressure from Google on remote workers to return to the office, amidst rising tensions, including a complaint lodged by YouTube contractors alleging the misuse of return-to-office policies to suppress labor organization.

Tesla jailbreak enabled by unpatchable hardware flaw LINK

Researchers from Technische Universität Berlin have reportedly jailbroken Tesla vehicles, unlocking features usually available through in-car purchases, and are set to present their findings at the 2023 Black Hat USA conference.
The jailbreak could potentially allow hackers to access hardware-protected keys used by Tesla for vehicle authentication and decrypt a vehicle’s internal storage, gaining access to personal user data.
The vulnerability is tied to an unpatchable flaw in each Tesla’s AMD processor, and the researchers used a voltage fault injection attack to manipulate the power flow and gain root privileges, a technique they have previously used to bypass AMD’s firmware TPM in PCs.

Unraveling August 2023: August 05th 2023

Latest AI News and Trends on August 05th 2023

AI Consciousness: The Next Frontier in Artificial Intelligence

Developing AI with emotions, desires, and the ability to learn and grow, raises many philosophical and ethical questions. Such AI may mimic human behavior to a certain extent, but the essence of being human—rooted in our unique biological and experiential nature—could remain distinct.

The Dawn of Proactive AI: Unprompted Conversations

With AI technology advancing rapidly, the possibility of AI initiating unprompted conversations might be within reach. However, these advancements also underline the need for stringent ethical guidelines to ensure respectful and beneficial human-AI interaction.

AI Therapists: Providing 24/7 Emotional Support

AI has revolutionized therapy by providing round-the-clock emotional support. As AI therapists become more sophisticated, they’re enhancing mental health care accessibility, yet also raising important questions about empathy and the human touch in therapy.

46 Generative AI Tools Transforming Businesses

Generative AI tools are providing businesses with unprecedented capabilities, from designing new products to automating content creation. However, as these tools evolve, it’s critical for businesses to understand and manage their ethical implications.

The Challenge of Converting 2D Images to 3D Models with AI

Creating an AI that can convert 2D images into 3D models presents a complex challenge, but strides are being made in this area. While no perfect solution exists yet, researchers are continually exploring alternative methods to solve this problem.

OpenAI is rolling out new updates to improve ChatGPT

OpenAI is shipping out a bunch of small updates over the next week to improve the ChatGPT experience. Here’s a tl;dr

1. Prompt examples: At the beginning of a new chat, you will now see examples to help you get started.

2. Suggested replies: ChatGPT will suggest relevant ways to continue your conversation.

3. GPT-4 by default: When starting a new chat as a Plus user, ChatGPT will remember your previously selected model – no more defaulting back to GPT-3.5.

4. Upload multiple files: Now, ChatGPT can analyze data and generate insights across multiple files.

5. Stay logged in: You’ll no longer be logged out every 2 weeks!

6. Keyboard shortcuts: Work faster with shortcuts, like ⌘ (Ctrl) + Shift + ; to copy last code block. Try ⌘ (Ctrl) + / to see the complete list.

Source

GPT-5 coming soon?

OpenAI has recently filed a Trademark application with the US Patent and Trademark Office for GPT-5. The application was filed on 18-07-2023 and is currently awaiting examination.

The trademark is intended to cover categories of:

Downloadable computer programs and software related to language models
The AI of human speech and text, NLP, ML-based language, and speech processing
Translation of text or speech and sharing datasets for ML
Conversion of audio data into text, voice, and speech recognition
Creating and generating text and developing and implementing artificial neural networks.

The application relates to Software as a Service (SaaS) in these areas.

Source

Google DeepMind finds way to put AI into robots

Google DeepMind has introduced Robotic Transformer 2 (RT-2), a first-of-its-kind vision-language-action (VLA) model that learns from both web and robotics data. It then translates this knowledge into generalized instructions for robotic control. This helps robots more easily understand and perform actions– in both familiar and new situations

Unraveling August 2023: Google Deepmind RT-2

The approach results in very performant robotic policies and, more importantly, leads to a significantly better generalization performance and emergent capabilities due to web-scale vision-language pretraining. Thus, internet-scale text, image, and video data can now be used to help robots develop better common sense.

Source

ChatGPT to Bard– Researchers find a way to turn AI chatbots evil

LLMs today undergo extensive fine-tuning to ensure they do not produce harmful content in their responses. However, new research has introduced an approach that automatically produces adversarial suffixes to prompt the models, which results in affirmative responses for objectionable queries.

Unlike traditional jailbreaks, these are built in an entirely automated fashion, allowing one to create virtually unlimited number of such attacks. Although built to target open-source LLMs, the strings easily transfer to many closed-source, publicly-available chatbots too, like ChatGPT, Bard, and Claude.

Source

Together AI extends Llama-2 to 32k context

Together AI has released LLaMA-2-7B-32K, a 32K context model built using Meta’s Position Interpolation and Together AI’s data recipe and system optimizations, including FlashAttention-2. You can fine-tune the model for targeted, long-context tasks– such as multi-document understanding, summarization, and QA. Here’s the model in Playground completing a book:

Upon evaluation, the model achieves comparable quality than the original LLaMA-2-7B base model.

Source

AI Daily News updates on August 05th 2023

While some schools have curbed the use of generative AI, the University of Hong Kong (HKU) is going all in and urging both its teachers and students to embrace the technology. The University of Hong Kong is supporting this by giving teachers and students free access to various generative AI tools, including Microsoft Azure OpenAI and OpenAI’s ChatGPT and DALL-E.

Intel’s CEO, Pat Gelsinger, has called NVIDIA the clear market leader who has done a great job within the AI space.

AI powerhouse, OpenAI has released some new features for its sensational chatbot, ChatGPT. The new features allow the chatbot to show suggested follow-up prompts at the bottom of its responses. The new features were announced by the company via a tweet on its official Twitter handle.

Asian Americans and women in the workforce are the most concentrated in fields where AI could assist or replace their job tasks, according to new research

Latest Tech News and Trends on August 05th 2023

Threads loses 80% of daily users LINK

Threads, a Twitter rival developed by Meta, had a record-breaking launch, reaching 100 million users within days, but its daily active user count has since declined by 82%.
Users are spending much less time on the app, with usage dropping from nearly 20 minutes per day at launch to barely three minutes per day now.
Despite the decline, Meta’s CEO, Mark Zuckerberg, remains optimistic about Threads and plans to focus on retention and improving the app’s features.

Apple’s third quarter shows mixed results: iPhone sales down, but subscriptions growing LINK

Apple’s third-quarter earnings for 2023 surpassed analyst expectations, but hardware revenue declined compared to the previous year.
iPhone, Mac, and iPad sales were down by 2%, 7%, and 20% respectively, while the “Other Products” category, including wearables, grew by 2%.
The highlight of the earnings report was Apple’s services division, which saw an 8% year-over-year growth, with more than 1 billion paying users in various subscription services, generating $21.21 billion in Q3 2023.

Alphabet sells 90% of its stake in struggling Robinhood LINK

Alphabet, the parent company of Google, reduced its stakes in several publicly traded firms, including Robinhood, 23andMe, and Duolingo.
The company sold nearly 90% of its stake in Robinhood and also trimmed significant positions in Duolingo and 23andMe.
Robinhood, which saw a surge of users during the pandemic, reported stronger-than-expected earnings but still faces challenges with depressed monthly active users.

FCC issues a record $300 million fine against largest robocall scam LINK

The FCC issued a record-breaking fine of $300 million to an international network of companies responsible for making over five billion illegal robocalls to more than 500 million phone numbers, including violating federal spoofing laws.
Phone companies were told to block the numbers used by the callers, resulting in a 99% decrease in calls.
The FCC described it as the largest illegal robocall operation ever investigated, and they are determined to stop the scammers behind these calls.

Bitfinex hackers who stole billions in crypto plead guilty LINK

Ilya Lichtenstein and Heather Morgan, the couple involved in the 2016 Bitfinex hack, have pleaded guilty in court.
Lichtenstein used advanced hacking tools to gain access to Bitfinex and moved 119,754 bitcoins to his own wallets, while Morgan helped him move and launder the stolen funds.
The couple set up false identities, used darknet markets and crypto exchanges, and purchased physical gold coins with the stolen money. Lichtenstein faces up to 20 years in prison, while Morgan could be sentenced to up to five years.

World’s First Tooth Regrowth Medicine Enters Clinical Trials — ‘Every Dentist’s Dream’ Could Be A Life-Changing Reality. Link

Frackers can use dangerous chemicals without disclosure due to “Halliburton loophole”. Link

The Reddit Protest Is Finally Over. Reddit Won. Link

NASA has reestablished full communications with Voyager 2. Link.

Latest News and Trends on August 05th 2023

A judge has ruled Texas’ abortion ban is too restrictive for women with pregnancy complications. Link

FDA approves first postpartum depression pill in the US. Link

Appeals court throws out conviction after judge says Black man ‘looks like a criminal to me’. Link

Unraveling August 2023: August 04th 2023

Latest AI News and Trends on August 04th 2023

AI Brings ‘Elvis’ Back: The New Age of Music Generation

In a unique feat of AI, ‘Elvis’ has been brought back to life, in a manner of speaking, to perform a humorous rendition of a modern classic. The technology behind this achievement demonstrates how AI is becoming an increasingly powerful tool in music generation and other creative fields.

Meta Releases Open Source AI Audio Tools, AudioCraft

Meta has released AudioCraft, an open-source suite of AI audio tools, marking a significant contribution to the AI audio technology sector. These tools are expected to facilitate advancements in audio synthesis, processing, and understanding.

Researchers Provoke AI to Misbehave, Expose System Vulnerabilities

Researchers have discovered a method to manipulate AI into displaying prohibited content, revealing potential vulnerabilities in these systems. This research underscores the importance of ongoing studies into the reliability and integrity of AI, as well as measures to safeguard against misuse.

Meta Leverages AI-Powered Chatbots to Drive User Engagement

Meta is planning to deploy AI-powered chatbots as part of a strategy to boost user numbers on their social media platforms. This approach signifies the growing influence of AI in enhancing user interaction and engagement on digital platforms.

Barriers To AI Adoption

Despite its evident potential, some businesses and industry leaders are maintaining a cautious distance from AI, hesitating to embrace its advantages to their operations.

Google’s AI Search: Now With Visuals!

Image Source: Google

Summary: Google’s Search Generative Experiment (SGE) is stepping up its AI game. Not only does it offer AI-powered results, but now also related images and videos, making searches easier and engaging. (source)

Key Points:

Google’s SGE now displays images and videos related to your search query.
Publication dates are also shown, so you know how fresh the information is.
Sign up to test these features via Search Labs

Why It Matters: This update takes Google’s AI search to a new level, providing a richer and more dynamic user experience. Getting information from searches will become easier than ever.

Tutorial: Craft Your Marketing Strategy with ChatGPT

Whether you’re a seasoned marketer or a startup founder, creating a comprehensive marketing strategy that captures the attention of your target audience can be a complex task. ChatGPT can serve as a sounding board, providing suggestions based on historical marketing knowledge and best practices.

Try the prompt below:

You are a seasoned marketing strategist. I'm launching a new eco-friendly skincare line targeted at millennials and Gen Z. I need a comprehensive marketing strategy that covers product positioning, branding, pricing, distribution channels, and promotion strategies. This should include social media campaigns, influencer partnerships, SEO, and content marketing tactics. Your recommendations should consider current marketing trends and consumer behaviors.

You can modify this prompt to suit your specific marketing needs. Whether you’re promoting a physical product, a digital service, or a personal brand, you can ask ChatGPT for tailored advice.

AI Won’t Replace Humans — But Humans With AI Will Replace Humans Without AI

Karim Lakhani is a professor at Harvard Business School who specializes in workplace technology and particularly AI. He’s done pioneering work in identifying how digital transformation has remade the world of business.

Machine learning helps researchers identify underground fungal networks

Justin Stewart left for Mount Chimborazo in August 2022 to collect fungal samples from the Ecuadoran volcano at an elevation of 4,000 meters, or about 13,000 feet.

DeepSpeed-Chat: Affordable RLHF training for AI

New Microsoft research has introduced DeepSpeed-Chat, a novel system that makes complex RLHF (Reinforcement Learning with Human Feedback) training fast, affordable, and easily accessible to the AI community (open-sourced). It has three key capabilities:

Easy-to-use Training and Inference Experience for ChatGPT Like Models
A DeepSpeed-RLHF pipeline that replicates the training pipeline from InstructGPT
A robust DeepSpeed-RLHF system that combines various optimizations for training and inference in a unified way

The system delivers unparalleled efficiency and scalability, enabling training of models with hundreds of billions of parameters in record time and at a fraction of the cost. Here’s how it compares to two other frameworks (Colossal-AI and HuggingFace DDP) for accelerating RLHF training on a single NVIDIA A100-40G commodity GPU.

Why does it matter?

The current landscape lacks an accessible, efficient, and cost-effective end-to-end RLHF training pipeline for powerful models like ChatGPT, particularly when training at the scale of billions of parameters. DeepSpeed-Chat paves the way for broader access to advanced RLHF training, thereby fostering innovation and further development in the field of AI.

(Source)

OpenAI is rolling out new updates to improve ChatGPT

OpenAI is shipping out a bunch of small updates over the next week to improve the ChatGPT experience. Here’s a tl;dr

1. Prompt examples: At the beginning of a new chat, you will now see examples to help you get started.

2. Suggested replies: ChatGPT will suggest relevant ways to continue your conversation.

3. GPT-4 by default: When starting a new chat as a Plus user, ChatGPT will remember your previously selected model – no more defaulting back to GPT-3.5.

4. Upload multiple files: Now, ChatGPT can analyze data and generate insights across multiple files.

5. Stay logged in: You’ll no longer be logged out every 2 weeks!

6. Keyboard shortcuts: Work faster with shortcuts, like ⌘ (Ctrl) + Shift + ; to copy last code block. Try ⌘ (Ctrl) + / to see the complete list.

Why does it matter?

These improvements make ChatGPT more user-friendly and streamline human-AI interactions, making it a more user-friendly and powerful tool overall. It will set the stage for improved and advanced AI applications as ChatGPT is today’s leading LLM.

(Source)

Latest versions of Vicuna, based on the open LLaMA-2

The latest Vicuna v1.5 series based on Llama 2 features 4K and 16K context lengths (has extended context length via positional interpolation by Meta), and have improved performance on almost all benchmarks. Vicuna 1.5 tl;dr

7B & 13B parameter versions
4096 and 16384 token context window
trained on 125k ShareGPT conversations
Commercial use
Evaluated with standard benchmarks, human preference, and LLM-as-a-judge

Why does this matter?

Since its release, Vicuna has been one of the most popular chat LLMs. It has enabled pioneering research on multi-modality, AI safety, and evaluation. Since the latest versions are based on the open-source Llama-2, they can be an open LLM alternative to ChatGPT/GPT-4.

Source

What Else Is Happening in AI on August 04th 2023

‘Every single’ Amazon team is working on generative AI, says CEO (Link)

Twilio’s new integration will bring OpenAI’s GPT-4 model to its Engage platform (Link)

Datadog launches generative AI assistant Bits and new model monitoring solution (Link)

Pinterest is now using next-gen AI for more relevant and personalized content and ads (Link)

AI.com now redirects to Elon Musk’s X.ai instead of taking you to ChatGPT (Link)

Unraveling August 2023: August 03rd 2023

Latest AI News and Trends on August 03rd 2023

Smartphone app uses machine learning to accurately detect stroke symptoms

Today at the Society of NeuroInterventional Surgery’s (SNIS) 20th Annual Meeting, researchers discussed a smartphone app created that reliably recognizes patients’ physical signs of stroke with the power of machine learning.

In the study, “Smartphone-Enabled Machine Learning Algorithms for Autonomous Stroke Detection,” researchers from the UCLA David Geffen School of Medicine and multiple medical institutions in Bulgaria used data from 240 patients with stroke at four metropolitan stroke centers. Within 72 hours of the start of the patients’ symptoms, researchers used smartphones to record videos of patients and test their arm strength in order to detect patients’ facial asymmetry, arm weakness, and speech changes-;all classic stroke signs.

To evaluate facial asymmetry, the study authors used machine learning to analyze 68 facial landmark points. To test arm weakness, the team used data from a smartphone’s standard internal 3D accelerometer, gyroscope, and magnetometer. To determine speech changes, researchers used mel-frequency cepstral coefficients, a typical sound recognition method that translates sound waves into images, to compare normal and slurred speech patterns. They then tested the app using neurologists’ reports and brain scan data, finding that the app was sensitive and specific enough to diagnose stroke accurately in nearly all cases.

AI and Machine Learning: The New Frontier in Global Anti-Money Laundering Efforts

The world of finance is no stranger to the nefarious activities of money laundering, a global menace that has proven to be a tough nut to crack for financial institutions and regulatory bodies. However, the advent of Artificial Intelligence (AI) and Machine Learning (ML) is heralding a new frontier in global anti-money laundering efforts, offering promising solutions to this age-old problem.

Money laundering, the process of making illegally-gained proceeds appear legal, is a complex and sophisticated crime. It often involves multiple transactions, used to disguise the origin of financial assets so that they appear to have originated from legitimate sources. Traditional methods of detecting and preventing money laundering have often fallen short, due to the sheer volume of financial transactions that occur daily and the clever tactics employed by money launderers.

Enter AI and ML, two technological advancements that are revolutionizing various sectors, including finance. These technologies are now being harnessed to combat money laundering, and early indications suggest they could be game-changers.

AI, with its ability to mimic human intelligence, and ML, a subset of AI that involves the science of getting computers to learn and act like humans, are being used to analyze vast amounts of financial data. They can sift through millions of transactions in a fraction of the time it would take a human, identifying patterns and anomalies that could indicate suspicious activity.

Moreover, these technologies are not just faster; they are also more accurate. Traditional anti-money laundering systems often generate a high number of false positives, leading to wasted time and resources. AI and ML, on the other hand, can learn from past data and improve their accuracy over time, reducing the number of false positives and allowing financial institutions to focus their resources on genuine threats.

The use of AI and ML in anti-money laundering efforts is not without its challenges. For one, these technologies require vast amounts of data to function effectively. This raises privacy concerns, as financial institutions must balance the need for effective anti-money laundering measures with the need to protect their customers’ personal information. Additionally, the use of AI and ML requires significant investment in technology and skilled personnel, which may be beyond the reach of smaller financial institutions.

Meta’s AudioCraft is AudioGen + MusicGen + EnCodec

Meta has introduced AudioCraft, a new family of generative AI models built for generating high-quality, realistic audio & music from text. AudioCraft is a single code base that works for music, sound, compression & generation — all in the same place. It consists of three models– MusicGen, AudioGen, and EnCodec.

Meta is also open-sourcing these models, giving researchers and practitioners access so they can train their own models with their own datasets for the first time. AudioCraft is also easy to build on and reuse. Thus, people who want to build better sound generators, compression algorithms, or music generators can do it all in the same code base and build on top of what others have done.

Why does it matter?

AudioCraft is a significant step forward in generative AI research. It opens up unprecedented possibilities for creating unique audio/music– whether for video games, merchandise promos, YouTube content, educational purposes, etc. Moreover, the open-source initiative will further help advance the field of AI-generated audio and music.

AudioCraft is for musicians what ChatGPT is for content writers.

Source: https://ai.meta.com/blog/audiocraft-musicgen-audiogen-encodec-generative-ai-audio

LLaMA2-Accessory: An Open-source Toolkit for LLM Development.

LLaMA2-Accessory is an advanced open-source toolkit for pre-training, fine-tuning, and deployment of Large Language Models (LLMs) and multimodal LLMs. Its repository is mainly inherited from LLaMA-Adapter with more advanced features.

Thus, it supports more datasets, tasks, visual encoders, and efficient optimization methods. (LLaMA-Adapter is a lightweight adaption method to efficiently fine-tune LLaMA into an instruction-following model).

Unraveling August 2023: Llama2 - Accessory — Unraveling August 2023: Llama2

Why does this matter?

It will allow to easily and quickly experiment with and build upon state-of-the-art language models, saving time and resources in the development process. Moreover, its open-source nature democratizes access to advanced AI tools, enhancing engagement and progress toward groundbreaking AI solutions across various industries and domains.

Source

Google’s AI can tell what music you listened to based on your brain signals.

In a cutting-edge collaborative study between Google and Osaka University, scientists have revealed a novel artificial intelligence (AI) system capable of producing music reminiscent of songs that individuals were listening to while undergoing brain scans.

The research team built an AI-based pipeline, called Brain2Music, that utilized functional magnetic resonance imaging (fMRI) data to recreate music corresponding to snippets of songs that subjects listened to. The fMRI technique observes oxygen-rich blood flow in the brain to determine the most active regions.

The collected brain scans were from five participants who listened to 15-second clips of various genres, such as blues, classical, hip-hop, and pop.

While there have been previous studies on reconstructing sounds like human speech or bird songs from brain activity, attempts to recreate music from brain signals have been rare.

The process began by training an AI program to associate features of music, such as genre, rhythm, mood, and instrumentation, with participants’ brain signals. The mood of the music was labeled by researchers with descriptive terms such as happy, sad, or exciting.

The AI was then customized for each participant, establishing connections between individual brain activity patterns and diverse musical elements.

Upon training, the AI could translate unseen brain imaging data into a format representing the musical elements of the original song clips. This information was fed into another AI model developed by Google, known as MusicLM, which was initially created to generate music from text descriptions.

MusicLM utilized this information to generate musical clips that fairly and accurately resembled the original song snippets, achieving an agreement level of about 60% in terms of mood. The genre and instrumentation in both the reconstructed and original music matched considerably more often than what could be attributed to chance.

Timo Denk, a software engineer at Google in Switzerland and the study’s co-author, emphasized that the method was robust across subjects, hinting at its likely effectiveness if applied to new individuals.

The underlying goal of the research is to enhance understanding of how the brain processes music. The team observed that listening to music activated specific brain regions, such as the primary auditory cortex and the lateral prefrontal cortex. The latter seems to be crucial for interpreting the meaning of songs, but more research is needed to validate this finding.

Intriguingly, the team also hopes to explore the possibility of reconstructing music that individuals are merely imagining, rather than actually hearing.

The study, published on July 20 in the preprint database arXiv, awaits peer review. The generated musical clips can be listened to online, showcasing a remarkable advancement in AI’s capabilities to bridge the gap between human cognition and machine interpretation.

Source.

AI use in breast cancer screening as good as TWO radiologists, study finds.

One of the most comprehensive trial of its kind has found that using AI in breast cancer screening is safe and can significantly reduce the workload of radiologists. It’s also revealed that AI-supported screening can detect cancer at a similar rate to standard double reading without increasing false positives, thereby possibly easing the pressure on medical professionals.

Here’s the source (The Guardian), which I summarized into a few key points:

AI in breast cancer screening and its benefits

AI’s effectiveness in screening is found to be on par with two radiologists working together, providing a new tool in early detection.
The technology almost halves the workload for radiologists, greatly improving the efficiency.
No increase in the false-positive rate, with 41 more cancers detected with AI support.

The study, results, and future implications

The study was a randomised controlled trial involving over 80,000 women, primarily from Sweden, comparing AI-supported screening with standard care.
Interim analysis considers AI use in mammography safe, with the potential to reduce radiologists’ workload by 44%. The lead author calls for further understanding, trials, and evaluations to assess the full potential and implications of AI…

What Else Is Happening in AI?

Instagram is working on labels for AI-generated content (Link)

Google’s generative search feature now shows related videos and images (Link)

Tinder tests AI photo selection feature to help users build profiles (Link)

Alibaba rolls out open-sourced AI model to take on Meta’s Llama 2 (Link)

IBM and NASA announced the availability of the watsonx.ai geospatial foundation model on (Link)

Nvidia researchers have created a new text-to-image personalization method called Perfusion. Unlike the million-dollar super heavyweight models out there Perfusion is 100KB and takes only four minutes to train.

Meta Platforms (META.O) on Wednesday introduced its open-source AI tool called AudioCraft that will help users to create music and audio based on text prompts. The AI tool is bundled with three models, AudioGen, EnCodec, and MusicGen, and works for music, sound, compression, and generation, Meta said.

As generative AI enters the mainstream, the crowdfunding platform Kickstarter has struggled to formulate a policy that satisfies parties on all sides of the debate.

In an astounding medical first, researchers have used AI-powered brain implants to restore movement and sensation for a man who was paralyzed from the chest down.

Latest Tech News and Trends on August 03rd 2023

Movie extras worry they’ll be replaced by AI. Hollywood is already doing body scans. Link

China considers limiting kids’ smartphone time to two hours per day | Younger children would face even stricter terms. Link

IRS vows to digitize all taxpayer documents by 2025. Link

New algorithm spots its first “potentially hazardous” near-Earth asteroid — and it’s 600 feet long. Link

Hong Kong grants its first crypto license under its new rules to HashKey Exchange, legalizing retail trading as officials seek to foster a global crypto hub (Bloomberg);

Intel files an application to significantly expand its Oregon campus, currently home to 22K workers, slated to begin as soon as 2025; the last upgrade cost $3B (Anton Shilov/AnandTech);

France’s AFP sues X in Paris, saying the company refused to discuss potential content distribution payments under a 2019 law; Elon Musk calls the move “bizarre” (Shubham Kalia/Reuters);

India cutting off internet in Manipur state on May 4 after ethnic conflict broke out did not quell the violence and provided cover for murders, rapes, and arson (Parth M.N./Wired);

Latest Soccer/Football/Sports News and Trends on August 03rd 2023

Germany out of Women’s World Cup in latest huge exit to boost England hopes

Germany have been knocked out of the Women’s World Cup after a 1-1 draw with South Korea in Brisbane, combined with Morocco’s win over Colombia which saw them join the South Americans in the last 16.

Morocco reach the knockout stage in their first ever Women’s World Cup

Knockout Stage Bracket for 2023 Women’s World Cup

Tom Brady invests in Birmingham City and joins the advisory Board

Golden Boot race for the Women’s World Cup after the group stage

Latest World and USA News on August 03 2023

‘Cancer-killing pill’ that appears to ‘annihilate’ solid tumours is now being tested on humans. Link

Body found in floating border barrier between Texas and Mexico. Link

DeSantis-controlled Disney World district gets rid of all diversity, equity and inclusion programs and staffers. Link

Federal court sides with Indiana trans schoolchildren on bathroom access. Link

A-listers including Oprah Winfrey, Meryl Streep, Leonardo DiCaprio donate $1 million each to SAG-AFTRA relief fund. Link

Federal jury acquits Louisiana trooper caught on camera pummeling Black motorist. Link

Atlantic orcas ‘learning from adults’ to target boats off Spain’s coast. Link

Australia Will Return Looted Sculptures to Cambodia;

Heat Illness Sickens Hundreds at Scout Jamboree in South Korea;

The Art of Telling Forbidden Stories in China;

36 Hours in Ho Chi Minh City, Vietnam: Things to Do and See;

VanMoof’s Bankruptcy Worries Owners of Electric Bikes;

‘Kill the Boer’ Song Fuels Backlash in South Africa and U.S.;

A Leader of Niger’s Coup Visits Mali, Raising Fears of a Wagner Alliance;

Are the Trump Indictments a Turning Point? History Says Not Likely.;

Amid Signs of a Covid Uptick, Researchers Brace for the ‘New Normal’;

Unraveling August 2023: August 02nd 2023

Latest AI News on August 02 2023

Google AI will replace your Doctor soon: Google DeepMind Advances Biomedical AI with ‘Med-PaLM M’

Google and DeepMind have introduced Med-PaLM M, a multimodal biomedical AI system that can interpret diverse types of medical data, including text, images, and genomics. The researchers curated a benchmark dataset called MultiMedBench, which covers 14 biomedical tasks, to train and evaluate Med-PaLM M.

The AI system achieved state-of-the-art performance across all tasks, surpassing specialized models optimized for individual tasks. Med-PaLM M represents a paradigm shift in biomedical AI, as it can incorporate multimodal patient information, improve diagnostic accuracy, and transfer knowledge across medical tasks. Preliminary evidence suggests that Med-PaLM M can generalize to novel tasks and concepts and perform zero-shot multimodal reasoning.

Why does this matter?

It brings us closer to creating advanced AI systems to understand and analyze various medical data types. Google DeepMind’s MultiMedBench and Med-PaLM M show promising performance and potential in healthcare applications. It means better healthcare tools that can handle different types of medical information, ultimately benefiting patients and healthcare providers.

Source

Meta is building AI friends for you

Meta, the owner of Facebook, is developing chatbots with different personalities to increase engagement on its platforms. These chatbots, known as “personas,” will mimic human conversations and may include characters like Abraham Lincoln or a surfer. The chatbots are expected to launch early in September and will provide users with search functions, recommendations, and entertainment.

The move is aimed at retaining users and competing with platforms like TikTok. However, there are concerns about privacy, data collection, and the potential for manipulation.

Why does this matter?

Meta’s move to develop AI-powered chatbots with different personas comes in response to competition from rivals like TikTok and Snap. TikTok has been gaining popularity and challenging established platforms like Facebook. Meanwhile, Snap has already launched its “My AI” feature, an experimental chatbot that has engaged 150 million users. Meta is also challenging companies like OpenAI, which launched ChatGPT. By introducing these chatbots, Meta aims to attract and retain users while staying at the forefront of AI innovation in social media.

Source

An Asian woman asked AI to improve her headshot and it turned her white

Unraveling August 2023: AI turn Asian to White

An Asian-American MIT grad used an AI image generator to make her headshot more professional but was shocked to find it altered her appearance to look white. The incident led to discussions about racial bias in AI, eliciting reactions from the CEO and highlighting concerns over the technology’s imperfections.

What happened and the reactions

Rona Wang, an Asian-American MIT grad, used Playground AI’s image editor to make her headshot look more professional, only to find that it lightened her skin and altered her race.
Wang expressed disbelief and concern over the incident, wondering if the AI assumed that she needed to be white to appear professional.
The incident quickly caught public attention, and both the CEO of Playground AI, Suhail Doshi, and media outlets reacted to it.

CEO’s response was evasive…

Suhail Doshi, the CEO of Playground AI, responded to the Boston Globe’s interview but did not directly address the concerns about racial bias.
He used a metaphor involving rolling a dice to question whether the incident was indicative of a systemic issue.

… which leads to the broader issue of racial bias in AI

Wang’s experience brought attention to the recurring problem of racial bias, a concern she had previously expressed.
Her evolving views on the AI’s bias and her struggles with AI photo generators highlight ongoing challenges in the industry.
The incident serves as a stark reminder of the imperfections in AI and raises questions about the haste to integrate such technology in various sectors.

Source (Futurism)

How China Is Using AI In Schools To Improve Education & Efficiency

1. AI Headband: Headbands measure how focused students are. Teachers and parents get this information on their computers.

2. Robots: Robots in classrooms look at students’ health and how involved they are in lessons.

3. Tracking Uniforms: Students wear special uniforms with chips that show where they are.

4. Surveillance Cameras: Cameras watch how often students look at their phones or yawn in class.

These efforts are part of a big experiment to use AI to make education in China better and more efficient.

Could this be the future of education worldwide?

Top 4 AI models for stock analysis/valuation?

– Boosted.ai – AI stock screening, portfolio management, risk management
– Danielfin – Rates stocks and ETFs with an easy-to-understand global AI Score
– JENOVA – AI stock valuation model that uses fundamental analysis to calculate intrinsic value
– Comparables.ai – AI designed to find comparables for market analysis quickly and intelligently

What Machine Learning Reveals About Forming a Healthy Habit. Link

Contrary to popular belief, behaviors don’t become habits after a “magic number” of days. Wharton’s Katy Milkman shares what machine learning is teaching scientists about habit formation.

“There’s this widely spread rumor that it takes 21 days to form a habit. You may have also heard it takes 90 days to form a habit. There are popular books that tout these numbers that don’t have a sound basis in research. What we find is there is no such magic number,” said Katy Milkman, a Wharton professor of operations, information and decisions.

What Else Is Happening in AI on August 02nd 2023

Uber is creating a ChatGPT-like AI bot, following competitors DoorDash & Instacart. (Link)

YouTube testing AI-generated video summaries. (Link)

AMD plans AI chips to compete Nvidia and calls it an opportunity to sell it in China. (Link)

Kickstarter needs AI projects to disclose model training methods. (Link)

UC hosting AI forum with experts from Microsoft, P&G, Kroger, and TQL. (Link)

AI employment opportunities are open at Coca-Cola and Amazon. (Link)

Latest Tech News on August 02nd 2023

Meta is so unwilling to pay for news under a new Canadian law that it’s starting to block it on Facebook and Instagram in that country. Meta permanently ending news availability on its platforms in Canada starting today. Link

Uber CEO balks after a reporter tells him the cost of his 2.9-mile Uber ride: ‘Oh my God. Wow.’ Link

Reddit beats film industry, won’t have to identify users who admitted torrenting. Link

Superconductor Breakthrough Replicated, Twice, in Preliminary Testing. Link

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’. Link’

Latest Football/Soccer/Sports News on August 02nd 2023

Women World Cup: France 6-3 Panama; Brazil 0-0 Jamaica; Argentina 0 – 2 Sweden; South Africa 3- Italy 2; Brazil and Argentina are out. Link

France 6-3 Panama; Brazil 0-0 Jamaica; France go through to the last 16 as group winners. The result confirms Brazil’s elimination. Jamaica are through in second place.

South Africa 3- Italy 2 South Africa are into the last 16 after claiming their first Women’s World Cup win with a thrilling 3-2 victory over Italy in Wellington.

Argentina 0 – 2 Sweden; Sweden beat Argentina to make it three wins from three at the Women’s World Cup, clinching top spot in Group G and a mouth-watering last-16 clash with the USA.

Bob Marley’s daughter, Cedella, is leading the Jamaican women’s soccer team

She created “Football is Freedom” to help raise funds for their trips, uniforms, and training camps.

Italian keeper Buffon hangs up gloves after 28 years

Italian goalkeeper Gianluigi Buffon has retired from football, the 45-year-old announced on Wednesday, after 28 years in which he reached exceptional milestones for club and country.

The girl who was too good: how Lauren James rose to World Cup stardom

Forward who trained as a youngster at Arsenal with women and boys is justifying the excitement over her talent with England

“UEFA or FIFA Must Find Solutions” – Liverpool Boss Jurgen Klopp Complains About Saudi Arabia’s Transfer Deadline. Link

Arsenal agree terms with Brentford keeper Raya; Dembele to PSG is done; Chelsea sign Rennes midfielder Ugochukwu; Mane leaves Bayern to join Ronaldo at Al-Nassr; Will Haaland continue breaking records. Link

Arsenal agree terms with Brentford keeper Raya – Gossip
Dembele to leave Barcelona for PSG – Xavi
Chelsea sign Rennes midfielder Ugochukwu
Mane leaves Bayern to join Ronaldo at Al-Nassr
Will Haaland continue breaking records

Erling Haaland broke all, well most of, the Premier League goalscoring records in his first season in England – so what can he do this season?

The Norway forward scored a record 36 goals in 35 league games to win the Golden Boot – and netted 52 goals, a record for a Manchester City player, in 53 games in all competitions.

He never looked back after his opening games, where he smashed many of the records for fast goalscoring starts in the Premier League that had been set back in 1992-93 by Coventry City’s Mick Quinn.

Haaland also helped City win the Treble of Premier League, Champions League and FA Cup.

What records can be break in 2023-24?

Katie Ledecky makes swimming history with major world championship wins. Link

Jalin Hyatt reported to have broken NFL record for the fastest speed at 24 MPH. Link

Guo Jincheng obliterates 50m world record, breaking :30 in S5 free at Para Swim Worlds. Link

Google Street View car evades police at 100 mph, crashes into creek, Indiana cops say. Link

Unraveling August 2023: Latest News on August 02nd 2023

Trump charged by Justice Department for efforts to overturn his 2020 presidential election loss. Link

FBI finds 200 sex trafficking victims, 59 missing children in two-week sweep. Link

Woman accused of killing bride in DUI golf cart crash must remain in custody, S.C. judge orders. Link

U.S. ban on popular lightbulb goes into effect. Link

The Pittsburgh synagogue gunman will be sentenced to death for the nation’s worst antisemitic attack. Link

Unraveling August 2023: August 01st 2023

Latest AI News on August 01st 2023

News Corp Leverages AI to Produce 3,000 Local News Stories per Week

News Corp Australia now uses AI to generate 3,000 local news articles each week. The executive chair, Michael Miller, disclosed this at the World News Media Congress in Taipei.

A team of four, known as the Data Local unit, utilizes AI to create localized news stories that span across various topics, including weather, fuel prices, and traffic reports. Peter Judd, News Corp’s data journalism editor, leads the team (he is also the credited author for many of these AI-generated stories).

News Corp’s AI supplements the work of reporters covering stories for the company’s 75 “hyperlocal” mastheads spread across Australia, from Penrith to Cairns. AI-generated content such as “Where to find the cheapest fuel in Penrith” is supervised by journalists. However, there is currently no indication within the articles that they are AI-assisted.

These thousands of AI-generated “articles” are more service-information-oriented, according to a News Corp spokesperson. They emphasized that the automated updates on local fuel prices, court lists, traffic, weather, and other areas are all overseen by the Data Local team’s journalists.

Miller revealed that a majority of their new subscribers sign up for the local news, but stay for the national, world, and lifestyle news. He also disclosed that 55% of all subscriptions are spurred by hyperlocal mastheads. Amidst the shift to digital platforms and local digital-only titles, News Corp seems to be harnessing the power of AI to enhance its hyperlocal news offerings.

The success of News Corp’s AI usage in journalism suggests a trend that other newsrooms in Australia, like ABC and Nine Entertainment, may be considering. As media companies explore AI applications, the question becomes how to use it effectively to enhance content accessibility, personalization, and more.

Source.

Workers are spilling more secrets to AI than to their friends

A new study reveals that workers are more open to sharing company secrets with AI tools than with friends. The research also highlights both the popularity of AI tools in workplaces and the potential security risks, with an emphasis on the growing challenges related to cybersecurity.

Here’s the source, which I summarized in a few main points:

Workers’ positive attitudes towards AI, especially in the US

A third of workers from the US and UK would continue using AI tools even if banned by their companies.
69% believe the benefits of AI tools outweigh the risks, with US workers being the most optimistic (74%).

Widespread use of AI in the workplace and lack of awareness about dangers

Half of the respondents use AI for tasks like research, copywriting, and data analysis.
CybSafe’s report emphasizes that businesses are not informing employees about risks, leading to potential threats like phishing scams.

Challenges in cybersecurity and distinguishing human from AI-generated content

64% of US workers have entered work-related information into AI tools, and 93% are potentially sharing confidential data with AI.
60% of respondents claim they can accurately distinguish human from AI content, yet the blurring line poses risks for cybercrime.

Google’s AI will auto-generate ads

Google Ads has introduced a new feature that uses AI to generate advertisements on its platform automatically. The feature utilizes Large Language Models and generative AI to create campaign workflows based on prompts from marketers.

Google Ads can analyze landing pages, successful queries, and approved headlines to generate new creatives. The company also highlighted its commitment to privacy and introduced enhanced privacy features like Privacy Sandbox.

Why does this matter?

Using LLMs and Generative AI, this AI tool for auto-generated ads will save time, ensure privacy, and empower small businesses to leverage AI. Integrating generative AI in content creation also promises exciting possibilities beyond advertising.

Source

Meta prepares AI chatbots with personas to try to retain users

Meta is preparing to launch AI chatbots with distinct personalities, in an effort to retain users on its platforms. This move aims to capitalize on the growing enthusiasm for AI technology and present a challenge to rivals like OpenAI, Snap, and TikTok.

If you want to stay up to date on the latest in AI and tech, look here first.

The article (Financial Times) is paywalled, so here’s a recap of the article’s main points:

Meta’s strategy for engaging users through chatbots

Meta is developing chatbots that exhibit distinct personalities, such as those of historical figures and characters, to create a more engaging and personalized user experience.
The company is targeting a launch as early as September, aiming to enhance user interaction with new search functions, recommendations, and entertaining experiences with these persona-driven chatbots.

Competitive landscape and user engagement

Meta’s aim is to boost engagement and keep pace with competitors like TikTok
They will introduce “personas” to provide search functions, recommendations, and entertainment
Finally, they plan to use these chatbots to collect user data for more relevant content targeting

Addressing challenges and ethical concerns

Unraveling August 2023: LLMs to think more like a human for answer quality

This research introduces “Skeleton-of-Thought” (SoT), a method to decrease the generation latency of large language models. SoT guides LLMs first to generate the skeleton of the answer and then complete the contents of each skeleton point in parallel.

This approach provides significant speed-up (up to 2.39x across 11 different LLMs) and can potentially improve answer quality regarding diversity and relevance. SoT is an initial attempt at optimizing LLMs for efficiency and encouraging them to think more like humans for better answers.

Research by: Microsoft Research And Department of Electronic Engineering, Tsinghua University.

Why does this matter?

By emulating human-like thinking processes, LLMs can deliver more natural and contextually appropriate answers, enhancing their practical applications across various domains, such as NLP, customer support, and information retrieval. This advancement brings us closer to creating AI systems that can interact with users more effectively, making them more valuable tools in our everyday lives.

Source

ChatGPT outperforms undergrads in SAT exams

Summary: UCLA researchers have discovered that GPT-3 matches or outperforms undergrad students in solving reasoning problems typically found on exams like the SAT. (source)

Key points:

In a study conducted at the University of California, Los Angeles, GPT-3 was tested against 40 undergraduates to solve reasoning problems. The AI scored an impressive 80%, leaving the humans trailing with a below 60% average.
In the SAT “analogy” questions that were unpublished online – GPT-3 outperformed the average human score yet again.
GPT-3 had a tougher time matching a piece of text with a short story that had the same message. But GPT-4, the next version of the model, did a better job.
While researchers recognize GPT-3’s advancements, they also acknowledge its limitations in social interactions, mathematical reasoning, and spatial problem-solving.

Why it Matters: Picture AI and humans, inching closer in a problem-solving marathon. This isn’t about robots stealing jobs, no. It’s about reshaping the way we learn and do business with AI.

Unraveling August 2023: ToolLLM masters 16k+ real-word APIs

ToolLLM is a framework that enhances the tool-use capabilities of open-source LLMs by training them to follow human instructions to use external tools (APIs). The framework includes a dataset called ToolBench, which contains instructions for using over 16,000 real-world APIs.

A depth-first search-based decision tree (DFSDT) is used to improve the planning and reasoning capabilities of the LLMs. An automatic evaluator called ToolEval is also developed to assess the performance of the LLMs. The results show that the trained LLM, ToolLLaMA, can execute complex instructions and generalize to unseen APIs, performing comparably to closed-source LLMs like ChatGPT.

Why does this matter?

ToolLLM, can execute complex instructions and perform comparably to closed-source models like ChatGPT. And it bridges the gap between language models and practical tool usage, making them more versatile and valuable for various applications.

Source

AI powered tools for email writing

GMPlus : GMPlus is a chrome extension that makes your email writing easier by providing a shortcut anytime you write an email. No need to switch to other tabs. It helps you compose high-quality emails in minutes.
NanoNets AI email autoresponder : It’s free no login AI email writer that helps you write an effective email copy in minutes. With this tool, you can automate your email responses and create compelling email copies.
Rytr : Rytr AI is an AI-powered writing tool that helps users generate high-quality content quickly and easily. It is easy to use and requires very little effort to generate email copy that converts.
Smartwriter AI : It is an AI email marketing tool that helps generate personalized emails that can get positive replies faster and cheaper. It automates email outreach, so you don’t have to research constantly.
Copy AI : It’s an easy to use copy-generating tools that can help you write copy real quick. It can generate copy for Instagram captions, nurturing email subject lines, cold outreach pitches.
Thoughts ?
More useful resources in this guide.

Tutorial: ChatGPT Prompt to Enhance Your Customer Service

In the evolving landscape of online businesses, excellent customer service remains pivotal. ChatGPT can play a vital role in elevating your customer service quality. In this tutorial, we will explore how you can utilize ChatGPT to ensure your customers feel valued, and their concerns are promptly addressed. Here’s a customized prompt you can try with ChatGPT to streamline your customer service approach.

Try the prompt below:

Assume the role of a customer service expert. I run an online store selling tech gadgets and I'm receiving an increasing volume of customer inquiries and complaints. I need a comprehensive plan to improve my customer service. This should include strategies for effectively managing and responding to customer inquiries, handling complaints, providing after-sales service, and turning negative experiences into positive ones. Your recommendations should be based on the latest best practices in customer service and consider the specific challenges of an online business.

This prompt can be customized according to your business’s specific needs. Whether you’re struggling with a high volume of inquiries, dealing with complex complaints, or looking to improve your overall customer satisfaction, you can seek advice from ChatGPT.

Daily AI Update News from Google DeepMind, Together AI, YouTube, Capgemini, Intel, and more

DoNotPay, an AI lawyer bot known as ChatGPT4, is transforming how users handle legal issues and save money. In under two years, this innovative robot has successfully overturned more than 160,000 parking tickets in cities like New York and London. Since its launch, it has resolved a total of 2 million related cases.

Microsoft hints Windows 11 Copilot with third-party AI plugins is almost here.

In an analyst note on Tuesday, the financial services arm of Swiss banking giant UBS raised its guidance for long-term AI end-demand forecast from 20% compound annual growth rate (CAGR) from 2020 to 2025 to 61% CAGR between 2022 to 2027.

The next generation of the successful OpenAI language model is already on the way. It has been discovered that the North American company has filed a registration application for the GPT-5 mark with the United States Patent and Trademark Office.

Dell and Nvidia join hands for Gen AI solutions
– The Dell Generative AI solutions portfolio builds on the initial Project Helix announcement made in May, which involved a close collaboration with Nvidia. The portfolio includes new validated designs to help enterprises deploy AI workloads on-premises. This partnership aims to assist customers in navigating the generative AI landscape and provide them with the necessary tools to successfully implement AI solutions in their businesses.

Google will update Assistant with similar tech like ChatGPT
– Google is planning to update its Assistant with features powered by generative AI, similar to ChatGPT and Bard. The company has already started exploring a “supercharged” Assistant powered by large language models. The team has begun working on this update, starting with mobile.

ChatGPT Android app is now available in all countries and regions where it is supported.

Incredible response to Meta’s Llama 2, 150K+ downloads in just a week!
– In just one week, they received over 150,000 download requests, showcasing the excitement and interest from the community. They are eagerly looking forward to seeing how developers and users utilize these models in their projects and applications.

Google DeepMind introduces AI model to control robots
– It has introduced Robotic Transformer 2 (RT-2), a first-of-its-kind vision-language-action (VLA) model that learns from both web and robotics data. It then translates this knowledge into generalized instructions for robotic control. This helps robots more easily understand and perform actions– in both familiar and new situations.
– The approach results in very performant robotic policies and, more importantly, leads to a significantly better generalization performance and emergent capabilities due to web-scale vision-language pretraining.

ChatGPT to Bard; researchers find a way to turn AI chatbots evil
– New research has introduced an approach that automatically produces adversarial suffixes to prompt language models, which results in affirmative responses for objectionable queries.
– Unlike traditional jailbreaks, the approach is built in an entirely automated fashion, allowing one to create virtually unlimited number of such attacks. Although built to target open-source LLMs, the strings easily transfer to many closed-source, publicly-available chatbots too, like ChatGPT, Bard, and Claude.

Together AI extends Llama-2 to 32k context
– It has released LLaMA-2-7B-32K, a 32K context model built using Meta’s Position Interpolation and Together AI’s data recipe and system optimizations, including FlashAttention-2. You can fine-tune the model for targeted, long-context tasks– such as multi-document understanding, summarization, and QA.

Forget subtitles; YouTube now dubs videos with AI-generated voices
– It is using Aloud, a free tool that automatically dubs videos using synthetic voices.

Capgemini will invest 2Bn euro in AI and double AI teams
– The Paris-based IT firm will invest 2 billion euro in AI and plans to double its data and AI teams in the next three years.

Intel plans to build AI into its every product
– Intel CEO Pat Gelsinger was very bullish on AI during the company’s Q2 2023 earnings call, telling investors that Intel plans to “build AI into every product that we build.”

GPT-4 passes first Harvard semester in humanities and social sciences
– In an experiment, a Harvard student had GPT-4 write seven essays on topics such as economic concepts, presidentialism in Latin America, and a literary analysis of a passage from Proust. GPT-4 earned a respectable 3.57 GPA.

AI Knowledge Nugget: Large Language Models and Nearest Neighbors

This thoughtful article by Sebastian Raschka, PhD explores using nearest-neighbor methods in the context of large language models. He highlights the beauty of simple techniques like nearest neighbor algorithms and discusses their potential for making significant contributions based on foundational or classic approaches. Nearest neighbor algorithms, though not as popular as before, are still widely used in practice, and the k-Nearest Neighbor algorithm is recommended as a benchmark for predictive performance in classification projects.

(A k-nearest neighbor classifier with k=5.)

The article also provides additional resources on improving computational performance for nearest-neighbor methods.

Why does this matter?

This article showcases a simple yet effective method. It demonstrates that foundational techniques can still be competitive in low-resource scenarios and highlights the potential of alternative approaches.

Source

Unraveling August 2023: Latest Sport News on August 01st 2023

Bayern Munich are prepared to break their club-record 80m euro (£68m) fee to sign 30-year-old England striker Harry Kane;

Tuesday’s gossip: Kane, Mbappe, Johnson, Lukaku, Vlahovic, Kolo Muani, Colwill, Verratti, Osimhen, Virgil van Dijk named new Liverpool captain, Trent Alexander-Arnold vice-captain.

Chelsea are now back in talks again with Juventus. Swap deal between Romelu Lukaku & Dušan Vlahović has been discussed again.

Bayern Munich are prepared to break their club-record 80m euro (£68m) fee to sign 30-year-old England striker Harry Kane from Tottenham. (Sky Sports)

Tottenham and Bayern held talks in London on Monday and are about £25m apart in their valuation of Kane. (Athletic – subscription)

Tottenham could use the money raised by Kane’s sale to bring in Barcelona’s Ivory Coast midfielder Franck Kessie, 26, and 28-year-old France defender Clement Lenglet. (Mundo Deportivo – in Spanish)

Tottenham are eyeing Nottingham Forest’s £50m-rated Wales forward Brennan Johnson, 22, if Kane is sold. (Mail)

Chelsea co-owner Todd Boehly faces competition from Barcelona in offering a player-plus-cash deal to Paris St-Germain for 24-year-old France forward Kylian Mbappe. (Independent)

Chelsea are exploring a potential swap deal involving Belgium striker Romelu Lukaku, 30, and Juventus’ 23-year-old Serbia forward Dusan Vlahovic. (Fabrizio Romano)

PSG have rekindled their interest in Eintracht Frankfurt’s 24-year-old France forward Randal Kolo Muani. (L’Equipe – in French)

Chelsea’s 20-year-old English defender Levi Colwell has agreed to sign a new six-year contract. (Guardian)

Man United are expected to announce decision regarding Mason Greenwood’s future opening PL game of the season on August 14.

Lauren James produced a sensational individual performance as England entertained to sweep aside China and book their place in the last 16 of the Women’s World Cup as group winners. Source: BBC

27 years ago today Alan Shearer signed for Newcastle for a world record £15 million. Measuring football inflation as changes in PL total income, that figure today would be £235 million

Unraveling August 2023: Latest Tech News on August 01st 2023

Scientists Create New Material Five Times Lighter and Four Times Stronger Than Steel.

Researchers from the University of Connecticut and colleagues have created a highly durable, lightweight material by structuring DNA and then coating it in glass. The resulting product, characterized by its nanolattice structure, exhibits a unique combination of strength and low density, making it potentially useful in applications like vehicle manufacturing and body armor. (Artist’s concept.)

First U.S. nuclear reactor built from scratch in decades enters commercial operation in Georgia

Unit 3’s reactor and cooling tower stand at Georgia Power Co.'s Plant Vogtle nuclear power plant in Waynesboro, Ga. — Unraveling August 2023: First U.S. nuclear reactor built from scratch in decades enters commercial operation in Georgia

Nuclear power now makes up about 25% of the generation of Georgia Power, the largest unit of Atlanta-based Southern Co.

https://www.nbcnews.com/science/science-news/first-us-nuclear-reactor-built-scratch-decades-enters-commercial-opera-rcna97258

A.I. is on a collision course with white-collar, high-paid jobs — and with unknown impact

About 1 in 5 American workers have a job with “high exposure” to artificial intelligence, according to Pew Research Center. It’s unclear if AI would enhance or displace these jobs.
Workers with the most exposure to AI like ChatGPT tend to be women, white or Asian, higher earners and have a college degree, Pew found.
Technology has led some to “lose out” in the past, largely when their job is substituted by automation, one expert said.

https://www.cnbc.com/2023/07/31/ai-could-affect-many-white-collar-high-paid-jobs.html

Amazon rolls out its virtual health clinic nationwide:

Amazon is expanding its virtual clinic service nationwide.
The company launched Amazon Clinic last November as a way for patients to connect with telemedicine providers to help receive treatment for common conditions such as acne and hair loss.
Amazon has been trying to break into the health-care industry for years with mixed success.
Amazon rolls out its virtual health clinic nationwide (cnbc.com)

YouTube plans to compensate for AI music

YouTube will pay artists and rights holders for AI-generated music used on the platform. This aims to balance creative innovation and fair compensation.

MidJourney introduced a new AI feature, ‘Vary’. (Link)

Fintech giant Paytm invests in AI to develop an Artificial General Intelligence software stack. (Link)

India using AI to bring voice-activated mobile payments, RBI’s new plan. (Link)

Developers exploring AI to create Text-to-Music Apps. (Link)

Chinese firm launches WonderJourney satellite with AI-powered ‘brain.’ (Link)

Unraveling August 2023: August 21st, 2023

Latest AI News and Trends on August 21st, 2023

OpenCopilot- AI sidekick for everyone

OpenCopilot allows you to have your own product’s AI copilot. With a few simple steps, it takes less than 5 minutes to build.

Why does this matter?

Source

Google teaches LLMs to personalize

Why does this matter?

Source

Local Llama

Why does this matter?

Source

AI creates lifelike 3D experiences from your phone video

Flythroughs is built on Luma’s breakthrough NeRF and 3D generative AI and a brand new path generation model that automatically creates smooth dramatic camera moves.

Why does this matter?

Source

Genetic Algorithm Optimized Neural Network Model for Malicious URL Detection

URL Genie is a web application implementing a Multilayer Perceptron Neural Network optimized using genetic algorithms. Detect whether a domain name or URL is malicious by inputting a URL.

Check it out!
https://github.com/ANG13T/url_genie

AI models for stock analysis and valuation?

– Boosted.ai – AI stock screening, portfolio management, risk management

– JENOVA – AI stock valuation model that uses fundamental analysis to calculate intrinsic value

– Danielfin – Rates stocks and ETFs with an easy-to-understand global AI Score

– Comparables.ai – AI designed to find comparables for market analysis quickly and intelligently

Daily AI Update News from OpenCopilot, Google, Luma AI, AI2, and more

Top 7 Best AI Design Software(Bonus)

As you journey through life, your passions take an intriguing turn, guiding you towards the realm of digital marketing.

By leveraging the power of artificial intelligence, these tools open up new horizons, enabling you to streamline and automate your design workflow like never before.

It’s time to manifest your artistic dreams into reality – let the voyage commence!

1. Adobe Photoshop CC

With its user-friendly interface and intuitive controls, even those new to the world of digital design can quickly find themselves delving into the realm of endless possibilities.

The result is a visual masterpiece that captures the essence of the creator’s vision with unparalleled fidelity.

Whether you’re crafting a logo, designing a website, or creating intricate vector graphics, the integration of these tools allows you to transcend the boundaries of imagination.

2. Planner 5D

Whether you seek an open-concept layout or a series of interconnected rooms, this software provides the tools to bring your architectural visions to life.

From stylish sofas and elegant dining tables to enchanting wall art and lighting fixtures, the possibilities are limitless.

By collaborating with skilled designers, you can rest assured that your dream home or office will become a reality, tailored to your unique taste and requirements.

3. Uizard

The speed and efficiency afforded by this cutting-edge technology empower you to focus on refining your concepts and iterating through ideas at an unprecedented pace.

From tech-savvy founders to aspiring entrepreneurs, Uizard ensures that the creative journey remains accessible and enjoyable for everyone.

Whether you’re a solo founder or part of a dynamic team, Uizard fosters seamless collaboration, enabling you to share and iterate on designs effortlessly.

This not only accelerates the decision-making process but also ensures that your final product resonates with your target audience.

4. Autodesk Maya

The synergy between Maya and its counterparts unlocks new avenues for innovation, granting you the freedom to experiment, iterate, and refine your creations with ease.

5. Autodesk 3Ds Max

Aimed at architects, engineers, and a myriad of other professionals, this cutting-edge tool serves as a gateway to bring your imaginative designs to life with astonishing realism.

The software becomes a haven for creative professionals seeking to visualize and communicate their concepts with exceptional clarity and visual fidelity.

Whatever your design aspirations may be, this software stands as a reliable companion, ensuring that your visions manifest into awe-inspiring digital realities.

Its potential to enhance the visual storytelling capabilities of any venture unlocks a new era of creativity and communication.

Each new iteration becomes a steppingstone towards perfection, ensuring that your final creation exudes excellence.

6. Foyr Neo

With Foyr Neo at your disposal, you can witness the transformation of your design ideas into reality in as little as a fifth of the time it takes with other software tools.

With its intuitive controls and seamless functionality, the software becomes an extension of your creative vision, ensuring that your ideas manifest into remarkable designs with utmost ease.

Additionally, the abundance of training materials and support ensures that you can unlock the full potential of the software, mastering its capabilities and expanding your design horizons.

This seamless experience fosters uninterrupted creativity, enabling you to focus on the art of design without the burden of managing disparate software tools.

7. Let’s Enhance

With this cutting-edge software, you can witness a remarkable increase in image resolution of up to 16 times, all without sacrificing an ounce of quality.

Embrace the art of delegation and allow Let’s Enhance to handle the technicalities while you concentrate on perfecting your artistic vision.

Envision your social media posts grabbing attention with larger-than-life visuals, thanks to Let’s Enhance’s seamless enlargement capabilities.

Whether it’s subtle adjustments or dramatic transformations, the software empowers you to create visuals that captivate audiences and leave a lasting impression.

6 AI Text to Video compared (updated August 2023 ) Link

Runway
Features

Pros

– Best of AI text-to-video research
– Comprehensive set of tools for video editing
– Available as both a desktop and mobile app

Cons

– Gen-2 has limitations in generating intricate details, like fingers
– Gen-2 video generation is limited to 4 seconds per video
– The tool does not offer text-to-speech capabilities

Synthesia AI
Features

– 120+ voices and accents
– 140+ diverse AI avatars
– 60+ video templates designed by professional designers
– The option to have a custom avatar created

Integrating ChatGPT to Automate WhatsApp Responses

The authors have uploaded all the codes on their GitHub Repository.

Real Time Multiplayer AI Trivia | Trivia Universe AI

Hey all ! I’ve been having a ton of fun getting this project launched and would greatly appreciate any feedback and/or requests !

https://TriviaUniverseAI.com

The site uses openAI to generate trivia on anything and everything you want ! You can then revisit trivia you or others have made and replay them at anytime.

Solo & real time multiplayer, daily challenge, infinite playability and is getting updates daily !

Current feature roadmap :

jeopardy mode ( multiple topics and large question count )
email / sms notifications for new daily challenges etc.
public lobbies / multiplayer against random players

Unraveling August 2023: August 20th, 2023

Latest AI News and Trends on August 20th, 2023

40% of workers need reskilling due to AI LINK

IBM’s study indicates that 40% of the global workforce, or 1.4 billion people, will need to reskill in the next three years due to AI’s rise.
While AI technologies, such as generative models, might shift job responsibilities, 87% of surveyed executives believe AI will augment jobs rather than replace them.
The focus in job skills has shifted from technical STEM skills (most important in 2016) to people skills like team management and adaptability (most important in 2023).

Meta did it first… Generative AI for producers

Imagine conceiving, designing, and animating with AI—a reality that even big players like Disney face. This emerging world is thrilling and transformative.

To prepare, embrace AI. Integrate it into your work wherever possible. If you want to stay ahead and not fall behind to AI, leverage its capabilities.

Trustworthy LLMs: A survey and guideline for evaluating LLMs’ alignment

It also thoroughly surveys the categories of LLMs that are likely to be crucial to improve their trustworthiness and shows how to build evaluation datasets for alignment accordingly.

Source

NVIDIA’s tool to curate trillion-token datasets for pretraining LLMs

The tool curates high-quality data that leads to improved LLM downstream performance and will significantly benefit LLM developers attempting to build pretraining datasets.

Source

Despite AI, Bing’s market share remains static LINK

Microsoft’s Bing AI Chat has failed to significantly boost Bing’s search engine market share despite its new capabilities.
Statistics from different sources show minimal changes in Bing’s global search engine share and web traffic.
Microsoft claims success based on its internal data, reporting over 100 million daily active users and strong growth since the launch of Bing Chat.

Microsoft-DataBricks collab may hurt OpenAI

Microsoft has been aggressively investing in AI services and integrating AI functionality across its products. Neither Microsoft nor Databricks have commented on the report.

Source

What else happened in AI this week of August 12-20?

- Google appears to be readying new AI-powered tools for ChromeOS
- Zoom rewrites policies to make clear user videos aren’t used to train AI
- Anthropic raises $100M in funding from Korean telco giant SK Telecom
- Modular, AI startup challenging Nvidia, discusses funding at $600M valuation
- California turns to AI to spot wildfires, feeding on video from 1,000+ cameras
- FEC to regulate AI deepfakes in political ads ahead of 2024 election
- Google’s AI search offers AI-generated summaries, definitions, and coding improvements.
- Google Photos introduce a new AI feature, ‘Memories view’!
- Amazon using AI to enhance product reviews.
- WhatsApp test beta upgrade with new feature ‘custom AI-generated stickers’.
- Google is testing an AI assistant that will give you Life Advice.
- Robomart adopts “store-hailing” for self-driving stores delivered to customers.
- OpenAI acquires AI design studio Global Illumination to work on core products, ChatGPT
- The Associated Press releases guidelines for Generative AI to its journalists
- Consulting giant McKinsey unveils its own generative AI tool for employees: Lilli
- Opera for iOS will now include Aria, its browser AI built in collaboration with OpenAI
- UK is using AI road safety cameras to detect potential driver offenses in passing vehicles
- Adobe Express with AI Firefly app, now out of beta, is available worldwide
- Ex-Google Brain researchers have started an AI research company called Sakana AI in Tokyo.
- Runway, a gen AI video startup, has launched a new ‘Watch’ feature.
- Research shows AI bots beat CAPTCHA and humans.
- ML startup Arthur launched an open-source tool to help find the best LLM.
- Buildbox has launched a new tool called StoryGames.AI!

Latest Tech News and Trends on August 20th, 2023

Major concerns after Cruise robotaxi incidents LINK

Following a recent collision between a Cruise robotaxi and a fire truck in San Francisco, the California DMV requested Cruise to halve its robotaxi fleet in the city.
The state agency is investigating “recent concerning incidents” with Cruise vehicles, emphasizing the need to ensure the safety of the public sharing the road with these autonomous vehicles.
This specific accident saw a Cruise Chevy Bolt EV hit by an emergency vehicle at an intersection, resulting in passenger injuries; it adds to a series of issues potentially affecting Cruise’s future operations.

As wildfires spread, Canadian leaders ask Meta to reverse its news ban LINK

The Canadian government demands that Meta lift its ban on domestic news sharing, citing its impact on sharing information about wildfires.
Meta blocked news on Facebook and Instagram due to a new law requiring payment for news articles, but this move hampers access to crucial information.
Officials and citizens express concerns, urging Meta to reinstate news sharing for safety and emergency information during the wildfire crisis.

X to remove ‘block’ feature LINK

Elon Musk suggests that Twitter’s block feature, except for direct messages, may be removed, causing concern among users.
Blocking is currently used to restrict interactions and visibility of accounts, while mute only hides posts; users value blocking for spam control and harassment prevention.
Musk’s statement prompts backlash and uncertainty about whether the feature will actually be removed.

Unraveling August 2023: August 18th, 2023

Latest AI News and Trends on August 18th, 2023

What is OpenAI code interpreter, and how does it work?

Discover the OpenAI code interpreter, an AI tool that translates human language into code. Learn about its functions, benefits and drawbacks in this guide.

The basics of OpenAI code interpreter

OpenAI, a leading entity in the field of artificial intelligence, has developed OpenAI code interpreter, a specialized model trained on extensive data sets to process and generate programming code.

New Generations of People Are Becoming More and More Indistinguishable from AI

How Neolithics is using AI machine learning to reduce global food waste

The new AI programming jobs that require only very basic programming skills

There has never been a more exciting and promising time to get into AI development. Forbes reports that job listings for ChatGPT-related positions increased 21 times since last November:

https://www.forbes.com/sites/emmylucas/2023/08/17/linkedin-says-chatgpt-related-job-postings-have-ballooned-21-fold-since-november/

If incumbent programming schools are not changing fast enough they risk losing a substantial market share to startups that begin teaching much more marketable skills.

Yeah there could never be a better time to get into computer programming!

The importance of making superintelligent small LLMs

Google’s Gemini will set a new standard in AI largely because of the massive data set that it is trained on.

If you’re not familiar with Gemini yet, watch this amazingly intelligent 8-minute YouTube video:

The next step would be for Google to train that stronger intelligence to shift from relying on data to relying on principles for its logic and reasoning.

Once AI’s intelligence is based on principles, subsequent iterations will no longer require massive data for their training.

That achievement will level the playing field so that Gemini is much sooner joined by competitive or stronger models.

Once that happens, everything will get very intelligent.

As Hollywood strikes, 96% of entertainment companies are boosting generative AI spend

If you want to stay ahead of the curve in AI and tech, look here first.

The rise in AI spending amidst the Hollywood strike

The Hollywood writer’s strike underscores a shift in the entertainment industry’s investment strategy.
Lucidworks’ research, one of the largest of its kind, shows 96% of executives prioritize generative AI investments.
Countries like China, the UK, France, India, and the U.S. have companies heavily investing in this technology.

AI’s potential impact on Hollywood content creation

Generative AI can produce content, virtual environments, and images, posing a potential disruption to traditional methods.
Predictions suggest that by 2025, up to 90% of Hollywood content could be influenced by AI.
There’s a growing concern among Hollywood writers about the rapid integration of AI and its effect on their careers.

The future of the entertainment industry with generative AI

The emergence of synthetic actors could revolutionize the way movies and shows are produced.
AI-driven actors don’t strike, age, or demand pay raises, presenting potential benefits for studios but challenges for human actors.

Source (Forbes)

Microsoft-DataBricks collab may hurt OpenAI

Microsoft has been aggressively investing in AI services and integrating AI functionality across its products. Neither Microsoft nor Databricks have commented on the report.

Why does this matter?

Source

Meta AI’s new RoboAgent with 12 skills

Why does this matter?

Source

Meta challenges OpenAI with code-gen free software

Why does this matter?

AP sets new AI guidelines for newsrooms LINK

The Associated Press has established standards for the use of generative AI in its newsroom, emphasizing that AI is not a replacement for human journalists and cautioning against creating publishable content with AI-generated text or images.
AP journalists are directed to treat AI-generated content as “unvetted source material” and apply editorial judgment and sourcing standards before considering it for publication.
The organization warns about the potential for AI to spread misinformation and advises its journalists to exercise caution, skepticism, and verify sources when dealing with AI-generated content.

Latest Tech News and Trends on August 18th, 2023

Scientists are leaving X LINK

A significant portion of scientific researchers using X have reduced their usage or left the platform altogether, with over 47% decreasing usage and nearly 7% quitting, according to a survey by Nature.
About 47% of polled researchers have turned to alternative platforms, with Mastodon being the most popular, followed by LinkedIn and Instagram.
The change in researcher behavior on X is attributed to the platform’s evolving dynamics, increased content prioritization, and limited accessibility of its API for researchers.

Amazon imposes fees on self-shipping sellers LINK

Starting from October 1st, third-party merchants on Amazon who ship their own packages will be required to pay a 2% fee per product sold.
This new fee is in addition to other charges Amazon already receives from merchants, including selling plan costs and referral fees based on product categories.
The fee comes as Amazon’s marketplace is under scrutiny, with the FTC planning to file an antitrust lawsuit over allegations that Amazon rewards third-party merchants using its logistics services while penalizing those fulfilling their own orders.

NYC bans TikTok from government devices LINK

New York City is banning TikTok from government devices within 30 days, with immediate prohibition on downloading and usage by employees.
The NYC Cyber Command cited TikTok as a security threat to the city’s technical networks, prompting the decision.
While some states have broadly banned TikTok, most have restricted its use on government-owned tech, amid ongoing debates about the app’s security risks.

Unraveling August 2023: August 17th, 2023

Latest AI News and Trends on August 17th, 2023

You can now write one sentence to train an entire ML model.

How does it work?

You just describe the ML model you want…a chain of AI systems will take that sentence…it generates a dataset based on that sentence…and it trains a model for you…in ten minutes 😳

What does that mean?

Custom models in AI just got a whole lot easier. You can go from an idea (“a model that writes Python functions”) to a fully trained custom Llama-2 model in minutes 😮

Why should I care?

What should I do about it?

It’s open source, made by Matt Shumer, with an easy Google Colab — check it out here: GitHub – mshumer/gpt-llm-trainer

GPT-4 Code Interpreter masters math with self-verification

The approach achieves an impressive accuracy of 84.32% on the MATH dataset, significantly outperforming the base GPT4-Code and previous state-of-the-art methods.

Why does this matter?

Source

Can machine learning algorithms identify patients at risk of a delay in starting cancer treatment?

Multi-level machine learning models for estimating the risk of delay between cancer diagnosis and treatment initiation in a large group of cancer patients.

Study significance

Future studies should include a higher proportion of vulnerable populations and more relevant social variables to improve the model performance.

Journal reference:

Frosch Z. A. K., Hasler, J., Handorf, E., et al. (2023). Development of a Multilevel Model to Identify Patients at Risk for Delay in Starting Cancer Treatment. JAMA Network Open. doi:10.1001/jamanetworkopen.2023.28712, https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2808249

10 top AI jobs in 2023

Discover the top AI jobs shaping the future. Navigate the booming AI industry with insights on rewarding careers.

1. AI product manager

2. AI research scientist

3. Big data engineer

Check out these top data architect and data engineer certifications.

4. Business intelligence developer

5. Computer vision engineer

6. Data scientist

7. Machine learning engineer

8. Natural language processing engineer

A natural language processing (NLP) engineer is a computer scientist who specializes in the development of algorithms and systems that understand and process natural human language input.

9. Robotics engineer

Robotics engineers typically have degrees in engineering, such as electrical, electronic or mechanical engineering.

10. Software engineer

What is a liquid neural network, really?

We spoke with MIT CSAIL head Daniela Rus about the emerging technology of liquid networks and implications for robotics.

AI reconstructs song from brain activity

Why does this matter?

Source

Saudi Arabia and UAE join the race for scarce Nvidia chips

Why does this matter?

Source

Snapchat’s AI chatbot creates unexpected chaos LINK

Snapchat users reported an unexpected video posted on the My AI chatbot’s Story, which some interpreted as showing a corner between a ceiling and a wall.
The unexpected post led to concerns and fears among users, with some believing the AI feature had become sentient or evolved, prompting some to delete the app.
Snapchat described the event as a “temporary outage”, which has since been resolved, and the AI chat feature temporarily stopped responding during this period.

Exploring the Power of Mojo Programming Language

https://www.seaflux.tech/blogs/mojo-ai-programming-language

Top AI Image-to-Video Generators 2023

Genmo

D-ID

LeiaPix Converter

InstaVerse

Sketch

NeROIC

DPT Depth

RODIN

Google Gemini: Facts and rumors

What does Gemini stand for ?

That part at least seems pretty clear beyond a shadow of a doubt:
Generative Enhanced Multimodal Intelligent Network Interface.

The word “Gemini” comes from Latin and means “twins” in German.
Some possible meanings in the context of Google’s AI system:

Gemini combines two components: Text and image processing. It is, in a sense, a “twin system.”

Gemini could refer to the „twins“ Sergey Brin and Larry Page, the founders of Google.

Astrology assigns communication strength and flexibility to the zodiac sign Gemini. Gemini as an AI assistant aims to adapt linguistically and situationally.

The name suggests a dual strength or ability. Gemini aims to unite Google’s text and image AI to outperform the competition.

Why is Google superior?

To do that, you have to understand WHAT treasure trove of data Google is actually sitting on. Here are a few facts:

Google, through its various services such as Google Search, YouTube and others, has an enormous amount of data that is very useful for developing AI systems.

On YouTube alone, over 500 hours of video material are uploaded every day, according to Statista. The total video database is over 30 million hours of video. The subtitles and transcripts of these videos give Google a gigantic text dataset for training language models.

According to a report by ARK Invest, Google owns over 130 exabytes of data. For comparison, 1 exabyte is equal to 1 billion gigabytes. This means that the entire data set comprises more than 130,000,000,000,000,000 bytes of information.

Google Search accounts for a large part of this data. Google says it processes over 40,000 search queries per second. That’s over 3.5 trillion search queries per year. From these queries and the clicked results, Google gains further insights.

Google – The Research Giant

Topics included:

Computer Vision (image recognition)
Natural Language Processing (NLP)
Speech Recognition
reinforcement learning
Robotics
Multimodal AI
Recommender Systems
Applications in medicine

AlphaGo: Go game AI that defeated world champion Lee Sedol in 2016.
BERT (Bidirectional Encoder Representations from Transformers): breakthrough language model for NLP from 2018.
PaLM (Pathways Language Model): enormous language model with 540 billion parameters from 2022
PaLM-SayCan: variant of PaLM that can carry on human-like conversations
Imagen: image generation AI for realistic and creative images
MusicLM: AI for music composition and production
RLHF (Reinforcement Learning with Human Feedback): Reinforcement learning with human feedback
Model Based RL: reinforcement learning with explicit models of the environment
RobustFit: Robust neural network against data noise
T5: Text-to-text transfer transducer for various NLP tasks
ViT (Vision Transformer): Image recognition with Transformer architecture
WAYMO: Autonomous driving and robot cab service
ProteinFold: Protein structure prediction with Deep Learning
FLOOD: AI for flood prediction and prevention
SLIDE: pixel-level image segmentation
Switch Transformers: efficient architecture for very large transformers
MuZero: reinforcement learning without environmental model in games
Meena: conversational AI from 2020
DALL-E & DALL-E 2: text-to-image generation.

Project: Google Gemini

Facts Google Gemini

There are already some facts from the Google Blog:

Gemini is supposed to be released this fall
Gemini combines text and image generation
Can create contextual images based on text generation
Has been trained with YouTube transcripts
Google lawyers are monitoring the training to avoid copyright issues
Gemini is said to have multiple modalities, e.g., text, image, audio, video
Sergey Brin is involved in development

Rumors

From Reddit and countless other sources on the web, there could be other features as well:

Gemini is said to be capable of AI image understanding and modification
Is said to combine text capabilities like GPT-4 with image generation
Has been developed from the ground up as a multimodal model
Could handle audio, video, 3D renderings, graphics, etc.
Shall learn with user interactions and thus become effective AGI
Architecture could enable lifelong learning
There are concerns about privacy and information leaks between users

Google Gemini and the (then new) AI market:

The AI market situation is likely to change significantly with the introduction of Google Gemini:

For OpenAI:

Strong new competitor for ChatGPT and DALL-E.
Google has significantly more resources and data
OpenAI could lose market share and come under pressure

For Anthropic:

Claude must stand up to Google Assistant with Gemini
Advantage due to focus on security and control
Risk of falling behind

For Microsoft:

Partnership with OpenAI important to compete with Google
Microsoft must further develop Azure AI services
Advantage due to strong cloud infrastructure

For others:

Startups could have a very hard time against Google
Consolidation in the market possible
Significantly higher innovation speed

Google Gemini Conclusion

#ai #ki #google #gemini #text #image #multimodal

Artificial intelligence steps in to assist dementia patients with ‘AI Powered Smart Socks’

People suffering from dementia could live more independently thanks to a pair of AI-powered socks that can track everything from a patient’s heart rate to movement.

Steer’s great-grandmother suffered from dementia, which also helped spark the creation of the socks.

WHAT IS CHATGPT?

“Anything that’s simple and easy to do, and is improving our look at life as a whole, I’m happy with,” dementia patient John Piper, 83, told the BBC.

The socks do not need to be recharged, according to Milbotix’s website, and can be machine washed.

Daniels said wristbands or watches often don’t hold long charges and are taken off by patients and then lost.

“SmartSocks offer a new and promising alternative, which could avoid many of these issues,” Daniels said.

The University of Exeter is investigating how beneficial the socks are for dementia patients.

U.K.-based start-up SmartSocks has developed hosiery that can monitor a dementia patient’s heart rate, motion and sweat levels with AI and alert caregivers to potential problems.

AI TOOL GIVES DOCTORS PERSONALIZED ALZHEIMER’S TREATMENT PLANS FOR DEMENTIA PATIENTS

What Else Is Happening in AI on August 17th, 2023

Adobe Express with AI Firefly app is available worldwide
– The web app is now out of beta and can be used free of charge in web browsers.

The Associated Press releases guidelines for Generative AI to its journalists (Link)

UK is using AI road safety cameras to detect potential driver offenses in passing vehicles (Link)

The founder of Centricity, a data analytics firm using AI, is indicted for defrauding investors by manipulating financial data. LINK

Leaders with a Montana digital academy say bringing artificial intelligence to high schools is an opportunity to embrace the future.

Google said to be testing new life coach AI for providing helpful advice to people.

Latest Tech News and Trends on August 17th, 2023

How to add captions on your phone for any app you use

usatoday.com

The end of SIM cards: A new eSIM guide for Android users 2023

Are physical SIMs about to be a thing of the past? Jump into the latest, and discover eSIMs #sponsored

Latest Sport Football Soccer News and Trends on August 17th, 2023

Atletico Madrid takes the alleged ‘threat’ of João Félix terminating his contract as a joke. If he wants to do so, he will have to pay his €350m release clause. Link

BREAKING: Theo Walcott is set to retire. Walcott, 34, who left Southampton at the end of last season, scored more than 100 goals for Arsenal and won 47 England caps. Link

‘Minecraft’ To ‘FIFA Soccer’: Best iOS Games To Play On The Upcoming iPhone 15

From epic open-world adventures to mind-bending puzzles, these are the best iOS games to play on the upcoming iPhone 15.

Unraveling August 2023: August 16th, 2023

Latest AI News and Trends on August 16th, 2023

GPT-4 to replace content moderators

Why does this matter?

Source

Meta beats ChatGPT in language model generation

Shepherd outperforms other models in human evaluation and is on par with ChatGPT. Shepherd offers a practical and valuable tool for enhancing language model generation.

Why does this matter?

Source

Microsoft launches private ChatGPT

Microsoft Azure ChatGPT offers several benefits to organizations:

Ensures data privacy with built-in guarantees and isolation from OpenAI-operated systems.
Allows full network isolation and offers enterprise-grade security controls.
Enhances business value by integrating internal data sources and services like ServiceNow.

Why does this matter?

Source

Google enhances search with AI-driven summaries LINK

Google is experimenting with a generative AI feature in Search that generates key points from long-form web content.
The summarization tool will display “key points” from articles but will not work on content marked as paywalled by publishers.
Initially launching as an “early experiment” in Google’s opt-in Search Labs program, it will first be available on the Google app for Android and iOS

Nvidia’s stocks surge LINK

Nvidia’s stock rises 7% as investors see its GPUs remaining dominant in powering large language models.
Morgan Stanley reiterates Nvidia as a “Top Pick” due to strong earnings, AI spending shift, and ongoing supply-demand imbalance.
Despite recent fluctuations, Nvidia’s stock has tripled in 2023, and analysts anticipate long-term benefits from AI and favorable market conditions.

Daily AI News on August 16th, 2023

OpenAI’s GPT-4 for more reliable and higher quality content moderation
– OpenAI aims to use its GPT-4 to solve the challenge of content moderation at scale. GPT-4 could replace human moderators, offering similar accuracy and more consistency. OpenAI has already used GPT-4 to develop and refine its own content policies.
– It provides three major benefits: consistent judgments, faster policy development, and improved worker well-being. While AI has been used for content moderation before, OpenAI’s approach could be appealing to smaller companies lacking resources.
Microsoft launches ChatGPT for enterprises with Azure
– Microsoft is now offering OpenAI’s ChatGPT model in its Azure OpenAI service, allowing developers and businesses to integrate conversational AI into their applications. ChatGPT can be used to power custom chatbots, automate emails, and provide summaries of conversations.
– Azure OpenAI users can access a preview of ChatGPT starting today, with pricing set at $0.002 for 1,000 tokens and it promises more control and privacy compared to the public model.
Google is progressing with new AI updates!
– Search experience adds AI-powered summaries, definitions, and coding improvements. In addition it will include related diagrams or images for various topics, color-coded syntax highlighting for code snippets, making it easier for programmers to understand and debug generated code.
– Google Photos adds a scrapbook-like Memories view feature aided by AI which allows users to relive and share their most memorable moments. The feature creates a scrapbook-like timeline that includes trips, celebrations, and daily moments with loved ones. The new Memories view is launching today for U.S. users and is similar to a combination of Stories and Facebook Memories.
Amazon using AI to enhance product reviews
– Amazon is tapping into generative AI to create handy highlights that collects key points from customer reviews which will help shoppers quickly gauge product review.
– The feature is part of ongoing efforts to improve utility of 125M+ reviews from shoppers. It uses only trusted reviews from verified purchases, and Amazon.
WhatsApp test beta upgrade with new feature ‘custom AI-generated stickers’
– The feature is currently available to a limited number of beta testers, includes a “Create” button under the stickers tab, which opens a keyboard for users to type prompts for the AI model to generate custom stickers. The feature is a server-side change and is currently only available in version 2.23.17.8 of the beta version.

Apple’s AI advancements in the last few months

Don’t sleep on Apple’s AI plans. Here’s how they’ve been slowly ramping up their AI efforts in the last few months.

Apple’s AI-powered health coach might soon be at your wrists
Apple is reportedly developing an AI-powered health coaching service called Quartz, aimed at helping users improve their exercise, eating habits, and sleep quality. The service will use AI and data from the user’s Apple Watch to create personalized coaching programs, with plans to introduce a monthly fee. The company is also working on emotion-tracking tools and plans to launch an iPad version of the iPhone Health app this year.
Apple enters the AI race with new features
Apple announced a host of updates at the WWDC 2023. Yet, the word “AI” was not used even once, despite today’s pervasive AI hype-filled atmosphere. The phrase “machine learning” was used a couple of times. (And AI is nothing but machine learning).
However, here are a few announcements Apple made that use AI as the underlying technology.
- Apple Vision Pro, a revolutionary spatial computer that seamlessly blends digital content with the physical world. It uses advanced ML techniques.
- Upgraded Autocorrect in iOS 17 that is powered by a transformer language model for improved prediction capabilities.
- Improved Dictation in iOS 17 that leverages a new speech recognition model to make it even more accurate.
- Live Voicemail that turns voicemail audio into text on the fly, which is powered by a neural engine.
- Personalized Volume, which uses ML to understand environmental conditions and listening preferences over time to automatically fine-tune the media experience.
- Journal, a new app for users to reflect and practice gratitude, uses on-device ML for personalized suggestions to inspire entries.
Apple Trials a ChatGPT-like AI Chatbot
Apple is developing AI tools, including its own large language model called “Ajax” and an AI chatbot named “Apple GPT.” They are gearing up for a major AI announcement next year as it tries to catch up with competitors like OpenAI and Google.
Apple bets big on AI
Apple’s CEO Tim Cook has stated that AI and ML are embedded in every company product. This comes after concerns were raised about Apple’s lack of discussion on its AI plans while competitors have been actively incorporating the technology into their products. He also emphasized that AI is central to the design of Apple’s products, contradicting suggestions that the company has not yet integrated the technology.
Apple gearing up for an AI showdown
Apple has reportedly ordered servers from Foxconn Industrial Internet, a division of its supplier Foxconn, for the testing and training of AI services. The servers are specifically for Apple’s AI work, which has been a focus for the company for years. While Apple does not currently have a ChatGPT-like app for external use, it is believed that this division of Foxconn already supplies servers to ChatGPT OpenAI, Nvidia, and Amazon Web Services. The news comes amidst reports about Apple’s plans to compete in the AI chatbot market.

Latest Tech News on August 16th, 2023

US watchdog targets surveillance industry LINK

The U.S. Consumer Financial Protection Bureau (CFPB) plans to regulate data brokers selling personal data due to concerns about their impact on privacy, including sensitive data from vulnerable groups.
CFPB aims to prevent illegal collection and sharing of personal data by data brokers in the surveillance industry.
The proposal expands coverage under the Fair Credit Reporting Act to include data derived from payment histories, personal income, and criminal records, addressing concerns such as credit header data disclosure.

Tesla unveils cheaper Model S and Model X variantsLINK

Tesla introduces lower-priced options for the Model S and Model X with reduced range.
The “standard range” trim brings the Model S starting price to $78,490 with 320 miles of range and the Model X starting price to $88,490 with 269 miles of range.
This trim, previously discontinued, likely uses the same battery pack with a software lock, and Tesla may offer range unlocking for an additional fee.

Singapore pioneers in stablecoin crypto regulationLINK

Singapore’s financial regulator has finalized rules for stablecoins, making it one of the first jurisdictions to do so globally.
Stablecoins are digital currencies designed to maintain a constant value against fiat currency, with a market value of around $125 billion and dominated by tokens like USDT and USDC.
The Monetary Authority of Singapore’s framework outlines requirements including holding reserves in low-risk assets, timely redemption, and proper user disclosures for stablecoins mimicking the Singapore dollar or other G10 currencies.

Amazon Pharmacy offers major savings on insulinLINK

Amazon Pharmacy offers automatic coupons to help uninsured insulin-requiring patients save on medication costs.
The digital pharmacy shows pricing with and without insurance, estimates savings with eligible coupons, and simplifies the process compared to existing coupon programs.
Amazon Pharmacy’s initiative has been praised by healthcare advocates and aims to improve access to affordable treatments, but manufacturer coupons are not available for patients benefiting from certain healthcare programs.

Other Tech news you might like

Apple will soon start making settlement payments to claimants of the “Batterygate” class-action lawsuit, with potential payments of up to $65 per person.LINK

YouTube is removing cancer treatment content that’s “harmful or ineffective,” implementing new guidelines to remove unproven treatments and harmful advice.LINK

California regulators approved round-the-clock robotaxi service in San Francisco for Waymo and Cruise, but a group of Cruise vehicles caused a traffic backup, highlighting challenges.LINK

Unraveling August 2023: August 15th, 2023

Latest AI News and Trends on August 15th, 2023

Do It Yourself Custom AI Chatbot for Business in 10 Minutes (Open Source)

AI powered tools for the recruitment industry

– MedhaHR : It’s an AI-driven healthcare talent sourcing platform that automates resume screening, provides personalized job recommendations, and offers cost-effective solutions.

– ZappyHire : It offers features such as candidate sourcing, resume screening, automated communication, and collaborative hiring.

– Sniper AI : It uses AI algorithms to source potential candidates, assess their suitability, and integrates with ATS for workflow optimization.

Surge in AI Talent demand and salaries

American firms are hunting for AI experts, with some offering salaries nearing a million dollars.
Industries like entertainment and manufacturing want data scientists and machine-learning specialists.
Competition is fierce, with companies like Accenture investing in internal training and others considering acquisition of AI startups for talent.

The compensation landscape for AI roles

As AI expertise becomes more sought-after, compensation packages are rising.
Companies are offering mid-six-figure salaries, bonuses, and stock grants to lure experienced professionals.
While top positions like Netflix’s machine-learning platform product manager can reach up to $900,000 in total compensation, othersalike a prompt engineer might average $130,000 annually.

Source (Washington Post)

How to Manage Your Remote Team Effectively with ChatGPT?

You are a seasoned consultant in remote team management. I am the leader of a remote team working on a [define project]. I need advice on how to effectively manage my team, ensure clear communication, monitor progress, and maintain a positive team culture. Your suggestions should include strategies for scheduling and conducting virtual meetings, task assignment, progress tracking tools, and methods to promote team building in a virtual setting.

I asked ChatGPT to remove password protection from an Excel document, and it worked flawlessly

How are you uploading an excel document to chat gpt?

Using ChatGPT code interpreter: It’s a feature for GPT plus member as the old “bing search” which got disabled, You have code interpreter now where you can directly upload files.

Can it analyze conversations/texts? Yes it can analyse data and even give u back charts and feedback for gpt plus users.

Johns Hopkins Researchers Developed a Deep-Learning Technology Capable of Accurately Predicting Protein Fragments Linked to Cancer

Johns Hopkins Engineers and Cancer Researchers have collaboratively pioneered a breakthrough in personalized cancer therapy with their cutting-edge deep-learning technology.

Microsoft releases private ChatGPT for Business

Key points:

Azure allows companies to run ChatGPT privately on their own networks, touting built-in data isolation from OpenAI.
The model connects to internal data services and sources, and is available on GitHub to install and deploy.
Benefits include privacy, control, and unique business value through internal data integration.

Apple’s AI-powered health coach might soon be at your wrists

Why does this matter?

Source

Apple enters the AI race with new features

Apple Vision Pro, a revolutionary spatial computer that seamlessly blends digital content with the physical world. It uses advanced ML techniques.

Upgraded Autocorrect in iOS 17 that is powered by a transformer language model for improved prediction capabilities.
Improved Dictation in iOS 17 that leverages a new speech recognition model to make it even more accurate.
Live Voicemail that turns voicemail audio into text on the fly, which is powered by a neural engine.
Personalized Volume, which uses ML to understand environmental conditions and listening preferences over time to automatically fine-tune the media experience.
Journal, a new app for users to reflect and practice gratitude, uses on-device ML for personalized suggestions to inspire entries.

Why does this matter?

Source

Apple Trials a ChatGPT-like AI Chatbot

Why does this matter?

Source

Apple bets big on AI

Why does this matter?

Source

Apple gearing up for an AI showdown

Why does this matter?

Source

Google Tests Using AI to Sum Up Entire Web Pages on Chrome

The ‘SGE while browsing’ feature can break down a long web page into easy-to-read key points.

Daily AI News August 15th, 2023

OpenAI likely to go bankrupt by the end of 2024. Link

Latest Tech News on August 15th, 2023

Youtube algorithm flaws?

Personally I’ve always been a huge fan of youtube but I always thought that their algorithm have actually gotten worse since the 2010s.

Supposedly Google should have perfected the algorithm at making simple recommendations; they have teams working on it yet i could think of a few things that could improve it.

Just off the top of my head, I can think of a few things – becoming more ambitious every refresh; recommend new topics more often; remind you of old topics you like more often.

YouTube will remove cancer treatment misinformation

YouTube announced a new policy for combatting medical misinformation on the video sharing platform

Latest World and Sport News on August 15th, 2023

Fulton County grand jury returns an indictment in 2020 election probe for Georgia. Link

The highest paid football players in the world in 2023 according to Le Parisien

After Al-Hilal move, Neymar leapfrogs Romelu Lukaku as the player with the highest combined transfer fee in football history.

Asked ChatGPT “Can you write an essay about web browsers but using a tone of a depressed teenager?” And this was the result

People are having sex in self-driving cars, apparently. Link

Top physicist says chatbots are just ‘glorified tape recorders’ Link

One small step for Spain, one final push for World Cup glory

Unraveling August 2023: August 14th, 2023

Manscape Refining Gentlemen

Latest AI News and Trends on August 14th, 2023

What is LLM? Understanding with Examples

What is LLM (Large Language Model)?

LLMs are capable of performing various natural language processing (NLP) tasks, including

Language translation
Text summarization
Question-answering
Sentiment analysis
Generating coherent and contextually relevant responses to user inputs

Examples of Large Language Models

Some of the most popular large language models are:

GPT-3 by OpenAI: GPT-3 is a large language model that was first released in 2020. It has been trained on a massive dataset of text and code, and it can generate text, translate languages, write different kinds of creative content, and answer your questions in an informative way.
T5 by Google AI: T5 is a large language model that was first released in 2021. It is specifically designed for text generation tasks, and it can generate text that is more accurate, consistent, and creative than smaller language models.
LaMDA by Google AI: LaMDA is a large language model that was first released in 2022. It is specifically designed for dialogue applications, and it can hold natural-language conversations with users.
PaLM by Google AI: PaLM is a large language model that was first released in 2022. It is the largest and most powerful language model ever created, and it can perform a wide range of tasks, including text generation, translation, summarization, and question-answering.
FlaxGPT by DeepMind: FlaxGPT is a large language model that was first released in 2022. It is based on the Transformer architecture, and it can generate text that is more accurate and consistent than smaller language models.

https://www.seaflux.tech/blogs/llm-explained-with-examples

Advantages of LLM

Large language models (LLMs) have a number of advantages over traditional machine learning models. These advantages include:

Improved accuracy and performance: LLMs can be trained on massive datasets of text and code, which allows them to learn the nuances of human language and generate more accurate and consistent results than traditional machine-learning models.
Increased efficiency: LLMs can automate many tasks that were previously done manually, such as text classification, summarization, and translation. This can save businesses time and money, and free up human workers to focus on more creative and strategic tasks.
Expanded possibilities: LLMs can be used to create new and innovative products and services. For example, they can be used to develop chatbots that can hold natural-language conversations with customers or to create virtual assistants that can help users with tasks such as scheduling appointments or finding information.
Enhanced creativity: LLMs can be used to generate creative text formats, such as poems, code, scripts, musical pieces, emails, letters, and more with endless possibilities. This can be used to improve the quality of content or to create new and innovative forms of art and entertainment.
Reduced bias: LLMs can be trained on datasets that are more diverse than traditional datasets, which can help to reduce bias in their results. This is important for businesses and organizations that want to ensure that their products and services are fair and equitable for all users.

Challenges of LLM

Large language models (LLMs) are a powerful new technology, but they also come with several challenges. These challenges include:

Data requirements: LLMs require massive datasets of text and code to train. This can be a challenge for businesses and organizations that do not have access to large datasets.
Computational resources: LLMs require a lot of computational resources to train and run. This can be a challenge for businesses and organizations that lack the necessary resources.
Interpretability: LLMs are often difficult to interpret. This makes it difficult to understand how they work and to ensure that they are not generating harmful or biased results.
Bias: LLMs can be biased, depending on the data they are trained on. This can be a challenge for businesses and organizations that have ensured that their products and services are fair and equitable for all users.
Safety: LLMs can be used to generate harmful or misleading content. This can be challenging for businesses and organizations having a reputation for safe and secure services.

Use cases of LLM

The future of LLM models is bright. As this technology continues to develop, we can expect to see even more innovative and groundbreaking applications for LLMs in the future.

Some of the promising applications of LLMs include:

Virtual Assistants: LLMs could be used to power virtual assistants that are even more human-like and helpful than they are today. These virtual assistants could be used to provide a wide range of services, such as scheduling appointments, finding information, and controlling smart home devices.
Content Generation: LLMs could be used to generate more engaging and informative content. This content could be used to improve the customer experience, educate users, and entertain people.
Translation: LLMs could be used to translate text from one language to another more accurately and efficiently than ever before. This could help businesses to reach a wider audience and to provide better customer service.
Research: LLMs could be used to conduct research in a wider range of fields, such as natural language processing, machine translation, and artificial intelligence. This could help to advance our understanding of these fields and to develop new and innovative applications.
Education: LLMs could be used to create personalized learning experiences for students. These experiences could be tailored to each student’s individual needs and interests.
Healthcare: LLMs could be used to diagnose diseases, develop new treatments, and provide personalized care to patients.
Art and entertainment: LLMs could be used to create new forms of art and entertainment. This could include poems, code, scripts, musical pieces, emails, letters, etc.

Introducing the Transformer Library

Sentiment Analysis	Named Entity Recognition
Text Generation	Translate language
Question Answering Pipeline	Summarization

Before jumping to the examples of Transformer Library, we need to install the library to use it.

Install the Transformer Library

pip install transformers

By using the pipeline feature of the Transformers Library, you can easily apply LLMs for text generation, question answering, sentiment analysis, named entity recognition, translation, and more.

from transformers import pipeline

Example: Question Answering Pipeline

To perform question-answering using the Transformers library, you can utilize the pipeline feature with a pre-trained question-answering model. Here’s an example:

from transformers import pipeline

# Define the list of file paths
file_paths = ['document1.txt', 'document2.txt', 'document3.txt']

# Read the contents of each file and store them in a list
documents = []
for file_path in file_paths:
with open(file_path, 'r') as file:
document = file.read()
documents.append(document)

# Concatenate the documents using a newline character
context = "\n".join(documents)

# Use the pipeline with the updated context
nlp = pipeline("question-answering")
result = nlp(question="When did Mars Mission Launched?", context=context)

print(result['answer'])

The code prints the below output correctly to the question – When did Mars Mission Launch?

Output - 5 November 2013

IBM’s AI chip mimics the human brain

The chip is primarily analogue but also has digital elements, which makes it easier to put into existing AI systems.

It addresses the concerns raised about emissions from warehouses full of computers powering AI systems. It could also cut the water needed to cool power-hungry data centers.

Why does this matter?

Source

NVIDIA’s tool to curate trillion-token datasets for pretraining LLMs

The tool curates high-quality data that leads to improved LLM downstream performance and will significantly benefit LLM developers attempting to build pretraining datasets.

Why does this matter?

Source

Trustworthy LLMs: A survey and guideline for evaluating LLMs’ alignment

It also thoroughly surveys the categories of LLMs that are likely to be crucial to improve their trustworthiness and shows how to build evaluation datasets for alignment accordingly.

Why does this matter?

Source

Amazon’s push to match Microsoft and Google in generative AI LINK

Amazon is developing proprietary chips, named “Inferentia” and “Trainium,” to rival Nvidia GPUs in terms of training and speeding up generative AI models.
The company’s late entry into the generative AI market has put it in a position of catch-up, with competitors like Microsoft and Google already investing heavily and integrating AI models into their products.
Despite Amazon’s cloud dominance, it aims to differentiate by leveraging its custom silicon capabilities, with Trainium offering significant price-performance improvements, although Nvidia remains dominant for training models.

World first’s mass-produced humanoid robots with AI brains LINK

Chinese start-up Fourier Intelligence showcased its humanoid robot GR-1, capable of walking on two legs at 5km/h carrying a 50kg load, highlighting the potential of bipedal robots.
Fourier originally focused on rehabilitation robotics, but in 2019, it embarked on creating humanoid robots, with GR-1 achieving success after three years of development.
While challenges remain in commercializing humanoid robots, Fourier aims to mass-produce GR-1 by year-end and sees potential applications in elderly care, education, and more.

Microsoft Designer: An AI-powered Canva: a super cool product that I just found!

Some cool features:

Prompt-to-design: From just a short description, Designer uses DALLE-2 to generate original and editable designs.
Brand-kit: stay on-brand by instantly applying your fonts and color pallets to any design; it an even suggest color combinations.
Other AI tools: suggests hashtags and captions; replace background of an image with your imagination; erase items from an image; auto-fill a section of the image with generated image.

Source: this AI newsletter

ChatGPT costs OpenAI $700,000 PER Day

Costs Outpace Revenue

ChatGPT costs $700,000 per day to run.
Despite paid offerings, revenue can’t offset losses.
Projected 2023 revenue of $200M seems unlikely.

Mounting Problems

ChatGPT saw 12% drop in users from June to July.
Top talent being poached by rivals like Google and Meta.
GPU shortages hindering ability to train better models.

Increasing Competition

Cheaper open-source models can replace OpenAI’s APIs.
Musk’s xAI working on more right wing biased model.
Chinese firms buying up GPU stockpiles.

With ChatGPT’s massive costs outpacing revenue and problems like declining users and talent loss mounting, OpenAI seems to be in a precarious financial position as competition heats up.

Source: (link)

What Else Is Happening in AI on August 14th, 2023

Google appears to be readying new AI-powered tools for ChromeOS (Link)

Zoom rewrites policies to make clear user videos aren’t used to train AI (Link)

Anthropic raises $100M in funding from Korean telco giant SK Telecom (Link)

Modular, AI startup challenging Nvidia, discusses funding at $600M valuation (Link)

California turns to AI to spot wildfires, feeding on video from 1,000+ cameras (Link)

FEC to regulate AI deepfakes in political ads ahead of 2024 election (Link)

AI in Scientific Papers on August 14th, 2023

This research paper has found that LLMs can naturally read docs to learn how to use tools without any training. Instead of showing demonstration, just provide tool documentation. LLMs figured out how to use programs like image generators and video tracking software, without any new training [Link]
This paper analyses and visualises the political bias of major AI language models. ChatGPT and GPT-4 were most left-wing while Meta’s Llama was right-wing [Link]. This type of research is very important and highlights the inherent bias in these models. It’s practically impossible to remove bias also, and we don’t even know what they’ve been trained on. People need to understand, you control the models, you control what people see, especially as AI models are used more frequently and become mainstream
Remember the Westworld style paper with the 25 AI agents living their lives? It’s now open-source. It’s implications in gaming cannot be overstated. Can’t wait to see what comes of this [Link]
MetaGPT is framework using multiple agents to behave as an entire company – engineer, pm, architect etc. It has over 18k stars on github. This specialised for industries and companies will be powerful [Link]
This paper discusses reconstructing images from signals in the brain. Soon we’ll have brain interfaces that could read these signals consistently, maybe map everything you see? Potential is limitless [Link]
Nvidia is partnering with HuggingFace with DGX Cloud platform allowing people to train and tune AI models. They’re offering a “Training Cluster as a Service” which will help companies and individuals build and train models faster than ever [Link]
Stability AI has released their new AI LLM called StableCode. 16k context length and 3b params with other version on the way [Link]
This paper discusses a framework for designing and implementing complex interactions between AI systems called Flows [Link] Will be very important when building complex AI software in industry. Github will be uploaded soon [Link]
Nvidia announced that Adobe Firefly models will be available as APIs in Omniverse [Link] This thread breaks down what the Omniverse will look like [Link]
Anthropic CEO Dario Amodei thinks AI will reach educated levels of humans in 2-3 years [Link] For reference, Claude 2 is probably the second most powerful model alongside GPT4
Layerbrain is building AI agents that can be used across Stripe, Hubspot and slack using plain english [Link] Looks very cool
LLMs picking random numbers almost always pick the numbers 6-8 [Link]
Inflection founder Mustafa Suleyman says we’ll probably rely on LLMs more than the best trained and most experienced humans within 5 years [Link]. For context, Mustafa is one of the co founders of Google DeepMind – this guys knows AI
Writer, a startup using Nvidia’s NeMo discuss how it helped them build and scale over 10 models. NeMo isn’t publicly available but seems like a massive advantage considering Writer’s cloud infra, which is managed by 2 people, hosts a trillion API calls a month [Link] Link to NeMo [Link] Link to NeMo guardrails blog [Link]
Someone open-sourced smol-podcaster – it transcribes and labels speakers, formats the transcription, creates chapters with timestamps [Link]
Ultra realistic AI generated videos are coming. It’s impossible to tell they’re fake now [Link] Signup for early access here [Link]
Anthropic released Claude Instant 1.2. Its very fast, better at math and coding and hallucinates less [Link]
This guy released the code for his modded Google Nest Mini using OpenAI’s function calling to take notes and control his lights. Once Amazon & Apple integrates better LLMs into their prods, AI will truly be everywhere [Link]
If you search “As an AI language model” in Google Scholar a lot of papers come up… [Link]
OpenAI released custom instructions for ChatGPT free users, except for people in the US or UK [Link]
OpenAI, Google, Microsoft and Anthropic partnered with Darpa for their AI cyber challenge [Link]
PlayHT released their new text-to-voice ai model and it looks crazy good. Change the way its delivered by describing an emotion and much more [Link] [Link]
A paper by Google showcasing that AI models tend to repeat a user’s opinion back to them, even if its wrong. Thread breaking it down [Link] Link to paper [Link]
Medisearch comes out of YC and claims to have the best model for medical questions [Link]
Someone made a way to one-click install AudioLDM with gradio web ui [Link]
A way to make llama-2 much faster [Link]
WizardLM released a new math model that outperforms ChatGPT on math skills [Link]
A team of researchers trained an AI model to hear the sounds of keystrokes and steal data. Apparently it has a 95% success rate. Link to article [Link] Link to paper [Link]
Yann LeCun gave a talk at MIT about Objective-Driven AI [Link]
Google released 7 free courses on gen AI [Link] [Link] [Link] [Link] [Link] [Link] [Link]
Alpaca, a new AI tool for artists is out for public beta. It’s sketch to image is very powerful [Link]
One of the most lucrative businesses in the AI arms race? GPU cloud. Coreweave got $400M in funding and are set to make billions [Link]
Google releases a guidebook on best practices when designing with AI [Link]
A great article on LLMs in healthcare [Link]
Implement text-to-SQL using langchain, a breakdown[Link]
SDXL implemented in 520 lines of code in a single file [Link]
OpenAI released a blog on Special Projects – one of them involved trying to find secret breakthroughs in the world [Link]
Google announced Project IDX, a browser-based code environment. Brings app dev to the cloud and has AI features like code gen, completion etc [Link] A shot at replit it seems
Meta open-sourced AudioCraft – musicgen, audiogen and encodec. Definitely worth checking out [Link]
If you’re interested in fine-tuning open-source models like Llama-2, definitely check out this blog [Link] In some cases, fine-tuned llama2 is better than gpt4 (for sql generation for example). Overall a great read if you’re interested in fine tuning
Nvidia released the code for Neuralangelo, an AI model that reconstructs 3d surfaces from 2d videos [Link]
Create digital environments in seconds with Blockade labs. Wild stuff [Link]
This paper compares the answers of ChatGPT and stackoverflow for software engineering questions [Link] “52% of chatgpt answers are incorrect and 77% are verbose but are still preferred 39% of the time due to their comprehensiveness and well-articulated language style”. Only issue is this uses 3.5. Need this test with gpt4

Latest Tech News and Trends on August 14th, 2023

Privacy win: Starting today Facebook must pay $100.000 to Norway each day for violating our right to privacy. Link

College professors are going back to paper exams and handwritten essays to fight students using ChatGPT. Link

New Footage Shows Tesla On Autopilot Crashing Into Police Car After Alerting Driver 150 Times. Link

ChromeOS might get some new AI-powered tools
– Google appears to be readying an AI writing tool for ChromeOS. Its code has hints of some AI tools for suggestions and rewrites.

Anthropic raises $100M from Korean telco giant SK Telecom – They plan to co-develop a multilingual LLM customized for global telco firms.

AI avatars are coming. In my mind the biggest market for this might be content creators. People who need to appear on video and are tired of ensuring pitch perfect recordings.

LLMs have in-built political biases. Meta’s Llama has right-wing bias and GPT-4 has left-wing bias. Really? Who would’ve thought?

It is often said that “the devil is in the details”. As this article points out the question on AI regulation is going to be as much about laws as it is about procedure.

A sober look at AI in education.

Implement Llama from scratch.

Fine Tuning Llama-2

Artificial General Intelligence – A gentle introduction

Discover a career in AI – Search 500+ job opportunities

GPTE just launched an AI job board with over 500 opportunities for those interested in joining the AI field professionally.

Latest Sport News on August 14th, 2023

Raphael Varane scored the winner 14 minutes from time as Manchester United gained a fortunate opening-weekend win against Wolves at Old Trafford. Link

Neymar transfer news: Al-Hilal agree deal with Paris St-Germain for Brazil forward. Link

Why is Saudi Pro League signing European clubs’ stars? Link

Moises Caicedo transfer news: Chelsea sign Brighton midfielder for £100m. Link

Harry Kane’s Bayern Munich may have found its goalkeeper;

Neymar to be paid $500K per social media post by Saudi Arabia;

Chelsea unveils Caicedo, breaks UK biggest transfer record;

Everton heartbroken at death of worker at new stadium;

Spalletti to succeed Mancini as Italy boss but issue emerges;

FC Dallas striker Jesus Ferreira wanted by Cadiz in LaLiga;

UEFA unsure on Athens hosting European final after fan violence;

Liverpool agrees $75m for Lavia, but can Chelsea scoop them?;

Kepa Arrizabalaga replaces Courtois at Real Madrid;

Second new Messi documentary in the works by Apple TV;

Unraveling August 2023: August 13th, 2023

Latest AI News and Trends on August 13th, 2023

Amazon wants you to pay with your palm LINK

Amazon is introducing Amazon One, a biometric hand-scanning service that allows users to pay at Whole Foods, Amazon Fresh stores, Panera restaurants, airports, stadiums, and Starbucks locations using their palm.
This move is part of Amazon’s effort to compete with Google and Apple in the digital wallet space, aiming to create a universal identity provider that goes beyond payments, potentially connecting to various services, including health records.
Amazon One uses near-infrared light to capture palm vein patterns and surface features, with a focus on security through encrypted hand scan transmission, but it faces privacy concerns and the challenge of convincing merchants to adopt the technology.

California’s AI-driven wildfire detection LINK

The California Department of Forestry and Fire Protection (Cal Fire) has launched the Alert California AI program in collaboration with UCSD, using AI and 360-degree cameras to detect potential wildfires by identifying abnormalities in camera feeds.
The program successfully detected and prevented a fledgling fire in the Cleveland National Forest, alerting firefighters who extinguished the flames within 45 minutes.
Alert California utilizes LiDAR scans and machine learning to differentiate between smoke and other particles, aiming to combat wildfires in the face of extreme climate conditions.

White House’s $1.2B carbon capture initiative LINK

The Department of Energy is providing grants of up to $1.2 billion to two direct air capture (DAC) projects aiming to remove over 2 million metric tons of CO2 annually, equivalent to emissions from 445,000 gas-powered cars.
The DAC projects in Texas and Louisiana, supported by the Regional Direct Air Capture Hubs program, will create jobs and could potentially remove up to 30 million tons of CO2 per year, contributing to the US goal of emissions neutrality by 2050.
The DOE aims to lower DAC costs below $100 per metric ton of CO2-equivalent and is funding feasibility studies, engineering projects, and a carbon removal credits program to achieve global impact on carbon reduction.

FTX’s Sam Bankman-Fried is back in jail LINK

Sam Bankman-Fried, former CEO of FTX, had his bail revoked ahead of his trial following allegations of leaking a diary to the New York Times.
Bankman-Fried faces charges including defrauding FTX investors and was initially under house arrest on a $250 million bond.
US District Court Judge revoked his bail due to alleged misconduct and possible witness intimidation, leading to potential detention at a detention center during trial.

AI can now outperform humans in Captcha tests LINK

A study reveals that humans are slower and less accurate than bots in solving Captcha tests, raising questions about their effectiveness.
Captchas are intended to deter bots from accessing services, preventing malicious activities like DDoS attacks and spam accounts.
Bots can outperform humans in solving certain types of Captchas, indicating an ongoing challenge in maintaining their efficacy.

Bots Are Better at Solving Captchas Than Humans, Research Shows

Not only are bots faster at solving captchas than humans they’re also more accurate.

Unraveling August 2023: August 12th, 2023

Latest AI News and Trends on August 12th 2023: Week Recap

Apple bets big on AI

Jupyter brings AI to notebooks

ChatGPT’s emotional awareness is more than humans’. What?

ChatGPT outperformed humans in all categories, achieving an overall score of 85 compared to 56 for men and 59 for women. The researchers suggest that ChatGPT could be helpful in psychotherapy, cognitive training, and diagnosing mental illness. Previous studies have shown that people perceive ChatGPT’s responses as more empathetic than medical professionals.

Microsoft’s many AI monetization plans

Microsoft has announced new Azure AI infrastructure advancements and availability to bring its customer closer to the transformative power of generative AI.

Azure OpenAI Service goes global: OpenAI’s most advanced models, including GPT-4 and GPT-35-Turbo, will now be available in multiple new regions and locations.
General availability of ND H100 v5 VMs for unprecedented AI processing and scale: -It also announced general availability of the ND H100 v5 Virtual Machine series, featuring the latest NVIDIA H100 Tensor Core GPUs and low-latency networking, propelling businesses into a new era of AI applications.

OpenAI launches a web crawler to train ChatGPT

Called GPTBot, the crawler will comb through the internet to train and enhance AI’s capabilities. It can be identified by the following user agent and string.

Moreover, OpenAI also revealed how websites can prevent GPTBot from accessing their sites, either partially or by opting out entirely.

AI deep fake audios are getting scarily realistic

NVIDIA’s Biggest AI Breakthroughs

Reveals a new chip GH200

The adoption of Universal Scene Description (OpenUSD)

An AI Workbench

The Partnership between NVIDIA and Hugging Face

Google’s AI Surprise for Developers

Stability AI launches LLM code generator

Midjourney’s present + future plans

Midjourney is rolling out a GPU cluster upgrade today. Pro and Mega users should see speedups of ~1.5x (/imagine from ~50 sec to ~30 sec). These renders should also be 1.5x cheaper.

Source

MetaGPT tackling LLM hallucination

The framework also mandates modular outputs, allowing agents to validate outputs and minimize errors. By assigning diverse roles to agents, MetaGPT effectively deconstructs complex problems.

Latest Tech News and Trends on August 12th 2023

Robotaxis greenlit for 24/7 operations in San Francisco LINK

California approved all-day paid robotaxi service in San Francisco, allowing unlimited self-driving car fleets.
The decision came amid objections from San Francisco officials, after a six-hour public comment session, and was a result of applications from Cruise (backed by GM) and Waymo (an Alphabet subsidiary).
Despite some challenges with driverless cars on the city’s streets, Cruise and Waymo see this approval as a pivotal step towards making their investments in self-driving technology profitable.

Russia launches its first lunar mission in 47 years LINK

Russia launches Luna-25, its first lunar mission since 1976, targeting the Moon’s south pole to potentially uncover water ice beneath its surface.
The mission is symbolic, referencing the Soviet Space Program era, and aims to project Russia as an influential world power amidst tensions following its 2022 Ukraine invasion.
Luna-25 is in competition with India’s Chandrayaan-3 mission, with both crafts expected to reach the Moon’s south pole around the same time.

Virgin Galactic debuts with its first civilian spaceflight LINK

Virgin Galactic’s second commercial flight, Galactic 02, took three private citizens to suborbital heights, including a historic mother-daughter duo.
The VSS Unity reached a peak altitude of 55 miles (88 kilometers) in an hour-long flight, with Kelly Latimer becoming the first woman pilot of a commercial spaceflight.
Following recent successes, Virgin Galactic aims for monthly commercial launches and is developing its Delta Class spacecraft for 2026, though substantial revenue from these flights is not anticipated.

Amazon penalizes excessive remote work LINK

Amazon warned US staff who didn’t spend enough time in the office after tracking their attendance.
The company’s office policy, effective since May, requires employees to be present at least three days a week.
Amazon responded to concerns by stating the warning was for those not adhering to the policy, but acknowledged potential inaccuracies in tracking.

Chinese firms invest billions in Nvidia GPUs LINK

Chinese internet giants, in response to US sanctions, are purchasing vast numbers of Nvidia GPUs to bolster their AI capabilities.
Companies like Alibaba, Baidu, ByteDance, and Tencent have reportedly spent around $1 billion on 100,000 Nvidia A800 GPUs, with further orders amounting to an additional $4 billion.
The GPUs are crucial for training large language models, and while the US seeks stricter export limitations on AI tech to China, US companies continue to design specific AI chips for the Chinese market.

Latest Football and Sport News on August 11th 2023

Australia ‘going nuts’ and soccer in the country ‘changed forever’ after the Matildas’ historic win

Ronaldo wins first title at Al-Nassr with brace in Arab Club Champions Cup final

World Cup Daily: England set up semifinal clash with Australia

It’s a case of old rivalry, new sport as England beat Colombia to set up a Women’s World Cup semifinal clash with perennial sporting rivals Australia.

Tom Brady makes first appearance at Birmingham City soccer game

Tom Brady made the trip across the pond to watch his new football club win on Saturday.

Bellingham scores on competitive debut ads Real wins 2-0

England midfielder Jude Bellingham scored on his competitive Real Madrid debut as they began their La Liga season with victory at Athletic Bilbao.

Harry Kane makes Bayern Munich debut in German Super Cup defeat by RB Leipzig

Clinical Isak helps Newcastle hammer Villa

Alexander Isak’s clinical finishing helped Newcastle United to an emphatic victory against Aston Villa on the opening weekend of the new Premier League campaign.

Unraveling August 2023: August 11th 2023

Latest AI News and Trends on August 11th 2023

AI Tutorial: Applying the 80/20 Rule in Decision-Making with ChatGPT

Try the prompt below:

MetaGPT tackling LLM hallucination

The framework also mandates modular outputs, allowing agents to validate outputs and minimize errors. By assigning diverse roles to agents, MetaGPT effectively deconstructs complex problems.

Why does this matter?

Source

Will AI ads be allowed in the next US elections?

Key Points:

A petition to regulate ads using AI for mis-representation.
Existing campaigns, including Florida GOP Gov. Ron DeSantis’s, are already using this technology.
The FEC will decide on rules only after a 60-day public comment window, likely starting next week.
Regulation could impose guidelines for disclaimers, but not cover all deepfake-related threats, such as those from individual social media users.

What Else Is Happening in AI on August 11th 2023

Microsoft introduced new tools for global frontline workers, enhancing their capabilities. (Link)

Google keyboard’s new update could include AI-powered proofreading, AI emojis & more. (Link)

Runway’s new update allows you to extend your Gen-2 videos up to 18 seconds! (Link)

China’s internet giants, including Baidu, TikTok-owner, Alibaba have reportedly ordered $5B worth of Nvidia chips! (Link)

PlayHT2.0 is a new AI model that can “talk”? (Link)

China’s largest web and cloud providers (Alibaba, Baidu, ByteDance, and Tencent)are lining up to buy as many Nvidia GPUs as they can while they still can get their hands on them. [Link]

Runway’s new update allows you to extend your Gen-2 videos up to 18 seconds.
– Available now in the browser and coming soon to iOS.

TikTok Introduces Toggle for AI-Generated Content Disclosure

TikTok is reportedly adding a toggle that enables creators to label AI-generated content, aiming to prevent content removal and enhance transparency.

Belva: Empower an AI agent to manage your phone calls effectively—an ideal solution for call management optimization.

Zefi: Enhance your product development process with this AI tool, integrating with development platforms to gather data, cluster feedback, assist in prioritization, and align stakeholders.

YT Transcripts by Editby: Download and edit YouTube videos easily with this tool, making it perfect for content creators seeking to repurpose their YouTube content.

AI Tools Database: Explore a comprehensive Notion database featuring 1350 useful AI tools curated by The Intelligo.

Latest Tech News and Trends on August 11th 2023

How to Stop Android Notifications from Turning On the Screen

Sick of notifications lighting up your Android screen? You can prevent this in your app settings. This guide explains how.

There isn’t a universal setting to prevent all apps from waking the lock screen, so you’ll need to manage them individually. Here’s how.

How to Disable App Wake Screen Settings on Android

To stop notifications from turning on the screen on Android:

Swipe down from the top of the screen and tap Settings (gear icon) in the top-right corner.
Select the Notifications option from the Settings menu.
Tap the App notifications option to view your complete list of installed apps.
Select the app that you don’t want to wake your screen.
Tap the Silent option under the Alerts section.
You can also disable all app notifications by toggling off the Allow notifications switch. You won’t have access to the notification settings for all apps when you turn this off, however.
It’s also important to note that some apps will allow you to manage specific notifications by selecting the Notification categories option and toggling individual notification types on or off.

How to Use In-App Settings to Stop Apps Waking Your Android Screen

Android 14 lets you easily check if an unknown object tracker is tracking you

iOS 17: how to create a Contact Poster on your iPhone

Contact Posters offer an engaging canvas to express creativity via otherwise mundane-looking contact cards. Here’s how to create them in iOS 17 on your iPhone.

he whole process of creating a Contact Poster is fairly easy. You can create a Contact Poster for your own number, or any other acquaintance in your contact list.

Another neat convenience that comes with Contact Posters is that you can share them by just bringing two iPhones close to each other. It also works if you tap your iPhone against an Apple Watch.

Step 1: Open the Phone app on your iPhone and select the contact that needs a poster makeover. As you tap on a name, it will open the detailed contact page as shown in the image below.

The US just invested more than $1 billion into carbon removal / The move represents a big step in the effort to suck CO2 out of the atmosphere—and slow down climate change. Link

Latest World USA Sport News on August 11th 2023

Orange juice prices to surge as US crops ravaged by disease and climate. Link

Teenage girl dies after being forced to stay in a ‘period hut’ in Nepal. Link

Nearly 50,000 Americans died by suicide in 2022, a record-high number: CDC. Link

Supreme Court blocks OxyContin maker’s bankruptcy deal that would shield Sackler family members. Link

New school bus routes a ‘disaster,’ Kentucky superintendent admits. Last kids got home at 10 pm. Link

2 minutes daily football news: Spain 2-1 Ned; Japan 1 – 2 Sweden; Harry Kane Caicedo; #soccer #footy

Liverpool have agreed a British record transfer fee of £111m with Brighton for midfielder Moises Caicedo.

England captain Harry Kane is set to have a medical at Bayern Munich after being given permission to travel to Germany by Tottenham.

Sweden produced a magnificent performance to book a semi-final date with Spain and leave Japan’s Women’s World Cup dreams in tatters

Teenage winger Salma Paralluelo came off the bench to score a 111th-minute winner as Spain beat the Netherlands to reach the Women’s World Cup semi-finals for the first time.

Off the pitch, few teams at this Women’s World Cup have been as dysfunctional and wracked by controversy as Spain.

Soccer Football Saudi Pro League kicks off after raiding Europe’s top football clubs.

Unraveling August 2023: August 10th 2023

Latest AI News and Trends on August 10th 2023

Advanced Library of 1000+ free GPT Workflows with HeroML – To Replace most “AI” Apps. By u/papsamir

Disclaimer: all links below are free, no ads, no sign-up required for open-source solution & no donation button. Workflow software is not only free, but open-source ❣️

Hi again! About 4 months ago, I posted here about free libraries that let people quickly input their own values into cool prompts for free. Then I made some more, and heard a lot of feedback.

I wanted to have these conversations, though, without actually having them... I wanted to automate the conversations I was already having on ChatGPT!

It’s essentially prompts chained together, and prompts that can reference previous responses for ❣️ context ❣️

Here’s a super short video example I was almost too embarrassed to make (Youtube mirror: 36 Second video):

0:00

0:37

quick example of how HeroML workflow steps work

Simple Example of HeroML

Let’s say you wanted to generate… an email! Here’s what that would look like in HeroML:

(BTW, each step is separated by ->>>>, so every time you see that, assume a new step has begun, the below example has 4 steps*)*

You are an email copywriter, write a short, 2 sentence email introduction intended for {{recipient}} and make sure to focus on {{focus_point_1}} and {{focus_point_2}}. You are writing from the perspective of me, {{your_name}}. Make sure this introduction is brief and do not exceed 2 sentences, as it's the introduction.

->>>>

Your task is to write the body of our email, intended for {{recipient}} and written by me, {{your_name}}. We're focusing on {{focus_point_1}} and {{focus_point_2}}. We already have the introduction:

Introduction:
{{step_1}}

Following on, write a short paragraph about {{focus_point_1}}, and make sure you adhere to the same tone as the introduction.

->>>>

Your task is to write the body of our email, intended for the recipient, "{{recipient}}" and written by me, {{your_name}}. We're focusing on {{focus_point_1}} and {{focus_point_2}}. We already have the introduction:

Introduction:
{{step_1}}

And also, we have a paragraph about {{focus_point_1}}:
{{step_2}}

Now, write a short paragraph about {{focus_point_2}}, and make sure you adhere to the same tone as the introduction and the first paragraph.

->>>> 

Your task is to write the body of our email, intended for {{recipient}} and written by me, {{your_name}}. We're focusing on {{focus_point_1}} and {{focus_point_2}}. We already have the introduction:

Introduction:
{{step_1}}

We also have the entire body of our email, 2 paragraphs, for {{focus_point_1}} & {{focus_point_2}} respectively:

First paragraph:
{{step_2}}

Second paragraph:
{{step_3}}

Your final task is to write a short conclusion the ends the email with a "thank you" to the recipient, {{recipient}}, and includes a CTA (Call to action) that requires them to reply back to learn more about {{focus_point_1}} or {{focus_point_2}}. End the conclusion with "Wonderful and Amazing Regards, {{your_name}}

I’ll do a quick deep dive into the above example.

You can see I use a bunch of dynamic variables with the double curly brackets, there are 2 types:

Variables that you define in the first prompt, and can refer to throughout the rest of the steps

{{your_name}}, {{focus_point_1}}, etc.

Step Variables, which are basically just variables that references responses from previous steps..

{{step_1}} can be used in Step #2, to input the AI response from Step 1, and so on.

In the above example, we generate an introduction in Step 1, and then, in Step 2, we tell the AI that "We have already generated an introduction: {{step_1}}"

When you run HeroML, it won’t actually see these variables (the double-curly brackets), it will always replace them with the real values, just like the example in the video above!

Please don’t hesitate to ask any questions, about HeroML or anything else in relation to this.

Free Library of HeroML Workflows

Github Repo of 1000+ HeroML Workflows

However, the Repo allows you or any contributor to make changes to these workflows (the .heroml) files, and when those changes are approved, they will automatically be merged online.

For example, if you make an edit to this blog post workflow, after changes are approved, the changes will be applied to this deployed version.

There are thousands of workflows in the Repo, but they are just examples. The best workflows are ones you create for your specific needs.

How to run HeroML

Online Playground

0:00

0:10

Example of hero app playground

My favorite method is the one below, since you have full control.

Local Machine with own OpenAI Key

I have built a HeroML compiler in Node.js that you can run in your terminal. This page has a bunch of documentation.

Running HeroML example and Output

Here’s an example of how to run it and what do expect.

This is the script

simple HeroML script to generate colors, and then people’s names for each color.

This is how quick it is to run these scripts (based on how many steps):

0:00

0:19

using HeroML CLI with your own OpenAI Key

And this is the output (In markdown) that it will generate. (it will also generate a structured JSON if you want to clone the whole repo and build a custom solution)

Output in markdown, first line is response of first step, and then the list is response from second step. You can get desired output by writing better prompts 😊

Conclusion

I also understand that the audience here is split between technical and non-technical, so as explained above, there are both technical examples, and non-technical deployed playgrounds.

Here’s a table of some of the (1000+) workflows you can play with (here’s the full list & repo):

Github Workflow Link is where to clone the app, or make edits to the workflow for the community.

Yes, I generated all the names with AI ✨, who wouldn’t?

That’s a wrap.

Thank you for all your support in my last few posts ❣️

Have a wonderful day, and open-source all the friggin way 😇

How ChatGPT and other AI tools are helping workers make more money

Generative AI tools like ChatGPT is helping workers make more money by automating time-consuming tasks and creating new income streams and full-time jobs.

Universal Music collaborates with Google on AI song licensing LINK

Universal Music Group is negotiating with Google to license artists’ voices and melodies for AI-generated songs, with Warner Music also participating.
Artists could opt out of the system, but the move could allow fans to create deepfakes of their favorite musicians.
While this might be lucrative for record labels, it poses challenges for artists who want to keep their voices free from AI-cloning.

AI’s role in reducing airlines’ contrail climate impact LINK

Contrails from airplanes trap heat in Earth’s atmosphere, leading to a net warming effect.
Pilots at American used Google’s AI predictions and Breakthrough Energy’s models to choose altitudes less likely to produce contrails.
After 70 test flights, satellite imagery revealed a 54% reduction in contrails, suggesting commercial flights can lessen their environmental impact.

Anthropic’s Claude Instant 1.2- Faster and safer LLM

Why does this matter?

Source

Google attempts to answer if LLMs generalize or memorize

Why does this matter?

Source

White House launches AI-based contest to secure government systems from hacks

The Competition

Teams compete to best secure vital software systems from cyber risks.
Up to 20 teams advance from qualifiers to win $2 million each at DEF CON 2024.
Finalists eligible for more prizes, including $4 million top prize at DEF CON 2025.

Innovating Cybersecurity with AI

Competitors required to open source their AI systems for widespread use.
Collaboration from AI leaders like Anthropic, Google, Microsoft, and OpenAI.
Aims to push boundaries of AI for national cyber defense.

Previous Government Hacking Contests

Similar to 2014 DARPA Cyber Grand Challenge to develop automated cybersecurity.
Various prizes offered to drive innovation through competition.
Hopes AI can keep defense ahead of evolving threats.