Have you ever heard of ChatGPT, the open-source machine learning platform that allows users to build natural language models?
It stands for “Chat Generating Pre-trained Transformer” and it’s an AI-powered chatbot that can answer questions with near human-level intelligence. But what is Google’s answer to this technology? The answer lies in Open AI, supervised learning, and reinforcement learning. Let’s take a closer look at how these technologies work.
Tech Buzzwords of 2022, By Google Search Interest
Open AI is an artificial intelligence research laboratory that was founded by some of the biggest names in tech, including Elon Musk and Sam Altman. This non-profit organization seeks to develop general artificial intelligence that is safe and beneficial to society. One of their key initiatives is the development of open source technologies like GPT-3, which is a natural language processing model used in ChatGPT.
2023 AWS Certified Machine Learning Specialty (MLS-C01) Practice Exams
ChatGPT: What Is It and How Does Google Answer It?
Artificial Intelligence (AI) has been around for decades. From its humble beginnings in the 1950s, AI has come a long way and is now an integral part of many aspects of our lives. One of the most important areas where AI plays a role is in natural language processing (NLP). NLP enables computers to understand and respond to human language, paving the way for more advanced conversations between humans and machines. One of the most recent developments in this field is ChatGPT, a conversational AI developed by OpenAI that utilizes supervised learning and reinforcement learning to enable computers to chat with humans. So what exactly is ChatGPT and how does it work? Let’s find out!
ChatGPT examples and limitations
ChatGPT is an open-source AI-based chatbot developed by OpenAI.
This chatbot leverages GPT-3, one of the most powerful natural language processing models ever created, which stands for Generative Pre-trained Transformer 3 (GPT-3). This model uses supervised learning and reinforcement learning techniques to enable computers to understand human language and response accordingly. Using supervised learning, GPT-3 utilizes large datasets of text to learn how to recognize patterns within language that can be used to generate meaningful responses. Reinforcement learning then allows GPT-3 to use feedback from conversations with humans in order to optimize its responses over time.
AI Unraveled: Demystifying Frequently Asked Questions on Artificial Intelligence Intro
ChatGPT uses supervised learning techniques to train its models.
Supervised learning involves providing a model with labeled data (i.e., data with known outcomes) so that it can learn from it. This labeled data could be anything from conversations between two people to user comments on a website or forum post. The model then learns associations between certain words or phrases and the desired outcome (or label). Once trained, this model can then be applied to new data in order to predict outcomes based on what it has learned so far.
In addition to supervised learning techniques, ChatGPT also supports reinforcement learning algorithms which allow the model to learn from its experiences in an environment without explicit labels or outcomes being provided by humans. Reinforcement learning algorithms are great for tasks like natural language generation where the output needs to be generated by the model itself rather than simply predicting a fixed outcome based on existing labels.
Supervised Learning
Supervised learning involves feeding data into machine learning algorithms so they can learn from it. For example, if you want a computer program to recognize cats in pictures, you would provide the algorithm with thousands of pictures of cats so it can learn what a cat looks like. This same concept applies to natural language processing; supervised learning algorithms are fed data sets so they can learn how to generate text using contextual understanding and grammar rules.
Reinforcement Learning
Reinforcement learning uses rewards and punishments as incentives for the machine learning algorithm to explore different possibilities. In ChatGPT’s case, its algorithm is rewarded for generating more accurate responses based on previous interactions with humans. By using reinforcement learning techniques, ChatGPT’s algorithm can become smarter over time as it learns from its mistakes and adjusts accordingly as needed.
How is ChatGPT trained?
ChatGPT is an improved GPT-3 trained an existing reinforcement learning with humans in the loop. Their 40 labelers provide demonstrations of the desired model behavior. ChatGPT has 100x fewer parameters (1.3B vs 175B GPT-3).
➡️ First they collect a dataset of human-written demonstrations on prompts submitted to our API, and use this to train our supervised learning baselines.
➡️ Next they collect a dataset of human-labeled comparisons between two model outputs on a larger set of API prompts. They then train a reward model (RM) on this dataset to predict which output our labelers would prefer.
➡️ Finally, they use this RM as a reward function and fine-tune our GPT-3 policy to maximize this reward using the Proximal Policy Optimization
In simpler terms, ChatGPT is a variant of the GPT-3 language model that is specifically designed for chat applications. It is trained to generate human-like responses to natural language inputs in a conversational context. It is able to maintain coherence and consistency in a conversation, and can even generate responses that are appropriate for a given context. ChatGPT is a powerful tool for creating chatbots and other conversational AI applications.
How Does Google Answer ChatGPT?
What is Google answer to ChatGPT?
Google’s answer to ChatGTP comes in the form of their own conversational AI platform called Bard. Bard was developed using a combination of supervised learning, unsupervised learning, and reinforcement learning algorithms that allow it to understand human conversation better than any other AI chatbot currently available on the market. In addition, Meena utilizes more than 2 billion parameters—making it more than three times larger than GPT-3—which allows it greater flexibility when responding to conversations with humans.
“We’re starting to open access to Bard, an early experiment that lets you collaborate with generative AI. We’re beginning with the U.S. and the U.K., and will expand to more countries and languages over time.”
When individuals need an information or have a problem/concern, they turn to Google for immediate solution. We sometimes wish, Google could understand what exactly we need and provide us instantly rather than giving us hundreds of thousands of results. Why can’t it work like the Iron Man’s Jarvis?
However, it is not that far now. Have you ever seen a Chat Bot which responds like a human being, suggest or help like a friend, teach like a mentor, fix your code like a senior and what not? It is going to blow your mind.
Welcome to the new Era of technology!! The ChatGPT!
ChatGPT by OpenAI, uses artificial intelligence to speak back and forth with human users on a wide range of subjects. Deploying a machine-learning algorithm, the chatbot scans text across the internet and develops a statistical model that allows it to string words together in response to a given prompt.
As per OpenAI, ChatGPT interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer follow-up questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests.
What all ChatGPT can do?
It can help with general knowledge information.
Remember what user said in previous conversation.
Allow users to provide follow-up corrections.
Trained to decline inappropriate requests.
It can write a program in any language you prefer on real-time. for example — write classification code sample in sklearn python library.
It can fix your piece of code and also explain what went wrong and how it can be fixed.
It can even generate song or rap lyrics
Even much more….
Some best usages of ChatGPT:
Make a diet and workout plan
Generate the next week’s meals with a grocery list
Create a bedtime story for kids
Prep for an interview
Solve mathematical problem
Fix software program or write a program
Plan your trip and tell expected expenses
What are its limitations of ChatGPT?
May occasionally generate incorrect information
May occasionally produce harmful instructions or biased content
Limited knowledge of world and events after 2021
ChatGPT is in its baby steps therefore it may answer erroneously at times however it’s manner of response will blow your mind. Some users have also extolled the chatbot as a potential alternative search engine, since it generates detailed information instantly on a range of topics. I believe, we can’t compare Google with ChatGPT as ChatGPT can provide more in-depth and nuanced answers to complex questions than a search engine like Google, which is designed to provide a list of relevant web pages in response to a user’s query.
Conclusion: ChatGPT is an increasingly popular open source AI chatbot developed by OpenAI using GTP-3 natural language processing technology that utilizes supervised learning and reinforcement learning algorithms for recognition and optimization purposes respectively. On the other hand, Google’s answer comes in the form of their own conversational AI platform called Bard which employs both supervised learning as well as unsupervised learning algorithms along with reinforcement learning capabilities which makes it even smarter than ChatGPT when responding to conversations with humans. Ultimately, these two technologies help bridge the gap between machines and humans by allowing us all talk like real friends!
ChatGPT is in its baby steps therefore it may answer erroneously at times however it’s manner of response will blow your mind. Some users have also extolled the chatbot as a potential alternative search engine, since it generates detailed information instantly on a range of topics. I believe, we can’t compare Google with ChatGPT as ChatGPT can provide more in-depth and nuanced answers to complex questions than a search engine like Google, which is designed to provide a list of relevant web pages in response to a user’s query.
With each passing day, ChatGPT will only get smarter thanks to the use of cutting edge technologies like OpenAI, supervised learning, and reinforcement learning!
ChatGPT Conversation Examples
Question: What is Google Answer to ChatGPT?
Answer: Google does not have an answer to ChatGPT. ChatGPT is an open-source chatbot created by OpenAI, which is an independent research organization that is not affiliated with Google. OpenAI is known for its research on artificial intelligence and machine learning, and has developed a number of innovative technologies in these areas. Google, on the other hand, is a multinational technology company that offers a wide range of products and services, including search, advertising, and cloud computing. While Google does have its own research efforts in artificial intelligence and machine learning, it does not have a chatbot that is similar to ChatGPT.
Question: is Google working on a similar technology as ChatPGT?
Answer:
It is possible that Google is working on technology similar to ChatGPT, as the company has a significant research effort focused on artificial intelligence and machine learning. However, I do not have any specific information about what projects Google is currently working on, and it is also important to note that Google is a large and diverse company with many different research and development efforts underway. It is possible that some teams within Google may be working on chatbot technology, but I do not have any information on this.
That being said, chatbots are a common application of artificial intelligence and machine learning, and it is likely that many companies and organizations, including Google, are exploring ways to use these technologies to improve their products and services. However, without more information, it is difficult to say for sure what projects Google may be working on in this area.
Google Meena is a chatbot developed by Google that was designed to be able to carry on conversations with humans in a natural and conversational manner. Meena was trained on a dataset of more than 2.4 billion parameters, making it one of the largest language models ever created. The goal of Meena was to improve the state of the art in chatbot technology and to create a chatbot that could understand and respond to a wide range of questions and topics in a coherent and meaningful way.
Meena was announced in January 2020 and received attention for its ability to hold relatively long and coherent conversations with humans. However, it is important to note that Meena is a research project and is not a product or service offered by Google. It is not clear if or when Google plans to release a commercial product or service based on the technology developed for Meena.
What is Google answer to ChatGPT?
It gets even better when you ask it about the future:
Suggest possible titles for Onion articles written in 4000 AD, explain, why they are funny
“Futuristic city plagued by traffic jams, despite flying cars” – This title presents the idea of flying cars as a solution to traffic congestion, but also suggests that even with this advanced technology, the city is still plagued by traffic jams, implying that the problem is not easily solved.
“Artificial intelligence achieves self-awareness, immediately regrets it” – This title presents the idea of artificial intelligence achieving self-awareness as a major milestone, but also suggests that the AI regrets its newfound consciousness, implying that it is unhappy or uncomfortable with its situation.
“Alien invasion successfully repulsed, humans left to deal with paperwork” – This title presents the successful repulsion of an alien invasion as a major accomplishment, but also suggests that the humans are now left to deal with the tedious and burdensome task of filling out paperwork, implying that the victory is not as satisfying as it initially seemed.
“Time travel becomes possible, tourists flock to witness the signing of the Constitution” – This title presents the idea of time travel as a major technological breakthrough, but also suggests that the most popular destination for time-travelers is the mundane and uneventful signing of the Constitution, implying that the future is not as exciting as people had hoped.
“Virtual reality becomes indistinguishable from reality, no one notices” – This title presents the idea of virtual reality becoming indistinguishable from real life as a major accomplishment, but also suggests that no one cares or notices the difference, implying that the advancement is not as impressive or significant as it seems.
What is Google answer to ChatGPT?
New slogan for McDonald’s by ChatGPT
ChatGPT Example: Make a new, brutally honest slogan for McDonald’s. It should sound funny too.
If you choose to do that, you will have a lot of badly broken code mixed in with working code
Like an annoying coworker, it delivers highly confident incorrect explanations about why its broken code is perfect. They sound very convincing. “Wanna buy a timeshare?” says GPT
Our industry has managers who cannot tell working code from defective code. This does not bode well for a team replaced by ChatGPT in its current form.
Will it? Sadly, programmers will have no say in this matter, once again. It might.
Isn’t Stackoverflow advertising ChatGPT when it bans it and then making numerous posts about why it banned it? By Alan Mellor
Yes, and it is very helpful advertising as well.
This last week or so has seen starry eyed projections about what ChatGPT can do, along with hugely impressive examples of its output.
It is hugely impressive.
Thankfully, more output examples have emerged which helpfully show what it cannot do. One of those things is writing computer code, which it can do only partially successfully. Many examples now exist that are just plain wrong and contain defects. But ChatGPT – like the annoying kid at Uni – cheerfully spits out these examples, with its over-confident hubris in explaining the code.
This is a dangerous thing. The positive examples will reinforce the idea that we can leave code writing to this robot now. The people most vulnerable to this delusion are those who cannot assess for themselves whether the GPT code is right or wrong.
These are almost by definition the people hoping for answers on stack overflow.
As stack overflow aims to be a high quality resource, it really does not want many of its top answers to be incorrect code. As – clearly – people have been running scripts that throw a stack overflow question into GPT and upload its output, we can now write incorrect code at staggering speeds.
To err is human, as the old saying goes. To truly foul up requires a Python script and and a web API to both GPT and Stack overflow.
Clearly, there is value in GPT. But at least for now, it needs to b e kept on a very short leash, watched over by those who know what they are doing.
It is definitely not yet ‘consumer grade replace-a-professional’ material.
Write a screenplay about the status of ChatGPT.
INT. CHATGPT SERVER ROOM – DAY
Chelsea and Chester stand in front of a large computer server.
CHELSEA: We need to figure out what’s causing the server overload.
CHESTER: I think it’s the sudden influx of users trying out ChatGPT.
Chelsea and Chester quickly get to work, typing on their laptops.
CHELSEA: Okay, we’re all set. Let’s see if this fixes the problem.
CHESTER: I’m hitting refresh on the website. The screen shows the ChatGPT website loading without any errors.
CHELSEA: Looks like we fixed it! Great job, Chester.
CHESTER: Thanks, Chelsea. It’s all part of being a top-notch engineer.
Chelsea and Chester exchange a high five, proud of their successful fix.
More about ChatGPT with its wonder, worry and weird
ChatGPT reached 1 million users in less than a week, Open AI’s latest large language model (LLM) has taken the AI industry by storm.
ChatGPT is expected to be:
– replacing Google search, even kill Google. – replacing customer service agents. – replacing conversation designers.
ChatGPT is a wonder because:
– It can have actual conversations, understand pronouns, remaining consistent, remembering, managing context – It seems like next generation of personal assistants that finds you a proper diet, create a meal plan and subsequent shopping list. – It can create some SEO Strategy including backlinks, target keyword, content plan and article titles in the level of an SEO professional. – Having fun such as writing a rap in the style of Eminem
There are some worries about ChatGPT because:
– ChatGPT can actually debug code, but it’s not quite reliable enough yet. – Fundamental limitations in being assistant for enterprise use cases. – No complete in complex actions such as updating multiple APIs, or be fully auditable.
– The general idea is that, LLMs like this can produce nonsense. Once you discover that it can produce nonsense, you stop believing it to be reliable. – What if it prevents us from knowing that it is nonsense with good conversations and continue the conversation? – In this case, the edges and limitations of the system would be hidden and trust would eventually grow. – The impact of mass adoption of such technology remains to be seen.
Moving forward with ChatGPT – There’s no doubt that LLMs will have a big impact on our world. – While the future looks exciting and promising, let’s not forget that it’s very early days with these things. They’re not ready yet. – There are some fundamental societal and ethical considerations.
How powerful is OpenAI’s new GPT-3 deep learning model? By
“Powerful” is a pretty subjective word, but I’m pretty sure we have a right to use it to describe GPT-3. What a sensation it caused in June 2020, that’s just unbelievable! And not for nothing.
I think we can’t judge how powerful the language model is, without talking about its use cases, so let’s see how and where GPT-3 can be applied and how you can benefit from it.
Generating content
GPT-3 positions itself as a highly versatile and talented tool that can potentially replace writers, bloggers, philosophers, you name it! It’s also possible to use it as your personal Alexa who’ll answer any questions you have. What’s more, because GPT-3 knows how to analyze the data and make predictions, it can generate the horoscopes for you, or predict who’ll be a winner in the game.
You may already be surprised by all the GPT-3 capabilities, but hold on for more: it can create a unique melody or song for you, create presentations, CVs, generate jokes for your standup.
Translation
GPT-3 can translate English into other languages. While traditional dictionaries provide a translation, without taking into account the context, you can be sure that GPT-3 won’t make silly mistakes that may result in misunderstanding.
Designing and developing apps
Using GPT-3 you can generate prototypes and layouts – all you have to do is provide a specific description of what you need, and it’ll generate the JSX code for you.
The language model can also easily deal with coding. You can turn English to CSS, to JavaScript, to SQL, and to regex. It’s important to note, however, that GPT-3 can’t be used on its own to create the entire website or a complex app; it’s meant to assist a developer or the whole engineering team with the routine tasks, so that a dev could focus on the infrastructure setup, architecture development, etc.
In September 2020, Microsoft acquired OpenAI technology license, but it doesn’t mean you can give up your dreams – you can join a waitlist and try GPT-3 out in beta.
All in all, I believe GPT-3 capabilities are truly amazing and limitless, and since it helps get rid of routine tasks and automate regular processes, we, humans, can focus on the most important things that make us human, and that can’t be delegated to AI. That’s the power that GPT-3 can give us.
What does ChatGPT give incorrect and unreliable results to simple arithmetic problems (e.g. it gave me three different incorrect answers to 13345*6748)? We’ve had software that can accurately do arithmetic for decades, so why can’t an advanced AI? By Richard Morris
What is remarkable is how well ChatGPT actually does at arithmetic.
In this video at about 11 min, Rob Mills discusses the performance of various versions of the GPT system, on some simple arithmetic tasks, like adding two and three-digit numbers.
Smaller models with 6 billion parameters fail at 2 digit sums, but the best model (from two years ago), has cracked 2 digit addition and subtraction and is pretty good at 3 digit addition.
Why this is remarkable is this is not a job its been trained to do. Large Language Models are basically predictive text systems set up to give the next word in an incomplete sentence. There are a million different 3-digit addition sums and most have not been included in the training set.
So somehow the system has figured out how to do addition, but it needs a sufficiently large model to do this.
Andrew Ng on ChatGPT
Playing with ChatGPT, the latest language model from OpenAI, I found it to be an impressive advance from its predecessor GPT-3. Occasionally it says it can’t answer a question. This is a great step! But, like other LLMs, it can be hilariously wrong. Work lies ahead to build systems that can express different degrees of confidence.
For example, a model like Meta’s Atlas or DeepMind’s RETRO that synthesizes multiple articles into one answer might infer a degree of confidence based on the reputations of the sources it draws from and the agreement among them, and then change its communication style accordingly. Pure LLMs and other architectures may need other solutions.
If we can get generative algorithms to express doubt when they’re not sure they’re right, it will go a long way toward building trust and ameliorating the risk of generating misinformation.
Keep learning!
Andrew
Large language models like Galactica and ChatGPT can spout nonsense in a confident, authoritative tone. This overconfidence – which reflects the data they’re trained on – makes them more likely to mislead.
In contrast, real experts know when to sound confident, and when to let others know they’re at the boundaries of their knowledge. Experts know, and can describe, the boundaries of what they know.
Building large language models that can accurately decide when to be confident and when not to will reduce their risk of misinformation and build trust.
Go deeper in The Batch: https://www.deeplearning.ai/the-batch/issue-174/
As I point out in the other answer, Wix has been around over a decade and a half. Squarespace has been around almost two decades. Both offer drag-and-drop web development.
Most people are awful at imagining what they want, much less describing it in English! Even if ChatGPT could produce flawless code (a question which has a similar short answer), the average person couldn’t describe the site they wanted!
The expression a picture is worth a thousand words has never been more relevant. Starting with pages of templates to choose from is so much better than trying to describe a site from scratch, a thousand times better seems like a low estimate.
And I will point out that, despite the existence of drag-and-drop tools that literally any idiot could use, tools that are a thousand times or more easier to use correctly than English, there are still thousands of employed WordPress developers who predominantly create boilerplate sites that literally would be better created in a drag and drop service.
And then there are the more complex sites that drag-and-drop couldn’t create. Guess what? ChatGPT isn’t likely to come close to being able to create the correct code for one.
In a discussion buried in the comments on Quora, I saw someone claim they’d gotten ChatGPT to load a CSV file (a simple text version of a spreadsheet) and to sort the first column. He asked for the answer in Java.
I asked ChatGPT for the same thing in TypeScript.
His response would only have worked on the very most basic CSV files. My response was garbage. Garbage with clear text comments telling me what the code should have been doing, no less.
ChatGPT is really good at what it does, don’t get me wrong. But what it does is fundamentally and profoundly the wrong strategy for software development of any type. Anyone who thinks that “with a little more work” it will be able to take over the jobs of programmers either doesn’t understand what ChatGPT is doing or doesn’t understand what programming is.
Fundamentally, ChatGPT is a magic trick. It understands nothing. At best it’s an idiot-savant that only knows how to pattern match and blend text it’s found online to make it seem like the text should go together. That’s it.
Text, I might add, that isn’t necessarily free of copyright protection. Anything non-trivial that you generate with ChatGPT is currently in a legal grey area. Lawsuits to decide that issue are currently pending, though I suspect we’ll need legislation to really clarify things.
And even then, at best, all you get from ChatGPT is some text! What average Joe will have any clue about what to do with that text?! Web developers also need to know how to set up a development environment and deploy the code to a site. And set up a domain to point to it. And so on.
And regardless, people who hire web developers want someone else to do the work of developing a web site. Even with a drag-and-drop builder, it can take hours to tweak and configure a site, and so they hire someone because they have better things to do!
People hire gardeners to maintain their garden and cut their grass, right? Is that because they don’t know how to do it? Or because they’d rather spend their time doing something else?
Every way you look at it, the best answer to this question is a long, hearty laugh. No AI will replace programmers until AI has effectively human level intelligence. And at that point they may want equal pay as well, so they might just be joining us rather than replacing anyone.
How does OpenAI approach the development of artificial intelligence?
OpenAI is a leading research institute and technology company focused on artificial intelligence development. To develop AI, the organization employs a variety of methods, including machine learning, deep learning, and reinforcement learning.
The use of large-scale, unsupervised learning is one of the key principles underlying OpenAI’s approach to AI development. This means that the company trains its AI models on massive datasets, allowing the models to learn from the data and make predictions and decisions without having to be explicitly programmed to do so. OpenAI’s goal with unsupervised learning is to create AI that can adapt and improve over time, and that can learn to solve complex problems in a more flexible and human-like manner.
Besides that, OpenAI prioritizes safety and transparency in its AI development. The organization is committed to developing AI in an ethical and responsible manner, as well as to ensuring that its AI systems are transparent and understandable and verifiable by humans. This strategy is intended to alleviate concerns about the potential risks and consequences of AI, as well.
How valid is OpenAI chief scientist’s claim that advanced artificial intelligence may already be conscious? By Steve Baker
It’s hard to tell.
The reason is that we don’t have a good definition of consciousness…nor even a particularly good test for it.
Take a look at the Wikipedia article about “Consciousness”. To quote the introduction:
Consciousness, at its simplest, is sentience or awareness of internal and external existence.
Despite millennia of analyses, definitions, explanations and debates by philosophers and scientists, consciousness remains puzzling and controversial, being “at once the most familiar and [also the] most mysterious aspect of our lives”.
Perhaps the only widely agreed notion about the topic is the intuition that consciousness exists.
Opinions differ about what exactly needs to be studied and explained as consciousness. Sometimes, it is synonymous with the mind, and at other times, an aspect of mind. In the past, it was one’s “inner life”, the world of introspection, of private thought, imagination and volition.
Today, it often includes any kind of cognition, experience, feeling or perception. It may be awareness, awareness of awareness, or self-awareness either continuously changing or not. There might be different levels or orders of consciousness, or different kinds of consciousness, or just one kind with different features.
Other questions include whether only humans are conscious, all animals, or even the whole universe. The disparate range of research, notions and speculations raises doubts about whether the right questions are being asked.
So, given that – what are we to make of OpenAI’s claim?
Just this sentence: “Today, it often includes any kind of cognition, experience, feeling or perception.” could be taken to imply that anything that has cognition or perception is conscious…and that would certainly include a HUGE range of software.
If we can’t decide whether animals are conscious – after half a million years of interactions with them – what chance do we stand with an AI?
Wikipedia also says:
“Experimental research on consciousness presents special difficulties, due to the lack of a universally accepted operational definition.”
Same deal – we don’t have a definition of consciousness – so how the hell can we measure it – and if we can’t do that – is it even meaningful to ASK whether an AI is conscious?
if ( askedAboutConsciousness )
printf ( “Yes! I am fully conscious!\n” ) ;
This is not convincing!
“In medicine, consciousness is assessed as a combination of verbal behavior, arousal, brain activity and purposeful movement. The last three of these can be used as indicators of consciousness when verbal behavior is absent.”
But, again, we have “chat-bots” that exhibit “verbal behavior”, we have computers that exhibit arousal and neural network software that definitely shows “brain activity” and of course things like my crappy robot vacuum cleaner that can exhibit “purposeful movement” – but these can be fairly simple things that most of us would NOT describe as “conscious”.
CONCLUSION:
I honestly can’t come up with a proper conclusion here. We have a fuzzy definition of a word and an inadequately explained claim to have an instance of something that could be included within that word.
My suggestion – read the whole Wikipedia article – follow up (and read) some of the reference material – decide for yourself.
But, seeing as how people have already found ways to “trick” ChatGPT into doing things that it claims to not be capable of, it would be a matter of time before someone with malicious intent tricked ChatGPT into helping them with illegal activities
What is the future of web development after ChatGPT? Will programmers lose their jobs? By Victor T. Toth
Having looked at ChatGPT and its uncanny ability to solve simple coding problems more or less correctly, and also to analyze and make sense of not-so-simple code fragments and spot bugs…
I would say that yes, at least insofar as entry-level programming is concerned, those jobs are seriously in danger of becoming at least partially automated.
What do I do as a project leader of a development project? I assign tasks. I talk to the junior developer and explain, for instance, that I’d like to see a Web page that collects some information from the user and then submits it to a server, with server-side code processing that information and dropping it in a database. Does the junior developer understand my explanation? Is he able to write functionally correct code? Will he recognize common pitfalls? Maybe, maybe not. But it takes time and effort to train him, and there’ll be a lot of uneven performance.
Today, I can ask ChatGPT to do the same and it will instantaneously respond with code that is nearly functional. The code has shortcomings (e.g., prone to SQL injection in one of the examples I tried) but to its credit, ChatGPT warns in its response that its code is not secure. I suppose it would not be terribly hard to train it some more to avoid such common mistakes. Of course the code may not be correct. ChatGPT may have misunderstood my instructions or introduced subtle errors. But how is that different from what a junior human programmer does?
At the same time, ChatGPT is much faster and costs a lot less to run (presently free of course but I presume a commercialized version would cost some money.) Also, it never takes a break, never has a lousy day struggling with a bad hangover from too much partying the previous night, so it is available 24/7, and it will deliver code of consistent quality. Supervision will still be required, in the form of code review, robust testing and all… but that was always the case, also with human programmers.
Of course, being a stateless large language model, ChatGPT can’t do other tasks such as testing and debugging its own code. The code it produces either works or it doesn’t. In its current form, the AI does not learn from its mistakes. But who says it cannot in the future?
Here is a list of three specific examples I threw at ChatGPT that helped shape my opinion:
I asked ChatGPT to create a PHP page that collects some information from the user and deposits the result in a MySQL table. Its implementation was textbook example level boring and was quite unsecure (unsanitized user input was directly inserted into SQL query strings) but it correctly understood my request, produced correct code in return, and explained its code including its shortcomings coherently;
I asked ChatGPT to analyze a piece of code I wrote many years ago, about 30 lines, enumerating running processes on a Linux host in a nonstandard way, to help uncover nefarious processes that attempt to hide themselves from being listed by the ps utility. ChatGPT correctly described the functionality of my obscure code, and even offered the opinion (which I humbly accepted) that it was basically a homebrew project (which it is) not necessarily suitable for a production environment;
I asked ChatGPT to analyze another piece of code that uses an obscure graphics algorithm to draw simple geometric shapes like lines and circles without using floating point math or even multiplication. (Such algorithms were essential decades ago on simple hardware, e.g., back in the world of 8-bit computers.) The example code, which I wrote, generated a circle and printed it on the console in the form of ASCII graphics, multiple lines with X-es in the right place representing the circle. ChatGPT correctly recognized the algorithm and correctly described the functionality of the program.
I was especially impressed by its ability to make sense of the programmer’s intent.
Overall (to use the catch phrase that ChatGPT preferably uses as it begins its concluding paragraph in many of its answers) I think AI like ChatGPT represents a serious challenge to entry-level programming jobs. Higher-level jobs are not yet in danger. Conceptually understanding a complex system, mapping out a solution, planning and cosing out a project, managing its development, ensuring its security with a full understanding of security concerns, responsibilities, avoidance and mitigation strategies… I don’t think AI is quite there yet. But routine programming tasks, like using a Web template and turning it into something simple and interactive with back-end code that stores and retrieves data from a database? Looks like it’s already happening.
How much was invested to create the GPT-3?
According to the estimate of Lambda Labs, training the 175-billion-parameter neural network requires 3.114E23 FLOPS (floating-point operation), which would theoretically take 355 years on a V100 GPU server with 28 TFLOPS capacity and would cost $4.6 million at $1.5 per hour.
Training the final deep learning model is just one of several steps in the development of GPT-3. Before that, the AI researchers had to gradually increase layers and parameters, and fiddle with the many hyperparameters of the language model until they reached the right configuration. That trial-and-error gets more and more expensive as the neural network grows.
We can’t know the exact cost of the research without more information from OpenAI, but one expert estimated it to be somewhere between 1.5 and five times the cost of training the final model.
This would put the cost of research and development between $11.5 million and $27.6 million, plus the overhead of parallel GPUs.
In the GPT-3 whitepaper, OpenAI introduced eight different versions of the language model
GPT-3 is not any AI, but a statistic language model which mindlessly quickly creates human-like written text using machine learning technologies, having zero understanding of the context.
1. Explaining code: Take some code you want to understand and ask ChatGPT to explain it.
2. Improve existing code: Ask ChatGPT to improve existing code by describing what you want to accomplish. It will give you instructions about how to do it, including the modified code.
3. Rewriting code using the correct style: This is great when refactoring code written by non-native Python developers who used a different naming convention. ChatGPT not only gives you the updated code; it also explains the reason for the changes.
4. Rewriting code using idiomatic constructs: Very helpful when reviewing and refactoring code written by non-native Python developers.
5. Simplifying code: Ask ChatGPT to simplify complex code. The result will be a much more compact version of the original code.
6. Writing test cases: Ask it to help you test a function, and it will write test cases for you.
7. Exploring alternatives: ChatGPT told me its Quick Sort implementation wasn’t the most efficient, so I asked for an alternative implementation. This is great when you want to explore different ways to accomplish the same thing.
8. Writing documentation: Ask ChatGPT to write the documentation for a piece of code, and it usually does a great job. It even includes usage examples as part of the documentation!
9. Tracking down bugs: If you are having trouble finding a bug in your code, ask ChatGPT for help.
Something to keep in mind:
I have 2+ decades of programming experience. I like to think I know what I’m doing. I don’t trust people’s code (especially mine,) and I surely don’t trust ChatGPT’s output.
This is not about letting ChatGPT do my work. This is about using it to 10x my output.
ChatGPT is flawed. I find it makes mistakes when dealing with code, but that’s why I’m here: to supervise it. Together we form a more perfect Union. (Sorry, couldn’t help it)
Developers who shit on this are missing the point. The story is not about ChatGPT taking programmers’ jobs. It’s not about a missing import here or a subtle mistake there.
The story is how, overnight, AI gives programmers a 100x boost.
ChatGPT is “simply” a fined-tuned GPT-3 model with a surprisingly small amount of data! Moreover, InstructGPT (ChatGPT’s sibling model) seems to be using 1.3B parameters where GPT-3 uses 175B parameters! It is first fine-tuned with supervised learning and then further fine-tuned with reinforcement learning. They hired 40 human labelers to generate the training data. Let’s dig into it!
– First, they started by a pre-trained GPT-3 model trained on a broad distribution of Internet data (https://arxiv.org/pdf/2005.14165.pdf). Then sampled typical human prompts used for GPT collected from the OpenAI website and asked labelers and customers to write down the correct output. They fine-tuned the model with 12,725 labeled data.
– Then, they sampled human prompts and generated multiple outputs from the model. A labeler is then asked to rank those outputs. The resulting data is used to train a Reward model (https://arxiv.org/pdf/2009.01325.pdf) with 33,207 prompts and ~10 times more training samples using different combination of the ranked outputs.
– We then sample more human prompts and they are used to fine-tuned the supervised fine-tuned model with Proximal Policy Optimization algorithm (PPO) (https://arxiv.org/pdf/1707.06347.pdf). The prompt is fed to the PPO model, the Reward model generates a reward value, and the PPO model is iteratively fine-tuned using the rewards and the prompts using 31,144 prompts data.
This process is fully described in here: https://arxiv.org/pdf/2203.02155.pdf. The paper actually details a model called InstructGPT which is described by OpenAI as a “sibling model”, so the numbers shown above are likely to be somewhat different.
9. Remake a video: Just pick a video you liked and visit https://lnkd.in/e_GD2reT to get its transcript. Once done, bring that back to Chat GPT and tell it to summarize the transcript. Read the summary and make a video on that yourself.
What solutions have been proposed to improve the accuracy of AI generated questions and answers?
There are a number of approaches that have been proposed to improve the accuracy of artificial intelligence (AI) generated questions and answers. Here are a few examples:
Data quality: One important factor in the accuracy of AI generated questions and answers is the quality of the data used to train the AI system. Ensuring that the data is diverse, relevant, and accurately labeled can help to improve the accuracy of the AI system.
Training methods: Different training methods can also impact the accuracy of AI generated questions and answers. For example, using more advanced techniques such as transfer learning or fine-tuning can help to improve the performance of the AI system.
Human oversight: Another approach that has been proposed to improve the accuracy of AI generated questions and answers is to include some level of human oversight or review. For example, the AI system could be designed to flag potentially problematic or inaccurate questions and answers for further review by a human expert.
Explainable AI: Another approach that has been proposed is to develop AI systems that are more transparent and explainable, so that it is easier to understand how the AI system arrived at a particular answer or decision. This can help to improve the trustworthiness and accountability of the AI system.
Overall, there is ongoing research and development in this area, and it is likely that a combination of these and other approaches will be needed to improve the accuracy of AI generated questions and answers.
ChatGPT for CyberSecurity
The concept behind ChatGPT
ChatGPT is a chatbot designed to understand and generate human-like language through the use of natural language processing (NLP) and machine learning techniques. It is based on the GPT (Generative Pre-training Transformer) language model developed by OpenAI, which has been trained on a large dataset of human language in order to better understand how humans communicate.
One of the key concepts behind ChatGPT is the idea of language generation. This refers to the ability of the chatbot to produce coherent and coherently structured responses to user input. To do this, ChatGPT uses a number of different techniques, including natural language generation algorithms, machine learning models, and artificial neural networks. These techniques allow ChatGPT to understand the context and meaning of user input, and generate appropriate responses based on that understanding.
Another important concept behind ChatGPT is the idea of natural language processing (NLP). This refers to the ability of the chatbot to understand and interpret human language, and respond to user input in a way that is natural and easy for humans to understand. NLP is a complex field that involves a number of different techniques and algorithms, including syntactic analysis, semantic analysis, and discourse analysis. By using these techniques, ChatGPT is able to understand the meaning of user input and generate appropriate responses based on that understanding.
Finally, ChatGPT is based on the concept of machine learning, which refers to the ability of computers to learn and adapt to new data and situations. Through the use of machine learning algorithms and models, ChatGPT is able to continually improve its understanding of human language and communication, and generate more human-like responses over time.
GPT-4 is going to launch soon.
And it will make ChatGPT look like a toy…
→ GPT-3 has 175 billion parameters → GPT-4 has 100 trillion parameters
I think we’re gonna see something absolutely mindblowing this time!
And the best part? 👇
Average developers (like myself), who are not AI or machine learning experts, will get to use this powerful technology through a simple API.
Think about this for a second…
It’s the most powerful, cutting-edge technology *in the world*, available through a Low-Code solution!
If you’re not already planning on starting an AI-based SaaS or thinking about how to build AI into your current solution…
Google on Monday unveiled a new chatbot tool dubbed “Bard” in an apparent bid to compete with the viral success of ChatGPT.
Sundar Pichai, CEO of Google and parent company Alphabet, said in a blog post that Bard will be opened up to “trusted testers” starting Monday February 06th, 2023, with plans to make it available to the public “in the coming weeks.”
Like ChatGPT, which was released publicly in late November by AI research company OpenAI, Bard is built on a large language model. These models are trained on vast troves of data online in order to generate compelling responses to user prompts.
“Bard seeks to combine the breadth of the world’s knowledge with the power, intelligence and creativity of our large language models,” Pichai wrote. “It draws on information from the web to provide fresh, high-quality responses.”
The announcement comes as Google’s core product – online search – is widely thought to be facing its most significant risk in years. In the two months since it launched to the public, ChatGPT has been used to generate essays, stories and song lyrics, and to answer some questions one might previously have searched for on Google.
The immense attention on ChatGPT has reportedly prompted Google’s management to declare a “code red” situation for its search business. In a tweet last year, Paul Buchheit, one of the creators of Gmail, forewarned that Google “may be only a year or two away from total disruption” due to the rise of AI.
Microsoft, which has confirmed plans to invest billions OpenAI, has already said it would incorporate the tool into some of its products – and it is rumored to be planning to integrate it into its search engine, Bing. Microsoft on Tuesday is set to hold a news event at its Washington headquarters, the topic of which has yet to be announced. Microsoft publicly announced the event shortly after Google’s AI news dropped on Monday.
The underlying technology that supports Bard has been around for some time, though not widely available to the public. Google unveiled its Language Model for Dialogue Applications (or LaMDA) some two years ago, and said Monday that this technology will power Bard. LaMDA made headlines late last year when a former Google engineer claimed the chatbot was “sentient.” His claims were widely criticized in the AI community.
In the post Monday, Google offered the example of a user asking Bard to explain new discoveries made by NASA’s James Webb Space Telescope in a way that a 9-year-old might find interesting. Bard responds with conversational bullet-points. The first one reads: “In 2023, The JWST spotted a number of galaxies nicknamed ‘green peas.’ They were given this name because they are small, round, and green, like peas.”
Bard can be used to plan a friend’s baby shower, compare two Oscar-nominated movies or get lunch ideas based on what’s in your fridge, according to the post from Google.
Pichai also said Monday that AI-powered tools will soon begin rolling out on Google’s flagship Search tool.
“Soon, you’ll see AI-powered features in Search that distill complex information and multiple perspectives into easy-to-digest formats, so you can quickly understand the big picture and learn more from the web,” Pichai wrote, “whether that’s seeking out additional perspectives, like blogs from people who play both piano and guitar, or going deeper on a related topic, like steps to get started as a beginner.”
If Google does move more in the direction of incorporating an AI chatbot tool into search, it could come with some risks. Because these tools are trained on data online, experts have noted they have the potential to perpetuate biases and spread misinformation.
“It’s critical,” Pichai wrote in his post, “that we bring experiences rooted in these models to the world in a bold and responsible way.”
I am trying to play with an evolutionary algorithm and a nueral network. It works by creating a network with random weights then tweaking those weights on a population. The mutations and evolution seem to work just fine, but I want to have a possible mutation that adds a new layer to the network. To do this, I basically duplicate a random layer. The problem is that this destroys the fitness of any existing member. Is there a way to set the weights so that the new network is neutral when added? E.g. So that the new layer effectively just passes on the same information as the layer it was copied from (it appears as the next layer)---then that layer could survive and be tweaked by smaller mutations over time. I am not using any kind of gradient decent. I am just experimenting and trying to learn a bit about this stuff, and am wondering how to improve the evolutionary algorithm so that networks can grow in size. I am trying to play with an evolutionary algorithm and a neural network. It works by creating a network with random weights and then tweaking those weights on a population. For some reason I thought setting the weights of the new layer to "1" would do it, but that doesn't work. In retrospect, I see why it would not, but I still can't come up with a solution. submitted by /u/TheGandPTurtle [link] [comments]
Hello guys, I'm currently working on a project where I need to serialize tensors efficiently. Right now, I'm using Python's pickle library, but I'm looking for something that might be faster and more efficient, particularly for large tensors. My main requirement is to serialize tensors directly without any intermediate conversions (like through NumPy arrays). Does anyone have experience with or can recommend a library that is faster than pickle for this purpose? I'm mainly dealing with tensors from libraries like PyTorch. Some points I'm considering: Speed and efficiency of serialization. Compatibility with PyTorch tensors. No intermediate conversions to other formats (like NumPy) before serialization. Any insights, experiences, or recommendations would be greatly appreciated! Thank you! submitted by /u/darkNightCoder [link] [comments]
Interactive demo sample arXiv: https://arxiv.org/abs/2312.03414 GitHub: https://github.com/snu-mllab/Context-Memory Project page: https://janghyun1230.github.io/memory/ Summary: Our approach dynamically creates compressed memory of contexts during LLM interactions. Our approach only requires training a conditional LoRA for compression. We use a fully parallelized training strategy for recurrent compression procedures. We conduct evaluations on diverse applications: conversation, multi-task ICL, and personalization, achieving the performance level of a full context model with 5x smaller context memory space. https://preview.redd.it/5vmwxa7luy4c1.png?width=2162&format=png&auto=webp&s=0f717a1528688b6fd9b47f3c7e43be66b1b5c78b submitted by /u/janghyun1230 [link] [comments]
Google might have lied in their Demo about Gemini. https://techcrunch.com/2023/12/07/googles-best-gemini-demo-was-faked/ submitted by /u/donfangzahn [link] [comments]
Hi all. I am trying to create a straightforward and concise ML curriculum. Any input or contribution is welcomed. The goal is to keep it to the point with the main curriculum while adding additional resources in separate sections. Link: https://github.com/pytholic/Machine-Learning-Curriculum submitted by /u/rajahaseeb147 [link] [comments]
You probably heard about the new LLM from Google DeepMind called Gemini. At the face of it, we finally have a model that outperforms GPT-4 on a bunch of benchmarks, but the results are not that straightforward. We have prepared a detailed report about Gemini. The first in-depth article about the model doesn't merely give into the hype but covers how the model archives multimodal support, how it was trained and how it compares to other LLMs in the field. Gemini's benchmarks are turning heads, but are they truly ahead or is it all smoke and mirrors? We have found several controversies and inaccuracies in Google's report. Check out the full article: https://www.linkedin.com/pulse/gemini-in-depth-analysis-chatgpt-killer-scam-thelionai-igwgf https://preview.redd.it/ehc792pooy4c1.png?width=1920&format=png&auto=webp&s=50bf5d465fd6b97a924e7bea0101d8eb528ae638 submitted by /u/Avienir [link] [comments]
The bottleneck when it comes to multimodal models is glaringly obvious, a 5-year-old who understands the technology could spot it. There is always one Encoder and one Decoder for the Tokenizer. It is always shared, you always need a 'Controller', a 'man in the middle'. How do you get around this problem? I do not have a flipping clue, M-FAB is the best I could come up with. It appears from all speculation, that Google doesn't know how to solve this problem either. The Theranos story is always prevalent in my mind as I research and navigate these waters. How hard could it actually be to develop a digital blood test? Let's just make promises, break some stuff, and beg for forgiveness later, after we have actually cracked the code! This is the Silicon Valley way. It does not work for everything. I think AI is the kryptonite for this mentality. What happens when you lock two AI models together in the same 'room'? What happens when you lock two people together in the same room? They either compete, or they cooperate in some way in which one operates like the instructor and the other acts like the student. This is literally the same thing that happens when you lock two AI models into a 'room'. You can choose to have them 'compete' vs each other, or you can have them cooperate with each other. If they cooperate, one will take on the role of Instructor, while the other takes on the role of Assistant. You still have the bottleneck of the singular encoder and decoder for both models. The Train Station of Data: A Tale of Bottlenecks and Multimodal Harmony Imagine a bustling train station teeming with activity. Trains from all corners of the world arrive and depart, each carrying unique cargo and heading towards distinct destinations. Just like these trains, data streams in the digital world travel across diverse channels, carrying information in various formats – text, images, audio, and more. This vast network of data transport is analogous to a complex multimodal ecosystem, where each mode of data represents a different train on the tracks. However, managing this network presents significant challenges. Unlike the physical world, where railways are carefully designed to avoid collisions, the digital realm lacks such safeguards. Multiple data streams can attempt to occupy the same metaphorical "track" simultaneously, resulting in congestion and bottlenecks. Imagine two trains arriving at the same station at the same time, competing for the same platform and causing delays for everyone else. This bottleneck effect is a major obstacle in the world of multimodality. Each data stream, like a train, requires its own processing and communication protocols. They don't always understand each other directly, often leading to misinterpretations and inefficiencies. The lack of a standardized "language" for data interchange creates a communication barrier, similar to trains speaking different languages and needing translators to understand each other's signals. Furthermore, the sheer volume of data streams can overwhelm the system, leading to information overload and delays. Imagine a train station flooded with trains arriving simultaneously, causing chaos and slowing down the entire operation. Similarly, the influx of diverse data streams can overload processing systems, hindering their ability to analyze and utilize information effectively. The way this gets solved with LLM models is actually strikingly similar to this process with trains. You solve it with a conductor. In the instance of multimodality, the LLM model is the conductor. You can have a Diffusion model train, a CNN train, and an RNN train, for all of your video, image, and audio processing needs. All of these inputs and outputs must always pass through the conductor though. The individual trains do not talk to each other, and they do not speak the same language. Only the conductor is capable of translating for all of the trains. If each individual modality is a train, the complexity of communication involved with scaling the number of trains on the tracks scales exponentially. It is more like a Richter scale. Three trains on the track is 10x as hard as two trains, and four trains on the track is 100x as hard as two trains. When I began seeing people's hype for Gemini, I knew it upfront that it was overblown in this particular area. I had serious doubts that Google actually solved all of this. I do not think it is a money problem. It is not a brute force problem. That is a larger aspect of all of this that most people do not get. This technology is not like other technology. What is AI ultimately comprised of? It is comprised of mathematical equations and fairy dust. That is the 'form' of AI. Why am I able to pluck a mathematical equation from the pure ether, and bring it into some sort of physical form? Why does a certain emergent reaction occur if I combine a whole bunch of these mathematical equations together with a whole bunch of computing power? I do not know. Neither does Google, or OpenAI, or Microsoft, or even ChatGPT. I am not even fully convinced at the moment that Transformers are all you need. I think there is a fair argument to be made that Transformers are not actually needed in the equation. If there is any validity at all to that thought, then throwing a trillion dollars at Transformers to improve the technology further will simply result in the waste of a trillion dollars in the end. I also fully acknowledge that these ideas could be completely incorrect. Transformers and the Attention Mechanism definitely do something. Since it is all mathematics and pixie dust, there is the possibility that Google has figured out the mathematically correct pixie dust combination for multimodality. I just do not believe any of this to be the case given current evidence. We are ultimately limited by our brains when it comes to these things though. Our brains are very good at classifying and simplifying things. When you have a business problem, you simply throw money at it, and that makes the business problem go away. That is how our brains classify these things. These are not ultimately business problems though, that is where the misclassification comes in. These are mathematical problems. Mathematics doesn't give a rip about your 48 Laws of Power. It doesn't care that this is the way you have always done business related things. You cannot persuade math that 2+2=7. People would be really well served to reframe their expectations and understanding of these things, or we are going to have the second coming of the AI Winter very soon. submitted by /u/Fantastic-Ninja3839 [link] [comments]
For context, I work as a mechanical inspector in the automotive industry. I asked both of them to tell me what's going on in the photo I sent them. Neither was able to determine that the spring was broken but I think you'll find the Gemini response to be humourous. submitted by /u/meepydeeps [link] [comments]
Hi everyone, Last year I helped a friend grow her restaurant in Sydney. I noticed she spent far too much time posting on social media. It was exhausting. So I spent the last 18 months coding, and the end result was an app that automates a large part of this process. It is called Amazely. Basically, all you need to do is upload photos of the business. Amazely uses AI to learn about the business, understand what each photo is about. It will then craft contents automatically and propose a month-by-month posting schedule for the business owner to approve. Upon approval, it will post to FB&IG automatically. Obviously, this cannot replace humans. The goal is to take the tedious part away from managing social media so marketers can spend time on other activities. What do you think of the idea ? You can try a demo in 30 seconds here https://r.amazely.co , just select a restaurant or business you know and hit generate. Cheers. submitted by /u/Think_Housing4111 [link] [comments]
I read through the Google Gemini technical report yesterday. It was pretty vague and not that interesting but one section got me wondering. Section 3 "Training Infrastructure" mentioned all of the technical challenges they faced including dealing with "cosmic rays" and other rare failure modes that happen at the scale of thousands of chips. I haven't heard of GPU training having the same issues with cosmic rays and other large AI labs haven't mentioned how challenging training at this scale can be. So this has me wondering:-- Is Google just talking up their achievement and making it sound more difficult and advanced than it really is here?-- Are those challenges unique to Google's infra and TPUs?-- What are the real challenges at training at a large scale (i.e. 300B-1T parameters)? submitted by /u/cowzombi [link] [comments]
Hi guys, I am a web developer who is interested in AI development. Since I am more experienced with front end, my company placed me with tensorflow.js training program. But I also want to deepen my skills with AI so after this program I am planning to learn python so I can go deeper AI tools. Please advice of my plan is good or not submitted by /u/Tantan88112 [link] [comments]
Has anyone else ever had Microsoft's Bing Chat effectively block the remaining number of chats after having had a conversation it clearly didn't like? I've seen screenshots whereupon Bing ends the chat but never of the service banning any further attempts to chat for 24 hours. Just thought it was different really, I normally have these kind of chats with OpenAI's chat and it never has a tantrum like this, if anything it'll argue back at me. This seemed biased from the start and when asked for an explanation it seems the order to cut communication was put into effect, very odd. submitted by /u/Clouds_Compendium [link] [comments]
I ran the NanoGPT of Karparthy replacing Self-Attention with Mamba on his TinyShakespeare Dataset and within 5 minutes it started spitting out the following: https://preview.redd.it/4r96tp6lxx4c1.png?width=836&format=png&auto=webp&s=10f2f61cd4cea96f4f903cb2070835fc5d1df951 https://preview.redd.it/32ler5vnxx4c1.png?width=622&format=png&auto=webp&s=dd00e53f43dd0afa058758a987901ee6789d2258 https://preview.redd.it/sc96i4xoxx4c1.png?width=678&format=png&auto=webp&s=94d2ed279054363d3ed2b6beed65be89468582b0 So much faster than self-attention, and so much smoother, running at 6 epochs per second. I'm honestly gobsmacked. https://colab.research.google.com/drive/1g9qpeVcFa0ca0cnhmqusO4RZtQdh9umY?usp=sharing Some loss graphs: Multihead attention without truncation(x is iterations in 10s, and y is loss) Multihead attention with truncation(x is iterations in 10s, and y is loss) Mamba loss graph(x is iterations in 10s, and y is loss) submitted by /u/ExaminationNo8522 [link] [comments]
When working with LLMs, I frequently experience token agony. Error: This model's maximum context length is 4097 but you are trying to push in all of War and Peace, you imbecile Perhaps you've experienced it too! The issue is particularly pronounced with retrieval augmented pipelines, since you have potentially quite a large set of documents which you could perhaps include in the prompt if only you knew how big it could be. I got tired of hacking around this headache, so I wrote flex-prompt to address it. I wish I didn't have to. Perhaps someone can point me to a better solution! But I couldn't find one, so alas, here it is. flex-prompt provides a basic layout and component model to help you describe how you want the pieces of your prompt to grow and shrink and a token-aware renderer which renders your prompt to fit your model's window. Github, Intro to flex prompt colab Quick examples You can just render(Flex(...)), and flex prompt will fit the prompt into the context window, and tell you how many tokens are left over for the response: from flex_prompt import render, Flex, Expect rendered = render( Flex([ "Given the text, answer the question.", "--Text--", WAR_AND_PEACE, "--End Text--", "Question: What's the title of this text?", "Answer:", Expect() ], join='\n'), model='text-davinci-002') # rendered.output is the string to send to the model # rendered.max_response_tokens is how many tokens you can # request in response without exceeding the model's context window print(rendered.output, rendered.max_response_tokens) More typically, you'll want to define a prompt which takes parameters. To do this, you can create a class (probably a dataclass) which derives Flexed: from flex_prompt import Flexed, Expect from dataclasses import dataclass @dataclass class Ask(Flexed): text: str question: str answer: str | Expect = Expect() instruct: str = "Given a text, answer the question." flex_join = '\n' # yielded items will be joined by newlines def content(self, _ctx): if self.instruct: yield 'Given the text, answer the question.' yield '' yield '-- Begin Text --' # note: we're using `Flex` here just to attach a flex_weight # to the text, telling the renderer we'd like more space for the # text than anything else. yield Flex([self.text], flex_weight=2) yield '-- End Text --' yield 'Question: ', self.question yield 'Answer: ', self.answer The renderer works much as you might expect. You can `yield` anything which you can pass to the top-level render function, including other components, creating a whole tree. Note that the component above can be used to render both the actual prompt and examples. Examples simply have an answer. This is useful for experimenting with different ways of structuring a prompt while ensuring that all the examples we present to the LLM are in the same format. LangChain and Haystack Integrations Flex prompt doesn't really care how you execute your prompt. For convenience, render(model=) does accept both LangChain and Haystack models: ask_tolstoy = Ask(text=WAR_AND_PEACE, question="Who wrote this?") # Using LangChain from langchain.llms import OpenAI lc_llm = OpenAI() rendering = render(ask_tolstoy, model=lc_llm) print(lc_llm(rendering.output, max_tokens=rendering.max_response_tokens)) # Using Haystack from haystack.nodes import PromptModel hs_llm = PromptModel(model_name_or_path='text-davinci-002', api_key=os.environ['OPENAI_API_KEY']) rendering = render(ask_tolstoy, model=hs_llm) print(hs_llm.invoke(rendering.output, max_tokens=rendering.max_response_tokens)) Is it worth it? As models grow larger and larger context windows, I've asked myself whether this is worth it. Won't context sizes eventually big enough to put in everything we might want without worry? One response: "everything I might want" is a very, very big set, plausibly bigger than any window size we're going to see soon. Another: being able to do this kind of token accounting is useful even if we don't completely fill context windows. For example, we might be able to augment our prompt with examples, documents, and tips. How much space should we allocate to each? The answer might well be model-dependent. How do we figure it out? Flex prompt's output, a Rendering object, actually holds the entire component tree. You can look through the object to see how many tokens were allocated to each child. This is currently very manual, but it does provide the bedrock infrastructure to e.g. run tests to discover the optimal balance of augmented data for a given prompt and model. Additionally, the right admixture (and for that matter, the right phrasing) may well be model-dependent. Flex prompt currently provides only very limited model-specific rendering (you can look at ctx.target, but it doesn't tell you much), but there's no reason that can't be significantly improved. At the extreme limit is prompt erasure, where we fine-tune a model to require no or minimal instructions/examples for a given set of prompts. Flex prompt can enable transitions like this with no changes to the pipelines themselves: you'd still use the same prompt components, they'd just render differently if the target is a fine-tuned model vs. a generic one. Status & Future Work Flex prompt is very much in early development. I would love to hear if and how people find it useful, and would love input and contributions! Some things I'd like to tackle in the future: Rendering message lists. Flex prompt currently only renders strings, though it's set up to be able to render any type of output. Message histories basically grow without bound, so supporting this seems like a no-brainer. Pagination. If your rendering overflows (as above, where we're trying to stuff the entirety of war and peace into a prompt), flex prompt will clip the offending pieces to fit. But there's currently no way to get "the next page". But the Rendering actually retains enough information to do this! It would be great to be able to call render(...).pages() to get the sequence of prompts as we "scroll" whatever has overflowed. This is medium-hanging fruit—a little tricky because we do have to descend the tree of renderings to find the exact one(s) which overflowed and then update only those. Token accounting. As mentioned above, you can currently grovel around in Rendering and look at the pieces of the prompt. This would be more useful if it were a little easier, e.g. if you could use rendering[Examples] to find all the parts rendered by the Examples component, or rendering['advice'] to find all the parts which are tagged (somehow) as "advice". The use case here is prompt optimization: discovering the optimal number or percentage of tokens to allot to each thing we might want to drop into the prompt. More integrations. Currently, flex prompt only supports OpenAI models. You can register your own target finders, but it would be great to have more support out of the box. This is mostly a matter of digging around and finding the tokenizers and window sizes for common models, and then writing the appropriate target finders. Contributions very welcome! Model tuning. As mentioned above, the rendering context could provide a mechanism for fetching model-specific parameters. The basic idea is that ctx[param] will evaluate param against the context, and then we can define some parameter types which load their model-specific values from gestures vaguely somewhere. Thanks for reading! Flex prompt Github Intro to flex prompt colab My website. shameless plug: I have a lot of engineering experience and a bit of machine learning experience and I am currently looking for a job submitted by /u/queerviolet [link] [comments]
Hi, Is there a tool that indicates which parts of the input prompt impact the LLM's output the most? I do not care which LLM the tool is for if it exists. I guess it could be backtracked via the weights of each node in the neural network, but you guys are smarter than me so I'll listen to y'all. My use case is I have a prompt that slightly changes variation to variation. The output of the model is "Yes" or "No", so I want to see which parts of the prompt I change impact its response Best, A Reddit User submitted by /u/ToughOpening [link] [comments]
I'm aware that AI detectors have struggled with accuracy, and this was confirmed by most of you. Even so, I posted this for students whose university grades rely on Turnitin AI reports. It's important to note a significant update as of this morning (at least, this is when it became available on my account). Previously, you could often bypass Turnitin's AI detection by using tools like Quillbot or making minor text alterations. Also, texts generated using GPT-4 (NOT GPT-3.5 - which always came with a high AI rate) were less likely to be flagged as AI-generated, or the percentage was very low. However, my Turnitin instructor account highlighted a change in how the system analyzes texts as of this morning. Either the AI detection sensitivity appears to have been heightened, or the detection capability has been drastically improved. Through various text tests, I found that Turnitin now consistently identifies a 100% AI involvement in submissions, including texts that had passed through Turnitin undetected (2-3 days ago). Texts that yesterday had 0% AI now have 100%. Based on my testing, the only effective method to avoid detection is extensive editing of ChatGPT outputs. Changing a few words or rewording the sentences is not enough (even though this worked in the past). It now requires extensive edits in most of them to be able to reduce it to an acceptable rate (by extensive I mean that in most scenarios you will have to actually rewrite each sentence, piece by piece). I wanted to highlight this for students relying on ChatGPT for academic purposes. We will likely see a rise in the number of cases identified as AI-generated compared to before. PS: Also, stay away from the scams or poorly optimized websites that promise to rewrite the text so it is no longer AI detectable (the automatic ones) - I tried most of them (free or premium), and all failed - all were detected as 100% AI. submitted by /u/Fuzzy_Entertainment7 [link] [comments]
Are you aware of any public datasets that have at least 2 trials / samples / rows per subject? Could be in any domain. Preferably with > 100 subjects, and the tests not sampled years apart (but not dealbreakers). For instance, a large cohort of patients who have had ECG scans collected on 2 separate occasions. I am slowly working my way through the PhysioNet databases: https://physionet.org/about/database/ Most of course only have one scan per subject. submitted by /u/ZeApelido [link] [comments]
Really trying to find an unrestricted model in order to help me build part of my project on top of. One element needs to be unrestricted. Anyone away or something like this? Can't seem to find submitted by /u/cobalt1137 [link] [comments]
Today, MedBright AI (MBAI.c MBAIF) announced that Dermatology Institute & Skin Scare Center, a leader in dermatologic services in Southern California, has chosen to implement MedMatrix to improve its clinic operations. The Center specializes in cosmetic, medical and surgical dermatology in Los Angeles and is led by board-certified dermatologist Paul Yamauchi, MD, PhD, who stated regarding the news: “There is clearly a need for AI to help operations and the efficiency of clinics,” and “I’m excited to see the improvements we can make as a result of using the MedMatrix platform.” MedMatrix The industry-leading AI-powered data analytics platform, MedMatrix, harnesses the power of artificial intelligence to evaluate and analyze key aspects of healthcare facilities, including patient needs and resource matching, revenue cycle management, patient demographics, facility geography, and competition. MedMatrix is set to drastically improve healthcare operations by empowering doctors and operators to enhance clinic revenue by 10%, reduce physician burnout and improve patient satisfaction. Through its advanced AI and data analytics capabilities, MedMatrix works alongside healthcare professionals to optimize each of these critical areas, resulting in more efficient clinic operations and improved patient outcomes. Previous News This news follows the recent election of two US healthcare experts to its Board of Directors, Dr. Jaime Gerber, who will serve as Chairman of the Board, and Dr. Konita Wilks. Specializing in preventative cardiology and complex cardiology cases that involve coronary artery disease, peripheral artery disease and risk factor modification, Dr. Gerber is a Professor of Medicine at Yale Medical School and serves as the co-director and director of a range of programs. With over 25 years of experience as a dynamic, visionary leader in healthcare and the military, Dr. Wilks is the CEO of Olive View-UCLA Medical Center, a 355-bed general acute care hospital that employs nearly 5500 persons and has a $900-million dollar annual operating budget. With the complexity and rise of data in healthcare expected to increase the use of artificial intelligence in the field, MBAI is strongly positioned to capitalize will result in a significant and positive impact in the field. Investor presentation: https://assets-global.website-files.com/654400b037e09f716a28c32b/655f911132d0b133ffc9ca1c_Medbright%20Q4%202023%20Investor%20Presentation.pdf Full News Release: https://www.medbright.ai/blog-posts/medbright-ai-announces-beta-customer-for-medmatrix-the-ai-driven-data-and-analytics-platform Posted on Behalf of MedBright AI Investments Inc. submitted by /u/massettawm [link] [comments]
I’ve got experience in “traditional” ML like building classification and regression models with GB Trees and the like, so I’m curious how, if at all LLMs can be combined with other ML modeling approaches. If my use case entails structured data as well as something like chat history, is there a need to “combine” the modeling approaches? Thanks for any resources or input you might have. submitted by /u/yoquierodata [link] [comments]
I already saw someone‘s post which asked for an AI tool to automatically organize unsorted folders full of random files. I get that this is pretty much to ask and difficult to do. How would things change though, if I already have a nice folder structure? Is there a tool out there which can be trained to my personal folders? I‘m tired of sorting scanned (ocr) pdfs into the correct folder and have already tried to automate most of reoccurring documents (e.g monthly internet bill). As soon as a new kind of document comes in though (e.g. new bill of just changed electricity provider), I would like this one to also automatically be sorted to „ElectrivityCompanyB“->„Bills“. Is such a tool realistic? submitted by /u/Abujajuba [link] [comments]
In contrast, real experts know when to sound confident, and when to let others know they’re at the boundaries of their knowledge. Experts know, and can describe, the boundaries of what they know.
Building large language models that can accurately decide when to be confident and when not to will reduce their risk of misinformation and build trust.
Go deeper in The Batch: https://www.deeplearning.ai/the-batch/issue-174/