AI Jobs and Career
And before we wrap up today's AI news, I wanted to share an exciting opportunity for those of you looking to advance your careers in the AI space. You know how rapidly the landscape is evolving, and finding the right fit can be a challenge. That's why I'm excited about Mercor – they're a platform specifically designed to connect top-tier AI talent with leading companies. Whether you're a data scientist, machine learning engineer, or something else entirely, Mercor can help you find your next big role. If you're ready to take the next step in your AI career, check them out through my referral link: https://work.mercor.com/?referralCode=82d5f4e3-e1a3-4064-963f-c197bb2c8db1. It's a fantastic resource, and I encourage you to explore the opportunities they have available.
- Full Stack Engineer [$150K-$220K]
- Software Engineer, Tooling & AI Workflow, Contract [$90/hour]
- DevOps Engineer, India, Contract [$90/hour]
- More AI Jobs Opportunitieshere
| Job Title | Status | Pay |
|---|---|---|
| Full-Stack Engineer | Strong match, Full-time | $150K - $220K / year |
| Developer Experience and Productivity Engineer | Pre-qualified, Full-time | $160K - $300K / year |
| Software Engineer - Tooling & AI Workflows (Contract) | Contract | $90 / hour |
| DevOps Engineer (India) | Full-time | $20K - $50K / year |
| Senior Full-Stack Engineer | Full-time | $2.8K - $4K / week |
| Enterprise IT & Cloud Domain Expert - India | Contract | $20 - $30 / hour |
| Senior Software Engineer | Contract | $100 - $200 / hour |
| Senior Software Engineer | Pre-qualified, Full-time | $150K - $300K / year |
| Senior Full-Stack Engineer: Latin America | Full-time | $1.6K - $2.1K / week |
| Software Engineering Expert | Contract | $50 - $150 / hour |
| Generalist Video Annotators | Contract | $45 / hour |
| Generalist Writing Expert | Contract | $45 / hour |
| Editors, Fact Checkers, & Data Quality Reviewers | Contract | $50 - $60 / hour |
| Multilingual Expert | Contract | $54 / hour |
| Mathematics Expert (PhD) | Contract | $60 - $80 / hour |
| Software Engineer - India | Contract | $20 - $45 / hour |
| Physics Expert (PhD) | Contract | $60 - $80 / hour |
| Finance Expert | Contract | $150 / hour |
| Designers | Contract | $50 - $70 / hour |
| Chemistry Expert (PhD) | Contract | $60 - $80 / hour |
What are the top 3 methods used to find Autoregressive Parameters in Data Science?
In order to find autoregressive parameters, you will first need to understand what autoregression is. Autoregression is a statistical method used to create a model that describes data as a function of linear regression of lagged values of the dependent variable. In other words, it is a model that uses past values of a dependent variable in order to predict future values of the same dependent variable.
In time series analysis, autoregression is the use of previous values in a time series to predict future values. In other words, it is a form of regression where the dependent variable is forecasted using a linear combination of past values of the independent variable. The parameter values for the autoregression model are estimated using the method of least squares.
The autoregressive parameters are the coefficients in the autoregressive model. These coefficients can be estimated in a number of ways, including ordinary least squares (OLS), maximum likelihood (ML), or least squares with L1 regularization (LASSO). Once estimated, the autoregressive parameters can be used to predict future values of the dependent variable.
To find the autoregressive parameters, you need to use a method known as least squares regression. This method finds the parameters that minimize the sum of the squared residuals. The residual is simply the difference between the predicted value and the actual value. So, in essence, you are finding the parameters that best fit the data.

How to Estimate Autoregressive Parameters?
There are three main ways to estimate autoregressive parameters: ordinary least squares (OLS), maximum likelihood (ML), or least squares with L1 regularization (LASSO).
Ordinary Least Squares: Ordinary least squares is the simplest and most common method for estimating autoregressive parameters. This method estimates the parameters by minimizing the sum of squared errors between actual and predicted values.
Maximum Likelihood: Maximum likelihood is another common method for estimating autoregressive parameters. This method estimates the parameters by maximizing the likelihood function. The likelihood function is a mathematical function that quantifies the probability of observing a given set of data given certain parameter values.
Least Squares with L1 Regularization: Least squares with L1 regularization is another method for estimating autoregressive parameters. This method estimates the parameters by minimizing the sum of squared errors between actual and predicted values while also penalizing models with many parameters. L1 regularization penalizes models by adding an extra term to the error function that is proportional to the sum of absolute values of the estimator coefficients.
Finding Autoregressive Parameters: The Math Behind It
To find the autoregressive parameters using least squares regression, you first need to set up your data in a certain way. You need to have your dependent variable in one column and your independent variables in other columns. For example, let’s say you want to use three years of data to predict next year’s sales (the dependent variable). Your data would look something like this:
| Year | Sales |
|——|——-|
| 2016 | 100 |
| 2017 | 150 |
| 2018 | 200 |
Next, you need to calculate the means for each column. For our sales example, that would look like this:
AI-Powered Professional Certification Quiz Platform
Web|iOs|Android|Windows
Are you passionate about AI and looking for your next career challenge? In the fast-evolving world of artificial intelligence, connecting with the right opportunities can make all the difference. We're excited to recommend Mercor, a premier platform dedicated to bridging the gap between exceptional AI professionals and innovative companies.
Whether you're seeking roles in machine learning, data science, or other cutting-edge AI fields, Mercor offers a streamlined path to your ideal position. Explore the possibilities and accelerate your AI career by visiting Mercor through our exclusive referral link:
Find Your AI Dream Job on Mercor
Your next big opportunity in AI could be just a click away!
$$ \bar{Y} = \frac{100+150+200}{3} = 150$$
Now we can calculate each element in what’s called the variance-covariance matrix:
AI- Powered Jobs Interview Warmup For Job Seekers

⚽️Comparative Analysis: Top Calgary Amateur Soccer Clubs – Outdoor 2025 Season (Kids' Programs by Age Group)
$$ \operatorname {Var} (X)=\sum _{i=1}^{n}\left({x_{i}}-{\bar {x}}\right)^{2} $$
and
AI Jobs and Career
And before we wrap up today's AI news, I wanted to share an exciting opportunity for those of you looking to advance your careers in the AI space. You know how rapidly the landscape is evolving, and finding the right fit can be a challenge. That's why I'm excited about Mercor – they're a platform specifically designed to connect top-tier AI talent with leading companies. Whether you're a data scientist, machine learning engineer, or something else entirely, Mercor can help you find your next big role. If you're ready to take the next step in your AI career, check them out through my referral link: https://work.mercor.com/?referralCode=82d5f4e3-e1a3-4064-963f-c197bb2c8db1. It's a fantastic resource, and I encourage you to explore the opportunities they have available.
$$ \operatorname {Cov} (X,Y)=\sum _{i=1}^{n}\left({x_{i}}-{\bar {x}}\right)\left({y_{i}}-{\bar {y}}\right) $$
For our sales example, that calculation would look like this:
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
- AWS Certified AI Practitioner (AIF-C01): Conquer the AWS Certified AI Practitioner exam with our AI and Machine Learning For Dummies test prep. Master fundamental AI concepts, AWS AI services, and ethical considerations.
- Azure AI Fundamentals: Ace the Azure AI Fundamentals exam with our comprehensive test prep. Learn the basics of AI, Azure AI services, and their applications.
- Google Cloud Professional Machine Learning Engineer: Nail the Google Professional Machine Learning Engineer exam with our expert-designed test prep. Deepen your understanding of ML algorithms, models, and deployment strategies.
- AWS Certified Machine Learning Specialty: Dominate the AWS Certified Machine Learning Specialty exam with our targeted test prep. Master advanced ML techniques, AWS ML services, and practical applications.
- AWS Certified Data Engineer Associate (DEA-C01): Set yourself up for promotion, get a better job or Increase your salary by Acing the AWS DEA-C01 Certification.
$$ \operatorname {Var} (Y)=\sum _{i=1}^{3}\left({y_{i}}-{\bar {y}}\right)^{2}=(100-150)^{2}+(150-150)^{2}+(200-150)^{2})=2500 $$
and
$$ \operatorname {Cov} (X,Y)=\sum _{i=1}^{3}\left({x_{i}}-{\bar {x}}\right)\left({y_{i}}-{\bar {y}}\right)=(2016-2017)(100-150)+(2017-2017)(150-150)+(2018-2017)(200-150))=-500 $$
Now we can finally calculate our autoregressive parameters! We do that by solving this equation:
$$ \hat {\beta }=(X^{\prime }X)^{-1}X^{\prime }Y=\frac {1}{2500}\times 2500\times (-500)=0.20 $$\.20 . That’s it! Our autoregressive parameter is 0\.20 . Once we have that parameter, we can plug it into our autoregressive equation:
$$ Y_{t+1}=0\.20 Y_t+a_1+a_2+a_3footnote{where $a_1$, $a_2$, and $a_3$ are error terms assuming an AR(3)} .$$ And that’s how you solve for autoregressive parameters! Of course, in reality you would be working with much larger datasets, but the underlying principles are still the same. Once you have your autoregressive parameters, you can plug them into the equation and start making predictions!.
Which Method Should You Use?
The estimation method you should use depends on your particular situation and goals. If you are looking for simple and interpretable results, then Ordinary Least Squares may be the best method for you. If you are looking for more accurate predictions, then Maximum Likelihood or Least Squares with L1 Regularization may be better methods for you.
Autoregressive models STEP BY STEP:
1) Download data: The first step is to download some data. This can be done by finding a publicly available dataset or by using your own data if you have any. For this example, we will be using data from the United Nations Comtrade Database.
2) Choose your variables: Once you have your dataset, you will need to choose the variables you want to use in your autoregression model. In our case, we will be using the import and export values of goods between countries as our independent variables.
3) Estimate your model: After choosing your independent variables, you can estimate your autoregression model using the method of least squares. OLS estimation can be done in many statistical software packages such as R or STATA.
4) Interpret your results: Once you have estimated your model, it is important to interpret the results in order to understand what they mean. The coefficients represent the effect that each independent variable has on the dependent variable. In our case, the coefficients represent the effect that imports and exports have on trade balance. A positive coefficient indicates that an increase in the independent variable leads to an increase in the dependent variable while a negative coefficient indicates that an increase in the independent variable leads to a decrease in the dependent variable.
5)Make predictions: Finally, once you have interpreted your results, you can use your autoregression model to make predictions about future values of the dependent variable based on past values of the independent variables.
Conclusion: In this blog post, we have discussed what autoregression is and how to find autoregressive parameters.
Estimating an autoregression model is a relatively simple process that can be done in many statistical software packages such as R or STATA.
In statistics and machine learning, autoregression is a modeling technique used to describe the linear relationship between a dependent variable and one more independent variables. To find the autoregressive parameters, you can use a method known as least squares regression which minimizes the sum of squared residuals. This blog post also explains how to set up your data for calculating least squares regression as well as how to calculate Variance and Covariance before finally calculating your autoregressive parameters. After finding your parameters you can plug them into an autoregressive equation to start making predictions about future events!
We have also discussed three different methods for estimating those parameters: Ordinary Least Squares, Maximum Likelihood, and Least Squares with L1 Regularization. The appropriate estimation method depends on your particular goals and situation.

Machine Learning For Dummies App
Machine Learning For Dummies on iOs: https://apps.apple.com/
Machine Learning For Dummies on Windows: https://www.
Machine Learning For Dummies Web/Android on Amazon: https://www.amazon.
What are some good datasets for Data Science and Machine Learning?
Machine Learning Engineer Interview Questions and Answers
Machine Learning Breaking News
Transformer – Machine Learning Models
Machine Learning – Software Classification
Autoregressive Model
Autoregressive generative models can estimate complex continuous data distributions such as trajectory rollouts in an RL environment, image intensities, and audio. Traditional techniques discretize continuous data into various bins and approximate the continuous data distribution using categorical distributions over the bins. This approximation is parameter inefficient as it cannot express abrupt changes in density without using a significant number of additional bins. Adaptive Categorical Discretization (ADACAT) is proposed in this paper as a parameterization of 1-D conditionals that is expressive, parameter efficient, and multimodal. A vector of interval widths and masses is used to parameterize the distribution known as ADACAT. Figure 1 showcases the difference between the traditional uniform categorical discretization approach with the proposed ADACAT.
Each component of the ADACAT distribution has non-overlapping support, making it a specific subfamily of mixtures of uniform distributions. ADACAT generalizes uniformly discretized 1-D categorical distributions. The proposed architecture allows for variable bin widths and more closely approximates the modes of two Gaussians mixture than a uniformly discretized categorical, making it highly expressive than the latter. Additionally, a distribution’s support is discretized using quantile-based discretization, which bins data into groups with similar measured data points. ADACAT uses deep autoregressive frameworks to factorize the joint density into numerous 1-D conditional ADACAT distributions in problems with more than one dimension.
Continue reading | Check out the paper and github link.
Pytorch – Computer Application
https://torchmetrics.readthedocs.io/en/stable//index.html
Best practices for training PyTorch model
What are some ways we can use machine learning and artificial intelligence for algorithmic trading in the stock market?
What are some good datasets for Data Science and Machine Learning?
Top 100 Data Science and Data Analytics and Data Engineering Interview Questions and Answers
Machine Learning Engineer Interview Questions and Answers
- [D] CVPR submission risk of desk rejectby /u/Public_Courage_7541 (Machine Learning) on November 7, 2025 at 5:45 am
I just got an email from CVPR saying "For CVPR 2026, all authors are required to have a complete OpenReview profile and a complete author enrollment." But I don't understand. What is the meaning of "Complete OpenReview Profile"? I went through tens of reviews and submissions this year, and suddenly it is incomplete? Anyone has an idea about this?? submitted by /u/Public_Courage_7541 [link] [comments]
- [D] Should I submit my survey paper to TPAMI?by /u/Outrageous_Tip_8109 (Machine Learning) on November 7, 2025 at 5:42 am
Hello everyone, I’m planning to write a literature survey paper in my research field, covering roughly the last 10–15 years of work. My goal is to submit it to TPAMI, since it’s a well-known and reputable journal that also accepts surveys. However, I’ve heard from colleagues that TPAMI sometimes considers the author’s research credentials and experience before even sending a paper for review. I’ve been working in this area for about 6 years (including 4 years during my PhD). My co-author also has some experience, but not a very strong profile. So my questions are: 1. Should I still go ahead and submit the survey to TPAMI? 2. What are my realistic odds of it being reviewed or accepted? 3. Any practical tips for writing and submitting a survey to such a high-impact journal? Thanks for your time and advice! submitted by /u/Outrageous_Tip_8109 [link] [comments]
- [D] ICML 2026 does not require in-person attendance, will the submission skyrocket?by /u/Striking-Warning9533 (Machine Learning) on November 7, 2025 at 2:34 am
Change in policy: Attendance for authors of accepted papers is optional. After acceptance notifications, the authors will be able to decide by a specified date whether they wish to present their paper in person at the conference or they just wish to include their paper in the proceedings (without presentation at the conference). Regardless of this choice, all the accepted papers will receive equivalent treatment in the proceedings. They will all be eligible for ICML awards as well as for the designations of distinction corresponding to the past “oral presentations” and “spotlight posters.” For proceedings-only papers, at least one of the authors must obtain virtual registration. source: https://icml.cc/Conferences/2026/CallForPapers submitted by /u/Striking-Warning9533 [link] [comments]
- [D] Returning large number of exact passages with LLM document retrieval?by /u/SwimmingMeringue9415 (Machine Learning) on November 6, 2025 at 8:22 pm
Hey all, I'm working on a project involving natural language search on large collections of unstructured cookbooks, with the goal of returning complete, unmodified recipes (not summaries). Example: User uploads 100 unstructured cookbooks (each containing many recipes), searches "paella," and gets 40 exact recipes returned (unmodified from the source). RAG isn’t a particularly good fit for this problem since I don’t want to re-generate/summarize the output content, I want to return exact recipes (and potentially a large volume of them). To me, I see two potential approaches: Precise chunking at index time: find out a way to accurately chunk cookbooks based on exact recipe boundaries (start/ends), and then just perform IR instead of RAG. I've tested semantic clustering and other chunking techniques, but achieving precise recipe start/end detection seems to be quite error-prone. NER feels too granular since I'm not extracting entities, just boundaries but maybe I’m wrong here. Better retrieval with post-processing: perhaps keep simpler/dumber chunking techniques and then use some sort of re-ranker/LLM to take revelant chunks from the semantic search and then “find” the beginning of the recipe passage from there, and then we can just query the original text. Wondering if anyone faced a similar problem before and any resources/techniques that would be interesting to try here. Cheers! submitted by /u/SwimmingMeringue9415 [link] [comments]
- [R][N] TabPFN-2.5 is now available: Tabular foundation model for datasets up to 50k samplesby /u/rsesrsfh (Machine Learning) on November 6, 2025 at 3:11 pm
TabPFN-2.5, a pretrained transformer that delivers SOTA predictions on tabular data without hyperparameter tuning is now available. It builds on TabPFN v2 that was released in the Nature journal earlier this year. Key highlights: 5x scale increase: Now handles 50,000 samples × 2,000 features (up from 10,000 × 500 in v2) SOTA performance: Achieves state-of-the-art results across classification and regression Rebuilt API: New REST interface & Python SDK with dedicated fit & predict endpoints, making deployment and integration significantly more developer-friendly Want to try it out? TabPFN-2.5 is available via an API and via a package on Hugging Face. We welcome your feedback and discussion! You can also join the discord here. submitted by /u/rsesrsfh [link] [comments]
- TabPFN-2.5 Is Live (Tabular Foundation Model, 2M+ Downloads)by /u/rsesrsfh (Data Science) on November 6, 2025 at 3:01 pm
We're releasing TabPFN-2.5, a pretrained transformer that delivers SOTA predictions on tabular data without hyperparameter tuning. It builds on v2 that was released in the Nature journal earlier this year. Key highlights: 5x scale increase: Now handles 50,000 samples × 2,000 features (up from 10,000 × 500 in v2) SOTA performance: Achieves state-of-the-art results across classification and regression Rebuilt API: New REST interface & Python SDK with dedicated fit & predict endpoints, making deployment and integration significantly more developer-friendly Speed Boost: Delivers top performance in seconds over API Want to try it out? TabPFN-2.5 is available via API and via Hugging Face. submitted by /u/rsesrsfh [link] [comments]
- [D] Kosmos achieves 79.4% accuracy in 12-hour autonomous research sessions, but verification remains the bottleneckby /u/Fair-Rain3366 (Machine Learning) on November 6, 2025 at 12:57 pm
I wrote a deep-dive on Kosmos after seeing lots of hype about "autonomous scientific discovery." The honest assessment: it's research acceleration, not autonomy. • 79.4% accuracy (20.6% failure rate matters) • 42,000 lines of code through iterative refinement • Reviews 1,500 papers via semantic search • But verification is still fully human-bound https://rewire.it/blog/kosmos-12-hour-ai-research-session/ submitted by /u/Fair-Rain3366 [link] [comments]
- How does your leadership see/organize AI investment?by /u/WarChampion90 (Data Science) on November 6, 2025 at 12:37 pm
I am being asked to organize the portfolio of AI products being developed, and not sure of the best path forward. Does your leadership see AI investment like this, or in a different way? Serious answers only please. Source: https://devnavigator.com/2025/10/20/ai-investment-portfolio-matrix-balancing-innovation-impact-and-feasibility/ submitted by /u/WarChampion90 [link] [comments]
- [D] Is ST-MOE model Decoder only or Encoder-Decoder architecture?by /u/red_dhinesh_it (Machine Learning) on November 6, 2025 at 6:32 am
Hey Folks, I'm reading https://arxiv.org/abs/2202.08906 paper and I'm not super clear whether the ST-MOE-32B is encoder-decoder model or decoder only model. Based on the token trace detailed for encoder and decoder experts separately in section 7, I believe it is encoder-decoder, but would like to confirm with someone who has worked on it. Please let me know if I misunderstood something here. Thanks submitted by /u/red_dhinesh_it [link] [comments]
- [D] Favorite Deep Learning Textbook for teaching undergrads?by /u/stabmasterarson213 (Machine Learning) on November 6, 2025 at 6:27 am
Hello. For the people here who have taught an undergraduate deep learning course, what's your favorite textbook that you have used and why? Leaning towards the Chris Murphy textbook just based on familiarity with Pattern Recognition and ML text but would love to hear what people have used before. submitted by /u/stabmasterarson213 [link] [comments]
- [P] Generating Knowledge Graphs From Unstructured Text Databy /u/Divine_Invictus (Machine Learning) on November 6, 2025 at 3:38 am
Hey all, I’m working on a project that involves taking large sets of unstructured text (mostly books or book series) and ingesting them into a knowledge graph that can be traversed in novel ways. Ideally the structure of the graph should encode crucial relationships between characters, places, events and any other named entities. I’ve tried using various spaCy models and strict regular expression rule based parsing, but I wasn’t able to extract as complete a picture as I wanted. At this point, the only thing I can think of is using a LLM to generate the triplets used to create the graph. I was wondering if anyone else has faced this issue before and what paper or resources they would recommend. Thanks for the help submitted by /u/Divine_Invictus [link] [comments]
- Is R Shiny still a thing?by /u/theSherz (Data Science) on November 6, 2025 at 2:19 am
I’ve been working in data for a while and decided to finally get my masters a year ago. This term I’m taking an advanced visualization course that’s focused on dashboard optimization. It covers a lot of good content in the readings but I’ve been shocked to find that the practical portion of the course revolves around R Shiny! I when I first heard of R Shiny a decade or more ago it was all the rage, it quickly died out. Now I’m only hearing about Tableau, power bi, maybe Looker, etc. So in your opinion is learning Shiny a good use of time or is my University simply out of touch or too cheap to get licenses for the tools people really use? Edit: thanks for the responses, everyone. This has helped me see more clearly where/why Shiny fits into the data spectrum. It has also helped me realize that a lot of my chafing has come from the fact that I’m already familiar with a few visualization tools and would rather be applying the courses theoretical content immediately using those. For most of the other students, adding Shiny to the R and Python the MS has already taught is probably the fastest route to that. Thanks again! submitted by /u/theSherz [link] [comments]
- Reasoning models don't degrade gracefully - they hit a complexity cliff and collapse entirely [Research Analysis] [R]by /u/Fair-Rain3366 (Machine Learning) on November 5, 2025 at 10:44 pm
I analyzed 18 recent papers on reasoning model limitations and found something disturbing: these models don't fail gracefully like humans do. They maintain high performance right up to a complexity threshold, then collapse entirely. Key findings: - The cliff is real: Models solving 10-step reasoning chains at 85% accuracy don't gradually degrade. They maintain that 85% until around step 12, then plummet to near-random guessing by step 15. - Composition breaks catastrophically: A model with 90% math accuracy and 85% commonsense accuracy drops to 55% when doing both together. They don't combine capabilities - they fragment them. - Chain-of-thought can hurt: In medical diagnosis tasks, 86.3% of models performed *worse* with CoT prompting. They talk themselves out of correct answers. - Scaling inference compute doesn't help: The Quiet-STaR approach spent $200 per query for 32% accuracy on complex reasoning. Humans: similar accuracy, 30 seconds, free. The production implications: Current benchmarks (MMLU, ARC-AGI) only test within narrow complexity bands. Your 95% test accuracy means nothing if those tests don't probe the cliff edge. I've included a production routing system example that handles this reality - routing by complexity detection with fallback logic for when models hit their limits. Full analysis with charts and code: https://rewire.it/blog/the-complexity-cliff-why-reasoning-models-work-until-they-dont Discussion: Are we fundamentally limited by transformer architecture, or is this solvable with better training methods? submitted by /u/Fair-Rain3366 [link] [comments]
- New Job Hunting Method: Not Applyingby /u/Fit-Employee-4393 (Data Science) on November 5, 2025 at 10:26 pm
Here’s why: A company opens a position and I apply along with 800 other people. The company sees 800 resumes and says F that, we’re hiring a recruiter. The recruiter finds me on LinkedIn and says they have a great job for me. Of course it’s the one I applied to. They ask if I’ve already applied and I tell them the truth, they ghost me because they don’t get commission if they’re not the original source. A few days after this, another recruiter reached out about a different position that I was planning on applying to directly with the company. This is also something that my current company has done after being overwhelmed with too many applicants. I’ll still be applying to some jobs, but it’s weird that applying has seemed to hurt my chances in some situations. Has anyone else experienced this? Any strategies for handling this? submitted by /u/Fit-Employee-4393 [link] [comments]
- [D] What is the current status of university-affiliated researchers getting access to uncensored versions of the largest LLMs today?by /u/moschles (Machine Learning) on November 5, 2025 at 8:50 pm
What is the current status of university-affiliated researchers getting access to uncensored versions of the largest LLMs today? Public-facing versions of GPT-5, Gemini 2.5, and Grok are both highly censored and tightly tuned by invisible prompts unseen by the user that turn them into helpful assistants for user tasks. Attempts to subvert these gaurdrails is called "jailbreaking" and the public LLMs have also been tuned or reprogrammed to be immune to such practices. But what does the workflow with a raw LLM actually look like? Do any of the larger tech companies allow outside researchers to interact with their raw versions, or do they keep these trillion+ parameter models a closely-guarded trade secret? (edit: After reading some replies, it appears the following must be true. ALl these IQ test results that keep popping on reddit with headlines about "..at the Ph.d level" must all be tests performed in-house by the coporations themselves. None of these results have been reproduced by outside teams. In academic writing this is called a "conflict of interest" and papers will actually divulge this problem near the end right before the bibliography section. These big tech companies are producing results about their own products, and then dressing them up with the ribbons-and-bows of "Research papers" when it is all just corporate advertising. No? Yes?) submitted by /u/moschles [link] [comments]
- Graph Database Implementationby /u/NervousVictory1792 (Data Science) on November 5, 2025 at 4:54 pm
Hii All. A use case has arised for implementing a Graph Database for fraud detection. I suggested Neo4j but I have been guided towards the Neptune path. I have surface level knowledge on Graphs. Can anyone please help me with a roadmap and resources on how I can learn it and go on with the implementation in Neptune? My main aim is to create a POC as of now. My data is in S3 buckets in csv formats. submitted by /u/NervousVictory1792 [link] [comments]
- Wharton: 74% of firms tracking GenAI ROI see positive resultsby /u/nullstillstands (Data Science) on November 5, 2025 at 4:12 pm
submitted by /u/nullstillstands [link] [comments]
- [P] Underwater target recognition using acoustic signalsby /u/carv_em_up (Machine Learning) on November 5, 2025 at 4:08 pm
Hello all !! I need your help to tackle this particular problem statement I want to solve: Suppose we have to devise an algorithm to classify sources of underwater acoustic signals recorded from a single channel hydrophone. A single recording can have different types/classes of sounds along with background noise and there can be multiple classes present in an overlapping or non overlapping fashion. So basically I need to identify what part of a recording has what class/classes present in there. Examples of different possible classes: Oil tanker, passenger ship, Whale/ sea mammal, background noise etc.. I have a rough idea about what to do, but due to lack of guidance I am not sure I am on the right path. As of now I am experimenting with clustering, feature construction such as spectrograms, mfcc, cqt etc. and then I plan to feed them to some CNN architecture. I am not sure how to handle overlapping classes. Also should I pre-process the audio but how, I might lose information ?? Please just tell me whatever you think can help. If anyone has some experience in tackling these type of problems, can you please help me. Suggest me some ideas. Also, if anyone has some dataset of underwater acoustics, can they please share them, I will follow your rules regarding the dataset. submitted by /u/carv_em_up [link] [comments]
- [D] AI provider wants a “win-win” data-sharing deal - how do I make sure it’s actually fair?by /u/Round_Mixture_7541 (Machine Learning) on November 5, 2025 at 3:52 pm
Hey everyone, I’m running a product that uses a large AI provider’s model for some specialized functionality. The system processes around 500k requests per month, which adds up to roughly 1.5B tokens in usage. The product generates customer interaction data that could, in theory, help the model provider improve their systems. They recently reached out saying they’d like to explore a “mutually beneficial collaboration” involving that data, but they haven’t given any concrete details yet. My guess is they might propose something like free usage or credits in exchange. Before I consider anything, I plan to update my Terms of Service and notify users about what’s collected and how it’s used. Still, I’m trying to make sure I don’t end up giving away something valuable for too little - the data could have real long-term value, and usage costs aren’t cheap on my end either. What I’m trying to figure out: • What should I ask them before agreeing to anything • Should I request an NDA first • How do I handle ownership and pricing discussions so it’s actually fair • Any red flags or traps to look out for in deals like this Would really appreciate advice from people who’ve done data or AI-related partnerships before. submitted by /u/Round_Mixture_7541 [link] [comments]
- [D] WACV 2026 Final Decision Notificationby /u/akshitsharma1 (Machine Learning) on November 5, 2025 at 7:15 am
WACV 2026 Final decisions are expected to be released within next 24 hours. Creating a discussion thread to discuss among ourselves, thanks! submitted by /u/akshitsharma1 [link] [comments]





























96DRHDRA9J7GTN6