Very ML
State-of-the-art Machine Learning News Feed
/r/MachineLearning
последний пост 4 часа назад
[R] Reproducible prompt protocol induces consistent self-referential responses across LLMs (Claude, GPT, Gemini)
[R] Reproducible prompt protocol induces consistent self-referential responses across LLMs (Claude, GPT, Gemini)

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

4 часа назад @ reddit.com
[N] Both OpenAI and DeepMind are claiming ICPC gold-level performance
[N] Both OpenAI and DeepMind are claiming ICPC gold-level performance

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

5 часов назад @ reddit.com
[D] How about we review the reviewers?
[D] How about we review the reviewers?

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

5 часов назад @ reddit.com
[D] AAAI 2026, a rejected paper just commented.
[D] AAAI 2026, a rejected paper just commented. [D] AAAI 2026, a rejected paper just commented.

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

6 часов назад @ reddit.com
[D] How do you handle experiment tracking across teams?
[D] How do you handle experiment tracking across teams?

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

6 часов назад @ reddit.com
[D] can we trust agents for time series forecasting?
[D] can we trust agents for time series forecasting? [D] can we trust agents for time series forecasting?

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

9 часов назад @ reddit.com
[R] Need model/paper/code suggestion for document template extraction
[R] Need model/paper/code suggestion for document template extraction

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

9 часов назад @ reddit.com
[D] Need suggestion for Traffic prediction Model
[D] Need suggestion for Traffic prediction Model

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

12 часов назад @ reddit.com
[D] WACV round 1 revised papers for round 2 -- rebuttal guidelines
[D] WACV round 1 revised papers for round 2 -- rebuttal guidelines

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

17 часов назад @ reddit.com
[D] How is IEEE TIP viewed in the CV/AI/ML community?
[D] How is IEEE TIP viewed in the CV/AI/ML community?

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

21 час назад @ reddit.com
[D] AAAI - phase 1 rejection rate?
[D] AAAI - phase 1 rejection rate?

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

23 часа назад @ reddit.com
Why I’m going back to the AI Agent Security Research Summit [R]
Why I’m going back to the AI Agent Security Research Summit [R]

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

1 day назад @ reddit.com
[P] I build a completely free website to help patients to get secondary opinion on mammogram, loading AI model inside browser and completely local inference without data transfer. Optional LLM-based radiology report generation if needed.
[P] I build a completely free website to help patients to get secondary opinion on mammogram, loading AI model inside browser and completely local inference without data transfer. Optional LLM-based radiology report generation if needed. [P] I build a completely free website to help patients to get secondary opinion on mammogram, loading AI model inside browser and completely local inference without data transfer. Optional LLM-based radiology report generation if needed.

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

1 day, 5 hours назад @ reddit.com
[D] A thought I had: Are Agents, KAG, etc., just a temporary form of 'prompt engineering' that will fade away?
[D] A thought I had: Are Agents, KAG, etc., just a temporary form of 'prompt engineering' that will fade away?

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

1 day, 9 hours назад @ reddit.com
[D] Last round interview at Canva for MLE
[D] Last round interview at Canva for MLE

Your request has been blocked due to a network policy.

If you're running a script or application, please register or sign in with your developer credentials here.

Additionally make sure your User-Agent is not empty and is something unique and descriptive and try again.

if you're supplying an alternate User-Agent string, try changing back to default as that can sometimes result in a block.

If you think that we've incorrectly blocked you or you would like to discuss easier ways to get the data you want, please file a ticket here.

1 day, 10 hours назад @ reddit.com
Towards Data Science
последний пост 5 часов назад
Analysis of Sales Shift in Retail with Causal Impact: A Case Study at Carrefour
Analysis of Sales Shift in Retail with Causal Impact: A Case Study at Carrefour Analysis of Sales Shift in Retail with Causal Impact: A Case Study at Carrefour

Additionally, it gradually builds a valuable history of sales shift impact estimates.

We consider that a reliable sales shift impact estimate should accurately assess both lost sales and portion of sales transferred to other products.

Therefore, we analyzed sales shift only when the theoretical maximum sales shift rate exceeds 5% of sales in its sub-family.

The theoretical maximum sales shift rate for this product was 9.5%, and our previous analyses showed very high sales shift rates in the dairy product family.

ConclusionIn this article we proposed a causal approach to estimate the sales shift effect when a product becomes unavailable, using Causal Impact.

5 часов назад @ towardsdatascience.com
RAG Explained: Understanding Embeddings, Similarity, and Retrieval
RAG Explained: Understanding Embeddings, Similarity, and Retrieval RAG Explained: Understanding Embeddings, Similarity, and Retrieval

Contextual Embeddings : Contextual embeddings take into account that the meaning of a word can change based on context.

Cosine similarity is a measure of how similar two vectors (embeddings) are.

Given two vectors A and B, cosine similarity is calculated as follows:Image by authorSimply put, cosine similarity is calculated as the cosine of the angle between two vectors, and it ranges from 1 to -1.

In this way, cosine similarity is the dominant metric used for quantifying the similarity between embeddings.

For example, in some cases, a chunk might only be considered if its similarity score exceeds a certain threshold (e.g., cosine similarity > 0.3).

6 часов назад @ towardsdatascience.com
Evaluating Your RAG Solution
Evaluating Your RAG Solution Evaluating Your RAG Solution

This article will give an introduction to RAG by building our own chatbot with open-source research data using LangChain and more.

Building a RAG PipelineFor our RAG solution, we will use abstracts from open-source research related to artificial intelligence.

Keep answers to 3 sentences or shorter.<|end|> <|assistant|>""" #define prompt template prompt = PromptTemplate( template=template, input_variables=["context", "question"]) #create RAG pipeline rag = RetrievalQA.from_chain_type(llm=llm, chain_type="stuff", retriever=retriever, return_source_documents=True, chain_type_kwargs={"prompt": prompt}, verbose = True) #test rag response rag.invoke("What are the most recent advancements in compu…

6 часов назад @ towardsdatascience.com
Deploying AI Safely and Responsibly
Deploying AI Safely and Responsibly Deploying AI Safely and Responsibly

In this article, four of the expert panelists each tackle one common myth about AI trust and explain what you need to know to make your AI projects a trustworthy, safe success.

– Just as elevators require strong cables and pulleys to carry weight safely, AI systems rely on scalable, reliable data pipelines.

By measuring trust metrics and making them transparent to end-users, trustworthiness improves the adoption of AI systems.

Further ReadingAbout the AuthorsVrushali Channapattan is the Director of Engineering at Okta, where she leads Data and AI initiatives with a strong focus on Responsible AI.

She is a frequent speaker on topics such as responsible AI, data quality, and observability, an…

11 часов назад @ towardsdatascience.com
ROC AUC Explained: A Beginner’s Guide to Evaluating Classification Models
ROC AUC Explained: A Beginner’s Guide to Evaluating Classification Models ROC AUC Explained: A Beginner’s Guide to Evaluating Classification Models

Now, in binary classification models, we have another way to evaluate the model, and that is ROC AUC.

We need to change the way we evaluate our model, and the best way to evaluate classification models with an imbalanced dataset is ROC AUC.

Now we know that there is another method to evaluate classification models, i.e., ROC AUC.

Now we need to interpret the ROC curve, and for that we will generate the ROC curve using Python on our dataset.

$$\text{Orange Rectangle: } l \times b = 0.33 \times 0.33 = 0.11$$$$\text{Green Rectangle: } l \times b = 0.34 \times 0.33 = 0.11$$$$\text{Red Rectangle: } l \times b = 0.33 \times 0.33 = 0.11$$$$\text{Total AUC} = 0.11 + 0.11 + 0.11 = 0.33$$This way we …

12 часов назад @ towardsdatascience.com
Building a Unified Intent Recognition Engine
Building a Unified Intent Recognition Engine Building a Unified Intent Recognition Engine

Yet across enterprise teams, intent recognition often happens in silos, each team building bespoke pipelines for different products, from troubleshooting assistants to chatbots and issue triage tools.

Wouldn’t it be better to create a modular system that different teams could configure for their own needs without starting from scratch?

That question set us on the path to what we now call the Unified Intent Recognition Engine (UIRE).

Preprocessing steps, summarization (if required), embedding and vector search tools (like MiniLM, SBERT, FAISS, Pinecone), similarity scoring logic, threshold tuning frameworks,.

Closing ThoughtsThe Unified Intent Recognition Engine is less a packaged product an…

1 day, 6 hours назад @ towardsdatascience.com
Using Python to Build a Calculator
Using Python to Build a Calculator Using Python to Build a Calculator

Let’s utilize Python basics, including functions, conditional statements, dictionaries, and loops, to create our version of a calculator.

def add(num1, num2): return num1 + num2 def subtract(num1, num2): return num1 - num2 def multiply(num1, num2): return num1 * num2 def divide(num1, num2): return num1/num2Creating a Dictionary of OperationsLet us also create a dictionary of operations .

But now we want to restart the calculator, so for this purpose, we will use a Python function concept called recursion.

For the calculator to restart, we will define it as a function, and it will be called when we want to restart the calculator function without carrying on the result.

Recursion in Python al…

1 day, 6 hours назад @ towardsdatascience.com
My Experiments with NotebookLM for Teaching
My Experiments with NotebookLM for Teaching My Experiments with NotebookLM for Teaching

Why NotebookLM for EducationThe tagline for NotebookLM is understand anything, and it really lives up to that.

But it can be an effective tool for teaching as well, due to its features like mind maps, flashcards, quiz generator, customizable reports, video overviews, etc.

NotebookLM + Google AI StudioI love the mindmaps from NotebookLM because they capture the whole topic so well.

NotebookLM for learning a new languageI’ve been experimenting with NotebookLM for language learning, and the new Flashcards + Custom Reports feature fits really well.

ConclusionIn this article, I’ve shared some of my experiments in the AI for education space using NotebookLM.

1 day, 7 hours назад @ towardsdatascience.com
How to Enrich LLM Context to Significantly Enhance Capabilities
How to Enrich LLM Context to Significantly Enhance Capabilities How to Enrich LLM Context to Significantly Enhance Capabilities

with an enormous corpus of text data, where they, during their pre-training stage, essentially consume the entire internet.

Retrieving information beforehandThe easiest approach is to retrieve additional data by fetching it before processing any live requests.

This is essentially how Anthropic has set up their deep research system, where they create one orchestrator agent that can spawn sub-agents to fetch additional information.

LLM use cases for metadataUntil now, I’ve discussed why you should utilize additional data and how to get a hold of it.

ConclusionIn this article, I’ve discussed how to significantly enhance your LLM by providing it with additional data.

1 day, 11 hours назад @ towardsdatascience.com
Why Your A/B Test Winner Might Just Be Random Noise
Why Your A/B Test Winner Might Just Be Random Noise Why Your A/B Test Winner Might Just Be Random Noise

In today’s post I want to see how it affects and fools us in our A/B tests, by using a fictional example to illustrate better what I’m trying to share.

But he wants to prove it and he decides to run a typical A/B test.

The A/B test is simple: the squad is divided in two groups where one keeps on warming up as usual (group A) while the other is instructed a new warm-up routine (group B).

The Problem: Random Noise Looks Like a WinOn paper our coach’s test looks convincing.

Generalizing Beyond SportsThe warm-up story is just a vivid case study, but the same pitfalls show up in every single A/B test one performs.

1 day, 12 hours назад @ towardsdatascience.com
A Visual Guide to Tuning Gradient Boosted Trees
A Visual Guide to Tuning Gradient Boosted Trees A Visual Guide to Tuning Gradient Boosted Trees

There are a bunch of gradient boosted tree libraries, including XGBoost, CatBoost, and LightGBM.

Notable hyperparameters, which I’ll look into more later, include learning_rate (how steep the gradient is, default 0.1), and n_estimators (similar to random forest — the number of trees).

Fitting took 2.2s, predicting took 0.005s, and the results:Metric max_depth=None MAE 0.369 MAPE 0.216 MSE 0.289 RMSE 0.538 R² 0.779So, quicker than the default random forest, but slightly worse performance.

This is why a GBT only works as a combined ensemble, not as separate standalone trees like in a random forest.

It can also help with stakeholder management — execs prefer pretty pictures to tables of number…

2 days, 6 hours назад @ towardsdatascience.com
Implementing the Coffee Machine Project in Python Using Object Oriented Programming
Implementing the Coffee Machine Project in Python Using Object Oriented Programming Implementing the Coffee Machine Project in Python Using Object Oriented Programming

Project WorkingIn my previous article on the Coffee Machine, I thoroughly explained the working of the project.

class MenuItem: def __init__(self, name, water, milk, coffee, cost): self.name = name self.cost = cost self.ingredients = { "water": water, "milk": milk, "coffee": coffee }Suppose we want to create an object menuitem of the class MenuItem .

In that case, we will need to give it the following parameters: name of the item, the amount of water required to make this menu item, the amount of milk required to make this menu item, the amount of coffee required to make this menu item, and the cost of this menu item.

Creating an Object from the “Menu” ClassTo put this class and its associa…

2 days, 6 hours назад @ towardsdatascience.com
You Only Need 3 Things to Turn AI Experiments into AI Advantage
You Only Need 3 Things to Turn AI Experiments into AI Advantage You Only Need 3 Things to Turn AI Experiments into AI Advantage

It also happened in the era of business intelligence, big data, data science, analytics, and machine learning.

Having a unified data platform, having a single source of data truth, having golden tables, data governance, etc.

As a late mover one can build a context platform instead of a pure data platform.

Below is one example of a largely open source stack that brings together both the Context Fabric and AgentOS.

Agent Builder Sprints: Teach and empower staff to build agents on approved infrastructure and own their efficiency goals.

2 days, 7 hours назад @ towardsdatascience.com
Learn How to Use Transformers with HuggingFace and SpaCy
Learn How to Use Transformers with HuggingFace and SpaCy Learn How to Use Transformers with HuggingFace and SpaCy

So let’s learn how to use SpaCy with Hugging Face’s models!

RoBERTa model builds on top of BERT with some key differences: https://arxiv.org/abs/1907.11692.

If you have a dataset, you can further train the RoBERTa model using spaCy to fine-tune it on the specific downstream task you’re trying to solve.

Once converted and saved as .spacy files, these can be passed directly into spacy train , which is much faster than using plain JSON or text files.

python —m spacy train config.cfg --output ./output --gpu-id 0You will see the training starting with and you can monitor the loss of the TextCategorizer component.

2 days, 11 hours назад @ towardsdatascience.com
How to Become a Machine Learning Engineer (Step-by-Step)
How to Become a Machine Learning Engineer (Step-by-Step) How to Become a Machine Learning Engineer (Step-by-Step)

As a machine learning engineer, you get to tackle fascinating problems, experiment with cutting-edge tools, and actually positively affect the world.

So in this article, I’ll give you a clear and simple learning roadmap to becoming a machine learning engineer, along with the best resources.

These are literally used inside every machine learning algorithm for training and learning.

Machine LearningTo everyone’s surprise, we need to learn machine learning to be a machine learning engineer!

Deep Learning (Adaptive Computation and Machine Learning series) — This book is written by one of the Godfathers of AI, Yoshua Bengio.

2 days, 13 hours назад @ towardsdatascience.com
Distill.pub Distill.pub
последний пост None
TheSequence TheSequence
последний пост 14 часов назад
The Sequence AI of the Week #721: Stop Blaming Temperature: Fighting Nondeterminism in LLM Inference
The Sequence AI of the Week #721: Stop Blaming Temperature: Fighting Nondeterminism in LLM Inference The Sequence AI of the Week #721: Stop Blaming Temperature: Fighting Nondeterminism in LLM Inference

Created Using GPT-5Large language models should feel deterministic when we ask them to be.

Set temperature to zero, fix the seed, and you expect the same bytes back for the same prompt.

Yet many practitioners have watched a production endpoint return slightly different completions across requests even under greedy decoding.

This isn’t a ghost in the GPU so much as a systems property: modern inference servers perform dynamic batching, kernels make shape‑dependent choices, and tiny numeric nudges can flip a token early in generation and cascade into a visibly different output.

The result is a practical path to “same input, same output” under real production load.

14 часов назад @ thesequence.substack.com
The Sequence Knowledge #720: A Cool Intro to Sparse Autoencoders for AI Interpretability
The Sequence Knowledge #720: A Cool Intro to Sparse Autoencoders for AI Interpretability The Sequence Knowledge #720: A Cool Intro to Sparse Autoencoders for AI Interpretability

Created Using GPT-5Today we will Discuss:An intro to sparse autoencoders.

OpenAI’s research about scaling sparse autoencoders.

💡 AI Concept of the Day: An Introduction to Sparse AutoencodersToday, we are going to discuss one of the most interesting architectures in the world of mechanistic interpretability.

Sparse autoencoders are a class of neural network models designed to learn compact, high-level representations of input data by enforcing a sparsity constraint on the hidden units.

This sparse activation pattern not only promotes efficient coding but also lays the foundation for interpretability by making individual hidden units more selective.

1 day, 14 hours назад @ thesequence.substack.com
The Sequence Radar #719: Oracle’s Quiet AI Decade, Loud Week
The Sequence Radar #719: Oracle’s Quiet AI Decade, Loud Week The Sequence Radar #719: Oracle’s Quiet AI Decade, Loud Week

The Sequence AI of the Week: Reviews Thinking Machines’ opening work on non-deterministic foundation modelsSubscribe Now to Not Miss Anything:📝 Editorial: Oracle’s Quiet AI Decade, Loud WeekOracle just had the kind of AI week that forces a narrative rewrite.

Experiments with Llama-3.2-3B show that LSP matches or surpasses data-driven RL baselines, demonstrating the feasibility of perpetual self-improvement without additional training data.

AI Lab: Google DeepMind, Google ResearchSummary: This work introduces SimpleQA Verified, a rigorously filtered 1,000-prompt benchmark that addresses noise, redundancy, and biases in OpenAI’s SimpleQA dataset.

AI Lab: MiniMax, HKUST, University of Waterloo…

3 days, 14 hours назад @ thesequence.substack.com
The Sequence Opinion #718: From Scale to Skill: The Rise of Post‑Training
The Sequence Opinion #718: From Scale to Skill: The Rise of Post‑Training The Sequence Opinion #718: From Scale to Skill: The Rise of Post‑Training

Created Using GPT-5Modern “frontier” AI models – spanning language, vision, and multimodal systems – are now built in two major phases.

First comes pretraining, where a large model (often called a foundation model) is trained on broad data to acquire general knowledge.

In this essay, we explore the transition from pretraining to post-training for cutting-edge AI models across modalities.

A dedicated section delves into reinforcement learning in post-training (especially RLHF and the newer RLAIF), discussing benefits and current limitations.

We close with reflections on how post-training is shaping the future of deploying and researching frontier AI models.

6 days, 13 hours назад @ thesequence.substack.com
The Sequence AI of the Week #717: First Trillion Among the Majors: Qwen-Max
The Sequence AI of the Week #717: First Trillion Among the Majors: Qwen-Max The Sequence AI of the Week #717: First Trillion Among the Majors: Qwen-Max

Created Using GPT-5Alibaba’s newest release Qwen3-Max is one of the most impressive frontier AI models ever created boosting 1 trillion parameters!

Altough we are still learning about the details behind Qwen3-Max, I wanted to share some initial findings and impressions.

Qwen3‑Max sits at the intersection of three trends: very large‑scale pretraining, sparse (Mixture‑of‑Experts) computation for throughput and cost, and aggressive post‑training that pushes reasoning, coding, and long‑context behavior.

It’s delivered as a cloud model with an OpenAI‑compatible API and aims squarely at the front of the pack—reporting strong results on capability suites like Arena‑Hard, LiveBench, LiveCodeBench, …

1 week назад @ thesequence.substack.com
The Sequence Radar #716: Sometimes, Circuits is All You Need
The Sequence Radar #716: Sometimes, Circuits is All You Need The Sequence Radar #716: Sometimes, Circuits is All You Need

This site requires JavaScript to run correctly.

Please turn on JavaScript or unblock scripts

1 week, 1 day назад @ thesequence.substack.com
The Sequence Radar #715: Qwen-Max: The Trillion-Parameter MoE You Can Actually Ship
The Sequence Radar #715: Qwen-Max: The Trillion-Parameter MoE You Can Actually Ship The Sequence Radar #715: Qwen-Max: The Trillion-Parameter MoE You Can Actually Ship

Subscribe Now to Not Miss Anything:📝 Editorial: The Trillion-Parameter MoE You Can Actually ShipAlibaba Qwen just released one of the most impressive AI models ever created.

Qwen‑Max is Alibaba Cloud’s flagship MoE language model, delivered through an OpenAI‑compatible API.

Under the hood, MoE gives Qwen‑Max high capacity without paying the full dense‑model cost for every token.

On capability, Qwen‑Max performs especially well on math, code, and hard multi‑step prompts—the stuff that actually blocks teams in daily workflows.

AI Lab: National University of Singapore, University of Oxford, Shanghai AI Lab, UCL, UIUC, Brown, Imperial College, CAS, CUHK, Fudan, Bristol, Georgia, UCSD, UCSB, Dal…

1 week, 3 days назад @ thesequence.substack.com
The Sequence Opinion #714: The AI Chip Cold War: NVIDIA, Intel, Huawei and
The Sequence Opinion #714: The AI Chip Cold War: NVIDIA, Intel, Huawei and The Sequence Opinion #714: The AI Chip Cold War: NVIDIA, Intel, Huawei and

Created Using GPT-5The AI chip wars have been front and center of the public debates in AI recently.

Nowhere is this more evident than in the competition for AI accelerator chips – the high-performance graphics processing units (GPUs) and specialized AI chips needed to train and run advanced models.

NVIDIA has emerged as the dominant supplier of these AI chips worldwide, making it one of the world’s most valuable tech companies.

In response, China is racing to develop its own alternatives, with Huawei leading the charge to build high-end AI chips domestically.

NVIDIA, China, and the AI Chip Export Dilemma

1 week, 6 days назад @ thesequence.substack.com
The Sequence AI of the Week #713: Inside the Amazing Hermes 4, an Open Reasoning Model
The Sequence AI of the Week #713: Inside the Amazing Hermes 4, an Open Reasoning Model The Sequence AI of the Week #713: Inside the Amazing Hermes 4, an Open Reasoning Model

Created Using GPT-5A year ago, AI lab Nous Research stunned the AI community by showing Hermes, a model completely pretrained in synthetic data.

The latest version of this endeavor came out last week.

Hermes 4 is Nous Research’s latest open‑weight family aimed at a question that hangs over today’s reasoning models: can we combine long‑form, structured thinking with crisp assistant behavior in a way that is both performant and reproducible?

Just as importantly, the team ships a transparent recipe—synthesized supervision with DataForge, verified trajectories with Atropos, and a standardized evaluation harness—that others can audit or extend.

For researchers and practitioners who care about co…

2 weeks назад @ thesequence.substack.com
The Sequence Knowledge #712: Mechanistic Interpretability and Diving Into the Mind of Claude
The Sequence Knowledge #712: Mechanistic Interpretability and Diving Into the Mind of Claude The Sequence Knowledge #712: Mechanistic Interpretability and Diving Into the Mind of Claude

Created Using GPT-5Today we will Discuss:An overview of mechanistic interpretability.

Anthropic’s brekthought paper that dives into “Claude’s mind”.

💡 AI Concept of the Day: What is Mechanistic Interpretability?

Mechanistic interpretability is revolutionizing how we understand and trust modern AI systems.

Rather than treating neural networks as inscrutable black boxes, this approach aims to dissect models into meaningful components—circuits, neurons, and pathways—and trace how data flows and transforms through them.

2 weeks, 1 day назад @ thesequence.substack.com
The Sequence Radar #711: Flash, But Precise: Inside Gemini 2.5 Flash Image
The Sequence Radar #711: Flash, But Precise: Inside Gemini 2.5 Flash Image The Sequence Radar #711: Flash, But Precise: Inside Gemini 2.5 Flash Image

Subscribe Now to Not Miss Anything:📝 Editorial: Flash, But Precise: Inside Gemini 2.5 Flash ImageGemini 2.5 Flash Image (internally nicknamed “nano-banana”) is Google’s new native image generation and editing model, designed to combine low-latency, cost-efficient inference with materially better visual quality and controllability than the Gemini 2.0 Flash image features.

It’s available now in Google AI Studio, the Gemini API, and Vertex AI.

Architecturally/operationally, Flash Image is positioned as a native image model rather than a multimodal text model with an image head.

AI Lab: Google ResearchSummary: This work introduces Adaptive Precise Boolean rubrics, a framework for evaluating hea…

2 weeks, 3 days назад @ thesequence.substack.com
The Sequence Opinion #710: The Inference Cloud Wars: Speed, Scale, and the Road to Commoditization
The Sequence Opinion #710: The Inference Cloud Wars: Speed, Scale, and the Road to Commoditization The Sequence Opinion #710: The Inference Cloud Wars: Speed, Scale, and the Road to Commoditization

Since the launch of ChatGPT in late 2022, a new class of startups has emerged to meet the exploding demand for running large AI models in real time.

These AI inference cloud providers – exemplified by companies like Together AI and Fireworks AI – offer hosted APIs to run generative models (large language models, image generators, etc.)

on their optimized GPU infrastructure.

In the past two years, they have proliferated rapidly, fueled by surging interest in AI applications and the need for cost-effective, customizable model deployment.

The race is on to provide the fastest, cheapest, and most flexible AI inference – but as the field matures, it may quickly commoditize, reshaping the landsca…

2 weeks, 6 days назад @ thesequence.substack.com
The Sequence #710: Learning About DeepSeek v3.1 in 10 Key Points
The Sequence #710: Learning About DeepSeek v3.1 in 10 Key Points The Sequence #710: Learning About DeepSeek v3.1 in 10 Key Points

Created Using DeepSeek v3.1DeepSeek-V3.1 is not a new model family so much as a decisive unification of three capabilities that have usually lived apart: a high-throughput generalist LLM, an explicit “thinking” reasoner, and an execution-competent agent.

It sits on the DeepSeek-V3 base—an economical but large Mixture-of-Experts (MoE) transformer with Multi-Head Latent Attention (MLA)—and extends it along three axes: (1) hybrid inference that supports both “thinking” and “non-thinking” modes in a single checkpoint via chat-template control; (2) long-context and tool/agent upgrades driven by additional continued pretraining and post-training; and (3) operational polish (function-calling stric…

3 weeks назад @ thesequence.substack.com
The Sequence Knowledge #709: Explainable-by-Design: An Intro to Intrinsic Interpretability in Generative AI
The Sequence Knowledge #709: Explainable-by-Design: An Intro to Intrinsic Interpretability in Generative AI The Sequence Knowledge #709: Explainable-by-Design: An Intro to Intrinsic Interpretability in Generative AI

Created Using GPT-5Today we will Discuss:An introduction to intrinsic interpretability in frontier AI models.

A review of MIT’s famous network dissection and Intrinsic Interpretability paper.

💡 AI Concept of the Day: What is Intrinsic Interpretability?

Intrinsic interpretability refers to the characteristic of AI models whereby their decision-making processes and internal representations are inherently understandable to humans.

This emphasis on built‑in explainability is increasingly important for frontier models, whose massive scale and complexity can otherwise render them inscrutable and heighten risks in high‑stakes applications.

3 weeks, 1 day назад @ thesequence.substack.com
The Sequence Radar: Two Drops, One Direction: The Week Agentic AI Got Practical
The Sequence Radar: Two Drops, One Direction: The Week Agentic AI Got Practical The Sequence Radar: Two Drops, One Direction: The Week Agentic AI Got Practical

Subscribe Now to Not Miss Anything:📝 Editorial: Two Drops, One Direction: The Week Agentic AI Got PracticalTwo standout model releases this week—DeepSeek’s V3.1 and Cohere’s Command A Reasoning—move agentic AI from buzz to build.

It packages reasoning, tool use, and multilingual competence with the guardrails large organizations expect—strong controls, observability, and predictable throughput.

AI Lab: Ant GroupSummary: Atom-Searcher proposes a reinforcement learning framework that decomposes reasoning into "Atomic Thoughts," enabling fine-grained supervision through Reasoning Reward Models (RRMs).

AI Lab: MIT and Google ResearchSummary: This paper introduces MaxAdaptiveDegree (MAD) and MAD…

3 weeks, 3 days назад @ thesequence.substack.com
Synced Review
последний пост 5 months, 1 week назад
DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT
DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT

DeepSeek AI, a prominent player in the large language model arena, has recently published a research paper detailing a new technique aimed…Continue reading on SyncedReview »

5 months, 1 week назад @ medium.com
Automating Artificial Life Discovery: The Power of Foundation Models
Automating Artificial Life Discovery: The Power of Foundation Models Automating Artificial Life Discovery: The Power of Foundation Models

The recent Nobel Prize for groundbreaking advancements in protein discovery underscores the transformative potential of foundation models…Continue reading on SyncedReview »

8 months, 2 weeks назад @ medium.com
Llama 3 Meets MoE: Pioneering Low-Cost High-Performance AI
Llama 3 Meets MoE: Pioneering Low-Cost High-Performance AI Llama 3 Meets MoE: Pioneering Low-Cost High-Performance AI

Continue reading on SyncedReview »

8 months, 3 weeks назад @ medium.com
DeepMind’s JetFormer: Unified Multimodal Models Without Modelling Constraints
DeepMind’s JetFormer: Unified Multimodal Models Without Modelling Constraints DeepMind’s JetFormer: Unified Multimodal Models Without Modelling Constraints

Recent advancements in training large multimodal models have been driven by efforts to eliminate modeling constraints and unify…Continue reading on SyncedReview »

8 months, 3 weeks назад @ medium.com
NVIDIA’s nGPT: Revolutionizing Transformers with Hypersphere Representation
NVIDIA’s nGPT: Revolutionizing Transformers with Hypersphere Representation NVIDIA’s nGPT: Revolutionizing Transformers with Hypersphere Representation

The Transformer architecture, introduced by Vaswani et al. in 2017, serves as the backbone of contemporary language models. Over the years…Continue reading on SyncedReview »

8 months, 4 weeks назад @ medium.com
From Token to Conceptual: Meta Introduces Large Concept Models in Multilingual AI
From Token to Conceptual: Meta Introduces Large Concept Models in Multilingual AI From Token to Conceptual: Meta Introduces Large Concept Models in Multilingual AI

Large Language Models (LLMs) have become indispensable tools for diverse natural language processing (NLP) tasks. Traditional LLMs operate…Continue reading on SyncedReview »

9 months назад @ medium.com
NVIDIA’s Hybrid: Combining Attention and State Space Models for Breakthrough Performance of Small…
NVIDIA’s Hybrid: Combining Attention and State Space Models for Breakthrough Performance of Small… NVIDIA’s Hybrid: Combining Attention and State Space Models for Breakthrough Performance of Small…

Language models (LMs) based on transformers have become the gold standard in natural language processing, thanks to their exceptional…Continue reading on SyncedReview »

9 months, 1 week назад @ medium.com
From Response to Query: The Power of Reverse Thinking in Language Models
From Response to Query: The Power of Reverse Thinking in Language Models From Response to Query: The Power of Reverse Thinking in Language Models

Continue reading on SyncedReview »

9 months, 1 week назад @ medium.com
Yann LeCun Team’s New Research: Revolutionizing Visual Navigation with Navigation World Models
Yann LeCun Team’s New Research: Revolutionizing Visual Navigation with Navigation World Models Yann LeCun Team’s New Research: Revolutionizing Visual Navigation with Navigation World Models

Navigation is a fundamental skill for any visually-capable organism, serving as a critical tool for survival. It enables agents to locate…Continue reading on SyncedReview »

9 months, 1 week назад @ medium.com
The Future of Vision AI: How Apple’s AIMV2 Leverages Images and Text to Lead the Pack
The Future of Vision AI: How Apple’s AIMV2 Leverages Images and Text to Lead the Pack The Future of Vision AI: How Apple’s AIMV2 Leverages Images and Text to Lead the Pack

The landscape of vision model pre-training has undergone significant evolution, especially with the rise of Large Language Models (LLMs)…Continue reading on SyncedReview »

9 months, 2 weeks назад @ medium.com
Redefining Music AI: The Power of Sony’s SoniDo as a Versatile Foundation Model
Redefining Music AI: The Power of Sony’s SoniDo as a Versatile Foundation Model Redefining Music AI: The Power of Sony’s SoniDo as a Versatile Foundation Model

A foundation model refers to a pre-trained model developed on extensive datasets, designed to be versatile and adaptable for a range of…Continue reading on SyncedReview »

9 months, 2 weeks назад @ medium.com
DeepMind’s Socratic Learning with Language Games: The Path to Self-Improving Superintelligence
DeepMind’s Socratic Learning with Language Games: The Path to Self-Improving Superintelligence DeepMind’s Socratic Learning with Language Games: The Path to Self-Improving Superintelligence

Continue reading on SyncedReview »

9 months, 3 weeks назад @ medium.com
Revolutionizing AI on a Budget: Apple’s Roadmap for Small Language Models Training Success
Revolutionizing AI on a Budget: Apple’s Roadmap for Small Language Models Training Success Revolutionizing AI on a Budget: Apple’s Roadmap for Small Language Models Training Success

While large language models (LLMs) dominate the AI landscape, Small-scale Large Language Models (SLMs) are gaining traction as…Continue reading on SyncedReview »

9 months, 3 weeks назад @ medium.com
Redefines Consistency Models”: OpenAI’s TrigFlow Narrows FID Gap to 10% with Efficient Two-Step…
Redefines Consistency Models”: OpenAI’s TrigFlow Narrows FID Gap to 10% with Efficient Two-Step… Redefines Consistency Models”: OpenAI’s TrigFlow Narrows FID Gap to 10% with Efficient Two-Step…

Consistency models (CMs) are a cutting-edge class of diffusion-based generative models designed for rapid and efficient sampling. However…Continue reading on SyncedReview »

9 months, 3 weeks назад @ medium.com
Precision in Pixels: NVIDIA’s Edify Image Model Combines High Quality with Unmatched Control
Precision in Pixels: NVIDIA’s Edify Image Model Combines High Quality with Unmatched Control Precision in Pixels: NVIDIA’s Edify Image Model Combines High Quality with Unmatched Control

The field of text-to-image synthesis has advanced rapidly, with state-of-the-art models now generating highly realistic and diverse images…Continue reading on SyncedReview »

9 months, 3 weeks назад @ medium.com
📓 Cool Blogs
ODS.ai Habr ODS.ai Habr
последний пост 1 month, 3 weeks назад
DRAGON: динамический бенчмарк для оценки RAG-систем на русском языке
DRAGON: динамический бенчмарк для оценки RAG-систем на русском языке DRAGON: динамический бенчмарк для оценки RAG-систем на русском языке

Ответ: Кэисукэ ТибаSPARQL-запрос SimpleSELECT DISTINCT ?s ?r ?o WHERE { { SELECT ?s ?r ?o WHERE { ?s ?r ?o . }

GROUP BY ?s ?r HAVING(count(?o) = 1) } { SELECT ?s ?r ?o WHERE { ?s ?r ?o . }

Ответ: Национальная система платежных карт (НСПК) Центр биометрических технологий (ЦБТ) ЕБСSELECT ?s ?r ?o ?len WHERE { { SELECT ?s ?r (COUNT(?o1) as ?len) (GROUP_CONCAT(DISTINCT(STR(?o1));separator="|") AS ?o) WHERE { ?s ?r ?o1 . }

FILTER(?o != ?o1) } GROUP BY ?o ?o1 ?r ?r1 HAVING(COUNT(?s) = 1) } UNION { SELECT ?s ?r ?o ?r1 ?s1 WHERE { ?s ?r ?o .

FILTER(?o != ?o1) } GROUP BY ?o ?o1 ?r ?r1 HAVING(COUNT(?s) = 1) } UNION { SELECT ?s ?r ?o ?r1 ?s1 WHERE { ?s ?r ?o .

1 month, 3 weeks назад @ habr.com
RKNN Toolkit2: конвертация моделей и симуляция NPU Rockchip
RKNN Toolkit2: конвертация моделей и симуляция NPU Rockchip RKNN Toolkit2: конвертация моделей и симуляция NPU Rockchip

В этой статье я хочу поделиться своим опытом по конвертации нейросети в формат rknn с помощью библиотеки rknn-toolkit2.

Вот как выглядят веса pytorch модели в Netron:веса pytorch модели в NetronВажно!

Конвертация onnx модели в rknnДалее создается объект RKNN , который управляет процессом конвертации и инференса модели на платформе Rockchip.

На этом этапе происходит подготавка модели к конвертации в формат RKNN и последующему запуску на NPU Rockchip.

Создание и экспорт rknn моделиНа этом этапе происходит конвертация ONNX-модели во внутренний формат RKNN, оптимизация графа и подготовка к запуску на NPU Rockchip.

2 months назад @ habr.com
MERA Code: всесторонняя оценка генерации кода в прикладных сценариях
MERA Code: всесторонняя оценка генерации кода в прикладных сценариях MERA Code: всесторонняя оценка генерации кода в прикладных сценариях

🔗MERA Code🔗GitHub с кодом и данными🔗Коллекция на Hugging Face🔗Статья на arxiv🔗Репозиторий проекта на GitVerseЧто такое MERA Code?

Современные кодовые языковые модели и модели общего назначения (ChatGPT, Claude, Qwen, YandexGPT, GigaChat и др.)

Список текущих задач MERA Code и их характеристикКаталог задач MERA Code и их подробное описание представлено на сайте.

В MERA Code промпты строго подобраны под задачу и корректный выбор ответа.

В заключениеMERA Code — это попытка закрыть важный пробел в тестировании LLM: насколько они действительно полезны в реальной, локализованной разработке.

2 months назад @ habr.com
Байесовская собака: анализ пёсьего компаса
Байесовская собака: анализ пёсьего компаса Байесовская собака: анализ пёсьего компаса

", подумал я. И, к счастью, у меня как раз под рукой оказался идеальный подопытный.

Стандартное арифметическое среднее между 360° и 0° даст нам 180°, несмотря на то, что и 360°, и 0° указывают в одном направлении.

Нулевая гипотеза утверждает, что данные распределены равномерно по кругу, альтернативная — что это не так.

from pingouin import circ_vtest v, pval = circ_vtest(data['radians'], dir=np.pi) print(f"V-statistics: {v:.3f}; p-value: {pval:.6f}")>> V-statistics: 24.127; p-value: 0.002904Вот мы и подобрались к чему-то интересному!

Априорное распределение и функция правдоподобияПредположим, что у нас есть:Априорное распределение с параметрамиФункция правдоподобия для нового наблюдения с п…

5 months, 3 weeks назад @ habr.com
Создаем воспоминания. Осваиваем FLUX, LoRA и ComfyUI
Создаем воспоминания. Осваиваем FLUX, LoRA и ComfyUI Создаем воспоминания. Осваиваем FLUX, LoRA и ComfyUI

Такие модели можно обучать с нуля и это дорого, нужен кластер с GPU (видеокарты) и много данных.

В домене текст-картинка бывают открытые модели, типа Stable Diffusion, Kandinsky и FLUX, бывают закрытые, типа DALL-E.Открытую модель можно дообучать разными способами.

Борис СтругацкийОсобенности: Для личностей типа Стругацких или Бродского, качественных фотографий крайне мало, но много и не надо.

Можно и фразу.

Владимир СурдинАлексей СемихатовВидео с их лекциями можно найти повсеместно, начать можно с канала Вселенная плюс на YouTube и в телеграм.

8 months, 2 weeks назад @ habr.com
Как нейросети, RL и байесовскую оптимизацию стали использовать на ускорителях заряженных частиц
Как нейросети, RL и байесовскую оптимизацию стали использовать на ускорителях заряженных частиц Как нейросети, RL и байесовскую оптимизацию стали использовать на ускорителях заряженных частиц

Один из них — поддержание стабильной орбиты пучка частиц (траектории, по которой происходит движение), которая критически важна для точности экспериментов.

Вот, кстати, наша статья про то, как сейсмические вибрации будут влиять на орбиту пучка в СКИФ: Beam Stability .

Классические подходы к стабилизации орбиты пучка в ускорителяхДля стабилизации орбиты используют датчики положения пучка (BPM) и магнитные корректоры.

По словам авторов получились следующие преимущества:Ускорение процесса коррекции орбиты и повышение точности по сравнению с классическими методами, такими как SVD.

Задача агента:Автоматическое восстановление орбиты пучка заряженных частиц за ограниченное время и минимальное коли…

8 months, 4 weeks назад @ habr.com
Machine Learning Mastery
последний пост 1 week, 1 day назад
Multi-Agent Systems: The Next Frontier in AI-Driven Cyber Defense
Multi-Agent Systems: The Next Frontier in AI-Driven Cyber Defense Multi-Agent Systems: The Next Frontier in AI-Driven Cyber Defense

The increasing sophistication of cyber threats calls for a systemic change in the way we defend ourselves against them.

1 week, 1 day назад @ machinelearningmastery.com
ROC AUC vs Precision-Recall for Imbalanced Data
ROC AUC vs Precision-Recall for Imbalanced Data ROC AUC vs Precision-Recall for Imbalanced Data

When building machine learning models to classify imbalanced data — i.

1 week, 1 day назад @ machinelearningmastery.com
7 Scikit-learn Tricks for Optimized Cross-Validation
7 Scikit-learn Tricks for Optimized Cross-Validation 7 Scikit-learn Tricks for Optimized Cross-Validation

Validating machine learning models requires careful testing on unseen data to ensure robust, unbiased estimates of their performance.

1 week, 2 days назад @ machinelearningmastery.com
A Gentle Introduction to Batch Normalization
A Gentle Introduction to Batch Normalization A Gentle Introduction to Batch Normalization

Deep neural networks have drastically evolved over the years, overcoming common challenges that arise when training these complex models.

1 week, 5 days назад @ machinelearningmastery.com
Small Language Models are the Future of Agentic AI
Small Language Models are the Future of Agentic AI Small Language Models are the Future of Agentic AI

This article provides a summary of and commentary on the recent paper

1 week, 6 days назад @ machinelearningmastery.com
10 Python One-Liners Every Machine Learning Practitioner Should Know
10 Python One-Liners Every Machine Learning Practitioner Should Know 10 Python One-Liners Every Machine Learning Practitioner Should Know

Developing machine learning systems entails a well-established lifecycle, consisting of a series of stages from data preparation and preprocessing to modeling, validation, deployment to production, and continuous maintenance.

2 weeks назад @ machinelearningmastery.com
3 Ways to Speed Up and Improve Your XGBoost Models
3 Ways to Speed Up and Improve Your XGBoost Models 3 Ways to Speed Up and Improve Your XGBoost Models

Extreme gradient boosting ( XGBoost ) is one of the most prominent machine learning techniques used not only for experimentation and analysis but also in deployed predictive solutions in industry.

2 weeks, 1 day назад @ machinelearningmastery.com
5 Key Ways LLMs Can Supercharge Your Machine Learning Workflow
5 Key Ways LLMs Can Supercharge Your Machine Learning Workflow 5 Key Ways LLMs Can Supercharge Your Machine Learning Workflow

Experimenting, fine-tuning, scaling, and more are key aspects that machine learning development workflows thrive on.

2 weeks, 5 days назад @ machinelearningmastery.com
7 Pandas Tricks for Efficient Data Merging
7 Pandas Tricks for Efficient Data Merging 7 Pandas Tricks for Efficient Data Merging

Data merging is the process of combining data from different sources into a unified dataset.

2 weeks, 6 days назад @ machinelearningmastery.com
How to Decide Between Random Forests and Gradient Boosting
How to Decide Between Random Forests and Gradient Boosting How to Decide Between Random Forests and Gradient Boosting

When working with machine learning on structured data, two algorithms often rise to the top of the shortlist: random forests and gradient boosting .

2 weeks, 6 days назад @ machinelearningmastery.com
A Gentle Introduction to Bayesian Regression
A Gentle Introduction to Bayesian Regression A Gentle Introduction to Bayesian Regression

In this article, you will learn: • The fundamental difference between traditional regression, which uses single fixed values for its parameters, and Bayesian regression, which models them as probability distributions.

3 weeks назад @ machinelearningmastery.com
10 Useful NumPy One-Liners for Time Series Analysis
10 Useful NumPy One-Liners for Time Series Analysis 10 Useful NumPy One-Liners for Time Series Analysis

Working with time series data often means wrestling with the same patterns over and over: calculating moving averages, detecting spikes, creating features for forecasting models.

3 weeks, 1 day назад @ machinelearningmastery.com
Logistic vs SVM vs Random Forest: Which One Wins for Small Datasets?
Logistic vs SVM vs Random Forest: Which One Wins for Small Datasets? Logistic vs SVM vs Random Forest: Which One Wins for Small Datasets?

When you have a small dataset, choosing the right machine learning model can make a big difference.

3 weeks, 2 days назад @ machinelearningmastery.com
5 Scikit-learn Pipeline Tricks to Supercharge Your Workflow
5 Scikit-learn Pipeline Tricks to Supercharge Your Workflow 5 Scikit-learn Pipeline Tricks to Supercharge Your Workflow

Perhaps one of the most underrated yet powerful features that scikit-learn has to offer, pipelines are a great ally for building effective and modular machine learning workflows.

3 weeks, 2 days назад @ machinelearningmastery.com
Seeing Images Through the Eyes of Decision Trees
Seeing Images Through the Eyes of Decision Trees Seeing Images Through the Eyes of Decision Trees

In this article, you'll learn to: • Turn unstructured, raw image data into structured, informative features.

3 weeks, 6 days назад @ machinelearningmastery.com
ML in Production
последний пост None
Sorta Insightful Sorta Insightful
последний пост 1 month назад
Ten Years Later
Ten Years Later Ten Years Later

Every now and then, someone asks me why I blog, and I don’t know really know what to tell them.

That’s another reason I’m not celebrating 10 years with more gusto, I know I’ve been writing less.

Indiana Jones and the Great Circle: I don’t know how they did it, but Indiana Jones and the Great Circle was just fun all the way through.

My one complaint is that the hand-to-hand combat feels like the worst part of the game, so of course they put a bunch of upgrades behind learning parry timings you’ll never use later.

I have not tried Peak, but Another Crab’s Treasure was really good and is worth playing if you’re interested in a Souls-like.

1 month назад @ alexirpan.com
Brony Musicians Seize The Means of Production: My Eyewitness Account to BABSCon 2025
Brony Musicians Seize The Means of Production: My Eyewitness Account to BABSCon 2025 Brony Musicians Seize The Means of Production: My Eyewitness Account to BABSCon 2025

A music concert in the evenings, typically set up as a rave with EDM or rock music made by brony musicians.

She has been involved in organizing pony music concerts for over a decade, for both BABSCon and other pony conventions.

Thank you, BABSCon ChairsThe brony musicians immediately jump into an emergency Discord call with Pinkaboo, to get her side of the story.

Other conventions start tweeting in support of the brony musicians, with no one taking BABSCon’s side.

It’s hard for me to explain why I like MLP fan music, because brony music really isn’t accessible.

1 month, 4 weeks назад @ alexirpan.com
Who is AI For?
Who is AI For? Who is AI For?

I think the easy answer to this question is that right now, AI is for the AI developers.

Code is useful, it makes money, it is a testbed for AI speeding up the development of AI, and it is easy.

I’m working in AI because it pays well and is potentially really good for the world.

The artists did not know what AI was, but when they learned, they quickly decided they did not want it.

It feels like the most likely outcome is that people go all-in on pushing raw intelligence, in the way that AI developers can measure it, leaving behind those that are not like AI developers.

5 months, 2 weeks назад @ alexirpan.com
MIT Mystery Hunt 2025
MIT Mystery Hunt 2025 MIT Mystery Hunt 2025

This has spoilers for MIT Mystery Hunt 2025.

I enjoyed it more than their 2018 Hunt, which is commonly cited as an all-time good Mystery Hunt.

In this Mystery Hunt it was reversed, where the act of unlocking is easy but the value and difficulty of a feeder varied.

In my free time pre-Hunt, I went to Puzzled Pint, where I tried to all-brain a logic puzzle (solve it without writing anything).

I’m looking forward to solving “No Assembly Required” in Mystery Hunt 2026, a puzzle that gives you the answer for no work.

7 months, 3 weeks назад @ alexirpan.com
Using AI to Get the Neopets Destruct-o-Match Avatar
Using AI to Get the Neopets Destruct-o-Match Avatar Using AI to Get the Neopets Destruct-o-Match Avatar

If AI can be superhuman at Go, surely AI can be slightly-worse-than-experts at Destruct-o-Match if we try?

Step 0: Is Making a Destruct-o-Match AI Against Neopets Rules?

I believe the precedent is in favor of a Destruct-o-Match AI being okay.

As long as I’m the one inputting moves the Destruct-o-Match AI recommends, I should be okay.

To write a game AI, we first need to implement the rules of the game in code.

8 months, 1 week назад @ alexirpan.com
Late Takes on OpenAI o1
Late Takes on OpenAI o1 Late Takes on OpenAI o1

I realize how late this is, but I didn’t get a post out while o1 was fresh, and still feel like writing one despite it being cold.

(Also, OpenAI just announced they’re going to ship new stuff starting tomorrow so it’s now or never to say something.)

OpenAI o1 is a model release widely believed (but not confirmed) to be a post-trained version of GPT-4o.

If true, that makes this video especially useful for understanding OpenAI o1.

Which I suppose is part of why I’m talking about o1 rather than building o1.

9 months, 2 weeks назад @ alexirpan.com
Lil'Log
последний пост None
The Spectator
последний пост None
Off the Convex Path
последний пост None
Piekniewski's blog
последний пост None
fast.ai NLP fast.ai NLP
последний пост None
Sebastian Ruder
последний пост None
Andrew Karpathy blog
последний пост None
大トロ 大トロ
последний пост None
🔬 Science
Papers With Code Papers With Code
последний пост 1 month, 3 weeks назад
/henry123-boy/ SpatialTrackerV2: 3D Point Tracking Made Easy
/henry123-boy/ SpatialTrackerV2: 3D Point Tracking Made Easy /henry123-boy/ SpatialTrackerV2: 3D Point Tracking Made Easy

We present SpatialTrackerV2, a feed-forward 3D point tracking method for monocular videos.

Going beyond modular pipelines built on off-the-shelf components for 3D tracking, our approach unifies the intrinsic connections between point tracking, monocular depth, and camera pose estimation into a high-performing and feedforward 3D point tracker.

It decomposes world-space 3D motion into scene geometry, camera ego-motion, and pixel-wise object motion, with a fully differentiable and end-to-end architecture, allowing scalable training across a wide range of datasets, including synthetic sequences, posed RGB-D videos, and unlabeled in-the-wild footage.

By learning geometry and motion jointly from …

1 month, 3 weeks назад @ paperswithcode.com
/antof27/ Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation
/antof27/ Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation /antof27/ Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation

Calisthenics skill classification is the computer vision task of inferring the skill performed by an athlete from images, enabling automatic performance assessment and personalized analytics.

Traditional methods for calisthenics skill recognition are based on pose estimation methods to determine the position of skeletal data from images, which is later fed to a classification algorithm to infer the performed skill.

This work proposes a direct approach to calisthenics skill recognition, which leverages depth estimation and athlete patch retrieval to avoid the computationally expensive human pose estimation module.

Using Depth Anything V2 for depth estimation and YOLOv10 for athlete localizat…

1 month, 3 weeks назад @ paperswithcode.com
/snowflakedb/ Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI
/snowflakedb/ Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI /snowflakedb/ Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI

Inference is now the dominant AI workload, yet existing systems force trade-offs between latency, throughput, and cost.

Arctic Inference, an open-source vLLM plugin from Snowflake AI Research, introduces Shift Parallelism, a dynamic parallelism strategy that adapts to real-world traffic while integrating speculative decoding, SwiftKV compute reduction, and optimized embedding inference.

It achieves up to 3.4 times faster request completion, 1.75 times faster generation, and 1.6M tokens/sec per GPU for embeddings, outperforming both latency- and throughput-optimized deployments.

Already powering Snowflake Cortex AI, Arctic Inference delivers state-of-the-art, cost-effective inference for ent…

1 month, 3 weeks назад @ paperswithcode.com
/NVIDIA/ FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale
/NVIDIA/ FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale /NVIDIA/ FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale

FourCastNet 3 advances global weather modeling by implementing a scalable, geometric machine learning (ML) approach to probabilistic ensemble forecasting.

The approach is designed to respect spherical geometry and to accurately model the spatially correlated probabilistic nature of the problem, resulting in stable spectra and realistic dynamics across multiple scales.

FourCastNet 3 delivers forecasting accuracy that surpasses leading conventional ensemble models and rivals the best diffusion-based methods, while producing forecasts 8 to 60 times faster than these approaches.

In contrast to other ML approaches, FourCastNet 3 demonstrates excellent probabilistic calibration and retains realis…

1 month, 3 weeks назад @ paperswithcode.com
/jingyanw/ Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?
/jingyanw/ Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work? /jingyanw/ Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?

We study A/B experiments that are designed to compare the performance of two recommendation algorithms.

The bias arising from this type of data sharing is known as "symbiosis bias".

In this paper, we highlight that, for decision-making purposes, the sign of the GTE often matters more than its precise magnitude when selecting the better algorithm.

We formalize this insight under a multi-armed bandit framework and theoretically characterize when the sign of the expected GTE estimate under data sharing aligns with or contradicts the sign of the true GTE.

Our analysis identifies the level of exploration versus exploitation as a key determinant of how symbiosis bias impacts algorithm selection.

1 month, 3 weeks назад @ paperswithcode.com
/qqq-yi/ DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression
/qqq-yi/ DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression /qqq-yi/ DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression

Task-agnostic prompt compression leverages the redundancy in natural language to reduce computational overhead and enhance information density within prompts, especially in long-context scenarios.

Existing methods predominantly rely on information entropy as the metric to compress lexical units, aiming to achieve minimal information loss.

However, these approaches overlook two critical aspects: (i) the importance of attention-critical tokens at the algorithmic level, and (ii) shifts in information entropy during the compression process.

Motivated by these challenges, we propose a dynamic attention-aware approach for task-agnostic prompt compression (DAC).

This approach effectively integrate…

1 month, 3 weeks назад @ paperswithcode.com
/lukasellinger/ Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions
/lukasellinger/ Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions /lukasellinger/ Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions

Large Language Models (LLMs) can provide accurate word definitions and explanations for any context.

However, the scope of the definition changes for different target groups, like children or language learners.

We investigate how simplification impacts homonym definition quality across three target groups: Normal, Simple, and ELI5.

Our results show that simplification drastically degrades definition completeness by neglecting polysemy, increasing the risk of misunderstanding.

Fine-tuning Llama 3.1 8B with Direct Preference Optimization substantially improves homonym response quality across all prompt types.

1 month, 3 weeks назад @ paperswithcode.com
/pspdada/ Mitigating Object Hallucinations via Sentence-Level Early Intervention
/pspdada/ Mitigating Object Hallucinations via Sentence-Level Early Intervention /pspdada/ Mitigating Object Hallucinations via Sentence-Level Early Intervention

Multimodal large language models (MLLMs) have revolutionized cross-modal understanding but continue to struggle with hallucinations - fabricated content contradicting visual inputs.

Existing hallucination mitigation methods either incur prohibitive computational costs or introduce distribution mismatches between training data and model outputs.

We identify a critical insight: hallucinations predominantly emerge at the early stages of text generation and propagate through subsequent outputs.

To address this, we propose **SENTINEL** (**S**entence-level **E**arly i**N**tervention **T**hrough **IN**-domain pr**E**ference **L**earning), a framework that eliminates dependency on human annotations…

1 month, 3 weeks назад @ paperswithcode.com
/owos/ FLEXITOKENS: Flexible Tokenization for Evolving Language Models
/owos/ FLEXITOKENS: Flexible Tokenization for Evolving Language Models /owos/ FLEXITOKENS: Flexible Tokenization for Evolving Language Models

Language models (LMs) are challenging to adapt to new data distributions by simple finetuning.

This is due to the rigidity of their subword tokenizers, which typically remain unchanged during adaptation.

This inflexibility often leads to inefficient tokenization, causing overfragmentation of out-of-distribution domains, unseen languages, or scripts.

In this work, we develop byte-level LMs with learnable tokenizers to make tokenization adaptive.

Our models include a submodule that learns to predict boundaries between the input byte sequence, encoding it into variable-length segments.

1 month, 3 weeks назад @ paperswithcode.com
/wojiufukele/ Graph-Structured Data Analysis of Component Failure in Autonomous Cargo Ships Based on Feature Fusion
/wojiufukele/ Graph-Structured Data Analysis of Component Failure in Autonomous Cargo Ships Based on Feature Fusion /wojiufukele/ Graph-Structured Data Analysis of Component Failure in Autonomous Cargo Ships Based on Feature Fusion

To address the challenges posed by cascading reactions caused by component failures in autonomous cargo ships (ACS) and the uncertainties in emergency decision-making, this paper proposes a novel hybrid feature fusion framework for constructing a graph-structured dataset of failure modes.

A hierarchical feature fusion framework is constructed, using Word2Vec encoding to encode subsystem/component features, BERT-KPCA to process failure modes/reasons, and Sentence-BERT to quantify the semantic association between failure impact and emergency decision-making.

The dataset covers 12 systems, 1,262 failure modes, and 6,150 propagation paths.

In the label prediction results, the Shore-based Meteor…

1 month, 3 weeks назад @ paperswithcode.com
/YF-W/ Tri-Learn Graph Fusion Network for Attributed Graph Clustering
/YF-W/ Tri-Learn Graph Fusion Network for Attributed Graph Clustering /YF-W/ Tri-Learn Graph Fusion Network for Attributed Graph Clustering

In recent years, models based on Graph Convolutional Networks (GCN) have made significant strides in the field of graph data analysis.

Although the Graph Transformer architecture has mitigated some of these issues, its performance is still limited when processing heterogeneous graph data.

To address these challenges, this study proposes a novel deep clustering framework that comprising GCN, Autoencoder (AE), and Graph Transformer, termed the Tri-Learn Graph Fusion Network (Tri-GFN).

The tri-learning mechanism allows mutual learning among these modules, while the feature fusion strategy enables the model to capture complex relationships, yielding highly discriminative representations for gra…

1 month, 3 weeks назад @ paperswithcode.com
/mr-ravin/ APTx Neuron: A Unified Trainable Neuron Architecture Integrating Activation and Computation
/mr-ravin/ APTx Neuron: A Unified Trainable Neuron Architecture Integrating Activation and Computation /mr-ravin/ APTx Neuron: A Unified Trainable Neuron Architecture Integrating Activation and Computation

We propose the APTx Neuron, a novel, unified neural computation unit that integrates non-linear activation and linear transformation into a single trainable expression.

The APTx Neuron is derived from the APTx activation function, thereby eliminating the need for separate activation layers and making the architecture both computationally efficient and elegant.

The proposed neuron follows the functional form $y = \sum_{i=1}^{n} ((\alpha_i + \tanh(\beta_i x_i)) \cdot \gamma_i x_i) + \delta$, where all parameters $\alpha_i$, $\beta_i$, $\gamma_i$, and $\delta$ are trainable.

We validate our APTx Neuron-based architecture on the MNIST dataset, achieving up to 96.69\% test accuracy in just 20 ep…

1 month, 3 weeks назад @ paperswithcode.com
/Rec4Fun/ A Reproducibility Study of Product-side Fairness in Bundle Recommendation
/Rec4Fun/ A Reproducibility Study of Product-side Fairness in Bundle Recommendation /Rec4Fun/ A Reproducibility Study of Product-side Fairness in Bundle Recommendation

While this problem has been widely studied in traditional recommendation settings, its implications for bundle recommendation (BR) remain largely unexplored.

Existing fairness frameworks and metrics designed for traditional recommender systems may not directly translate to this multi-layered setting.

In this paper, we conduct a comprehensive reproducibility study of product-side fairness in BR across three real-world datasets using four state-of-the-art BR methods.

We analyze exposure disparities at both the bundle and item levels using multiple fairness metrics, uncovering important patterns.

Overall, our findings offer actionable insights for building fairer bundle recommender systems and…

1 month, 3 weeks назад @ paperswithcode.com
/cbobed/ OntView: What you See is What you Meant
/cbobed/ OntView: What you See is What you Meant /cbobed/ OntView: What you See is What you Meant

However, the lack of tools that provide effective visualization is still a significant challenge.

In this paper, we present OntView, an ontology viewer that is designed to provide users with an intuitive visual representation of ontology concepts and their formal definitions through a user-friendly interface.

Building on the use of a DL reasoner, OntView follows a "What you see is what you meant" paradigm, showing the actual inferred knowledge.

One key aspect for this is its ability to visualize General Concept Inclusions (GCI), a feature absent in existing visualization tools.

OntView has been released with an open-source license for the whole community.

1 month, 3 weeks назад @ paperswithcode.com
/Rec4Fun/ RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction
/Rec4Fun/ RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction /Rec4Fun/ RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction

These approaches fail to capture elaborate relations hidden in real-world bundle structures, resulting in suboptimal bundle representations.

To overcome this limitation, we propose RaMen, a novel method that provides a holistic multi-strategy approach for bundle construction.

RaMen utilizes both intrinsic (characteristics) and extrinsic (collaborative signals) information to model bundle structures through Explicit Strategy-aware Learning (ESL) and Implicit Strategy-aware Learning (ISL).

Integrating diverse strategies enables RaMen to learn more comprehensive and robust bundle representations.

Meanwhile, Multi-strategy Alignment & Discrimination module is employed to facilitate knowledge tr…

1 month, 3 weeks назад @ paperswithcode.com
Papers With Code Papers With Code
последний пост 1 month, 3 weeks назад
/PrimisAI/ Adaptive Multi-Agent Reasoning via Automated Workflow Generation
/PrimisAI/ Adaptive Multi-Agent Reasoning via Automated Workflow Generation /PrimisAI/ Adaptive Multi-Agent Reasoning via Automated Workflow Generation

The rise of Large Reasoning Models (LRMs) promises a significant leap forward in language model capabilities, aiming to tackle increasingly sophisticated tasks with unprecedented efficiency and accuracy.

However, despite their impressive performance, recent studies have highlighted how current reasoning models frequently fail to generalize to novel, unseen problems, often resorting to memorized solutions rather than genuine inferential reasoning.

In this paper, we introduce Nexus Architect, an enhanced iteration of our multi-agent system framework, Nexus, equipped with a novel automated workflow synthesis mechanism.

Given a user's prompt and a small set of representative examples, the Archi…

1 month, 3 weeks назад @ paperswithcode.com
/sharanya02/ Real Time Captioning of Sign Language Gestures in Video Meetings
/sharanya02/ Real Time Captioning of Sign Language Gestures in Video Meetings /sharanya02/ Real Time Captioning of Sign Language Gestures in Video Meetings

One of the most tested ways to establish such a communication is through the use of sign based languages.

However, not many people are aware of the smaller intricacies involved with sign language.

Sign language recognition using computer vision aims at eliminating the communication barrier between deaf-mute and ordinary people so that they can properly communicate with others.

In recent studies, it has been found that people with hearing disabilities prefer to sign over typing during these video calls.

In this paper, we are proposing a browser extension that will automatically translate sign language to subtitles for everyone else in the video call.

1 month, 3 weeks назад @ paperswithcode.com
/alessiopittiglio/ Leveraging Context for Multimodal Fallacy Classification in Political Debates
/alessiopittiglio/ Leveraging Context for Multimodal Fallacy Classification in Political Debates /alessiopittiglio/ Leveraging Context for Multimodal Fallacy Classification in Political Debates

In this paper, we present our submission to the MM-ArgFallacy2025 shared task, which aims to advance research in multimodal argument mining, focusing on logical fallacies in political debates.

Our approach uses pretrained Transformer-based models and proposes several ways to leverage context.

In the fallacy classification subtask, our models achieved macro F1-scores of 0.4444 (text), 0.3559 (audio), and 0.4403 (multimodal).

Our multimodal model showed performance comparable to the text-only model, suggesting potential for improvements.

PDFAbstract

1 month, 3 weeks назад @ paperswithcode.com
/RS2002/ One Step is Enough: Multi-Agent Reinforcement Learning based on One-Step Policy Optimization for Order Dispatch on Ride-Sharing Platforms
/RS2002/ One Step is Enough: Multi-Agent Reinforcement Learning based on One-Step Policy Optimization for Order Dispatch on Ride-Sharing Platforms /RS2002/ One Step is Enough: Multi-Agent Reinforcement Learning based on One-Step Policy Optimization for Order Dispatch on Ride-Sharing Platforms

On-demand ride-sharing platforms face the fundamental challenge of dynamically bundling passengers with diverse origins and destinations and matching them with vehicles in real time, all under significant uncertainty.

However, conventional MARL-based ride-sharing approaches heavily rely on the accurate estimation of Q-values or V-values, which becomes problematic in large-scale, highly uncertain environments.

To address these challenges, we propose two novel alternative methods that bypass value function estimation.

First, we adapt GRPO to ride-sharing, replacing the PPO baseline with the group average reward to eliminate critic estimation errors and reduce training bias.

Second, inspired b…

1 month, 3 weeks назад @ paperswithcode.com
/LiXinran6/ Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation
/LiXinran6/ Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation /LiXinran6/ Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation

Include the markdown at the top of your GitHub README.md file to showcase the performance of the model.

Badges are live and will be dynamically updated with the latest ranking of this paper.

1 month, 3 weeks назад @ paperswithcode.com
/ShimSoonYong/ ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space
/ShimSoonYong/ ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space

We introduce a novel classification framework, ZClassifier, that replaces conventional deterministic logits with diagonal Gaussian-distributed logits. Code: https://github.com/ShimSoonYong/ZClassifier

2 months назад @ paperswithcode.com
/briziorusso/ On Gradual Semantics for Assumption-Based Argumentation
/briziorusso/ On Gradual Semantics for Assumption-Based Argumentation

In this paper, we fill this gap and propose a family of novel gradual semantics for equipping assumptions, which are the core components in ABA frameworks, with dialectical strengths. Code: https://github.com/briziorusso/GradualABA

2 months назад @ paperswithcode.com
/wumingqi/ Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
/wumingqi/ Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Cloudflare is unable to establish an SSL connection to the origin server.

If you're a visitor of this website:Please try again in a few minutes.

If you're the owner of this website:It appears that the SSL configuration used is not compatible with Cloudflare.

This could happen for a several reasons, including no shared cipher suites.

Additional troubleshooting information here.

2 months назад @ paperswithcode.com
/IsaacYQH/ WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling
/IsaacYQH/ WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling

Despite rapid progress in end-to-end AI music generation, AI-driven modeling of professional Digital Signal Processing (DSP) workflows remains challenging. Code: https://github.com/IsaacYQH/WildFX

2 months назад @ paperswithcode.com
/summer1278/ Addressing Data Imbalance in Transformer-Based Multi-Label Emotion Detection with Weighted Loss
/summer1278/ Addressing Data Imbalance in Transformer-Based Multi-Label Emotion Detection with Weighted Loss

This paper explores the application of a simple weighted loss function to Transformer-based models for multi-label emotion detection in SemEval-2025 Shared Task 11. Code: https://github.com/summer1278/semeval2025-task11

2 months назад @ paperswithcode.com
/gabrielkmbo/ Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMs
/gabrielkmbo/ Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMs

We present Step-wise Policy for Rare-tool Knowledge (SPaRK), a novel reinforcement learning framework that teaches large language models to explore diverse tool usage patterns beyond conventional high-temperature sampling. Code: https://github.com/gabrielkmbo/explore-rl

2 months назад @ paperswithcode.com
/Cavendish518/ Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation
/Cavendish518/ Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation

Service robots are increasingly deployed in diverse and dynamic environments, where both physical layouts and social contexts change over time and across locations. Code: https://github.com/Cavendish518/LE-Nav

2 months назад @ paperswithcode.com
/MatteoFasulo/ AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles
/MatteoFasulo/ AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles

Cloudflare is unable to establish an SSL connection to the origin server.

If you're a visitor of this website:Please try again in a few minutes.

If you're the owner of this website:It appears that the SSL configuration used is not compatible with Cloudflare.

This could happen for a several reasons, including no shared cipher suites.

Additional troubleshooting information here.

2 months назад @ paperswithcode.com
/VCA-EPFL/ SystolicAttention: Fusing FlashAttention within a Single Systolic Array
/VCA-EPFL/ SystolicAttention: Fusing FlashAttention within a Single Systolic Array

The frequent data swaps between the systolic array and external vector units result in low systolic array utilization. Code: https://github.com/VCA-EPFL/FSA

2 months назад @ paperswithcode.com
/Buddhi19/ Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection
/Buddhi19/ Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection

Cloudflare is unable to establish an SSL connection to the origin server.

If you're a visitor of this website:Please try again in a few minutes.

If you're the owner of this website:It appears that the SSL configuration used is not compatible with Cloudflare.

This could happen for a several reasons, including no shared cipher suites.

Additional troubleshooting information here.

2 months назад @ paperswithcode.com
Papers With Code Papers With Code
последний пост 1 month, 3 weeks назад
/fudanvi/ Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual Reasoning
/fudanvi/ Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual Reasoning

Cloudflare is unable to establish an SSL connection to the origin server.

If you're a visitor of this website:Please try again in a few minutes.

If you're the owner of this website:It appears that the SSL configuration used is not compatible with Cloudflare.

This could happen for a several reasons, including no shared cipher suites.

Additional troubleshooting information here.

2 months назад @ paperswithcode.com
/benedekrozemberczki/ PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training
/benedekrozemberczki/ PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training

Spatiotemporal graph neural networks (ST-GNNs) are powerful tools for modeling spatial and temporal data dependencies. Code: https://github.com/benedekrozemberczki/pytorch_geometric_temporal

2 months назад @ paperswithcode.com
/chengxuphd/ DCR: Quantifying Data Contamination in LLMs Evaluation
/chengxuphd/ DCR: Quantifying Data Contamination in LLMs Evaluation

Cloudflare is unable to establish an SSL connection to the origin server.

If you're a visitor of this website:Please try again in a few minutes.

If you're the owner of this website:It appears that the SSL configuration used is not compatible with Cloudflare.

This could happen for a several reasons, including no shared cipher suites.

Additional troubleshooting information here.

2 months назад @ paperswithcode.com
/gitter-lab/ Assay2Mol: large language model-based drug design using BioAssay context
/gitter-lab/ Assay2Mol: large language model-based drug design using BioAssay context

Scientific databases aggregate vast amounts of quantitative data alongside descriptive text. Code: https://github.com/gitter-lab/Assay2Mol

2 months назад @ paperswithcode.com
/hayatkhan8660-maker/ DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition
/hayatkhan8660-maker/ DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition

We employ forward Kullback-Leibler (KL) divergence alongside spatio-temporal focal modulation to effectively transfer both local and global context from the Video-FocalNet Base (teacher) to the proposed VFL-Net (student). Code: https://github.com/hayatkhan8660-maker/DVFL-Net

2 months назад @ paperswithcode.com
/JudyJuezhuLong/ Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows
/JudyJuezhuLong/ Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows

Cloudflare is unable to establish an SSL connection to the origin server.

If you're a visitor of this website:Please try again in a few minutes.

If you're the owner of this website:It appears that the SSL configuration used is not compatible with Cloudflare.

This could happen for a several reasons, including no shared cipher suites.

Additional troubleshooting information here.

2 months назад @ paperswithcode.com
/joaojcorreia/ A Fuzzy Approach to Project Success: Measuring What Matters
/joaojcorreia/ A Fuzzy Approach to Project Success: Measuring What Matters

This paper introduces a novel approach to project success evaluation by integrating fuzzy logic into an existing construct. Code: https://github.com/joaojcorreia/FuzzyLogic_ProjectSuccess

2 months назад @ paperswithcode.com
/kunkunlin1221/ InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing
/kunkunlin1221/ InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing

Extensive experiments demonstrate the effectiveness of InstructFLIP by outperforming SOTA models in accuracy and substantially reducing training redundancy across diverse domains in FAS. Code: https://github.com/kunkunlin1221/InstructFLIP

2 months назад @ paperswithcode.com
/Linvyl/ Describe Anything Model for Visual Question Answering on Text-rich Images
/Linvyl/ Describe Anything Model for Visual Question Answering on Text-rich Images

Recent progress has been made in region-aware vision-language modeling, particularly with the emergence of the Describe Anything Model (DAM). Code: https://github.com/Linvyl/DAM-QA

2 months назад @ paperswithcode.com
/abhijeet3922/ Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker
/abhijeet3922/ Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker

We propose multi-step custom implementation utilizing widely adopted hybrid search (metadata & embedding) and state of the art late interaction re-ranker to retrieve best matching pages. Code: https://github.com/abhijeet3922/vision-RAG

2 months назад @ paperswithcode.com
/ziangcao0312/ PhysX: Physical-Grounded 3D Asset Generation
/ziangcao0312/ PhysX: Physical-Grounded 3D Asset Generation

3D modeling is moving from virtual to physical. Code: https://github.com/ziangcao0312/PhysX

2 months назад @ paperswithcode.com
/henry123-boy/ SpatialTrackerV2: 3D Point Tracking Made Easy
/henry123-boy/ SpatialTrackerV2: 3D Point Tracking Made Easy

We present SpatialTrackerV2, a feed-forward 3D point tracking method for monocular videos. Code: https://github.com/henry123-boy/SpaTrackerV2

2 months назад @ paperswithcode.com
/cncs-fit/ Emergence of Functionally Differentiated Structures via Mutual Information Optimization in Recurrent Neural Networks
/cncs-fit/ Emergence of Functionally Differentiated Structures via Mutual Information Optimization in Recurrent Neural Networks

Analysis of network performance, correlation patterns, and weight matrices reveals that mutual information minimization yields high task performance alongside clear functional modularity and moderate structural modularity. Code: https://github.com/cncs-fit/mio_rnn

2 months назад @ paperswithcode.com
/coswindywang/ Making Language Model a Hierarchical Classifier and Generator
/coswindywang/ Making Language Model a Hierarchical Classifier and Generator

Language heads of the last layer are copied to different selected intermediate layers, and fine-tuned with different task inputs. Code: https://github.com/coswindywang/HdLM

2 months назад @ paperswithcode.com
/ahmedehabb/ From Roots to Rewards: Dynamic Tree Reasoning with RL
/ahmedehabb/ From Roots to Rewards: Dynamic Tree Reasoning with RL

Modern language models address complex questions through chain-of-thought (CoT) reasoning (Wei et al., 2023) and retrieval augmentation (Lewis et al., 2021), yet struggle with error propagation and knowledge integration. Code: https://github.com/ahmedehabb/From-Roots-to-Rewards-Dynamic-Tree-Reasoning-with-RL

2 months назад @ paperswithcode.com
💼 University and corporation labs
DeepMind DeepMind
последний пост 8 часов назад
Gemini achieves gold-level performance at the International Collegiate Programming Contest World Finals
Gemini achieves gold-level performance at the International Collegiate Programming Contest World Finals Gemini achieves gold-level performance at the International Collegiate Programming Contest World Finals

Gemini 2.5 Deep Think achieves breakthrough performance at the world’s most prestigious computer programming competition, demonstrating a profound leap in abstract problem solving.

An advanced version of Gemini 2.5 Deep Think has achieved gold-medal level performance at the 2025 International Collegiate Programming Contest (ICPC) World Finals.

This milestone builds directly on Gemini 2.5 Deep Think's gold-medal win at the International Mathematical Olympiad (IMO) just two months ago.

Innovations from these efforts will continue to be integrated into future versions of Gemini Deep Think, expanding the frontier of advanced AI capabilities accessible to students and researchers.

Gemini solved …

8 часов назад @ deepmind.google
Using AI to perceive the universe in greater depth
Using AI to perceive the universe in greater depth Using AI to perceive the universe in greater depth

Our novel Deep Loop Shaping method improves control of gravitational wave observatories, helping astronomers better understand the dynamics and formation of the universe.

In a paper published today in Science, we introduce Deep Loop Shaping, a novel AI method that will unlock next-generation gravitational-wave science.

Deep Loop Shaping reduces the noise level in the most unstable and difficult feedback loop at LIGO by 30 to 100 times, improving the stability of its highly-sensitive interferometer mirrors.

In the future, Deep Loop Shaping could also be applied to many other engineering problems involving vibration suppression, noise cancellation and highly dynamic or unstable systems import…

1 week, 6 days назад @ deepmind.google
Image editing in Gemini just got a major upgrade
Image editing in Gemini just got a major upgrade Image editing in Gemini just got a major upgrade

Today in the Gemini app, we're unveiling a new image editing model from Google DeepMind.

People have been going bananas over it already in early previews — it's the top-rated image editing model in the world.

Now, we're excited to share that it's integrated into the Gemini app so you have more control than ever to create the perfect picture.

Maintain your look as you editWe launched native image editing in the Gemini app earlier this year, and we’ve been working hard to improve it, with particular focus on maintaining a character's likeness from one image to the next.

Bring your vision to life with advanced editingHere are a few things to try as you explore this new image editing capability:

3 weeks, 1 day назад @ blog.google
Introducing Gemma 3 270M: The compact model for hyper-efficient AI
Introducing Gemma 3 270M: The compact model for hyper-efficient AI Introducing Gemma 3 270M: The compact model for hyper-efficient AI

Gemma 3 270M brings strong instruction-following capabilities to a small-footprint model.

Internal tests on a Pixel 9 Pro SoC show the INT4-quantized model used just 0.75% of the battery for 25 conversations, making it our most power-efficient Gemma model.

While this model is not designed for complex conversational use cases, it’s a strong model that follows general instructions right out of the box.

The results were stunning: the specialized Gemma model not only met but exceeded the performance of much larger proprietary models on its specific task.

It's the perfect starting point for creating a fleet of small, specialized models, each an expert at its own task.

1 month назад @ developers.googleblog.com
Introducing Gemma 3 270M: The compact model for hyper-efficient AI
Introducing Gemma 3 270M: The compact model for hyper-efficient AI Introducing Gemma 3 270M: The compact model for hyper-efficient AI

Gemma 3 270M brings strong instruction-following capabilities to a small-footprint model.

Internal tests on a Pixel 9 Pro SoC show the INT4-quantized model used just 0.75% of the battery for 25 conversations, making it our most power-efficient Gemma model.

While this model is not designed for complex conversational use cases, it’s a strong model that follows general instructions right out of the box.

The results were stunning: the specialized Gemma model not only met but exceeded the performance of much larger proprietary models on its specific task.

It's the perfect starting point for creating a fleet of small, specialized models, each an expert at its own task.

1 month назад @ developers.googleblog.com
Introducing Gemma 3 270M: The compact model for hyper-efficient AI
Introducing Gemma 3 270M: The compact model for hyper-efficient AI Introducing Gemma 3 270M: The compact model for hyper-efficient AI

Gemma 3 270M brings strong instruction-following capabilities to a small-footprint model.

Internal tests on a Pixel 9 Pro SoC show the INT4-quantized model used just 0.75% of the battery for 25 conversations, making it our most power-efficient Gemma model.

While this model is not designed for complex conversational use cases, it’s a strong model that follows general instructions right out of the box.

The results were stunning: the specialized Gemma model not only met but exceeded the performance of much larger proprietary models on its specific task.

It's the perfect starting point for creating a fleet of small, specialized models, each an expert at its own task.

1 month назад @ developers.googleblog.com
How AI is helping advance the science of bioacoustics to save endangered species
How AI is helping advance the science of bioacoustics to save endangered species How AI is helping advance the science of bioacoustics to save endangered species

Our new Perch model helps conservationists analyze audio faster to protect endangered species, from Hawaiian honeycreepers to coral reefs.

These recordings can tell us a lot about the animals present in a given area, along with other clues about the health of that ecosystem.

Today, we are releasing an update to Perch, our AI model designed to help conservationists analyze bioacoustic data.

This new model has better state-of-the-art off-the-shelf bird species predictions than the previous model.

It can disentangle complex acoustic scenes over thousands or even millions of hours of audio data.

1 month, 1 week назад @ deepmind.google
How AI is helping advance the science of bioacoustics to save endangered species
How AI is helping advance the science of bioacoustics to save endangered species How AI is helping advance the science of bioacoustics to save endangered species

Our new Perch model helps conservationists analyze audio faster to protect endangered species, from Hawaiian honeycreepers to coral reefs.

These recordings can tell us a lot about the animals present in a given area, along with other clues about the health of that ecosystem.

Today, we are releasing an update to Perch, our AI model designed to help conservationists analyze bioacoustic data.

This new model has better state-of-the-art off-the-shelf bird species predictions than the previous model.

It can disentangle complex acoustic scenes over thousands or even millions of hours of audio data.

1 month, 1 week назад @ 9e7ba71-dot-gdm-deepmind-com-prod.appspot.com
How AI is helping advance the science of bioacoustics to save endangered species
How AI is helping advance the science of bioacoustics to save endangered species How AI is helping advance the science of bioacoustics to save endangered species

Our new Perch model helps conservationists analyze audio faster to protect endangered species, from Hawaiian honeycreepers to coral reefs.

These recordings can tell us a lot about the animals present in a given area, along with other clues about the health of that ecosystem.

Today, we are releasing an update to Perch, our AI model designed to help conservationists analyze bioacoustic data.

This new model has better state-of-the-art off-the-shelf bird species predictions than the previous model.

It can disentangle complex acoustic scenes over thousands or even millions of hours of audio data.

1 month, 1 week назад @ 8fffa84-dot-gdm-deepmind-com-prod.appspot.com
Genie 3: A new frontier for world models
Genie 3: A new frontier for world models Genie 3: A new frontier for world models

World models are also a key stepping stone on the path to AGI, since they make it possible to train AI agents in an unlimited curriculum of rich simulation environments.

Last year we introduced the first foundation world models with Genie 1 and Genie 2, which could generate new environments for agents.

We have also continued to push the state of the art in video generation with our models Veo 2 and Veo 3, which exhibit a deep understanding of intuitive physics.

Each of these models marks progress along different capabilities of world simulation.

Genie 3 is our first world model to allow interaction in real-time, while also improving consistency and realism compared to Genie 2.

1 month, 1 week назад @ deepmind.google
Genie 3: A new frontier for world models
Genie 3: A new frontier for world models Genie 3: A new frontier for world models

World models are also a key stepping stone on the path to AGI, since they make it possible to train AI agents in an unlimited curriculum of rich simulation environments.

Last year we introduced the first foundation world models with Genie 1 and Genie 2, which could generate new environments for agents.

We have also continued to push the state of the art in video generation with our models Veo 2 and Veo 3, which exhibit a deep understanding of intuitive physics.

Each of these models marks progress along different capabilities of world simulation.

Genie 3 is our first world model to allow interaction in real-time, while also improving consistency and realism compared to Genie 2.

1 month, 1 week назад @ 8fffa84-dot-gdm-deepmind-com-prod.appspot.com
Genie 3: A new frontier for world models
Genie 3: A new frontier for world models Genie 3: A new frontier for world models

World models are also a key stepping stone on the path to AGI, since they make it possible to train AI agents in an unlimited curriculum of rich simulation environments.

Last year we introduced the first foundation world models with Genie 1 and Genie 2, which could generate new environments for agents.

We have also continued to push the state of the art in video generation with our models Veo 2 and Veo 3, which exhibit a deep understanding of intuitive physics.

Each of these models marks progress along different capabilities of world simulation.

Genie 3 is our first world model to allow interaction in real-time, while also improving consistency and realism compared to Genie 2.

1 month, 1 week назад @ 9e7ba71-dot-gdm-deepmind-com-prod.appspot.com
Rethinking how we measure AI intelligence
Rethinking how we measure AI intelligence Rethinking how we measure AI intelligence

Current AI benchmarks are struggling to keep pace with modern models.

As models reach closer to 100% on certain benchmarks, they also become less effective at revealing meaningful performance differences.

We continue to invest in new and more challenging benchmarks, but on the path to general intelligence, we need to continue to look for new ways to evaluate.

While we continue to evolve and pursue current AI benchmarks, we’re also consistently looking to test new approaches to evaluating models.

That’s why today, we're introducing the Kaggle Game Arena: a new, public AI benchmarking platform where AI models compete head-to-head in strategic games, providing a verifiable, and dynamic measure…

1 month, 2 weeks назад @ blog.google
Rethinking how we measure AI intelligence
Rethinking how we measure AI intelligence Rethinking how we measure AI intelligence

Current AI benchmarks are struggling to keep pace with modern models.

As models reach closer to 100% on certain benchmarks, they also become less effective at revealing meaningful performance differences.

We continue to invest in new and more challenging benchmarks, but on the path to general intelligence, we need to continue to look for new ways to evaluate.

While we continue to evolve and pursue current AI benchmarks, we’re also consistently looking to test new approaches to evaluating models.

That’s why today, we're introducing the Kaggle Game Arena: a new, public AI benchmarking platform where AI models compete head-to-head in strategic games, providing a verifiable, and dynamic measure…

1 month, 2 weeks назад @ blog.google
Rethinking how we measure AI intelligence
Rethinking how we measure AI intelligence Rethinking how we measure AI intelligence

Current AI benchmarks are struggling to keep pace with modern models.

As models reach closer to 100% on certain benchmarks, they also become less effective at revealing meaningful performance differences.

We continue to invest in new and more challenging benchmarks, but on the path to general intelligence, we need to continue to look for new ways to evaluate.

While we continue to evolve and pursue current AI benchmarks, we’re also consistently looking to test new approaches to evaluating models.

That’s why today, we're introducing the Kaggle Game Arena: a new, public AI benchmarking platform where AI models compete head-to-head in strategic games, providing a verifiable, and dynamic measure…

1 month, 2 weeks назад @ blog.google
Google
последний пост 9 часов назад
BigQuery under the hood: Scalability, reliability and usability enhancements for gen AI inference
BigQuery under the hood: Scalability, reliability and usability enhancements for gen AI inference BigQuery under the hood: Scalability, reliability and usability enhancements for gen AI inference

People often think of BigQuery in the context of data warehousing and analytics, but it is a crucial part of the AI ecosystem as well.

And today, we’re excited to share significant performance improvements to BigQuery that make it even easier to extract insights from your data with generative AI.

All these functions are compatible with text data in managed tables and unstructured object files, such as images and documents.

Reliability: Over 99.99% LLM inference query completion without any row failures; and over 99.99% row-level success rate across all jobs, with the rare per-row failures being easily retriable without failing the query.

We also enabled a single place for quota management (…

9 часов назад @ cloud.google.com
Announcing MCP Toolbox support for Firestore
Announcing MCP Toolbox support for Firestore Announcing MCP Toolbox support for Firestore

Our new pre-built tools for Firestore enable you to do just that, directly from your Gemini CLI or other AI-powered development environment.

She uses Gemini CLI to help her code, debug, and test.

Alex opens her terminal and asks:"Hey, show me the Firestore data for the test users qa_user_123 and qa_user_456 from the users-staging collection."

Gemini CLI understands this, calls the firestore-get-documents tool, and instantly displays the JSON for both user documents.

Gemini CLI confirms the plan and uses the firestore-update-document tool to perform the cleanup, clearing the way for a successful re-test.

9 часов назад @ cloud.google.com
How to secure your remote MCP server on Google Cloud
How to secure your remote MCP server on Google Cloud How to secure your remote MCP server on Google Cloud

As enterprises increasingly adopt model context protocol (MCP) to extend capabilities of AI models to better integrate with external tools, databases, and APIs, it becomes even more important to ensure secure MCP deployment.

MCP unlocks new capabilities for AI systems; it can also introduce new risks, such as tool poisoning, prompt injection, and dynamic tool manipulation.

Securing an MCP deployment begins with a strong security foundation.

Here are five key MCP deployment risks you should be aware of, and how using a centralized proxy architecture on Google Cloud can help mitigate them.

Top five MCP deployment risks you should knowWhile there are some broader risks unique to AI, these five…

9 часов назад @ cloud.google.com
Gemini and OSS text embeddings are now in BigQuery ML
Gemini and OSS text embeddings are now in BigQuery ML Gemini and OSS text embeddings are now in BigQuery ML

The Gemini embedding model utilizes a new quota control method known as Tokens Per Minute (TPM).

OSS embedding models in BigQuery MLThe OSS community is rapidly evolving the text-embedding model landscape, offering a wide spectrum of choices to fit any need.

Offerings range from top-ranking models like the recent Qwen3-Embedding & EmbeddingGemma to small, efficient, and cost-effective small models such as multilingual-e5-small .

You can now use any Hugging Face text embedding models (13K+ options) deployed to Vertex AI Model Garden in BigQuery ML.

To use an open-source text-embedding model, follow these steps.

1 day, 9 hours назад @ cloud.google.com
Powering AI commerce with the new Agent Payments Protocol (AP2)
Powering AI commerce with the new Agent Payments Protocol (AP2) Powering AI commerce with the new Agent Payments Protocol (AP2)

Adyen’s collaboration on Google’s Agent Payments Protocol (AP2) is a natural extension of our mission to provide the merchants with the payments building blocks for tomorrow’s commerce.

- Ingo Uytdehaage, Co-CEO at AdyenAirwallex: "Airwallex is thrilled to support Google’s Agent Payments Protocol (AP2) .

Google's new Agent Payments Protocol (AP2) is a huge step forward, providing the foundational framework to make this possible.

Google's Agent Payments Protocol (AP2) combines programmable payments via modern blockchains like Sui with open protocols like A2A and MCP that are enjoying rapid growth.

We believe the Agent Payments Protocol (AP2) and extension to the Agent2Agent protocol represen…

1 day, 12 hours назад @ cloud.google.com
Cloud CISO Perspectives: APAC security leaders speak out on AI and key topics
Cloud CISO Perspectives: APAC security leaders speak out on AI and key topics Cloud CISO Perspectives: APAC security leaders speak out on AI and key topics

CISOs are hopeful that bringing AI-driven automation to security operations workflows can help tip the scales towards defenders, said Franck Vervial, Regional CISO, APAC and MENA, L'Oreal.

As CISOs look for automated, agile defenses that scale beyond their existing security operations center (SOC) capacity, we’ve introduced our vision of an agentic SOC to help address the biggest security operations bottlenecks.

Agentic AI governance should follow the same guardrails for traditional AI systems, while implementing further measures for evolving security, privacy, and compliance risks, as appropriate.

Naturally, CISOs want to learn more about how AI can improve detection engineering, particula…

2 days, 9 hours назад @ cloud.google.com
Scaling high-performance inference cost-effectively
Scaling high-performance inference cost-effectively Scaling high-performance inference cost-effectively

The re-computation of the prefill phase is very inefficient and adds unnecessary latency, with the user experiencing delays between each question.

With prefix-aware routing, the system intelligently reuses the data from the initial query by routing the request back to the same KV cache.

This bypasses the prefill phase, allowing the model to answer almost instantly.

Enhancements in GKE Inference Gateway, llm-d, and vLLM, work together to enable dynamic selection of prefill and decode nodes based on query size.

When a developer submits a completion request, the application must first process the input codebase; this is referred to as the prefill phase.

1 week назад @ cloud.google.com
Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer
Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer

As generative AI becomes more widespread, it’s important for developers and ML engineers to be able to easily configure infrastructure that supports efficient AI inference, i.e., using a trained AI model to make predictions or decisions based on new, unseen data.

Further, deploying large generative AI models can be both complex and resource-intensive.

That's why we are excited to announce a new recipe for disaggregated inferencing with NVIDIA Dynamo, a high-performance, low-latency platform for a variety of AI models.

Specifically, this recipe makes it easy to deploy NVIDIA Dynamo on Google Cloud’s AI Hypercomputer, including Google Kubernetes Engine (GKE), vLLM inference engine, and A3 Ult…

1 week назад @ cloud.google.com
Deliver intuitive shopping experiences with Conversational Commerce agent
Deliver intuitive shopping experiences with Conversational Commerce agent Deliver intuitive shopping experiences with Conversational Commerce agent

Consumer search behavior is shifting, with users now entering longer, more complex questions into search bars in pursuit of more relevant results.

We are excited to announce the general availability of Google Cloud’s Conversational Commerce agent designed to engage shoppers in natural, human-like conversations to guide them from initial intent to a completed purchase.

Companies like Albertsons Cos., who was a marquee collaborator on this product and is using Conversational Commerce agent within their Ask AI tool, are already seeing an impact.

Early results show customers using Ask AI often add one or more additional items to their cart, uncovering products they might not have found otherwis…

1 week назад @ cloud.google.com
Automate app deployment and security analysis with new Gemini CLI extensions
Automate app deployment and security analysis with new Gemini CLI extensions Automate app deployment and security analysis with new Gemini CLI extensions

Today, we’re closing the gap between your terminal and the cloud with a first look at the future of Gemini CLI, delivered through two new extensions: security extension and Cloud Run extension.

These commands are the first expression of a new extensibility framework for Gemini CLI.

While we'll be sharing more about the full Gemini CLI extension world soon, we couldn't wait to get these capabilities into your hands.

Security extension: automate security analysis with /security:analyzeTo help teams address software vulnerabilities early in the development lifecycle, we are launching the Gemini CLI Security extension.

And after the report is generated, you can also ask Gemini CLI to save it to…

1 week назад @ cloud.google.com
Introducing no-cost, multicloud Data Transfer Essentials for EU and U.K. customers
Introducing no-cost, multicloud Data Transfer Essentials for EU and U.K. customers Introducing no-cost, multicloud Data Transfer Essentials for EU and U.K. customers

We pioneered a multicloud data warehouse, enabling workloads to run across clouds.

We continue this open approach with the launch today of our new Data Transfer Essentials service for customers in the European Union and the United Kingdom.

Built in response to the principles of cloud interoperability and choice outlined in the EU Data Act, Data Transfer Essentials is a new, simple solution for data transfers between Google Cloud and other cloud service providers.

Although the Act allows cloud providers to pass through costs to customers, Data Transfer Essentials is available today at no cost to customers.

Read more about Data Transfer Essentials here.

1 week назад @ cloud.google.com
Introducing the Agentic SOC Workshops for security professionals
Introducing the Agentic SOC Workshops for security professionals Introducing the Agentic SOC Workshops for security professionals

The security operations centers of the future will use agentic AI to enable intelligent automation of routine tasks, augment human decision-making, and streamline workflows.

At Google Cloud, we want to help prepare today’s security professionals to get the most out of tomorrow’s AI agents.

As we build our agentic vision, we’re also excited to invite you to the first Agentic SOC Workshop: Practical AI for Today's Security Teams.

Ultimately, we believe that agentic AI will empower security professionals to focus more on complex investigations and strategic initiatives, and drive better security outcomes and operational efficiency.

We are building the next class of security experts empowered b…

1 week, 1 day назад @ cloud.google.com
Announcing partner-built AI security innovations on Google Cloud
Announcing partner-built AI security innovations on Google Cloud Announcing partner-built AI security innovations on Google Cloud

Many of our partners are using Google Cloud's AI to build new defenses and automate security operations, transforming the security landscape.

Today, we’re excited to announce new security solutions from our partners, also available in the Google Cloud Marketplace.

CrowdStrike has enhanced security operations by embedding AI directly into its Falcon platform, and integrating platform data with Google Security Operations and Google Agentspace.

Fortinet has also unveiled how its product ecosystem can help secure AI workloads on Google Cloud.

Palo Alto Networks has designed Prisma AIRS to protect AI and agentic workloads on Google Cloud, including Vertex AI and Agentspace.

1 week, 1 day назад @ cloud.google.com
Registration now open: Our no-cost, generative AI training and certification program for veterans
Registration now open: Our no-cost, generative AI training and certification program for veterans Registration now open: Our no-cost, generative AI training and certification program for veterans

Introduced in 2024, this three-week, no-cost, virtual training program provides veterans with the foundational skills necessary to jump start rewarding careers with generative AI.

Become a leader in AI digital transformation with this virtual, no-cost programThe Gen AI Leader training, which does not require previous technical experience, kicks off with a two-day virtual event on November 13th and 14th.

After completing the program, you’ll receive a complimentary voucher to take the Gen AI Leader exam.

After successful completion, you’ll receive Google’s industry-recognized Gen AI Leader certification, a valuable credential to help you advance your career.

Learn more about Google Public Sec…

1 week, 2 days назад @ cloud.google.com
Calling all devs: Join the GKE Turns 10 Hackathon and build with agentic AI
Calling all devs: Join the GKE Turns 10 Hackathon and build with agentic AI Calling all devs: Join the GKE Turns 10 Hackathon and build with agentic AI

Hands-on learning with GKE: This is your shot to build the next evolution of applications by integrating agentic AI capabilities on GKE.

When you join the GKE Turns 10 Hackathon, your mission is to take pre-existing microservice applications (either Bank of Anthos or Online Boutique) and then integrate cutting-edge agentic AI capabilities.

Streamline maintenance and mitigation: Develop an agent that intelligently monitors microservice performance on GKE, suggests troubleshooting steps, and even automates remediation.

Crucial note: Your project must be built using GKE and Google AI models such as Gemini, focusing on how the agents interact with your chosen microservice application.

As long a…

1 week, 5 days назад @ cloud.google.com
OpenAI
последний пост None
Microsoft Microsoft
последний пост 6 days, 9 hours назад
Tool-space interference in the MCP era: Designing for agent compatibility at scale
Tool-space interference in the MCP era: Designing for agent compatibility at scale Tool-space interference in the MCP era: Designing for agent compatibility at scale

Hugging Face is now serving many Spaces apps over MCP (opens in new tab), and Shopify has enabled MCP for millions of storefronts (opens in new tab).

Figure 1: We can extend Magentic-One by adding an agent that equips the GitHub MCP server.

Likewise, Docker MCP Hub is a registry that distributes MCP servers as Docker images, and we manually collected popular entries.

To automate the inspection of running MCP servers, we developed an MCP Interviewer tool.

One-size fits all (but some more than others)So, what does our survey of MCP servers tell us about the MCP ecosystem?

6 days, 9 hours назад @ microsoft.com
RenderFormer: How neural networks are reshaping 3D rendering
RenderFormer: How neural networks are reshaping 3D rendering RenderFormer: How neural networks are reshaping 3D rendering

This shift is giving rise to a new field known as neural rendering.

Neural rendering combines deep learning with traditional graphics techniques, allowing models to simulate complex light transport without explicitly modeling physical optics.

RenderFormer: Toward a general-purpose neural rendering modelTo overcome these limitations, researchers at Microsoft Research have developed RenderFormer, a new neural architecture designed to support full-featured 3D rendering using only ML—no traditional graphics computation required.

View-independent rendering effects decoded directly from the view-independent transformer, including diffuse lighting and coarse shadow effects.

3D animation sequence r…

1 week назад @ microsoft.com
Breaking the networking wall in AI infrastructure
Breaking the networking wall in AI infrastructure Breaking the networking wall in AI infrastructure

Traditional copper links are power-efficient and reliable, but limited to very short distances (< 2 meters) that restrict their use to within a single GPU rack.

With copper links, higher channel speeds lead to greater signal integrity challenges, which limits their reach.

With optical links, high-speed transmission is inherently inefficient, requiring power-hungry laser drivers and complex electronics to compensate for transmission impairments.

This imbalance ultimately erects a networking wall akin to the memory wall, in which CPU speeds have outstripped memory speeds, creating performance bottlenecks.

Operating at low speed improves power efficiency by eliminating the need for complex ele…

1 week, 1 day назад @ microsoft.com
Crescent library brings privacy to digital identity systems
Crescent library brings privacy to digital identity systems Crescent library brings privacy to digital identity systems

To address this, we have released Crescent (opens in new tab), a cryptographic library that adds unlinkability to widely used identity formats, protecting privacy.

These include JSON Web Tokens (the authentication standard behind many app logins) and mobile driver’s licenses.

Some digital identity systems already offer selective disclosure, allowing users to share only specific pieces of information in each interaction.

SetupA Crescent service pre-generates the zero-knowledge parameters for creating and verifying proofs from JSON Web Tokens and mobile driver’s licenses.

Age verification: The user presents their mobile driver’s license to a social network, proving they are over 18.

3 weeks, 1 day назад @ microsoft.com
Applicability vs. job displacement: further notes on our recent research on AI and occupations
Applicability vs. job displacement: further notes on our recent research on AI and occupations Applicability vs. job displacement: further notes on our recent research on AI and occupations

Recently, we released a paper (Working with AI: Measuring the Occupational Implications of Generative AI) that studied what occupations might find AI chatbots useful, and to what degree.

We set out to better understand how people are using AI, highlighting where AI might be useful in different occupations.

So, to summarize, our paper is about identifying the occupations where AI may be most useful, by assisting or performing subtasks.

Finally, we only evaluated AI chatbot usage, so this study does not evaluate the impact or applicability of other forms of AI.

The public interest in our research is based, in large part, on the topic of AI and job displacement.

3 weeks, 6 days назад @ microsoft.com
Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education
Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education

KOHANE: So I think you’ve “nerd sniped” me because you [LAUGHTER]—which is all too easy—but I think there’s a central issue here.

But I actually think this is dark matter of human organizational technology that is not well understood.

AZEEM AZHAR: We didn’t talk about, you know, AI in its ability to potentially do this, which is to extend the clinician’s presence throughout the week.

And so I think there’s always going to be an opening for either differences of opinion or agreeing with you too much.

And this gets into whether AI is really going to get almost to the ab initio understanding of human biology.

3 weeks, 6 days назад @ microsoft.com
MindJourney enables AI to explore simulated 3D worlds to improve spatial interpretation
MindJourney enables AI to explore simulated 3D worlds to improve spatial interpretation MindJourney enables AI to explore simulated 3D worlds to improve spatial interpretation

A new research framework helps AI agents explore three-dimensional spaces they can’t directly detect.

Called MindJourney, the approach addresses a key limitation in vision-language models (VLMs), which give AI agents their ability to interpret and describe visual scenes.

MindJourney applies the same process to AI agents, letting them roam a virtual space before answering spatial questions.

Given a spatial reasoning query, MindJourney searches through the imagined 3D space using a world model and improves the VLM’s spatial interpretation through generated observations when encountering new challenges.

This enhancement could enable agents more accurately interpret spatial relationships and ph…

4 weeks назад @ microsoft.com
Dion: the distributed orthonormal update revolution is here
Dion: the distributed orthonormal update revolution is here Dion: the distributed orthonormal update revolution is here

And yet, last December, a new optimizer called Muon (opens in new tab) showed serious promise by powering a nanoGPT speedrun (opens in new tab).

Going back to the inspiration for Muon, the key idea is an orthonormal update, which sparked the search for more scalable alternative linear algebras realizing the same goal.

This is achieved by enforcing orthonormality (opens in new tab) on the update matrix, thereby equalizing its effect across all input directions.

Wall-clock time speedup of Dion for 3B model trainingWhy would adding a constraint improve the update rule?

Using Dion with rank fraction 1/16 or lower offers an order-of-magnitude speedup over Muon.

1 month назад @ microsoft.com
Reimagining healthcare delivery and public health with AI
Reimagining healthcare delivery and public health with AI Reimagining healthcare delivery and public health with AI

All this is critical for maintaining good public health, as well as aligning better health and financial outcomes.

Umair and Gianrico are CEO-level leaders representing some of the best of the worlds of public health, healthcare delivery, medical research, and medical education.

So there’s so many use cases that AI has for population health or public health.

LEE: I find it really interesting that you are using the terms “public health” and “population health” …SHAH: Yeah.

As Umair explained in our discussion, the core of public health is the idea of population health, the idea of extracting new health insights from signals from population-scale data.

1 month, 1 week назад @ microsoft.com
Self-adaptive reasoning for science
Self-adaptive reasoning for science Self-adaptive reasoning for science

Recent approaches show promise in the utilization of intrinsic rewards for training reasoning models (RLIR).

While today’s reasoning models require additional data in the training phase and limit user control during the reasoning generation process, CLIO’s approach enables users to steer reasoning from scratch without additional data.

Through this open architecture approach to reasoning, we alleviate the necessity for further model post-training to achieve desired reasoning behavior.

Capabilities such as explaining the outcomes of internal reasoning are standard in the scientific field and are present in current reasoning model approaches.

Implications for science and trustworthy discoveryT…

1 month, 1 week назад @ microsoft.com
VeriTrail: Detecting hallucination and tracing provenance in multi-step AI workflows
VeriTrail: Detecting hallucination and tracing provenance in multi-step AI workflows VeriTrail: Detecting hallucination and tracing provenance in multi-step AI workflows

Instead, an LM was prompted to classify the claim as “Fully Supported,” “Not Fully Supported,” or “Inconclusive” based on the evidence.

In this case, the verdict was “Fully Supported.”Since the verdict in Iteration 1 was “Fully Supported,” VeriTrail proceeded to Iteration 2.

Right: VeriTrail’s hallucination detection process for a “Not Fully Supported” claim, where the maximum number of consecutive “Not Fully Supported” verdicts was set to 2.

Once again, the verdict was “Not Fully Supported.” Since this was the second consecutive “Not Fully Supported” verdict, verification terminated and the verdict was deemed final.

A higher maximum increases confidence that a flagged claim is truly halluc…

1 month, 1 week назад @ microsoft.com
Project Ire autonomously identifies malware at scale
Project Ire autonomously identifies malware at scale Project Ire autonomously identifies malware at scale

To verify its findings, Project Ire can invoke a validator tool that cross-checks claims in the report against the chain of evidence.

This tool draws on expert statements from malware reverse engineers on the Project Ire team.

For each file it analyzes, Project Ire generates a report that includes an evidence section, summaries of all examined code functions, and other technical artifacts.

Project Ire correctly identified the code that locates and disables antivirus programs, providing evidence that the file was malicious.

The issue was later resolved by updating decompiler rules, but this example illustrates how Project Ire navigates uncertainty during analysis.

1 month, 3 weeks назад @ microsoft.com
Navigating medical education in the era of generative AI
Navigating medical education in the era of generative AI Navigating medical education in the era of generative AI

Prior to med school, Daniel pursued experiences that cultivated his interest in the application of AI in medical practice and education.

Really, really looking forward to this chat.

There’s AI before ChatGPT and before, you know, generative AI really became a big thing, and then afterwards.

And then after we talk about what’s really happening, what do you think should happen in medical education given the reality of generative AI?

And I do agree [that] AI really gives us real hope that we can make it true.

1 month, 3 weeks назад @ microsoft.com
Xinxing Xu bridges AI research and real-world impact at Microsoft Research Asia – Singapore
Xinxing Xu bridges AI research and real-world impact at Microsoft Research Asia – Singapore Xinxing Xu bridges AI research and real-world impact at Microsoft Research Asia – Singapore

In 2024, Xu joined Microsoft Research Asia where he began a new chapter focused on bridging between academic research and real-world AI applications.

“Microsoft Research Asia is committed to integrating scientific exploration with real-world applications, which creates a unique research environment,” Xu says.

Microsoft Research Asia – Singapore: Expanding global reach, connecting regional innovationRealizing AI’s full potential requires more than technical breakthroughs.

This is why Microsoft Research Asia continues to collaborate closely with Singapore’s top universities, research institutions, and industry partners.

– “Core knowledge in machine learning, linear algebra, and probability an…

1 month, 3 weeks назад @ microsoft.com
Technical approach for classifying human-AI interactions at scale
Technical approach for classifying human-AI interactions at scale Technical approach for classifying human-AI interactions at scale

From batching strategies and token optimization and orchestration, we’ll share what it takes to build a real-time LLM classification pipeline.

Engineering challenges & solutionsBuilding a high-throughput, LLM-powered classification pipeline at scale introduced a range of engineering challenges—from managing latency and token limits to ensuring system resilience.

Evolving LLM models & prompt alignmentChallenge: Each new LLM release—such as Phi, Mistral, DeepSeek, and successive generations of GPT (e.g., GPT-3.5, GPT-4, GPT-4 Turbo, GPT-4o)—brings improvements, but also subtle behavioral shifts.

Dynamic concurrency scaling for LLM callsChallenge: LLM endpoints frequently encounter rate limits…

1 month, 3 weeks назад @ microsoft.com
MIT AI MIT AI
последний пост 1 day, 10 hours назад
How to build AI scaling laws for efficient LLM training and budget maximization
How to build AI scaling laws for efficient LLM training and budget maximization How to build AI scaling laws for efficient LLM training and budget maximization

According to Choshen, elucidating scaling laws not only enable better pre-training decisions, but also democratize the field by enabling researchers without vast resources to understand and build effective scaling laws.

In general, scaling laws aren’t new; however, in the field of AI, they emerged as models grew and costs skyrocketed.

“There hadn’t really been a lot of systematic meta-analysis, as everybody is individually training their own scaling laws.

With this, the team compared the scaling laws, and after analysis, distilled practical recommendations for AI practitioners about what makes effective scaling laws.

Unexpectedly, the researchers found that it’s possible to utilize the scal…

1 day, 10 hours назад @ news.mit.edu
Machine-learning tool gives doctors a more detailed 3D picture of fetal health
Machine-learning tool gives doctors a more detailed 3D picture of fetal health Machine-learning tool gives doctors a more detailed 3D picture of fetal health

That is, until a new approach called “Fetal SMPL” from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL), Boston Children’s Hospital (BCH), and Harvard Medical School presented clinicians with a more detailed picture of fetal health.

Fetal SMPL was then trained on 20,000 MRI volumes to predict the location and size of a fetus and create sculpture-like 3D representations.

The extensive, real-world scans that Fetal SMPL learned from helped it develop pinpoint accuracy.

Fetal SMPL was only misaligned by an average of about 3.1 millimeters, a gap smaller than a single grain of rice.

Such upgrades would make the models more human-like, but the current version of Fetal SMPL al…

2 days, 11 hours назад @ news.mit.edu
DoE selects MIT to establish a Center for the Exascale Simulation of Coupled High-Enthalpy Fluid–Solid Interactions
DoE selects MIT to establish a Center for the Exascale Simulation of Coupled High-Enthalpy Fluid–Solid Interactions DoE selects MIT to establish a Center for the Exascale Simulation of Coupled High-Enthalpy Fluid–Solid Interactions

The U.S. Department of Energy’s National Nuclear Security Administration (DoE/NNSA) recently announced that it has selected MIT to establish a new research center dedicated to advancing the predictive simulation of extreme environments, such as those encountered in hypersonic flight and atmospheric re-entry.

The Center for the Exascale Simulation of Coupled High-Enthalpy Fluid–Solid Interactions (CHEFSI) — a joint effort of the MIT Center for Computational Science and Engineering, the MIT Schwarzman College of Computing, and the MIT Institute for Soldier Nanotechnologies (ISN) — plans to harness cutting-edge exascale supercomputers and next-generation algorithms to simulate with unprecedent…

1 week назад @ news.mit.edu
AI and machine learning for engineering design
AI and machine learning for engineering design AI and machine learning for engineering design

In Ahmed’s course, 2.155/156 (AI and Machine Learning for Engineering Design), students use tools and techniques from artificial intelligence and machine learning for mechanical engineering design, focusing on the creation of new products and addressing engineering design challenges.

“When people think about mechanical engineering, they're thinking about basic mechanical tools like hammers and … hardware like cars, robots, cranes, but mechanical engineering is very broad,” says Faez Ahmed, the Doherty Chair in Ocean Utilization and associate professor of mechanical engineering at MIT.

“There’s a lot of reason for mechanical engineers to think about machine learning and AI to essentially exp…

1 week, 3 days назад @ news.mit.edu
A greener way to 3D print stronger stuff
A greener way to 3D print stronger stuff A greener way to 3D print stronger stuff

3D printing has come a long way since its invention in 1983 by Chuck Hull, who pioneered stereolithography, a technique that solidifies liquid resin into solid objects using ultraviolet lasers.

The vast majority of consumer and industrial 3D printing still relies on petroleum-based plastic filament.

The rest of the part can be printed using greener, weaker filament, reducing plastic use while preserving structural integrity.

Using this ratio, SustainaPrint was able to recover up to 70 percent of the strength of an object printed entirely with high-performance plastic.

“It turns these abstract concepts into something tangible.”As 3D printing becomes more embedded in how we manufacture and pr…

1 week, 6 days назад @ news.mit.edu
A new generative AI approach to predicting chemical reactions
A new generative AI approach to predicting chemical reactions A new generative AI approach to predicting chemical reactions

Many attempts have been made to harness the power of new artificial intelligence and large language models (LLMs) to try to predict the outcomes of new chemical reactions.

Now, a team of researchers at MIT has come up with a way of incorporating these physical constraints on a reaction prediction model, and thus greatly improving the accuracy and reliability of its outputs.

So, this requires us to know what product is likely” to result from a given set of chemical inputs to a reaction.

This representation, he says, was one of the key elements to including mass conservation in their prediction system.

They say the model could potentially be relevant for predicting reactions for medicinal che…

2 weeks назад @ news.mit.edu
3 Questions: The pros and cons of synthetic data in AI
3 Questions: The pros and cons of synthetic data in AI 3 Questions: The pros and cons of synthetic data in AI

Synthetic data are artificially generated by algorithms to mimic the statistical properties of actual data, without containing any information from real-world sources.

To unpack some pros and cons of using synthetic data, MIT News spoke with Kalyan Veeramachaneni, a principal research scientist in the Laboratory for Information and Decision Systems and co-founder of DataCebo whose open-core platform, the Synthetic Data Vault, helps users generate and test synthetic data.

Plus, the model creates synthetic data in a way that captures all the underlying rules and infinite patterns that exist in the real data.

Synthetic data provide data augmentation — additional data examples that are similar …

2 weeks назад @ news.mit.edu
3 Questions: On biology and medicine’s “data revolution”
3 Questions: On biology and medicine’s “data revolution” 3 Questions: On biology and medicine’s “data revolution”

In this interview, she discusses machine learning in biology, areas that are ripe for problem-solving, and cutting-edge research coming out of the Schmidt Center.

What, within the current landscape of machine learning, makes now the right time to work on these specific problem classes?

Importantly, biology is poised to be not just a beneficiary of machine learning, but also a significant source of inspiration for new ML research.

A: Machine learning has demonstrated remarkable success in predictive tasks across domains such as image classification, natural language processing, and clinical risk modeling.

I believe that addressing these challenges will not only unlock new insights into the m…

2 weeks, 1 day назад @ news.mit.edu
MIT researchers develop AI tool to improve flu vaccine strain selection
MIT researchers develop AI tool to improve flu vaccine strain selection MIT researchers develop AI tool to improve flu vaccine strain selection

But if the prediction is off, protection can drop significantly, leading to (potentially preventable) illness and strain on health care systems.

They created an AI system called VaxSeer, designed to predict dominant flu strains and identify the most protective vaccine candidates, months ahead of time.

For antigenicity, the system estimates how well a given vaccine strain will perform in a common lab test called the hemagglutination inhibition assay.

Applying the system to other viruses would also require large, high-quality datasets that track both viral evolution and immune responses — data that aren’t always publicly available.

The team, however is currently working on the methods that ca…

2 weeks, 6 days назад @ news.mit.edu
Simpler models can outperform deep learning at climate prediction
Simpler models can outperform deep learning at climate prediction Simpler models can outperform deep learning at climate prediction

The team demonstrates that, in certain climate scenarios, much simpler, physics-based models can generate more accurate predictions than state-of-the-art deep-learning models.

This could lead someone to believe a deep-learning model makes more accurate predictions when that is not the case.

The researchers see their work as a “cautionary tale” about the risk of deploying large AI models for climate science.

Scientists often create climate emulators, simpler approximations of a state-of-the art climate model, which are faster and more accessible.

They found that the high amount of natural variability in climate model runs can cause the deep learning model to perform poorly on unpredictable l…

3 weeks, 1 day назад @ news.mit.edu
New technologies tackle brain health assessment for the military
New technologies tackle brain health assessment for the military New technologies tackle brain health assessment for the military

Cognitive readiness denotes a person's ability to respond and adapt to the changes around them.

For military service members, cognitive readiness is crucial for their health and safety, as well as mission success.

"Unfortunately, the cumulative effects of these exposures are often not well-documented during military service or after transition to Veterans Affairs, making it challenging to provide effective support."

Smalt is part of a team at the laboratory developing a suite of portable diagnostic tests that provide near-real-time screening for brain injury and cognitive health.

Both READY and MINDSCAPE are a response to a series of Congressional legislation mandates, military program requ…

3 weeks, 2 days назад @ news.mit.edu
Can large language models figure out the real world?
Can large language models figure out the real world? Can large language models figure out the real world?

“Humans all the time have been able to make this transition from good predictions to world models,” says Vafa, the study’s lead author.

So the question their team was addressing was, “have foundation models — has AI — been able to make that leap from predictions to world models?

“It would need to adapt, have a world model to adapt to any possible task.”Are AI systems anywhere near the ability to reach such generalizations?

To test the question, the team looked at different examples of predictive AI systems, at different levels of complexity.

The answer is yes: Predictive models do well at reconstructing the “world” in such a simple case.

3 weeks, 2 days назад @ news.mit.edu
A new model predicts how molecules will dissolve in different solvents
A new model predicts how molecules will dissolve in different solvents A new model predicts how molecules will dissolve in different solvents

Common organic solvents include ethanol and acetone, and there are hundreds of others that can also be used in chemical reactions.

The model could be particularly useful for identifying solvents that are less hazardous than some of the most commonly used industrial solvents, the researchers say.

Before Burns and Attia began working on their new model, the state-of-the-art model for predicting solubility was a model developed in Green’s lab in 2022.

Part of the reason that existing solubility models haven’t worked well is because there wasn’t a comprehensive dataset to train them on.

“ChemProp should always outperform any static embedding when you have sufficient data,” Burns says.

4 weeks, 1 day назад @ news.mit.edu
Researchers glimpse the inner workings of protein language models
Researchers glimpse the inner workings of protein language models Researchers glimpse the inner workings of protein language models

These models, which are based on large language models (LLMs), can make very accurate predictions of a protein’s suitability for a given application.

Opening the black boxIn 2018, Berger and former MIT graduate student Tristan Bepler PhD ’20 introduced the first protein language model.

Protein language models use a similar approach, but instead of analyzing words, they analyze amino acid sequences.

In the new study, the researchers wanted to dig into how protein language models make their predictions.

The new study from Berger’s lab is the first to use this algorithm on protein language models.

1 month назад @ news.mit.edu
How AI could speed the development of RNA vaccines and other RNA therapies
How AI could speed the development of RNA vaccines and other RNA therapies How AI could speed the development of RNA vaccines and other RNA therapies

Using artificial intelligence, MIT researchers have come up with a new way to design nanoparticles that can more efficiently deliver RNA vaccines and other types of RNA therapies.

This approach could dramatically speed the process of developing new RNA vaccines, as well as therapies that could be used to treat obesity, diabetes, and other metabolic disorders, the researchers say.

Particle predictionsRNA vaccines, such as the vaccines for SARS-CoV-2, are usually packaged in lipid nanoparticles (LNPs) for delivery.

Creating particles that handle these jobs more efficiently could help researchers to develop even more effective vaccines.

“Most AI models in drug discovery focus on optimizing a s…

1 month назад @ news.mit.edu
Berkeley AI
последний пост 5 months, 1 week назад
Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)
Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign) Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications.

To mitigate the imminent prompt injection threat, we propose two fine-tuning-defenses, StruQ and SecAlign.

Prompt Injection Attack: CausesBelow is the threat model of prompt injection attacks.

Prompt injection threat model in LLM-integrated applicationsWe propose that prompt injection has two causes.

Below are resources to learn more and keep updated on prompt injection attacks and defenses.

5 months, 1 week назад @ bair.berkeley.edu
Repurposing Protein Folding Models for Generation with Latent Diffusion
Repurposing Protein Folding Models for Generation with Latent Diffusion Repurposing Protein Folding Models for Generation with Latent Diffusion

Repurposing Protein Folding Models for Generation with Latent DiffusionPLAID is a multimodal generative model that simultaneously generates protein 1D sequence and 3D structure, by learning the latent space of protein folding models.

In PLAID, we develop a method that learns to sample from the latent space of protein folding models to generate new proteins.

Unlike many previous protein structure generative models, PLAID addresses the multimodal co-generation problem setting: simultaneously generating both discrete sequence and continuous all-atom structural coordinates.

In this way, we can use structural understanding information in the weights of pretrained protein folding models for the p…

5 months, 1 week назад @ bair.berkeley.edu
Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment
Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway DeploymentTraining Diffusion Models with Reinforcement LearningWe deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone.

The challenges of phantom jamsA stop-and-go wave moving backwards through highway traffic.

Smoothing behavior of RL AVs.

Overall, the steps towards deployment involved:Training in data-driven simulations: We used highway traffic data from I-24 to create a training environment with realistic wave dynamics, then validate the trained agent’s performance and robustness in a variety of new traffic scenarios.…

5 months, 3 weeks назад @ bair.berkeley.edu
AWS Machine Learning AWS Machine Learning
последний пост 5 часов назад
Supercharge your organization’s productivity with the Amazon Q Business browser extension
Supercharge your organization’s productivity with the Amazon Q Business browser extension Supercharge your organization’s productivity with the Amazon Q Business browser extension

We recently launched the Amazon Q Business browser extension in Amazon Q Business, and it is now available to Amazon Q Business subscribers (Lite and Pro).

Create an Amazon Q Business application and subscribe your usersThe Amazon Q Business browser extension is a feature of Amazon Q Business and requires customers to first create an Amazon Q Business application and subscribe their users before the browser extension can be enabled.

However, if you have developed a custom web experience using the Amazon Q Business APIs, complete the following steps to create an Amazon Q Business web experience:On the Amazon Q Business console, go to your Amazon Q Business application.

To deploy the Amazon Q…

5 часов назад @ aws.amazon.com
Build Agentic Workflows with OpenAI GPT OSS on Amazon SageMaker AI and Amazon Bedrock AgentCore
Build Agentic Workflows with OpenAI GPT OSS on Amazon SageMaker AI and Amazon Bedrock AgentCore Build Agentic Workflows with OpenAI GPT OSS on Amazon SageMaker AI and Amazon Bedrock AgentCore

These agents collaborate within Amazon Bedrock AgentCore Runtime, and when language understanding or generation is required, they invoke a GPT OSS model hosted on SageMaker AI.

If this is your first time working with Amazon SageMaker Studio, you first need to create a SageMaker domain.

For more information, see How Amazon SageMaker AI works with IAM in the SageMaker Developer Guide.

Deploy to Amazon Bedrock AgentCoreAfter you have developed and tested your LangGraph framework locally, you can deploy it to Amazon Bedrock AgentCore Runtime.

After you deploy to Amazon Bedrock AgentCore Runtime, you will be able to see the status show as Ready on the Amazon Bedrock AgentCore console.

5 часов назад @ aws.amazon.com
Streamline access to ISO-rating content changes with Verisk rating insights and Amazon Bedrock
Streamline access to ISO-rating content changes with Verisk rating insights and Amazon Bedrock Streamline access to ISO-rating content changes with Verisk rating insights and Amazon Bedrock

Verisk Rating Insights as a feature of ISO Electronic Rating Content (ERC) is a powerful tool designed to provide summaries of ISO Rating changes between two releases.

By integrating Anthropic’s Claude, available in Amazon Bedrock, and Amazon OpenSearch Service, Verisk created a sophisticated conversational platform where users can effortlessly access and analyze rating content changes.

Verisk’s generative AI solution is a comprehensive, secure, and flexible service for building generative AI applications and agents.

Business impact and opportunityBy integrating generative AI into Verisk Rating Insights, the business has seen a remarkable transformation.

Verisk Rating Insights, now powered …

1 day, 8 hours назад @ aws.amazon.com
Unified multimodal access layer for Quora’s Poe using Amazon Bedrock
Unified multimodal access layer for Quora’s Poe using Amazon Bedrock Unified multimodal access layer for Quora’s Poe using Amazon Bedrock

Together, they developed a unified wrapper API framework that streamlines the deployment of Amazon Bedrock FMs on Quora’s Poe system.

Initially, integrating the diverse FMs available through Amazon Bedrock presented significant technical challenges for the Poe.com team.

Technical challenge: Bridging different systemsThe integration between Poe and Amazon Bedrock presented fundamental architectural challenges that required innovative solutions.

Solution overviewThe wrapper API framework provides a unified interface between Poe and Amazon Bedrock models.

– Converts responses from Amazon Bedrock format to Poe’s expected format and vice-versa, handling streaming chunks, image data, and video ou…

1 day, 8 hours назад @ aws.amazon.com
Schedule topology-aware workloads using Amazon SageMaker HyperPod task governance
Schedule topology-aware workloads using Amazon SageMaker HyperPod task governance Schedule topology-aware workloads using Amazon SageMaker HyperPod task governance

SageMaker HyperPod task governance streamlines resource allocation and facilitates efficient compute resource utilization across teams and projects on Amazon Elastic Kubernetes Service (Amazon EKS) clusters.

Refer to Best practices for Amazon SageMaker HyperPod task governance for more information.

In this post, we introduce topology-aware scheduling with SageMaker HyperPod task governance by submitting jobs that represent hierarchical network information.

In this post, we discussed how SageMaker HyperPod task governance helps schedule workloads to enable job efficiency by optimizing throughput and latency.

We also walked through how to schedule jobs using SageMaker HyperPod topology networ…

2 days, 8 hours назад @ aws.amazon.com
How msg enhanced HR workforce transformation with Amazon Bedrock and msg.ProfileMap
How msg enhanced HR workforce transformation with Amazon Bedrock and msg.ProfileMap How msg enhanced HR workforce transformation with Amazon Bedrock and msg.ProfileMap

msg.ProfileMap is a cloud-based application that uses several AWS services, notably Amazon Neptune, Amazon DynamoDB, and Amazon Bedrock.

Amazon Bedrock met these needs by providing a fully managed, serverless interface to leading foundation models—without the need to manage infrastructure or deploy custom machine learning stacks.

Unlike hosting models on Amazon Elastic Compute Cloud (Amazon EC2) or Amazon SageMaker, Amazon Bedrock abstracts away provisioning, versioning, scaling, and model selection.

Conclusionmsg’s successful integration of Amazon Bedrock into msg.ProfileMap demonstrates that large-scale AI adoption doesn’t require complex infrastructure or specialized model training.

With…

2 days, 8 hours назад @ aws.amazon.com
Automate advanced agentic RAG pipeline with Amazon SageMaker AI
Automate advanced agentic RAG pipeline with Amazon SageMaker AI Automate advanced agentic RAG pipeline with Amazon SageMaker AI

Furthermore, management of high-performing RAG pipeline involves complex deployment, with teams often using manual RAG pipeline management, leading to inconsistent results, time-consuming troubleshooting, and difficulty in reproducing successful configurations.

The following diagram illustrates the architecture of a scalable RAG pipeline built on SageMaker AI, with MLflow experiment tracking seamlessly integrated at every stage and the RAG pipeline automated using SageMaker Pipelines.

A SageMaker Studio notebook for a development environment to experiment and automate the RAG pipelines with SageMaker managed MLflow and SageMaker Pipelines.

CI/CD for an agentic RAG pipelineNow we integrate t…

5 days, 7 hours назад @ aws.amazon.com
Unlock model insights with log probability support for Amazon Bedrock Custom Model Import
Unlock model insights with log probability support for Amazon Bedrock Custom Model Import Unlock model insights with log probability support for Amazon Bedrock Custom Model Import

PrerequisitesTo use log probability support with custom model import in Amazon Bedrock, you need:An active AWS account with access to Amazon BedrockA custom model created in Amazon Bedrock using the Custom Model Import feature after July 31, 2025, when the log probabilities support was releasedAppropriate AWS Identity and Access Management (IAM) permissions to invoke models through the Amazon Bedrock RuntimeIntroducing log probabilities support in Amazon BedrockWith this release, Amazon Bedrock now allows models imported using the Custom Model Import feature to return token-level log probabilities as part of the inference response.

Let’s walk through an example of invoking a custom model on…

5 days, 7 hours назад @ aws.amazon.com
Migrate from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock
Migrate from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock Migrate from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock

Anthropic’s Claude 4 Sonnet model has launched on Amazon Bedrock, marking a significant advancement in foundation model capabilities.

This post provides a systematic approach to migrating from Anthropic’s Claude 3.5 Sonnet to Claude 4 Sonnet on Amazon Bedrock.

The migration from Anthropic’s Claude Sonnet 3.5 Sonnet to Claude 4 Sonnet introduces capability and behavioral shifts that you can take advantage of:Increased context window – Anthropic’s Claude 4 Sonnet expands the context window from 200,000 tokens to 1 million tokens (beta).

PrerequisitesBefore you can start using Anthropic’s Claude 4 Sonnet model, you must enable access to these models in Amazon Bedrock.

For more details, refer t…

5 days, 7 hours назад @ aws.amazon.com
Enhance video understanding with Amazon Bedrock Data Automation and open-set object detection
Enhance video understanding with Amazon Bedrock Data Automation and open-set object detection Enhance video understanding with Amazon Bedrock Data Automation and open-set object detection

In this post, we explore how Amazon Bedrock Data Automation uses OSOD to enhance video understanding.

Amazon Bedrock Data Automation and video blueprints with OSODAmazon Bedrock Data Automation is a cloud-based service that extracts insights from unstructured content like documents, images, video and audio.

For more information about Amazon Bedrock Data Automation, see Automate video insights for contextual advertising using Amazon Bedrock Data Automation.

Example use casesIn this section, we explore some examples of different use cases for Amazon Bedrock Data Automation video blueprints using OSOD.

To learn more about Amazon Bedrock Data Automation video and audio analysis, see New Amazon …

6 days, 6 hours назад @ aws.amazon.com
How Skello uses Amazon Bedrock to query data in a multi-tenant environment while keeping logical boundaries
How Skello uses Amazon Bedrock to query data in a multi-tenant environment while keeping logical boundaries How Skello uses Amazon Bedrock to query data in a multi-tenant environment while keeping logical boundaries

In this post, we explain how Skello used Amazon Bedrock to create this AI assistant for end-users while maintaining customer data safety in a multi-tenant environment.

This led us to identify two key areas for improvement:Quick access to non-structured data – Our users needed to find specific information across various data types—employee records, scheduling data, attendance logs, and performance metrics.

Their ability to understand natural language and context, combined with their capability to generate structured outputs, made them perfectly suited for translating user questions into precise database queries.

The streamlined process makes sure your visualizations remain consistently clear…

6 days, 7 hours назад @ aws.amazon.com
Create a private workforce on Amazon SageMaker Ground Truth with the AWS CDK
Create a private workforce on Amazon SageMaker Ground Truth with the AWS CDK Create a private workforce on Amazon SageMaker Ground Truth with the AWS CDK

Solution overviewThis solution demonstrates how to create a private workforce and a coupled Amazon Cognito user pool and its dependent resources.

Amazon Cognito user pool app client – The user pool app client configures the client application that will interact with the user pool.

Amazon Cognito user pool domain – The user pool domain defines the domain name for the managed login experience provided by the user pool.

Private workforce – Implemented using a custom resource backing the CreateWorkforce API call, the private workforce is the foundation to manage labeling activities.

ConclusionThis solution demonstrates how to programmatically create a private workforce on SageMaker Ground Truth…

6 days, 7 hours назад @ aws.amazon.com
TII Falcon-H1 models now available on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart
TII Falcon-H1 models now available on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart TII Falcon-H1 models now available on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

We are excited to announce the availability of the Technology Innovation Institute (TII)’s Falcon-H1 models on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart.

For help determining which deployment option—Amazon Bedrock Marketplace or SageMaker JumpStart—best suits your specific requirements, see Amazon Bedrock or Amazon SageMaker AI?

Deploy the model using the Amazon Bedrock Marketplace UITo deploy the model using Amazon Bedrock Marketplace, complete the following steps:On the Amazon Bedrock console, under Discover in the navigation pane, choose Model catalog.

To learn more about how IAM works with SageMaker AI, see Identity and Access Management for Amazon SageMaker AI.

To do so…

1 week назад @ aws.amazon.com
Oldcastle accelerates document processing with Amazon Bedrock
Oldcastle accelerates document processing with Amazon Bedrock Oldcastle accelerates document processing with Amazon Bedrock

The company was processing 100,000–300,000 ship tickets per month across more than 200 facilities.

This post explores how Oldcastle partnered with AWS to transform their document processing workflow using Amazon Bedrock with Amazon Textract.

The end-to-end workflow uses Amazon Simple Email Service (Amazon SES) to receive ship tickets, which are sent directly from drivers in the field.

This strategic move towards AI-powered document processing is positioning Oldcastle for improved efficiency and scalability in its operations.

Review your current manual document processing procedures and identify where intelligent document processing can help you automate these workflows for your business.

1 week назад @ aws.amazon.com
How London Stock Exchange Group is detecting market abuse with their AI-powered Surveillance Guide on Amazon Bedrock
How London Stock Exchange Group is detecting market abuse with their AI-powered Surveillance Guide on Amazon Bedrock How London Stock Exchange Group is detecting market abuse with their AI-powered Surveillance Guide on Amazon Bedrock

Accordingly, regulators place great emphasis on the ability of market surveillance teams to keep pace with evolving risk profiles.

Effective monitoring must cover all MiFID asset classes, markets and jurisdictions to detect market abuse, while also giving weight to participant relationships, and market surveillance systems must scale with volumes and volatility.

In this post, we explore how LSEG used Amazon Bedrock and Anthropic’s Claude foundation models to build an automated system that significantly improves the efficiency and accuracy of market surveillance operations.

Powered by Amazon Bedrock, the solution improves efficiency and enhances the quality and consistency of market abuse de…

1 week назад @ aws.amazon.com
NVIDIA
последний пост 2 часа назад
NVIDIA RAPIDS 25.08 Adds New Profiler for cuML, Updates to the Polars GPU Engine, Additional Algorithm Support, and More
NVIDIA RAPIDS 25.08 Adds New Profiler for cuML, Updates to the Polars GPU Engine, Additional Algorithm Support, and More NVIDIA RAPIDS 25.08 Adds New Profiler for cuML, Updates to the Polars GPU Engine, Additional Algorithm Support, and More

Performance comparison of the Polars GPU engine’s in-memory and streaming execution modes.

For more information on the Polars GPU streaming executor, visit our documentation.

Keep complex data like structs and string operations on the GPUThe Polars GPU engine now supports struct data in columns.

With the introduction of the cuml.accel profiler, developers now have powerful tools to diagnose and improve the performance of their machine learning code.

Updates to the Polars GPU engine like the streaming executor and expanded data type support enable efficient processing of large datasets, enhancing scalability and performance.

2 часа назад @ developer.nvidia.com
Meet the Streamlabs Streaming Assistant, Accelerated by NVIDIA RTX
Meet the Streamlabs Streaming Assistant, Accelerated by NVIDIA RTX Meet the Streamlabs Streaming Assistant, Accelerated by NVIDIA RTX

From Solo Producer to AI-Powered StreamsFirst previewed at the CES trade show in January, the Streamlabs Intelligent Streaming Agent, now in Streamlabs Desktop, acts as a cohost, producer and technical expert.

NVIDIA has steadily lowered the barrier of entry to livestreaming with enhancements to:Encoding: The NVIDIA encoder and optimizations to livestreaming apps enable high-quality streaming from a single NVIDIA RTX GPU system.

The NVIDIA encoder and optimizations to livestreaming apps enable high-quality streaming from a single NVIDIA RTX GPU system.

The Intelligent Streaming Assistant builds on these enhancements, making it easier for newcomers to start streaming and helping creators ele…

7 часов назад @ blogs.nvidia.com
The AI Makers: NVIDIA Partners in UK Advance Physical and Agentic AI, Robotics, Life Sciences and More
The AI Makers: NVIDIA Partners in UK Advance Physical and Agentic AI, Robotics, Life Sciences and More The AI Makers: NVIDIA Partners in UK Advance Physical and Agentic AI, Robotics, Life Sciences and More

The AI Makers: NVIDIA Partners in UK Advance Physical and Agentic AI, Robotics, Life Sciences and MoreThe U.K. is driving investments in sovereign AI, using the technology to advance industries like manufacturing, life sciences and more.

ElevenLabs develops AI voice technology that generates natural, ultrarealistic speech in over 70 languages using NVIDIA software and NVIDIA DGX B200 systems.

PolyAI deployed advanced conversational AI agents using NVIDIA Riva automatic speech recognition NVIDIA NIM microservices.

Speechmatics developed speech-to-text software using NVIDIA Dynamo-Triton and NVIDIA cuDNN software.

Synthesia built an enterprise-focused AI video platform using NVIDIA Dynamo-Tri…

1 day, 3 hours назад @ blogs.nvidia.com
Reaching Across the Isles: UK-LLM Brings AI to UK Languages With NVIDIA Nemotron
Reaching Across the Isles: UK-LLM Brings AI to UK Languages With NVIDIA Nemotron Reaching Across the Isles: UK-LLM Brings AI to UK Languages With NVIDIA Nemotron

The UK-LLM development team has tapped the 49-billion-parameter Llama Nemotron Super model and 9-billion-parameter Nemotron Nano model, post-training them on Welsh-language data.

Bydd darparwr cwmwl Deallusrwydd Artiffisial yn y DU, Nscale, yn sicrhau bod y model newydd ar gael i ddatblygwyr trwy ei ryngwyneb rhaglennu rhaglenni (API).

O’i gymharu ag ieithoedd fel Saesneg neu Sbaeneg, mae llai o ddata ffynhonnell ar gael yn y Gymraeg ar gyfer hyfforddiant Deallusrwydd Artiffisial.

“Mae’n un peth cael y gallu Deallusrwydd Artiffisial hwn yn bodoli yn y Gymraeg, ond mae’n beth arall ei wneud yn agored ac yn hygyrch i bawb,” meddai Prys.

Dewch i ddechrau arni gyda NVIDIA Nemotron.

4 days назад @ blogs.nvidia.com
Accelerate Protein Structure Inference Over 100x with NVIDIA RTX PRO 6000 Blackwell Server Edition
Accelerate Protein Structure Inference Over 100x with NVIDIA RTX PRO 6000 Blackwell Server Edition Accelerate Protein Structure Inference Over 100x with NVIDIA RTX PRO 6000 Blackwell Server Edition

The new NVIDIA RTX PRO 6000 Blackwell Server Edition GPU fundamentally changes this.

The NVIDIA RTX PRO 6000 Blackwell Server Edition GPU sets a new benchmark for protein structure inferenceWhy do speed and scale matter in protein structure prediction?

Validated on 20 CASP14 protein targets, these benchmarks establish RTX PRO 6000 Blackwell as a breakthrough solution for end-to-end protein structure prediction.

The power of RTX PRO 6000 Blackwell Server Edition is available today in NVIDIA RTX PRO Servers from global system makers as well as in cloud instances from leading cloud service providers.

Find a partner for NVIDIA RTX PRO 6000 Blackwell Server Edition and experience protein folding…

1 week назад @ developer.nvidia.com
Paint It Blackwell: GeForce RTX 5080 SuperPOD Rollout Begins
Paint It Blackwell: GeForce RTX 5080 SuperPOD Rollout Begins Paint It Blackwell: GeForce RTX 5080 SuperPOD Rollout Begins

GeForce NOW Blackwell RTX 5080-class SuperPODs are now rolling out, unlocking a new level of ultra high-performance, cinematic cloud gaming.

The new Install-to-Play feature is expanding the cloud library to nearly 4,500 games for Ultimate and Performance members.

With the NVIDIA Blackwell RTX upgrade, GeForce NOW brings GeForce RTX 5080-class performance to the cloud for the first time.

Look for the new “GeForce RTX 5080 Ready” row in the app for the full list of GeForce RTX 5080-optimized games, updated weekly with fresh additions.

Enjoy NVIDIA DLSS 4-fueled graphics and low-latency gameplay streaming from GeForce RTX 5080 gaming rigs in the cloud.

1 week назад @ blogs.nvidia.com
‘Safety First, Always,’ NVIDIA VP of Automotive Says, Unveiling the Future of AI-Defined Vehicles at IAA Mobility
‘Safety First, Always,’ NVIDIA VP of Automotive Says, Unveiling the Future of AI-Defined Vehicles at IAA Mobility ‘Safety First, Always,’ NVIDIA VP of Automotive Says, Unveiling the Future of AI-Defined Vehicles at IAA Mobility

Expanding the Ecosystem: Automotive Leaders Embrace Cloud-to-Car AIAutomotive leaders are embracing NVIDIA’s cloud-to-car AI platform to transform their next-generation vehicles.

Lucid headlined the IAA showcase with its all-electric Lucid Gravity SUV, which is accelerated by the NVIDIA DRIVE AGX platform, uses the NVIDIA Blackwell architecture and operates on NVIDIA DriveOS.

Global Tech Leaders Accelerate Software-Defined Vehicles With NVIDIABeyond automakers, technology leaders across the globe are building on NVIDIA AI to accelerate the development of software-defined vehicles.

Cerence is presenting its xUI AI assistant at IAA, built on CaLLM models and running on NVIDIA DRIVE AGX with D…

1 week, 1 day назад @ blogs.nvidia.com
NVIDIA Blackwell Ultra Sets the Bar in New MLPerf Inference Benchmark
NVIDIA Blackwell Ultra Sets the Bar in New MLPerf Inference Benchmark NVIDIA Blackwell Ultra Sets the Bar in New MLPerf Inference Benchmark

Less than half a year since its debut at NVIDIA GTC, the NVIDIA GB300 NVL72 rack-scale system — powered by the NVIDIA Blackwell Ultra architecture — set records on the new reasoning inference benchmark in MLPerf Inference v5.1, delivering up to 1.4x more DeepSeek-R1 inference throughput compared with NVIDIA Blackwell-based GB200 NVL72 systems.

Blackwell Ultra builds on the success of the Blackwell architecture, with the Blackwell Ultra architecture featuring 1.5x more NVFP4 AI compute and 2x more attention-layer acceleration than Blackwell, as well as up to 288GB of HBM3e memory per GPU.

NVIDIA TensorRT Model Optimizer software quantized DeepSeek-R1, Llama 3.1 405B, Llama 2 70B and Llama 3.…

1 week, 1 day назад @ blogs.nvidia.com
NVIDIA Partners With AI Infrastructure Ecosystem to Unveil Reference Design for Giga-Scale AI Factories
NVIDIA Partners With AI Infrastructure Ecosystem to Unveil Reference Design for Giga-Scale AI Factories NVIDIA Partners With AI Infrastructure Ecosystem to Unveil Reference Design for Giga-Scale AI Factories

At the AI Infrastructure Summit, NVIDIA’s Ian Buck introduces a reference design and partner-driven strategy to transform global infrastructure for high-performance, energy-efficient AI.

At this week’s AI Infrastructure Summit in Silicon Valley, NVIDIA’s VP of Accelerated Computing Ian Buck unveiled a bold new vision: the transformation of traditional data centers into fully integrated AI factories.

Already, NVIDIA is collaborating with scores of companies across every layer of the stack, from building design and grid integration to power, cooling and orchestration.

NVIDIA, along with a deep bench of industrial and technology partners, is reactivating decades of infrastructure expertise to …

1 week, 1 day назад @ blogs.nvidia.com
Get Started Using Generative AI for Content Creation With ComfyUI and NVIDIA RTX AI PCs
Get Started Using Generative AI for Content Creation With ComfyUI and NVIDIA RTX AI PCs Get Started Using Generative AI for Content Creation With ComfyUI and NVIDIA RTX AI PCs

State-of-the-Art AI Models, Accelerated by RTXIncredible models for AI content creation have been released in the last weeks, all of which are now available in ComfyUI.

The Blender plug-in — featured in the NVIDIA AI Blueprint for 3D-guided generative AI — allows users to connect 2D and 3D workflows.

Run Hyper-Optimized Models for NVIDIA RTX GPUs in ComfyUIThe best way to use NVIDIA RTX GPUs is with the TensorRT library — a high-performance deep learning inference engine designed to squeeze maximum speed out of the Tensor Cores in NVIDIA RTX GPUs.

Plug in to NVIDIA AI PC on Facebook, Instagram, TikTok and X — and stay informed by subscribing to the RTX AI PC newsletter.

Join NVIDIA’s Discor…

1 week, 1 day назад @ blogs.nvidia.com
Now Live: Europe’s First Exascale Supercomputer, JUPITER, Accelerates Climate Research, Neuroscience, Quantum Simulation
Now Live: Europe’s First Exascale Supercomputer, JUPITER, Accelerates Climate Research, Neuroscience, Quantum Simulation Now Live: Europe’s First Exascale Supercomputer, JUPITER, Accelerates Climate Research, Neuroscience, Quantum Simulation

“JUPITER marks the culmination of more than a decade of research and development,” said Thomas Lippert, director of the Jülich Supercomputing Centre.

“As the world’s most advanced and versatile exascale system, it represents a unique innovation, opening up completely new possibilities for science and industry in Europe.”“With JUPITER, Europe gains its most advanced AI supercomputer, built for large-scale simulation and AI, powered by NVIDIA Grace Hopper Superchips and Quantum-2 InfiniBand,” said Jensen Huang, founder and CEO of NVIDIA.

“JUPITER fuses high-performance computing and AI into a single architecture.

“Jupiter is the first European supercomputer, and the first outside the U.S., to…

1 week, 5 days назад @ blogs.nvidia.com
NVIDIA Pledges AI Education Funding for K-12 Programs
NVIDIA Pledges AI Education Funding for K-12 Programs NVIDIA Pledges AI Education Funding for K-12 Programs

NVIDIA today announced new AI education support for K-12 programs at a White House event to celebrate public-private partnerships that advance artificial intelligence education for America’s youth.

Pledging $25 million in support with AI education programs, NVIDIA is partnering with Study Fetch and CK-12 — two leading K-12 learning platforms — to tailor the NVIDIA Deep Learning Institute (DLI) and NVIDIA Academy content offerings to meet the instructional needs of U.S. K-12 classrooms.

The NVIDIA effort aligns with the White House executive order Advancing Artificial Intelligence Education for American Youth, announced in April.

Additionally, in support of the executive order, NVIDIA signed…

1 week, 6 days назад @ blogs.nvidia.com
AI On: 6 Ways AI Agents Are Raising Team Performance — and How to Measure It
AI On: 6 Ways AI Agents Are Raising Team Performance — and How to Measure It AI On: 6 Ways AI Agents Are Raising Team Performance — and How to Measure It

AI On: 6 Ways AI Agents Are Raising Team Performance — and How to Measure ItEditor’s note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copilots.

AI agents can help achieve and exceed efficiency goals as they learn, reason and adjust based on context and outcomes.

AI agents in IT operations offer:In fast-paced telco environments, agents can help manage networks by analyzing real-time performance indicators and predicting service failures.

Pegatron developed the PEGA AI Factory platform to accelerate the development of AI agents across the company by 400% in the last four years.

Stay up to date on age…

1 week, 6 days назад @ blogs.nvidia.com
Cloud Gaming to Reach New Heights: GeForce NOW’s Blackwell RTX Upgrade Begins Next Week
Cloud Gaming to Reach New Heights: GeForce NOW’s Blackwell RTX Upgrade Begins Next Week Cloud Gaming to Reach New Heights: GeForce NOW’s Blackwell RTX Upgrade Begins Next Week

Don’t miss a special early GFN Thursday next Wednesday as GeForce NOW begins lighting up the globe with GeForce RTX 5080-class power streaming from the cloud.

With this upgrade, cloud gaming is about to level up.

NVIDIA Blackwell RTX brings GeForce RTX 5080-powered streaming to GeForce NOW, unlocking the highest resolutions and frame rates ever in the cloud.

Look out for Hell Is Us, announced at Gamescom and the first AAA title to hit the cloud at launch, along with the eagerly awaited Hollow Knight: Silksong, part of the five new games in the cloud available this week.

Play it on GeForce NOW and experience every tank clash in crystal-clear detail and fluid motion, streamed effortlessly to …

1 week, 6 days назад @ blogs.nvidia.com
Scene It to Believe It: Populate 3D Worlds Quickly With NVIDIA AI Blueprints
Scene It to Believe It: Populate 3D Worlds Quickly With NVIDIA AI Blueprints Scene It to Believe It: Populate 3D Worlds Quickly With NVIDIA AI Blueprints

The new NVIDIA AI Blueprint for 3D object generation simplifies scene creation, and the new Microsoft TRELLIS NVIDIA NIM microservice accelerates performance by 20%, all powered by NVIDIA RTX.

NVIDIA AI Blueprints help address this challenge by providing sample workflows so users can skip the hypertechnical stages and quickly benefit from advanced generative AI techniques.

The new NVIDIA AI Blueprint for 3D object generation gives users a pipeline to automate the prototyping process.

Ready, Aim, DeployGet started with the NVIDIA AI Blueprint for 3D object generation by following these instructions:Load up the blueprint, including models.

Explore a variety of expertly curated AI Blueprints, …

2 weeks назад @ blogs.nvidia.com
Facebook
последний пост 2 weeks, 1 day назад
A New Ranking Framework for Better Notification Quality on Instagram
A New Ranking Framework for Better Notification Quality on Instagram A New Ranking Framework for Better Notification Quality on Instagram

We’ve introduced a diversity-aware notification ranking framework to reduce uniformity and deliver a more varied and engaging mix of notifications.

Instagram leverages machine learning (ML) models to decide who should get a notification, when to send it, and what content to include.

To tackle this, we’ve introduced a diversity-aware notification ranking framework that helps deliver more diverse, better curated, and less repetitive notifications.

Introducing Instagram’s Diversity-Aware Notification Ranking FrameworkInstagram’s diversity-aware notification ranking framework is designed to enhance the notification experience by balancing the predicted potential for user engagement with the nee…

2 weeks, 1 day назад @ engineering.fb.com
Federation Platform and Privacy Waves: How Meta distributes compliance-related tasks at scale
Federation Platform and Privacy Waves: How Meta distributes compliance-related tasks at scale Federation Platform and Privacy Waves: How Meta distributes compliance-related tasks at scale

We’re exploring Meta’s Federation Platform, a scalable set of tools for managing compliance-related tasks, along with Privacy Waves, our method for batching these tasks and ensuring accountability.

To facilitate this, we developed the Federation Platform and Privacy Waves program:The Federation Platform breaks down large compliance-related initiatives into smaller, manageable workstreams.

Internal surveys reveal significantly higher positive sentiment for Privacy Waves tasks compared to ad-hoc tasks.

Step 6: Reporting and recognitionThe centralized distribution of tasks via Federation Platform and Privacy Waves streamline operational effectiveness and verification.

Expansions for the Federa…

1 month, 1 week назад @ engineering.fb.com
Diff Risk Score: AI-driven risk-aware software development
Diff Risk Score: AI-driven risk-aware software development Diff Risk Score: AI-driven risk-aware software development

Built on a fine-tuned Llama LLM, DRS evaluates code changes and metadata to produce a risk score and highlight potentially risky code snippets.

Production risk was one of the areas we tackled first.

The demand to build such features also led us to build the Risk Awareness Platform to provide risk analysis APIs and tool integrations.

We believe code risk can play a significant role in improving this tradeoff, so we will build more risk-aware features while improving their quality.

While code changes cause the plurality of SEVs at Meta, configuration changes are another large category.

1 month, 1 week назад @ engineering.fb.com
Building a human-computer interface for everyone
Building a human-computer interface for everyone Building a human-computer interface for everyone

What if you could control any device using only subtle hand movements?

New research from Meta’s Reality Labs is pointing even more firmly toward wrist-worn devices using surface electromyography (sEMG) becoming the future of human-computer interaction.

Generalization has been one of the most significant challenges in the field of human-computer interaction (HCI).

They discuss the road to creating a first-of-its-kind, generic human-computer neuromotor interface, what happens when software and hardware engineering meet neuroscience, and more!

And if you’re interested in learning more about career opportunities at Meta visit the Meta Careers page.

1 month, 2 weeks назад @ engineering.fb.com
Using AI to make lower-carbon, faster-curing concrete
Using AI to make lower-carbon, faster-curing concrete Using AI to make lower-carbon, faster-curing concrete

But concrete suppliers can utilize AI to develop and scale innovative concrete mixes as drop-in replacements, accelerating the discovery and integration of sustainable materials for large-scale use.

Meta’s AI model for green concreteDesigning concrete formulas is a complex, multi-objective problem.

To accelerate the concrete mix design process, Meta developed an AI model for sustainable concrete using BoTorch and Ax, Meta’s open-source software for Bayesian optimization and adaptive experimentation, respectively.

Our AI pipeline consists of the workflow of generating baseline data, training an AI model, using it to develop and validate new hypotheses, and then improving the baseline data an…

2 months назад @ engineering.fb.com
Accelerating GPU indexes in Faiss with NVIDIA cuVS
Accelerating GPU indexes in Faiss with NVIDIA cuVS Accelerating GPU indexes in Faiss with NVIDIA cuVS

Meta and NVIDIA collaborated to accelerate vector search on GPUs by integrating NVIDIA cuVS into Faiss v1.10 , Meta’s open source library for similarity search.

In its latest release, Faiss 1.10.0 officially includes these algorithms from the NVIDIA cuVS library.

Faiss 1.10.0 also includes a new conda package that unlocks the ability to choose between the classic Faiss GPU implementations and the newer NVIDIA cuVS algorithms, making it easy for users to switch between GPU and CPU.

Build time (95% recall@10)Index Embeddings100M x 96(seconds) Embeddings5M x 1536(seconds) Faiss Classic Faiss cuVS Faiss Classic Faiss cuVS Faiss Classic Faiss cuVS IVF Flat IVF Flat 101.4 37.9 (2.7x) 24.4 15.2 (1…

4 months, 1 week назад @ engineering.fb.com
Introducing AutoPatchBench: A Benchmark for AI-Powered Security Fixes
Introducing AutoPatchBench: A Benchmark for AI-Powered Security Fixes Introducing AutoPatchBench: A Benchmark for AI-Powered Security Fixes

We are introducing AutoPatchBench, a benchmark for the automated repair of vulnerabilities identified through fuzzing.

As illustrated, fixing a fuzzing crash involves:Analyzing the crash stack trace and the target code.

Inside AutoPatchBenchWe’re making AutoPatchBench publicly available as part of CyberSecEval 4 to encourage community collaboration in tackling the challenge of automating fuzzing crash repairs.

Then we find the lowest common ancestor (LCA) across all pairs of stacktraces offered by the groundtruth patch and the LLM patch.

As an experienced Security Engineer at Meta, your task is to address the following security-critical fuzzing crash.

4 months, 3 weeks назад @ engineering.fb.com
Building multimodal AI for Ray-Ban Meta glasses
Building multimodal AI for Ray-Ban Meta glasses Building multimodal AI for Ray-Ban Meta glasses

With our Ray-Ban Meta glasses, multimodal AI helps the glasses see what the wearer is seeing.

This means anyone wearing Ray-Ban Meta glasses can ask them questions about what they’re looking at.

On this episode of the Meta Tech Podcast, meet Shane, a research scientist at Meta who has spent the last seven years focusing on computer vision and multimodal AI for wearables.

Shane sits down with Pascal Hartig to share how his team is building foundational models for the Ray-Ban Meta glasses.

They talk about the unique challenges of AI glasses and pushing the boundaries of AI-driven wearable technology.

6 months, 2 weeks назад @ engineering.fb.com
Revolutionizing software testing: Introducing LLM-powered bug catchers
Revolutionizing software testing: Introducing LLM-powered bug catchers Revolutionizing software testing: Introducing LLM-powered bug catchers

WHAT IT ISMeta’s Automated Compliance Hardening (ACH) tool is a system for mutation-guided, LLM-based test generation.

Traditionally, automated test generation techniques sought merely to increase code coverage.

LLM-based test generation and LLM-based mutant generation are not new, but this is the first time they’ve been combined and deployed in large-scaled industrial systems.

WHAT’S NEXTOur novel approach combines LLM-based test generation and mutant generation to help automate complex technical organizational workflows in this space.

READ THE PAPERMutation-Guided LLM-based Test Generation at Meta

7 months, 2 weeks назад @ engineering.fb.com
Meta Andromeda: Supercharging Advantage+ automation with the next-gen personalized ads retrieval engine
Meta Andromeda: Supercharging Advantage+ automation with the next-gen personalized ads retrieval engine Meta Andromeda: Supercharging Advantage+ automation with the next-gen personalized ads retrieval engine

Unlocking advertiser value through industry-leading ML innovationMeta Andromeda is a personalized ads retrieval engine that leverages the NVIDIA Grace Hopper Superchip, to enable cutting edge ML innovation in the Ads retrieval stage to drive efficiency and advertiser performance.

Its deployment across Instagram and Facebook applications has achieved +6% recall improvement to the retrieval system, delivering +8% ads quality improvement on selected segments.

Andromeda is designed to maximize ads performance by utilizing the exponential growth in volume of eligible ads available to the retrieval stage.

The design is optimized for AI hardware, minimizing memory bandwidth bottlenecks and enablin…

9 months, 2 weeks назад @ engineering.fb.com
Uber Engineering
последний пост None
neptune.ai neptune.ai
последний пост 1 month, 1 week назад
Understanding Prompt Injection: Risks, Methods, and Defense Measures
Understanding Prompt Injection: Risks, Methods, and Defense Measures Understanding Prompt Injection: Risks, Methods, and Defense Measures

Prompt injection 101: When prompts go rogueThe term ‘Prompt Injection’ comes from SQL injection attacks.

There is another claim of the independent discovery of prompt injection attacks, which suggests that Riley Goodside publicly exhibited a prompt injection in a tweet back in September 2022.

The indirect prompt injection attacks are classified into active, passive, user-driven and virtual prompt attacks.

Virtual prompt injection attacksThis injection type is closely related to passive injection attacks previously described.

Prompt injection: current challenges & lessons learnedThe arms race between prompt injection attacks and defenses is a challenge for researchers, developers, and users.

1 month, 1 week назад @ neptune.ai
SabiYarn: Advancing Low-Resource Languages With Multitask NLP Pre-Training [Paper Reflections]
SabiYarn: Advancing Low-Resource Languages With Multitask NLP Pre-Training [Paper Reflections] SabiYarn: Advancing Low-Resource Languages With Multitask NLP Pre-Training [Paper Reflections]

This simple idea avoids computing loss on input prompt tokens the model already knows.

Prompt tokens are (too) expensive in low-resource settingsDuring pre-training, LLMs are trained in causal language modeling through a next-token prediction task.

=> Mo fẹ́ràn ìrẹsì,” the model is trained to predict every token, from the prompt to the actual answer:Step Prompt Next token 1 Translate English Static prompt 2 Translate English to Static prompt 3 Translate English to Yoruba: Static prompt 4 Translate English to Yoruba: I 5 Translate English to Yoruba: I love 6 Translate English to Yoruba: I love rice.

This is straightforward to implement in PyTorch by masking out the prompt tokens in the label …

1 month, 2 weeks назад @ neptune.ai
How to Monitor, Diagnose, and Solve Gradient Issues in Foundation Models
How to Monitor, Diagnose, and Solve Gradient Issues in Foundation Models How to Monitor, Diagnose, and Solve Gradient Issues in Foundation Models

What gradient issues occur during foundation model training?

During training, gradient descent updates model parameters by computing the gradients of the loss function via forward and backward passes.

The green line corresponds to a learning rate of 10, while the orange line has a learning rate of 0.1.

The gradient norm for the orange line with LR = 0.1 is very high in the first steps, while the gradient norm of the green line with LR = 10 diverges to NaN after a few steps.

Techniques for gradient stabilizationMonitoring gradient norms and training loss provides insights into the learning dynamics of the foundation models.

2 months, 2 weeks назад @ neptune.ai
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning [Paper Reflection]
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning [Paper Reflection] STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning [Paper Reflection]

Unstructured pruning removes individual weights, while structured pruning removes entire model components.

In the context of MoEs, as expert structures from training MoEs correspond to such patterns, pruning experts is a natural fit for structured pruning.

Thus, structured pruning does not significantly decrease kurtosis, leaving plenty of margin for unstructured pruning.

Since structured pruning primarily reduces architectural redundancy rather than reshaping the underlying weight distribution, our two-phase approach—leveraging unstructured pruning after structured pruning—outperforms unstructured-only pruning.

Since STUN does not make any assumption about base MoE models, it is generaliza…

3 months, 2 weeks назад @ neptune.ai
Evaluating RAG Pipelines
Evaluating RAG Pipelines Evaluating RAG Pipelines

Related Building LLM Applications With Vector Databases Read moreDimensions of RAG evaluationEvaluating a RAG pipeline means assessing its behavior across three dimensions:1.

The evaluation of the RAG pipeline is a multi-step process, starting with creating an evaluation dataset, then evaluating the individual components (retriever, generator, etc.

Curating an evaluation datasetThe first step in the RAG evaluation process is the creation of a ground truth dataset.

MAP considers both the presence and rank of relevant chunks but fails to consider the relative position of relevant chunks.

However, not all retrieved chunks are equally relevant and sometimes, the most relevant chunks might not b…

4 months назад @ neptune.ai
How to Build an LLM Agent With AutoGen: Step-by-Step Guide
How to Build an LLM Agent With AutoGen: Step-by-Step Guide How to Build an LLM Agent With AutoGen: Step-by-Step Guide

The efficiency of an LLM agent depends on the selection of the right LLM model.

In this article, we’ll introduce the fundamental building blocks of LLM agents and then walk through the process of building an LLM agent step by step.

Building an LLM agent from scratchIn the following, we’ll build a trip-planning LLM agent from scratch.

Using AutoGen’s OpenAI Assistant Agent, we instantiate a prompt that the LLM agent will follow throughout its interactions.

Related Ethical Considerations and Best Practices in LLM Development Read moreEnhancing LLM agent performanceWhile architecting an LLM agent, you have to keep in mind opportunities to improve the performance of the LLM agent.

6 months назад @ neptune.ai
Bayesian Deep Learning is Needed in the Age of Large-Scale AI [Paper Reflection]
Bayesian Deep Learning is Needed in the Age of Large-Scale AI [Paper Reflection] Bayesian Deep Learning is Needed in the Age of Large-Scale AI [Paper Reflection]

Moreover, I will make the case for why Bayesian deep learning can satisfy these desiderata and briefly review recent advances in the field.

The case for Bayesian deep learningBayesian deep learning uses the foundational statistical principles of Bayesian inference to endow deep learning systems with the ability to make probabilistic predictions.

However, Bayesian deep learning is unfortunately still not as easy to use as standard deep learning, which you can do these days in a few lines of PyTorch code.

If you want to use a Bayesian deep learning model, first, you have to think about specifying the prior.

If this is the case, trying out Bayesian deep learning is likely worth your while.

6 months, 1 week назад @ neptune.ai
Introduction to State Space Models as Natural Language Models
Introduction to State Space Models as Natural Language Models Introduction to State Space Models as Natural Language Models

TL;DR State Space Models (SSMs) use first-order differential equations to represent dynamic systems.

Understanding state space modelsBefore exploring how State Space Models (SSMs) can function as components of large language models (LLMs), we’ll examine their foundational mechanics.

State space models for natural language processingState Space Models (SSMs), long established in time series analysis, have been utilized as trainable sequence models for decades.

Linear state space layers (LSSLs)So far, we’ve seen that State Space Models are efficient sequence models.

Improvements on the state matrix AIn the previous section, we explored how the original LSSL relied on a fixed, predefined form …

6 months, 2 weeks назад @ neptune.ai
Ethical Considerations and Best Practices in LLM Development
Ethical Considerations and Best Practices in LLM Development Ethical Considerations and Best Practices in LLM Development

To keep data secure throughout the model’s lifecycle, implement these practices: data anonymization, secure model serving and privacy penetration tests.

For example, a recruitment LLM favoring male applicants due to biased training data reflects a harmful bias that requires correction.

Monitor bias continuouslyMitigating bias isn’t a one-time effort—it requires ongoing monitoring to ensure that your LLM remains fair and effective across iterations.

Although these contributions are publicly available, the move opened up debates about the ethics of reusing community-contributed content for proprietary AI training.

Best practices for ethical LLM developmentNavigating the regulatory landscape r…

6 months, 3 weeks назад @ neptune.ai
Open LLMs are Necessary For Current Private Adaptations and Outperform Their Closed Alternatives [Paper Reflection]
Open LLMs are Necessary For Current Private Adaptations and Outperform Their Closed Alternatives [Paper Reflection] Open LLMs are Necessary For Current Private Adaptations and Outperform Their Closed Alternatives [Paper Reflection]

While much of the discussion around LLMs centers on task and computational performance, in our paper Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed Alternatives, we focus on the privacy implications of using Open and Closed LLMs.

The threat space in adapting LLMs to private dataThe adaptation of Closed LLMs to private datasets introduces a multifaceted threat space.

Related Zero-Shot and Few-Shot Learning with LLMs Read morePrivate adaptation methods for Open LLMsUnlike Closed LLMs, Open LLMs provide access to their parameters, enabling more flexible and parameter-centric private adaptation methods.

Performance: All adaptation methods for Closed LLMs ach…

6 months, 4 weeks назад @ neptune.ai
Learnings From Teams Training Large-Scale Models: Challenges and Solutions For Monitoring at Hyperscale
Learnings From Teams Training Large-Scale Models: Challenges and Solutions For Monitoring at Hyperscale Learnings From Teams Training Large-Scale Models: Challenges and Solutions For Monitoring at Hyperscale

“What is not measured, cannot be improved.” This quote has become a guiding principle for teams training foundation models.

During my talk at NeurIPS, I broke down five key lessons learned from teams facing large-scale model training and monitoring.

Waabi’s teams, running large-scale ML experiments, needed a way to organize and share their experiment data efficiently.

Visualizing large datasetsWe generally do not think of dataset visualization as part of experiment monitoring.

Moving forwardThe path to efficient hyperscale training lies in combining robust monitoring, advanced debugging tools, and comprehensive experiment tracking.

7 months назад @ neptune.ai
Mixture of Experts LLMs: Key Concepts Explained
Mixture of Experts LLMs: Key Concepts Explained Mixture of Experts LLMs: Key Concepts Explained

TL;DR Mixture of Experts (MoE) is a type of neural network architecture that employs sub-networks (experts) to process specific input parts.

This is the key idea behind Mixture of Expert LLMs.

The Switch-Language Transformer, Mixtral, GLaM, GShard, and DeepSeekMoE are Mixture of Experts LLMs (MoEs), which require only executing a portion of the model’s computational graph during inference.

Optimization strategies for MoE LLMs are discussed comprehensively in the papers introducing the Switch Transformer, GShard, and GLaM.

Mixture of Experts (MoE) is an approach to scaling LLMs to trillions of parameters with conditional computation while avoiding exploding computational costs.

7 months, 1 week назад @ neptune.ai
Hyperparameter Optimization For LLMs: Advanced Strategies
Hyperparameter Optimization For LLMs: Advanced Strategies Hyperparameter Optimization For LLMs: Advanced Strategies

Advanced hyperparameter optimization strategies, like population-based training, Bayesian optimization, and adaptive LoRA, promise to balance computational effort and outcome.

To avoid this, learning rate schedules for LLMs start with a small learning rate and slowly ramp it up to its maximum value.

Can we use traditional machine learning hyperparameter optimization methods for LLMs?

| Modified based on: sourceHands-on: LLM hyperparameter optimization with neptune.aiOptuna is a framework for optimizing hyperparameter search using Bayesian optimization.

See the docs or watch a short product demo (2 min)Play with a live Neptune Scale projectRequest your early accessWhat’s next in LLM hyperpar…

7 months, 2 weeks назад @ neptune.ai
Multimodal Large Language Models
Multimodal Large Language Models Multimodal Large Language Models

TL;DR Multimodal Large Language Models (MLLMs) process data from different modalities like text, audio, image, and video.

This article explores Multimodal Large Language Models, exploring their core functionalities, challenges, and potential for various machine-learning domains.

Let’s break down the concept of Multimodal Large Language Models (MLLMs) by first understanding the terms “modal” and “multimodal:”“Modal” refers to a particular way of communicating or perceiving information.

| SourceGoogle: PaLM-EGoogle developed an embodied language model, PaLM-E, to incorporate continuous sensor modalities into language models and establish the link between words and perceptions.

Improving how t…

7 months, 3 weeks назад @ neptune.ai
How to Build and Evaluate a RAG System Using LangChain, Ragas, and neptune.ai
How to Build and Evaluate a RAG System Using LangChain, Ragas, and neptune.ai How to Build and Evaluate a RAG System Using LangChain, Ragas, and neptune.ai

In this guide, we’ll show you how to build a RAG system using the LangChain framework, evaluate its performance using Ragas, and track your experiments with neptune.ai.

Part 1: Building a baseline RAG system with LangChainIn the first part of this guide, we’ll use LangChain to build a RAG system for the blog posts in the LLMOps category on Neptune’s blog.

Ragas works smoothly with LangChain, making it a great choice for evaluating our RAG system.

Step 1: Generate a RAG evaluation datasetAn evaluation set for RAG tasks is similar to a question-answering task dataset.

Step 2: Choose RAG evaluation metricsAs mentioned earlier, Ragas offers both LLM-based and non-LLM-based metrics for RAG syste…

8 months, 3 weeks назад @ neptune.ai
▶️ YouTube
Yannic Kilcher Yannic Kilcher
последний пост 1 month, 1 week назад
AGI is not coming!
AGI is not coming! AGI is not coming!

jack Morris's investigation into GPT-OSS training data https://x.com/jxmnop/status/1953899426075816164?t=3YRhVQDwQLk2gouTSACoqA&s=09

1 month, 1 week назад @ youtube.com
Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)
Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis) Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Paper: https://research.trychroma.com/context-rot Abstract:

Large Language Models (LLMs) are typically presumed to process context uniformly—that is, the model should handle the 10,000th token just as reliably as the 100th. However, in practice, this assumption does not hold. We observe that model performance varies significantly as input length changes, even on simple tasks.

In this report, we evaluate 18 LLMs, including the state-of-the-art GPT-4.1, Claude 4, Gemini 2.5, and Qwen3 models. Our results reveal that models do not use their context uniformly; instead, their performance grows increasingly unreliable as input length grows. Authors: Kelly Hong, Anton Troynikov, Jeff Huber Links:

1 month, 3 weeks назад @ youtube.com
Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review)
Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review) Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review)

Paper: https://arxiv.org/abs/2507.02092

Code: https://github.com/alexiglad/EBT

Website: https://energy-based-transformers.github.io/ Abstract:

Inference-time computation techniques, analogous to human System 2 Thinking, have recently become popular for improving model performances. However, most existing approaches suffer from several limitations: they are modality-specific (e.g., working only in text), problem-specific (e.g., verifiable domains like math and coding), or require additional supervision/training on top of unsupervised pretraining (e.g., verifiers or verifiable rewards). In this paper, we ask the question "Is it possible to generalize these System 2 Thinking approaches, and de…

2 months назад @ youtube.com
On the Biology of a Large Language Model (Part 2)
On the Biology of a Large Language Model (Part 2) On the Biology of a Large Language Model (Part 2)

An in-depth look at Anthropic's Transformer Circuit Blog Post

Part 1 here: https://youtu.be/mU3g2YPKlsA

Discord here: https;//ykilcher.com/discord https://transformer-circuits.pub/2025/attribution-graphs/biology.html Abstract:

We investigate the internal mechanisms used by Claude 3.5 Haiku — Anthropic's lightweight production model — in a variety of contexts, using our circuit tracing methodology. Authors:

Jack Lindsey†, Wes Gurnee*, Emmanuel Ameisen*, Brian Chen*, Adam Pearce*, Nicholas L. Turner*, Craig Citro*,

David Abrahams, Shan Carter, Basil Hosmer, Jonathan Marcus, Michael Sklar, Adly Templeton,

Trenton Bricken, Callum McDougall◊, Hoagy Cunningham, Thomas Henighan, Adam Jermyn, Andy …

4 months, 2 weeks назад @ youtube.com
On the Biology of a Large Language Model (Part 1)
On the Biology of a Large Language Model (Part 1) On the Biology of a Large Language Model (Part 1)

An in-depth look at Anthropic's Transformer Circuit Blog Post https://transformer-circuits.pub/2025/attribution-graphs/biology.html Abstract:

We investigate the internal mechanisms used by Claude 3.5 Haiku — Anthropic's lightweight production model — in a variety of contexts, using our circuit tracing methodology. Authors:

Jack Lindsey†, Wes Gurnee*, Emmanuel Ameisen*, Brian Chen*, Adam Pearce*, Nicholas L. Turner*, Craig Citro*,

David Abrahams, Shan Carter, Basil Hosmer, Jonathan Marcus, Michael Sklar, Adly Templeton,

Trenton Bricken, Callum McDougall◊, Hoagy Cunningham, Thomas Henighan, Adam Jermyn, Andy Jones, Andrew Persic, Zhenyi Qi, T. Ben Thompson,

Sam Zimmerman, Kelley Rivoire, Thom…

5 months, 2 weeks назад @ youtube.com
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models (Paper Explained)
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models (Paper Explained) DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models (Paper Explained)

#deepseek #llm #reinforcementlearning GRPO is one of the core advancements used in Deepseek-R1, but was introduced already last year in this paper that uses a combination of new RL techniques and iterative data collection to achieve remarkable performance on mathematics benchmarks with just a 7B model. Paper: https://arxiv.org/abs/2402.03300 Abstract:

Mathematical reasoning poses a significant challenge for language models due to its complex and structured nature. In this paper, we introduce DeepSeekMath 7B, which continues pre-training DeepSeek-Coder-Base-v1.5 7B with 120B math-related tokens sourced from Common Crawl, together with natural language and code data. DeepSeekMath 7B has achie…

7 months, 3 weeks назад @ youtube.com
Traditional Holiday Live Stream
Traditional Holiday Live Stream Traditional Holiday Live Stream

https://ykilcher.com/discord Links:

TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick

YouTube: https://www.youtube.com/c/yannickilcher

Twitter: https://twitter.com/ykilcher

Discord: https://discord.gg/4H8xxDF

BitChute: https://www.bitchute.com/channel/yannic-kilcher

Minds: https://www.minds.com/ykilcher

Parler: https://parler.com/profile/YannicKilcher

LinkedIn: https://www.linkedin.com/in/yannic-kilcher-488534136/

BiliBili: https://space.bilibili.com/1824646584 If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):

SubscribeStar: https:/…

8 months, 3 weeks назад @ youtube.com
Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained)
Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained) Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained)

#tokenization #llm #meta This paper does away with tokenization and creates an LLM architecture that operates on dynamically sized "patches" instead of tokens. By controlling the patch size, they gain a level of control over the tradeoff between model size and FLOPs and use that to achieve more favorable scaling behavior than classically tokenized LLMs. Paper: https://ai.meta.com/research/publications/byte-latent-transformer-patches-scale-better-than-tokens/

Code: https://github.com/facebookresearch/blt Abstract:

We introduce the Byte Latent Transformer (BLT), a new byte-level LLM architecture that, for the first time, matches tokenization-based LLM performance at scale with significant imp…

8 months, 3 weeks назад @ youtube.com
Safety Alignment Should be Made More Than Just a Few Tokens Deep (Paper Explained)
Safety Alignment Should be Made More Than Just a Few Tokens Deep (Paper Explained) Safety Alignment Should be Made More Than Just a Few Tokens Deep (Paper Explained)

This paper demonstrates in a series of experiments that current safety alignment techniques of LLMs, as well as corresponding jailbreaking attacks, are in large part focusing on modulating the distribution of the first few tokens of the LLM response. Paper: https://openreview.net/forum?id=6Mxhg9PtDE&s=09 Abstract:

The safety alignment of current Large Language Models (LLMs) is vulnerable. Simple attacks, or even benign fine-tuning, can jailbreak aligned models. We note that many of these vulnerabilities are related to a shared underlying issue: safety alignment can take shortcuts, wherein the alignment adapts a model's generative distribution primarily over only its very first few output to…

9 months, 1 week назад @ youtube.com
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (Paper Explained)
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (Paper Explained) TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (Paper Explained)

A deep dive into the TokenFormer and an opinion about its impact, novelty, and relation to prior work. Paper: https://arxiv.org/abs/2410.23168 Abstract:

Transformers have become the predominant architecture in foundation models due to their excellent performance across various domains. However, the substantial cost of scaling these models remains a significant concern. This problem arises primarily from their dependence on a fixed number of parameters within linear projections. When architectural modifications (e.g., channel dimensions) are introduced, the entire model typically requires retraining from scratch. As model sizes continue growing, this strategy results in increasingly high com…

9 months, 4 weeks назад @ youtube.com
Henry AI Labs Henry AI Labs
последний пост None
3blue1brown 3blue1brown
последний пост 1 week, 3 days назад
Incomplete open cubes
Incomplete open cubes Incomplete open cubes

Full video: https://youtu.be/_BrFKp-U8GI

1 week, 3 days назад @ youtube.com
Exploration & Epiphany
Exploration & Epiphany Exploration & Epiphany

Sol Lewitt's "Incomplete Open Cubes" and rediscovering Burnside's lemma in group theory

This is a guest video by Paul Dancstep: https://youtu.be/JEeM2ABUMoo

Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support

An equally valuable form of support is to share the videos.

Home page: https://www.3blue1brown.com Thanks to the Wadsworth Atheneum for granting permission to use LeWitt's notebooks. Talks by Paul you can find online: What is Category Theory:

https://www.youtube.com/watch?app=desktop&v=eXBwU9ieLL0 How to Predict Eclipses:

https://www.exploratorium.edu/eclipse/video/how-predict-eclipses Theo Jansen's Strandbeests

https://www.youtube.com/w…

1 week, 3 days назад @ youtube.com
Simulating Phase Change | Guest video by Vilas Winstein
Simulating Phase Change | Guest video by Vilas Winstein Simulating Phase Change | Guest video by Vilas Winstein

Deriving the Boltzmann formula, defining temperature, and simulating liquid/vapor.

@SpectralCollective has the second part: https://youtu.be/yEcysu5xZH0

You can play with a simulation of this model here: https://vilas.us/simulations/liquidvapor/

These lessons are funded directly by viewers: https://3b1b.co/support

Home page: https://www.3blue1brown.com Notes from Vilas:

1) This open problem is to prove the ergodicity of the deterministic dynamical systems that are used to model the molecule-level physics. A good example of such a dynamical system is the box with particles evolving according to Newton's laws with elastic collisions, like in the video. 2) This video assumes that all probabili…

2 weeks, 6 days назад @ youtube.com
How AI connects text and images
How AI connects text and images How AI connects text and images

From this guest video by @WelchLabsVideo on how diffusion models work: https://youtu.be/iv-5mZ_9CPY

3 weeks, 6 days назад @ youtube.com
The AI that solved IMO Geometry Problems | Guest video by @Aleph0
The AI that solved IMO Geometry Problems | Guest video by @Aleph0 The AI that solved IMO Geometry Problems | Guest video by @Aleph0

How AlphaGeometry combines logic and intuition.

Share stories about AI in math research for an upcoming video: https://forms.gle/gr9aZVdUrW5T3yDg9

Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support

An equally valuable form of support is to simply share the videos.

Home page: https://www.3blue1brown.com AlphaGeometry announcement:

https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry/ Similar open-source model, Newclid, by Harmonic:

https://harmonic.fun/news#blog-post-geometry Timestamps:

0:00 - What's surprising

1:33 - Solve without AI

7:10 - Where AI comes in

12:48 - Grant's comments ------------------…

1 month назад @ youtube.com
But how do AI videos actually work? | Guest video by @WelchLabsVideo
But how do AI videos actually work? | Guest video by @WelchLabsVideo But how do AI videos actually work? | Guest video by @WelchLabsVideo

Diffusion models, CLIP, and the math of turning text into images

Welch Labs Book: https://www.welchlabs.com/resources/imaginary-numbers-book Sections

0:00 - Intro

3:37 - CLIP

6:25 - Shared Embedding Space

8:16 - Diffusion Models & DDPM

11:44 - Learning Vector Fields

22:00 - DDIM

25:25 Dall E 2

26:37 - Conditioning

30:02 - Guidance

33:39 - Negative Prompts

34:27 - Outro

35:32 - About guest videos + Grant’s Reaction Special Thanks to:

Jonathan Ho - Jonathan is the Author of the DDPM paper and the Classifier Free Guidance Paper.

https://arxiv.org/pdf/2006.11239

https://arxiv.org/pdf/2207.12598 Preetum Nakkiran - Preetum has an excellent introductory diffusion tutorial:

https://arxiv.org/pdf/24…

1 month, 3 weeks назад @ youtube.com
Summer of Math Exposition #4 | Teachers, I'd love to hear from you
Summer of Math Exposition #4 | Teachers, I'd love to hear from you Summer of Math Exposition #4 | Teachers, I'd love to hear from you

Make a math explainer, get feedback, and receive prizes: https://some.3b1b.co

Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support

An equally valuable form of support is to simply share the videos. ------------------ These animations are largely made using a custom Python library, manim. See the FAQ comments here:

https://3b1b.co/faq#manim

https://github.com/3b1b/manim

https://github.com/ManimCommunity/manim/ All code for specific videos is visible here:

https://github.com/3b1b/videos/ The music is by Vincent Rubinetti.

https://www.vincentrubinetti.com

https://vincerubinetti.bandcamp.com/album/the-music-of-3blue1brown

https://open.spotify.com/…

4 months, 2 weeks назад @ youtube.com
Where my explanation of Grover’s algorithm failed
Where my explanation of Grover’s algorithm failed Where my explanation of Grover’s algorithm failed

Addressing viewer questions from the last video.

These lessons are funded directly by viewers: https://3b1b.co/support

An equally valuable form of support is to share the videos. ------------------ These animations are largely made using a custom Python library, manim. See the FAQ comments here:

https://3b1b.co/faq#manim

https://github.com/3b1b/manim

https://github.com/ManimCommunity/manim/ All code for specific videos is visible here:

https://github.com/3b1b/videos/ The music is by Vincent Rubinetti.

https://www.vincentrubinetti.com

https://vincerubinetti.bandcamp.com/album/the-music-of-3blue1brown

https://open.spotify.com/album/1dVyjwS8FBqXhRunaG5W5u ------------------ 3blue1brown is a ch…

4 months, 2 weeks назад @ youtube.com
But what is Quantum Computing? (Grover's Algorithm)
But what is Quantum Computing?  (Grover's Algorithm) But what is Quantum Computing? (Grover's Algorithm)

Qubits, state vectors, and Grover's algorithm for search.

Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support

An equally valuable form of support is to share the videos. The subtitles on this video were done using AI, and are likely imperfect, but they are open for community corrections at https://criblate.com/ Adam Brown's paper on the connection between Grover's Algorithm and block collisions:

https://arxiv.org/pdf/1912.02207 If you want to learn the relevant underlying quantum mechanics here, a very friendly resource is the course Mithuna at Looking Glass Universe is currently putting together. See, for instance, this explainer of a qubit:…

4 months, 2 weeks назад @ youtube.com
Testing your intuition for quantum computing
Testing your intuition for quantum computing Testing your intuition for quantum computing

Full video: https://youtu.be/RQWpF2Gb-gU

4 months, 2 weeks назад @ youtube.com
How to measure nearby galaxies
How to measure nearby galaxies How to measure nearby galaxies

From this video: https://youtu.be/hFMaT9oRbs4

5 months назад @ youtube.com
Measuring the distance to Venus without radar
Measuring the distance to Venus without radar Measuring the distance to Venus without radar

From this video with Terry Tao: https://youtu.be/hFMaT9oRbs4

5 months, 1 week назад @ youtube.com
Measuring the speed of light using Jupiter's moons
Measuring the speed of light using Jupiter's moons Measuring the speed of light using Jupiter's moons

From this video with Terry Tao: https://youtu.be/hFMaT9oRbs4

5 months, 1 week назад @ youtube.com
The tragic tale of Guillaume Le Gentil
The tragic tale of Guillaume Le Gentil The tragic tale of Guillaume Le Gentil

From this video: https://youtu.be/hFMaT9oRbs4 Artwork by Kurt Bruns

5 months, 3 weeks назад @ youtube.com
Zooming out by powers of 10
Zooming out by powers of 10 Zooming out by powers of 10

From this video: https://youtu.be/YdOXS_9_P4U

5 months, 3 weeks назад @ youtube.com
Two Minute Papers Two Minute Papers
последний пост 1 day, 10 hours назад
This Free AI Generates Video FASTER Than Real Life 🤯
This Free AI Generates Video FASTER Than Real Life 🤯 This Free AI Generates Video FASTER Than Real Life 🤯

❤️ Check out the Fully Connected Conference by Weights & Biases - https://wandb.me/fclon2025-2min

20% discount code: FCLON2025-2MIN 📝 The paper is available here:

https://github.com/Lightricks/LTX-Video 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Benji Rabhan, B Shang, Christian Ahlin, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Michael Tedder, Owen Skarpness, Richard Sundvall, Steef, Sv…

1 day, 10 hours назад @ youtube.com
Intel Just Changed Computer Graphics Forever!
Intel Just Changed Computer Graphics Forever! Intel Just Changed Computer Graphics Forever!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide:

Rent one of their GPU's with over 16GB of VRAM

Open a terminal

Just get Ollama with this command - https://ollama.com/download/linux

Then run ollama run gpt-oss:120b - https://ollama.com/library/gpt-oss:120b 📝 The paper is available here:

https://www.sdiolatz.info/publications/00ImageGS.html Genetic algorithm for the Mona Lisa:

https://users.cg.tuwien.ac.at/zsolnai/gfx/mona_lisa_parallel_genetic_algorithm/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/art…

6 days, 9 hours назад @ youtube.com
Google’s New AI Fixes The #1 Problem With Your Photos!
Google’s New AI Fixes The #1 Problem With Your Photos! Google’s New AI Fixes The #1 Problem With Your Photos!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide:

Rent one of their GPU's with over 16GB of VRAM

Open a terminal

Just get Ollama with this command - https://ollama.com/download/linux

Then run ollama run gpt-oss:120b - https://ollama.com/library/gpt-oss:120b 📝 The paper is available here:

https://nadmag.github.io/LightLab/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Benji Rabhan, B S…

1 week, 2 days назад @ youtube.com
NVIDIA’s Tech: The Physics Engine That Fooled Everyone’s Ears!
NVIDIA’s Tech: The Physics Engine That Fooled Everyone’s Ears! NVIDIA’s Tech: The Physics Engine That Fooled Everyone’s Ears!

❤️ Check out DeepInfra and run DeepSeek or many other AI projects: https://deepinfra.com/papers 📝 The paper is available here:

https://graphics.stanford.edu/papers/waveblender/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Benji Rabhan, B Shang, Christian Ahlin, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Michael Tedder, Owen Skarpness, Richard Sundvall, Steef, Sven Pfiffner, Taras Bobrovy…

1 week, 5 days назад @ youtube.com
New AI Finally Solved The Hardest Animation Problem!
New AI Finally Solved The Hardest Animation Problem! New AI Finally Solved The Hardest Animation Problem!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide:

Rent one of their GPUs with over 16GB of VRAM

Open a terminal

Just get Ollama with this command - https://ollama.com/download/linux

Then run ollama run gpt-oss:120b - https://ollama.com/library/gpt-oss:120b 📝 The paper is available here:

https://diffusecloc.github.io/website/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Benji Rabhan, …

2 weeks, 3 days назад @ youtube.com
This New Physics Engine Lets Jelly Move Like Humans!
This New Physics Engine Lets Jelly Move Like Humans! This New Physics Engine Lets Jelly Move Like Humans!

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.me/papers 📝 The paper is available here:

https://arxiv.org/abs/2405.14595 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 Sources:

https://www.youtube.com/shorts/Mq7zzK-ZiWI

https://www.youtube.com/watch?v=A_Cdz-QBlT4 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Benji Rabhan, B Shang, Christian Ahlin, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Michael Tedder…

2 weeks, 6 days назад @ youtube.com
DeepMind Just Made The Most Powerful Game AI Engine!
DeepMind Just Made The Most Powerful Game AI Engine! DeepMind Just Made The Most Powerful Game AI Engine!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide:

Rent one of their GPU's with over 16GB of VRAM

Open a terminal

Just get Ollama with this command - https://ollama.com/download/linux

Then run ollama run gpt-oss:120b - https://ollama.com/library/gpt-oss:120b Genie 3:

https://deepmind.google/discover/blog/genie-3-a-new-frontier-for-world-models/ Sources:

https://x.com/amoufarek/status/1955776162447102238

https://x.com/amoufarek/status/1955299375548076382

https://x.com/holynski_/status/1953882726656094622

https://x.com/holynski_/status/1953879983535141043

https://x.com/RuiHuang_art/status/1954716703340048877

https://x.com/mattmcgill_/status/1953827141700…

1 month назад @ youtube.com
Forgotten AI Research Solved The Problem Photoshop Never Could!
Forgotten AI Research Solved The Problem Photoshop Never Could! Forgotten AI Research Solved The Problem Photoshop Never Could!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide:

Rent one of their GPU's with over 16GB of VRAM

Open a terminal

Just get Ollama with this command - https://ollama.com/download/linux

Then run ollama run gpt-oss:120b - https://ollama.com/library/gpt-oss:120b 📝 The paper "Physically Controllable Relighting of Photographs" is available here:

https://yaksoy.github.io/PhysicalRelighting/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon sup…

1 month назад @ youtube.com
OpenAI’s New Free AI: The Good, The Bad, The Unexpected!
OpenAI’s New Free AI: The Good, The Bad, The Unexpected! OpenAI’s New Free AI: The Good, The Bad, The Unexpected!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide:

Rent one of their GPU's with over 16GB of VRAM

Open a terminal

Just get Ollama with this command - https://ollama.com/download/linux

Then run ollama run gpt-oss:120b - https://ollama.com/library/gpt-oss:120b Try it online:

https://gpt-oss.com/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 Humanity's Last Exam:

https://agi.safe.ai/ Sources:

https://x.com/flavioad/status/1952792389636198489

https://x.com/kwindla/status/1952947685…

1 month, 1 week назад @ youtube.com
New Game AI Turns Photos Into Playable Worlds! | Celebrating 10 Years Of Papers! 🎂
New Game AI Turns Photos Into Playable Worlds!  | Celebrating 10 Years Of Papers! 🎂 New Game AI Turns Photos Into Playable Worlds! | Celebrating 10 Years Of Papers! 🎂

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide for using DeepSeek on Lambda:

https://docs.lambdalabs.com/education/large-language-models/deepseek-r1-ollama/?utm_source=two-minute-papers&utm_campaign=relevant-videos&utm_medium=video 📝 The paper is available here:

https://hunyuan-gamecraft.github.io/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Benji Rabhan, B Shang, Christian Ahlin,…

1 month, 1 week назад @ youtube.com
The Forgotten Research That Fixed The Worst Physics Bug!
The Forgotten Research That Fixed The Worst Physics Bug! The Forgotten Research That Fixed The Worst Physics Bug!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide for using DeepSeek on Lambda:

https://docs.lambdalabs.com/education/large-language-models/deepseek-r1-ollama/?utm_source=two-minute-papers&utm_campaign=relevant-videos&utm_medium=video 📝 The paper is available here:

https://graphics.cs.utah.edu/research/projects/merging-and-splitting/ 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 Video game glitch: https://www.youtube.com/watch?v=fZgRVatBXTE 🙏 We would like to thank our generous…

1 month, 2 weeks назад @ youtube.com
This AI Learns Faster Than Anything We’ve Seen!
This AI Learns Faster Than Anything We’ve Seen! This AI Learns Faster Than Anything We’ve Seen!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide for using DeepSeek on Lambda:

https://docs.lambdalabs.com/education/large-language-models/deepseek-r1-ollama/?utm_source=two-minute-papers&utm_campaign=relevant-videos&utm_medium=video 📝 "Genesis: A Generative and Universal Physics Engine for Robotics and Beyond" is available here:

https://genesis-embodied-ai.github.io/ Tech report direct link: https://placid-walkover-0cc.notion.site/genesis-performance-benchmarking Criticism: https://stoneztao.substack.com/p/the-new-hyped-genesis-simulator-is

Their answer to criticism: https://placid-walkover-0cc.notion.site/genesis-performance-benchmarking (at 4.1 Res…

1 month, 3 weeks назад @ youtube.com
Blender 4.5 - How My Dream Just Came True!
Blender 4.5 - How My Dream Just Came True! Blender 4.5 - How My Dream Just Came True!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide for using DeepSeek on Lambda:

https://docs.lambdalabs.com/education/large-language-models/deepseek-r1-ollama/?utm_source=two-minute-papers&utm_campaign=relevant-videos&utm_medium=video My part at the official Blender 4.5 announcement video:

https://www.youtube.com/watch?v=wPhA0imjvVs&t=1474s Get Blender: https://www.blender.org/

Demo files: https://www.blender.org/download/demo-files/

Tutorial: https://www.youtube.com/watch?v=4haAdmHqGOw Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supp…

2 months назад @ youtube.com
The “Biggest” AI That Came Out Of Nowhere!
The “Biggest” AI That Came Out Of Nowhere! The “Biggest” AI That Came Out Of Nowhere!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers Guide for using DeepSeek on Lambda:

https://docs.lambdalabs.com/education/large-language-models/deepseek-r1-ollama/?utm_source=two-minute-papers&utm_campaign=relevant-videos&utm_medium=video Kimi K2:

https://moonshotai.github.io/Kimi-K2/

API: https://platform.moonshot.ai Run it yourself locally: https://x.com/unslothai/status/1944780685409165589 Sources:

https://x.com/chetaslua/status/1943681568549052458

https://x.com/satvikps/status/1944861384573169929 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clicka…

2 months назад @ youtube.com
Roblox Solved The Physics Problem That Stumped Everyone!
Roblox Solved The Physics Problem That Stumped Everyone! Roblox Solved The Physics Problem That Stumped Everyone!

❤️ Check out Vast.ai and run DeepSeek or any AI project: https://vast.ai/papers 📝 The paper is available here:

https://graphics.cs.utah.edu/research/projects/avbd/ Play with it!

https://graphics.cs.utah.edu/research/projects/avbd/avbd_demo2d.html 📝 My paper on simulations that look almost like reality is available for free here:

https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations:

https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Benji Rabhan, B Shang, Christian Ahlin, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Michael Tedder,…

2 months назад @ youtube.com
DataFest Video DataFest Video
последний пост None
Семинары JetBrains Research Семинары JetBrains Research
последний пост None
Яндекс. Компьютерные науки Яндекс. Компьютерные науки
последний пост 1 day, 10 hours назад
Как мы закрываем наши боли от прогнозов в Лавке
Как мы закрываем наши боли от прогнозов в Лавке Как мы закрываем наши боли от прогнозов в Лавке

Это фрагмент из доклада Александра Исакова, руководителя группы прогнозирования в Яндекс Лавке, который он прочитал на конференции AHA!25. Александр рассказал о создании и внедрении системы среднесрочного прогнозирования для Яндекс Лавки. Такое решение точно прогнозирует заказы на полугодовой период и помогает планировать ресурсы в условиях быстрого роста сервиса и высокой волатильности спроса. А больше подробностей вы найдёте в полной записи выступления — смотрите её на нашем канале. #яндекс #яндекславка #aha25 #прогнозирование #datascience #ml #ai #bigdata #аналитика

1 day, 10 hours назад @ youtube.com
Этапы прогнозирования в Яндекс Лавке
Этапы прогнозирования в Яндекс Лавке Этапы прогнозирования в Яндекс Лавке

Это фрагмент из доклада Александра Исакова, руководителя группы прогнозирования в Яндекс Лавке, который он прочитал на конференции AHA!25. Александр рассказал о создании и внедрении системы среднесрочного прогнозирования для Яндекс Лавки. Такое решение точно прогнозирует заказы на полугодовой период и помогает планировать ресурсы в условиях быстрого роста сервиса и высокой волатильности спроса. А больше подробностей вы найдёте в полной записи выступления — смотрите её на нашем канале. #яндекс #яндекславка #aha25 #datascience #ml #ai #прогнозирование #аналитика #bigdata #прогнозспроса

1 week назад @ youtube.com
Размеры рекомендательных моделей
Размеры рекомендательных моделей Размеры рекомендательных моделей

Это фрагмент доклада Николая Савушкина, руководителя службы рекомендательных технологий в Яндексе, с конференции AHA!25. Николай рассказал, как устроена современная персонализация, как большие генеративные модели внедряют в рекомендательные системы Яндекса и что ждёт нас в будущем. Полное выступление смотрите на нашем канале. #яндекс #aha25 #рекомендательныесистемы #рекомендации #персонализация #ai #ml #искусственныйинтеллект #нейросети #машиннообучение #generativeai #bigmodels #recommendations #datascience #технологиибудущего #conference

1 week, 4 days назад @ youtube.com
Почему генеративное обучение важно для RecSys
Почему генеративное обучение важно для RecSys Почему генеративное обучение важно для RecSys

Это фрагмент доклада Николая Савушкина, руководителя службы рекомендательных технологий в Яндексе, с конференции AHA!25. Николай рассказал, как устроена современная персонализация, как большие генеративные модели внедряют в рекомендательные системы Яндекса и что ждёт нас в будущем. Полное выступление смотрите на нашем канале. #яндекс #aha25 #рекомендательныесистемы #рекомендации #персонализация #ai #ml #искусственныйинтеллект #нейросети #машиннообучение #generativeai #bigmodels #recommendations #datascience #технологиибудущего #conference

2 weeks назад @ youtube.com
Как понять, чего хочет ваш пользователь
Как понять, чего хочет ваш пользователь Как понять, чего хочет ваш пользователь

Это фрагмент доклада Николая Савушкина, руководителя службы рекомендательных технологий в Яндексе, с конференции AHA!25. Николай рассказал, как устроена современная персонализация, как большие генеративные модели внедряют в рекомендательные системы Яндекса и что ждёт нас в будущем. Полное выступление смотрите на нашем канале. #яндекс #aha25 #рекомендательныесистемы #рекомендации #персонализация #ai #ml #искусственныйинтеллект #нейросети #машиннообучение #generativeai #bigmodels #recommendations #datascience #технологиибудущего #conference

3 weeks назад @ youtube.com
Data Dojo — встреча ML-сообщества в Питере
Data Dojo — встреча ML-сообщества в Питере Data Dojo — встреча ML-сообщества в Питере

Data Dojo — это сообщество ML-экспертов. Здесь обсуждают тренды, разбирают реальные задачи, делятся опытом и практикуются. Додзё в японской культуре — место Пути, где совершенствуют не только мастерство, но и дух. Мы перенесли этот принцип в мир данных. Программа:

17:00 Приветственное слово | Владислав Офицеров, модератор встречи, руководитель команды развития нейронных технологий международного поиска, и Пётр Ермаков, ML-бренд-директор 17:05 Лекция: R&D в Яндексе | Алексей Колесов, руководитель команды NLP 17:35 Лекция: Актуальные технологии в рекомендательных системах | Николай Савушкин, руководитель команды рекомендательных технологий в Поиске 19:05 Разбор ML-задачи 19:50 Лекция: Как ML-…

3 weeks, 5 days назад @ youtube.com
Мы будем меньше писать код руками — и это хорошо
Мы будем меньше писать код руками — и это хорошо Мы будем меньше писать код руками — и это хорошо

В этом году конференция Data Fest отмечает юбилей — 10 лет. За это время машинное обучение добралось до многих сфер нашей жизни. Нам стало интересно, как развитие ML изменит привычную реальность ещё через 10 лет. Вот мы и спросили об этом у гостей конференции! Полное видео ищите на нашем канале. #автоматизация #lowcode #nocode #будущееразработки #нейросети #AIвразработке #программирование #ИИ #разработчики #машинноеобучение #генерациякода #технологии

1 month, 3 weeks назад @ youtube.com
Как ИИ и нейросети заставят нас БОЛЬШЕ думать
Как ИИ и нейросети заставят нас БОЛЬШЕ думать Как ИИ и нейросети заставят нас БОЛЬШЕ думать

В этом году конференция Data Fest отмечает юбилей — 10 лет. За это время машинное обучение добралось до многих сфер нашей жизни. Нам стало интересно, как развитие ML изменит привычную реальность ещё через 10 лет. Вот мы и спросили об этом у гостей конференции! Полное видео ищите на нашем канале. #искусственныйинтеллект #нейросети #машинноеобучение #DataFest #будущеетехнологий #AI #ИИ #глубокообучение #технологиибудущего #GPT #этикаИИ #развитиеИИ #автоматизация

1 month, 3 weeks назад @ youtube.com
Как мы учим Алису видеть объекты
Как мы учим Алису видеть объекты Как мы учим Алису видеть объекты

Это фрагмент выступления Дарьи Виноградовой, руководителя команды стримингового зрения в Яндексе. На Data Fest Дарья рассказала о VLM в Алисе. #Алиса #Яндекс #нейросети #ИИ #технологии #VLM #компьютерноезрение #голосовойассистент #визуальныйинтеллект #AI #умныеустройства #обучениеИИ #распознаваниеобъектов #мультимодальность

2 months назад @ youtube.com
VLM: как видит мир Алиса от Яндекса
VLM: как видит мир Алиса от Яндекса VLM: как видит мир Алиса от Яндекса

Это фрагмент выступления Дарьи Виноградовой, руководителя команды стримингового зрения в Яндексе. На Data Fest Дарья рассказала о VLM в Алисе. #Алиса #Яндекс #нейросети #искусственныйинтеллект #VLM #AI #технологии #умныеустройства #мультимодальность #АлисаЯндекс #голосовойассистент #визуальноеAI #ЯндексАлиса #VLMтехнологии #ИИ

2 months, 1 week назад @ youtube.com
VLM: как нейросети связывают картинку и текст
VLM: как нейросети связывают картинку и текст VLM: как нейросети связывают картинку и текст

Это фрагмент выступления Дарьи Виноградовой, руководителя команды стримингового зрения в Яндексе. На Data Fest Дарья рассказала о VLM в Алисе. #VLM #нейросети #ИИ #искусственныйинтеллект #мультимодальность #AI #визуальныйИИ #нейросетитексткартинка #компьютерноеЗрение #моделиЯндекса #умныеустройства #технологиибудущего #визуальноятехнология #machinelearning #deeplearning #языкитехнологии #AIЯндекс #модельпокартинке #LLM #NLP #CV

2 months, 1 week назад @ youtube.com
Что за Муза в Алисе от Яндекса
Что за Муза в Алисе от Яндекса Что за Муза в Алисе от Яндекса

Это фрагмент выступления Дарьи Виноградовой, руководителя команды стримингового зрения в Яндексе. На Data Fest Дарья рассказала о VLM в Алисе. #Алиса #Муза #Яндекс #VLM #DataFest #ИИ #нейросети #AI #умныеустройства #визуальныемодели #визуальныйИИ #ЯндексТехнологии #визуальноятехнология #ассистентыбудущего #технологии #компьютерноеЗрение #LLM #интеллектуальныйассистент #стриминговоезрение #нейроассистент #техконференция

2 months, 2 weeks назад @ youtube.com
Как прогнозировать тысячи временных рядов и не сходить с ума / Александр Исаков
Как прогнозировать тысячи временных рядов и не сходить с ума / Александр Исаков Как прогнозировать тысячи временных рядов и не сходить с ума / Александр Исаков

Александр Исаков, руководитель группы прогнозирования в Яндекс Лавке, прочитал этот доклад на конференции AHA!25. Александр рассказал о создании и внедрении системы среднесрочного прогнозирования для Яндекс Лавки. Эта система точно прогнозирует ключевую бизнес-метрику (заказы) на полугодовой период и помогает планировать ресурсы в условиях быстрого роста сервиса и высокой волатильности спроса. Александр подробно остановился на том, как ребята решили проблему волатильности временных рядов с помощью приведения данных к стабильному виду и выделения ключевых влияющих факторов, что позволило не только улучшить прогноз, но и объяснить отклонения от плана. А ещё рассказал, как они встроили прогноз…

2 months, 3 weeks назад @ youtube.com
Как продукты на основе LLM повышают эффективность сотрудников и экономят миллионы / Эльвира Морозова
Как продукты на основе LLM повышают эффективность сотрудников и экономят миллионы / Эльвира Морозова Как продукты на основе LLM повышают эффективность сотрудников и экономят миллионы / Эльвира Морозова

Эльвира Морозова, ХХ в Яндексе, прочитала этот доклад на конференции AHA!25. Эльвира рассказала, как ребята внедрили YaGPT в формате подсказок оператора в одну из поддержек Яндекса. И в итоге получили доказанный в A/B-тестах экстраэффект как на скорости работы операторов, так и на качестве ответов. Эльвира показала, почему «экзоскелет» из GPT для оператора поддержки — это новая норма. Спойлер: применение GPT в саппорте сейчас приносит 15% чистой экономии в Яндекс Маркете. Больше классных материалов ищите в телеграм-канале Yandex for ML: https://t.me/yandexforml #GPT #Яндекс #поддержка #аналитика #AHA25 #YaGPT #искусственныйинтеллект #YandexForAnalytics #LLM #AI

2 months, 3 weeks назад @ youtube.com
Будущее рекомендательных систем / Николай Савушкин
Будущее рекомендательных систем / Николай Савушкин Будущее рекомендательных систем / Николай Савушкин

Николай Савушкин, руководитель службы рекомендательных технологий в Яндексе, прочитал этот доклад на конференции AHA!25. Николай рассказал, как персонализация устроена сейчас, как большие генеративные модели внедряют в рекомендательные системы Яндекса и что ждёт нас в будущем. Доклад направлен на широкую аудиторию с техническим бэкграундом, знакомит с основными понятиями и показывает технологические тренды. Больше классных материалов ищите в телеграм-канале Yandex for ML: https://t.me/yandexforml #персонализация #рекомендации #машинноеобучение #генеративныемодели #Яндекс #AI #LLM #AHA25 #аналитика #YandexForAnalytics

2 months, 3 weeks назад @ youtube.com
ML Trainings ML Trainings
последний пост 9 часов назад
Айрат Галямов | КиберФерма: как мы учим ИИ наблюдать за коровами
Айрат Галямов | КиберФерма: как мы учим ИИ наблюдать за коровами Айрат Галямов | КиберФерма: как мы учим ИИ наблюдать за коровами

Спикер: Айрат Галямов, Открытые Агро Инновации, CV team lead Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции ML in Manufacturing: https://ods.ai/tracks/df25-ml-in-manufacturing

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

9 часов назад @ youtube.com
Федор Жирков | Улучшение предсказания точки получения заказа
Федор Жирков | Улучшение предсказания точки получения заказа Федор Жирков | Улучшение предсказания точки получения заказа

Спикер: Федор Жирков Data Fest 2025: https://ods.ai/events/datafest2025 ____________________

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

9 часов назад @ youtube.com
Александр Туманов, Анна Пинигина | Doc2reactions — система извлечения химических реакций из текстов
Александр Туманов, Анна Пинигина | Doc2reactions — система извлечения химических реакций из текстов Александр Туманов, Анна Пинигина | Doc2reactions — система извлечения химических реакций из текстов

Спикер: Александр Туманов, Анна Пинигина, Syntelly Тема: Doc2reactions — система извлечения химических реакций из текстов научных документов Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции ML in Chemistry: https://ods.ai/tracks/df25-ml-in-chemistry

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

19 часов назад @ youtube.com
Антон Витвицкий | Умные конюшни: Мониторинг состояния и здоровья лошадей
Антон Витвицкий | Умные конюшни: Мониторинг состояния и здоровья лошадей Антон Витвицкий | Умные конюшни: Мониторинг состояния и здоровья лошадей

Спикер: Антон Витвицкий, ООО "Автоматизация видеоанализа" Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Computer Vision: https://ods.ai/tracks/df25-cv

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

19 часов назад @ youtube.com
Александр Нозик | Экосистема открытого научного ПО KScience и возможности развития открытого ПО
Александр Нозик | Экосистема открытого научного ПО KScience и возможности развития открытого ПО Александр Нозик | Экосистема открытого научного ПО KScience и возможности развития открытого ПО

Спикер: Александр Нозик Тема: Экосистема открытого научного ПО KScience и возможности развития открытого ПО в вузах Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Open Source: https://ods.ai/tracks/df25-opensource Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

1 day, 7 hours назад @ youtube.com
Екатерина Лемдясова | Алгоритм рекомендательной системы с учётом культурных факторов
Екатерина Лемдясова | Алгоритм рекомендательной системы с учётом культурных факторов Екатерина Лемдясова | Алгоритм рекомендательной системы с учётом культурных факторов

Спикер: Екатерина Лемдясова, студентка ФГБОУ ВО "РОСБИОТЕХ" Тема: Алгоритм рекомендательной системы с учётом культурных факторов для решения проблем холодного старта. Data Fest 2025: https://ods.ai/events/datafest2025 Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

1 day, 7 hours назад @ youtube.com
Артём Земляк | Холодный старт продаж комплементарных товаров с помощью LLM
Артём Земляк | Холодный старт продаж комплементарных товаров с помощью LLM Артём Земляк | Холодный старт продаж комплементарных товаров с помощью LLM

Спикер: Артём Земляк, Lead DS Поиска, ecom.tech Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу можно скачать в треке секции Advanced LLM: https://ods.ai/tracks/df25-allm

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

1 day, 18 hours назад @ youtube.com
Антон Воронов | LLM-tools в поиске работы
Антон Воронов | LLM-tools в поиске работы Антон Воронов | LLM-tools в поиске работы

Спикер: Антон Воронов, Avito.tech Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Career & TeamLead: https://ods.ai/tracks/df25-career

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

1 day, 18 hours назад @ youtube.com
Петр Сокерин | Cut Cross Entropy: Эффективное обучение трансформеров для рекомендательных систем
Петр Сокерин | Cut Cross Entropy: Эффективное обучение трансформеров для рекомендательных систем Петр Сокерин | Cut Cross Entropy: Эффективное обучение трансформеров для рекомендательных систем

Спикер: Петр Сокерин, Сколтех, МЛ инженер исследователь Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Advanced LLM: https://ods.ai/tracks/df25-allm

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

2 days, 9 hours назад @ youtube.com
Игорь Чурин | LIBRA: как оценивать LLM на длинном контексте
Игорь Чурин | LIBRA: как оценивать LLM на длинном контексте Игорь Чурин | LIBRA: как оценивать LLM на длинном контексте

Спикер: Игорь Чурин Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Data Collection & Labelling: https://ods.ai/tracks/df25-data-collection-labelling

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

2 days, 9 hours назад @ youtube.com
Сергей Абраменков | Co-designing with LLM: personalized cognitive engine architecture basis
Сергей Абраменков | Co-designing with LLM: personalized cognitive engine architecture basis Сергей Абраменков | Co-designing with LLM: personalized cognitive engine architecture basis

Спикер: Сергей Абраменков, Full-stack seismologist, PhD Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу можно скачать в треке секции Advanced LLM: https://ods.ai/tracks/df25-allm

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

2 days, 18 hours назад @ youtube.com
Михаил Козак | Делаем трекинг с ReID на уровне SOTA
Михаил Козак | Делаем трекинг с ReID на уровне SOTA Михаил Козак | Делаем трекинг с ReID на уровне SOTA

Спикер: Михаил Козак, НПП "Югпромавтоматизация" Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Computer Vision: https://ods.ai/tracks/df25-cv

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

2 days, 18 hours назад @ youtube.com
Капитанский мостик №11: ASML ивестирует в Mistral | внедрение ИИ падает | подружка для Хинтона
Капитанский мостик №11: ASML ивестирует в Mistral | внедрение ИИ падает | подружка для Хинтона Капитанский мостик №11: ASML ивестирует в Mistral | внедрение ИИ падает | подружка для Хинтона

Одиннадцатый выпуск подкаста "Капитанский мостик" традиционно ведут Валентин Малых и Дмитрий Колодезев. В этот раз снова вдвоем. Оставляйте новости к следующему выпуску в нашем канале в Mattermost. ИИ-саммари: В этом эпизоде подкаста обсуждаются последние новости в области технологий и инвестиций. Ведущие говорят о намерении компании SML стать крупным инвестором в Местраль, а также о том, как китайские компании, такие как Alibaba и Baidu, стремятся снизить зависимость от Nvidia, разрабатывая собственные чипы. В этом разговоре обсуждаются ключевые аспекты конкуренции и монополии в мире технологий, в частности, влияние компаний AMD и Intel на рынок процессоров, а также роль программного обесп…

3 days, 18 hours назад @ youtube.com
Малик Мохрат | Seeing Is Knowing: Vision–Language Knowledge SLAM for 3D Graph Generation with FMs
Малик Мохрат | Seeing Is Knowing: Vision–Language Knowledge SLAM for 3D Graph Generation with FMs Малик Мохрат | Seeing Is Knowing: Vision–Language Knowledge SLAM for 3D Graph Generation with FMs

Спикер: Малик Мохрат, Sber Robotics Center Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Robotics: https://ods.ai/tracks/df25-robotics

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

4 days, 17 hours назад @ youtube.com
Алексей Сушков | Пайплайн подготовки данных для проверки ремней безопасности
Алексей Сушков | Пайплайн подготовки данных для проверки ремней безопасности Алексей Сушков | Пайплайн подготовки данных для проверки ремней безопасности

Спикер: Алексей Сушков Data Fest 2025: https://ods.ai/events/datafest2025

Презентацию к докладу Вы можете скачать в треке секции Data Collection & Labelling: https://ods.ai/tracks/df25-data-collection-labelling

______

Наши соц.сети:

Telegram: https://t.me/datafest

Вконтакте: https://vk.com/datafest

Канал с вакансиями в telegram: https://t.me/odsjobs

Канал с апдейтами по курсам: https://t.me/odscourses

Как попасть в чат сообщества ODS Mattermost: https://ods.ai/tracks/mattermost

4 days, 17 hours назад @ youtube.com
🎧 Podcasts
Lex Fridman AI Podcast Lex Fridman AI Podcast
последний пост 1 week, 6 days назад
#480 – Dave Hone: T-Rex, Dinosaurs, Extinction, Evolution, and Jurassic Park
#480 – Dave Hone: T-Rex, Dinosaurs, Extinction, Evolution, and Jurassic Park #480 – Dave Hone: T-Rex, Dinosaurs, Extinction, Evolution, and Jurassic Park

Dave Hone is a paleontologist, expert on dinosaurs, co-host of the Terrible Lizards podcast, and author of numerous scientific papers and books on the behavior and ecology of dinosaurs.

He lectures at Queen Mary University of London on topics of Ecology, Zoology, Biology, and Evolution.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep480-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://go.lindy.ai/lexBetterHelp: Online therapy and counseling.

Go to https://shopify.com/lexLMNT: Zero-sugar electrolyte drink mix.

1 week, 6 days назад @ lexfridman.com
#479 – Dave Plummer: Programming, Autism, and Old-School Microsoft Stories
#479 – Dave Plummer: Programming, Autism, and Old-School Microsoft Stories #479 – Dave Plummer: Programming, Autism, and Old-School Microsoft Stories

Dave Plummer is a programmer, former Microsoft software engineer (Windows 95, NT, XP), creator of Task Manager, author of two books on autism, and host of the Dave’s Garage YouTube channel, where he shares stories from his career, insights on software development, and deep dives into technology.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep479-scSee below for timestamps, and to give feedback, submit questions, contact Lex, etc.

Go to https://upliftdesk.com/lexZocDoc: App that helps patients find healthcare providers.

Go to https://zocdoc.com/lexFin: AI agent for customer service.

Go to https://fin.ai/lexAllio Capital: AI-powered investment app that use…

2 weeks, 5 days назад @ lexfridman.com
#478 – Scott Horton: The Case Against War and the Military Industrial Complex
#478 – Scott Horton: The Case Against War and the Military Industrial Complex #478 – Scott Horton: The Case Against War and the Military Industrial Complex

Scott Horton is the director of the Libertarian Institute, editorial director of Antiwar.com, host of The Scott Horton Show, co-host of Provoked, and for the past three decades a staunch critic of U.S. military interventionism.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep478-scSee below for timestamps, and to give feedback, submit questions, contact Lex, etc.

Go to https://alliocapital.com/Hampton: Community for high-growth founders and CEOs.

Go to https://joinhampton.com/lexBetterHelp: Online therapy and counseling.

Go to https://drinkag1.com/lexOUTLINE:(00:00) – Introduction(00:35) – Sponsors, Comments, and Reflections(09:14) – From the Cold War to …

3 weeks, 3 days назад @ lexfridman.com
#477 – Keyu Jin: China’s Economy, Tariffs, Trade, Trump, Communism & Capitalism
#477 – Keyu Jin: China’s Economy, Tariffs, Trade, Trump, Communism & Capitalism #477 – Keyu Jin: China’s Economy, Tariffs, Trade, Trump, Communism & Capitalism

Keyu Jin is an economist specializing in China’s economy, international macroeconomics, global trade imbalances, and financial policy.

She is the author of The New China Playbook: Beyond Socialism and Capitalism.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep477-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://alliocapital.com/UPLIFT Desk: Standing desks and office ergonomics.

Go to https://upliftdesk.com/lexHampton: Community for high-growth founders and CEOs.

1 month назад @ lexfridman.com
#476 – Jack Weatherford: Genghis Khan and the Mongol Empire
#476 – Jack Weatherford: Genghis Khan and the Mongol Empire #476 – Jack Weatherford: Genghis Khan and the Mongol Empire

Jack Weatherford is an anthropologist and historian specializing in Genghis Khan and the Mongol Empire.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep476-scSee below for timestamps, and to give feedback, submit questions, contact Lex, etc.

Go to https://alliocapital.com/ZocDoc: App that helps patients find healthcare providers.

Go to https://zocdoc.com/lexFin: AI agent for customer service.

Go to https://shopify.com/lexMasterClass: Online classes from world-class experts.

1 month, 2 weeks назад @ lexfridman.com
#475 – Demis Hassabis: Future of AI, Simulating Reality, Physics and Video Games
#475 – Demis Hassabis: Future of AI, Simulating Reality, Physics and Video Games #475 – Demis Hassabis: Future of AI, Simulating Reality, Physics and Video Games

Demis Hassabis is the CEO of Google DeepMind and Nobel Prize winner for his groundbreaking work in protein structure prediction using AI.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep475-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://joinhampton.com/lexFin: AI agent for customer service.

Go to https://shopify.com/lexLMNT: Zero-sugar electrolyte drink mix.

Go to https://drinkLMNT.com/lexAG1: All-in-one daily nutrition drink.

1 month, 3 weeks назад @ lexfridman.com
#474 – DHH: Future of Programming, AI, Ruby on Rails, Productivity & Parenting
#474 – DHH: Future of Programming, AI, Ruby on Rails, Productivity & Parenting #474 – DHH: Future of Programming, AI, Ruby on Rails, Productivity & Parenting

David Heinemeier Hansson (aka DHH) is a legendary programmer, creator of Ruby on Rails, co-owner & CTO of 37signals that created Basecamp, HEY, & ONCE, and is a NYT-best-selling author (with Jason Fried) of 4 books: REWORK, REMOTE, Getting Real, and It Doesn’t Have To Be Crazy At Work.

He is also a race car driver, including a class-winning performance at the 24 hour Le Mans race.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep474-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://upliftdesk.com/lexLindy: No-code AI agent builder.

Go to https://go.lindy.ai/lexLMNT: Zero-sugar electrolyte drink …

2 months, 1 week назад @ lexfridman.com
#473 – Iran War Debate: Nuclear Weapons, Trump, Peace, Power & the Middle East
#473 – Iran War Debate: Nuclear Weapons, Trump, Peace, Power & the Middle East #473 – Iran War Debate: Nuclear Weapons, Trump, Peace, Power & the Middle East

Debate on Iran war between Scott Horton and Mark Dubowitz.

Scott Horton is the author and director of the Libertarian Institute, editorial director of Antiwar.com, host of The Scott Horton Show, and for the past three decades, a staunch critic of U.S. foreign policy and military interventionism.

Mark Dubowitz is the chief executive of the Foundation for Defense of Democracies, host of the Iran Breakdown podcast, and a leading expert on Iran and its nuclear program for over 20 years.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep473-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://drinkLMNT.c…

2 months, 3 weeks назад @ lexfridman.com
#472 – Terence Tao: Hardest Problems in Mathematics, Physics & the Future of AI
#472 – Terence Tao: Hardest Problems in Mathematics, Physics & the Future of AI #472 – Terence Tao: Hardest Problems in Mathematics, Physics & the Future of AI

Terence Tao is widely considered to be one of the greatest mathematicians in history.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep472-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://shopify.com/lexNetSuite: Business management software.

Go to http://netsuite.com/lexLMNT: Zero-sugar electrolyte drink mix.

Go to https://drinkLMNT.com/lexAG1: All-in-one daily nutrition drink.

3 months назад @ lexfridman.com
#471 – Sundar Pichai: CEO of Google and Alphabet
#471 – Sundar Pichai: CEO of Google and Alphabet #471 – Sundar Pichai: CEO of Google and Alphabet

Sundar Pichai is CEO of Google and Alphabet.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep471-sc

See below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc. Transcript:

https://lexfridman.com/sundar-pichai-transcript CONTACT LEX:

Feedback - give feedback to Lex: https://lexfridman.com/survey

AMA - submit questions, videos or call-in: https://lexfridman.com/ama

Hiring - join our team: https://lexfridman.com/hiring

Other - other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS:

Sundar's X: https://x.com/sundarpichai

Sundar's Instagram: https://instagram.com/sundarpichai

Sundar's Blog: https://blog.goo…

3 months, 2 weeks назад @ lexfridman.com
#470 – James Holland: World War II, Hitler, Churchill, Stalin & Biggest Battles
#470 – James Holland: World War II, Hitler, Churchill, Stalin & Biggest Battles #470 – James Holland: World War II, Hitler, Churchill, Stalin & Biggest Battles

James Holland is a historian specializing in World War II.

He hosts a podcast called WW2 Pod: We Have Ways of Making You Talk.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep470-scSee below for timestamps, and to give feedback, submit questions, contact Lex, etc.

Go to https://shopify.com/lexLMNT: Zero-sugar electrolyte drink mix.

Go to https://drinkag1.com/lexNotion: Note-taking and team collaboration.

3 months, 3 weeks назад @ lexfridman.com
#469 – Oliver Anthony: Country Music, Blue-Collar America, Fame, Money, and Pain
#469 – Oliver Anthony: Country Music, Blue-Collar America, Fame, Money, and Pain #469 – Oliver Anthony: Country Music, Blue-Collar America, Fame, Money, and Pain

Oliver Anthony is singer-songwriter who first gained worldwide fame with his viral hit Rich Men North of Richmond.

He became a voice for many who are voiceless, with many of his songs speaking to the struggle of the working class in modern American life.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep469-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://oracle.com/lexTax Network USA: Full-service tax firm.

Go to https://drinkLMNT.com/lexOUTLINE:(00:00) – Introduction(09:00) – Open mics(13:03) – Mainstream country music(22:10) – Fame(28:06) – Music vs politics(36:56) – Rich Men North of Richmon…

4 months назад @ lexfridman.com
#468 – Janna Levin: Black Holes, Wormholes, Aliens, Paradoxes & Extra Dimensions
#468 – Janna Levin: Black Holes, Wormholes, Aliens, Paradoxes & Extra Dimensions #468 – Janna Levin: Black Holes, Wormholes, Aliens, Paradoxes & Extra Dimensions

Janna Levin is a theoretical physicist and cosmologist specializing in black holes, cosmology of extra dimensions, topology of the universe, and gravitational waves.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep468-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://brain.fm/lexBetterHelp: Online therapy and counseling.

Go to https://betterhelp.com/lexNetSuite: Business management software.

Go to https://shopify.com/lexAG1: All-in-one daily nutrition drink.

4 months, 2 weeks назад @ lexfridman.com
#467 – Tim Sweeney: Fortnite, Unreal Engine, and the Future of Gaming
#467 – Tim Sweeney: Fortnite, Unreal Engine, and the Future of Gaming #467 – Tim Sweeney: Fortnite, Unreal Engine, and the Future of Gaming

Tim Sweeney is a legendary video game programmer, founder and CEO of Epic Games that created the Unreal Engine, Fortnite, Gears of War, Unreal Tournament, and many other groundbreaking and influential video games.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep467-scSee below for timestamps, and to give feedback, submit questions, contact Lex, etc.

Go to https://notion.com/lexMasterClass: Online classes from world-class experts.

Go to https://shopify.com/lexAG1: All-in-one daily nutrition drink.

Go to https://drinkag1.com/lexLMNT: Zero-sugar electrolyte drink mix.

4 months, 2 weeks назад @ lexfridman.com
#466 – Jeffrey Wasserstrom: China, Xi Jinping, Trade War, Taiwan, Hong Kong, Mao
#466 – Jeffrey Wasserstrom: China, Xi Jinping, Trade War, Taiwan, Hong Kong, Mao #466 – Jeffrey Wasserstrom: China, Xi Jinping, Trade War, Taiwan, Hong Kong, Mao

Jeffrey Wasserstrom is a historian of modern China.

Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep466-scSee below for timestamps, transcript, and to give feedback, submit questions, contact Lex, etc.

Go to https://oracle.com/lexTax Network USA: Full-service tax firm.

Go to https://shopify.com/lexLMNT: Zero-sugar electrolyte drink mix.

Go to https://drinkLMNT.com/lexAG1: All-in-one daily nutrition drink.

4 months, 3 weeks назад @ lexfridman.com
Microsoft Research Podcast Microsoft Research Podcast
последний пост 3 weeks, 6 days назад
Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education
Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education

KOHANE: So I think you’ve “nerd sniped” me because you [LAUGHTER]—which is all too easy—but I think there’s a central issue here.

But I actually think this is dark matter of human organizational technology that is not well understood.

AZEEM AZHAR: We didn’t talk about, you know, AI in its ability to potentially do this, which is to extend the clinician’s presence throughout the week.

And so I think there’s always going to be an opening for either differences of opinion or agreeing with you too much.

And this gets into whether AI is really going to get almost to the ab initio understanding of human biology.

3 weeks, 6 days назад @ microsoft.com
Reimagining healthcare delivery and public health with AI
Reimagining healthcare delivery and public health with AI Reimagining healthcare delivery and public health with AI

We are sorry, the page you requested cannot be found.

The page you are looking for could not be found or is no longer available.

1 month, 1 week назад @ microsoft.com
Navigating medical education in the era of generative AI
Navigating medical education in the era of generative AI Navigating medical education in the era of generative AI

Prior to med school, Daniel pursued experiences that cultivated his interest in the application of AI in medical practice and education.

Really, really looking forward to this chat.

There’s AI before ChatGPT and before, you know, generative AI really became a big thing, and then afterwards.

And then after we talk about what’s really happening, what do you think should happen in medical education given the reality of generative AI?

And I do agree [that] AI really gives us real hope that we can make it true.

1 month, 3 weeks назад @ microsoft.com
AI Testing and Evaluation: Reflections
AI Testing and Evaluation: Reflections AI Testing and Evaluation: Reflections

Our goal is to learn from their successes and their stumbles to move the science and practice of AI testing forward.

We have examples, like the pharmaceutical or medical device industry experts with whom you spoke, that’s really, you know, testing … there is a pre-deployment requirement.

And the third is just how rigid versus adaptive these testing and evaluation regimes or frameworks are in these different domains.

I really agree that there has been a lot of emphasis to date on, sort of, testing models upstream, the AI model evaluation.

You know, I think there’s been real progress already in the AI evaluation and testing ecosystem in the public-private partnership context.

1 month, 4 weeks назад @ microsoft.com
AI Testing and Evaluation: Learnings from cybersecurity
AI Testing and Evaluation: Learnings from cybersecurity AI Testing and Evaluation: Learnings from cybersecurity

Absolutely, I really, really was.

As a principal director on the Microsoft AI Red Team, Tori leads all AI security and safety red team operations, as well as dangerous capability testing, to directly inform C-suite decision-makers.

This year, we’ve pulled a lot of those assets and insights into the Azure [AI] Foundry AI Red Teaming Agent (opens in new tab).

So you can get a little taste of what we do day to day in the AI Red Teaming Agent.

WESTERHOFF: I think the most important takeaway from those lessons is that AI security is truly a team sport.

2 months назад @ microsoft.com
How AI will accelerate biomedical research and discovery
How AI will accelerate biomedical research and discovery How AI will accelerate biomedical research and discovery

Dr. Eric Topol is the executive vice president of the biomedical research non-profit Scripps Research, where he founded and now directs the Scripps Research Translational Institute.

Let’s continue our deep dive on AI and biomedical research with this conversation with Noubar Afeyan:LEE: Noubar, thanks so much for joining.

And there’s the origin story of contact with AI, you know, before the emergence of generative AI and afterwards.

What is going on today with respect to AI really being used for something meaningful in the design and development of drugs?

TOPOL: You would read about how, you know, data is the new oil and, you know, gold and whatnot.

2 months, 1 week назад @ microsoft.com
AI Testing and Evaluation: Learnings from pharmaceuticals and medical devices
AI Testing and Evaluation: Learnings from pharmaceuticals and medical devices AI Testing and Evaluation: Learnings from pharmaceuticals and medical devices

Our goal is to learn from their successes and their stumbles to move the science and practice of AI testing forward.

During the pre-market phase, medical testing establishes baseline safety and effectiveness metrics through bench testing, performance standards, and clinical studies.

SULLIVAN: So medical devices face a pretty prescriptive multi-level testing path before they hit the market.

We are looking into medical devices, as well, obviously, but also other technologies in advanced medical computing.

So we see Phase 3 trials as something that occurs in the medical devices and pharmaceuticals field.

2 months, 1 week назад @ microsoft.com
AI Testing and Evaluation: Learnings from genome editing
AI Testing and Evaluation: Learnings from genome editing AI Testing and Evaluation: Learnings from genome editing

As generative AI continues to advance, Microsoft has gathered a range of experts—from genome editing to cybersecurity—to share how their fields approach evaluation and risk assessment.

CHARO: Well, you know, genome editing is both very old and very new.

Now the earliest forms of genome editing were very inefficient, and so we didn’t worry that much.

But the bottom-line thing to remember, the way to really think about it is, we don’t regulate genome editing; we regulate the things that use genome editing.

And she said, you know, we don’t regulate genome editing; we regulate the things that use genome editing.

2 months, 2 weeks назад @ microsoft.com
AI Testing and Evaluation: Learnings from Science and Industry
AI Testing and Evaluation: Learnings from Science and Industry AI Testing and Evaluation: Learnings from Science and Industry

Our goal is to learn from their successes and their stumbles to move the science and practice of AI testing forward.

And I think, really, there are two reasons why tech is so, kind of, representative of that kind of challenge that I’ve always found fascinating.

Continues to be a really important topic in the AI policy conversation right now, I think, for really good reason.

Testing is an important component for governance and AI and, of course, in all of these other domains, as well.

I think about almost, like, in the near to mid-term, like three issues that we need to address in the AI, kind of, policy and testing context.

2 months, 3 weeks назад @ microsoft.com
The AI Revolution in Medicine, Revisited: How AI is reshaping the future of healthcare and medical research
The AI Revolution in Medicine, Revisited: How AI is reshaping the future of healthcare and medical research The AI Revolution in Medicine, Revisited: How AI is reshaping the future of healthcare and medical research

LEE: Yeah, yeah.

It cannot—as, you know, Bill was saying—it cannot learn from your document.

And I don’t know if the two of you remember, but I ended up doing a lot of tests.

I don’t know if you know, but just recently, there was a paper that was published on a scientific discovery using o3- mini (opens in new tab).

Like, if you have a human trained for one task and you put them into another task, then you don’t … you often don’t know.

3 months, 1 week назад @ microsoft.com
What AI's impact on individuals means for the health workforce and industry
What AI's impact on individuals means for the health workforce and industry What AI's impact on individuals means for the health workforce and industry

So I don’t think we should be surprised that business schools matter on this because we care about management.

That’s really going to change the way, like, middle school works, was my thinking at the time.

We’ve gone from AI being highly discriminative to AI that’s able to explore the world in particular ways.

The symptoms that they’re showing are quite different, and also their compliance is really, really different.

LEE: Yeah, really, really interesting.

3 months, 3 weeks назад @ microsoft.com
Abstracts: Zero-shot models in single-cell biology with Alex Lu
Abstracts: Zero-shot models in single-cell biology with Alex Lu Abstracts: Zero-shot models in single-cell biology with Alex Lu

And single-cell foundation models claim to be capable of unraveling deeper insights than ever before.

Basically, we showed that single-cell foundation models perform worse in settings that are fundamental to biological discovery than much simpler machine learning and statistical methods that were used in the field before single-cell foundation models emerged and are the go-to standard for unpacking meaning from these complicated experiments.

And the way to understand this is because single-cell foundation models are trained in a way that tries to expose these models to millions of single-cells.

But let’s also talk about the impact for methodologists, people who are trying to improve these s…

3 months, 4 weeks назад @ microsoft.com
Abstracts: Aurora with Megan Stanley and Wessel Bruinsma
Abstracts: Aurora with Megan Stanley and Wessel Bruinsma Abstracts: Aurora with Megan Stanley and Wessel Bruinsma

This is such exciting work about environmental forecasting, so we’re happy to have the two of you join us today.

Mostly because AI weather forecasting models are computationally much more efficient and can even be more accurate.

What’s unfortunate though, about this big step forward, is that these developments are mostly limited to the setting of weather forecasting.

Weather forecasting is very important, obviously, but there are many other important environmental forecasting problems out there, such as air pollution forecasting or ocean wave forecasting.

STANLEY: Current approaches have really focused training very specifically on weather forecasting models.

3 months, 4 weeks назад @ microsoft.com
Collaborators: Healthcare Innovation to Impact
Collaborators: Healthcare Innovation to Impact Collaborators: Healthcare Innovation to Impact

LUNGREN: And now it really feels like this collaborative effort, you know, really can help start to extend that mission.

I think, you know, Will and Smitha, that we definitely feel the passion and the innovation.

Again, you know, in text, you refer to that earlier and certainly off the shelf, there’s really powerful applications.

LUNGREN: So, I think AI has always been thought of as a savior kind of technology.

And I guess for my part, I think really what we’re going to see is a massive unleash of creativity.

4 months назад @ microsoft.com
Coauthor roundtable: Reflecting on real world of doctors, developers, patients, and policymakers
Coauthor roundtable: Reflecting on real world of doctors, developers, patients, and policymakers Coauthor roundtable: Reflecting on real world of doctors, developers, patients, and policymakers

LEE: Yeah, yeah.

LEE: Yeah, yeah.

LEE: Yeah, yeah.

[LAUGHS]GOLDBERG: Right, right, right, yeah.

Yeah, yeah.

4 months назад @ microsoft.com
NLP Highlights NLP Highlights
последний пост None
Data Skeptic
последний пост 1 week, 2 days назад
Why Am I Seeing This?
Why Am I Seeing This? Why Am I Seeing This?

In this episode of Data Skeptic, we explore the challenges of studying social media recommender systems when exposure data isn't accessible. Our guests Sabrina Guidotti, Gregor Donabauer, and Dimitri Ognibene introduce their innovative "recommender neutral user model" for inferring the influence of opaque algorithms.

1 week, 2 days назад @ dataskeptic.com
Eco-aware GNN Recommenders
Eco-aware GNN Recommenders Eco-aware GNN Recommenders

In this episode of Data Skeptic, we dive into eco-friendly AI with Antonio Purificato, a PhD student from Sapienza University of Rome. Antonio discusses his research on "EcoAware Graph Neural Networks for Sustainable Recommendations" and explores how we can measure and reduce the environmental impact of recommender systems without sacrificing performance.

2 weeks, 4 days назад @ dataskeptic.com
Networks and Recommender Systems
Networks and Recommender Systems Networks and Recommender Systems

Kyle reveals the next season's topic will be "Recommender Systems". Asaf shares insights on how network science contributes to the recommender system field.

1 month назад @ dataskeptic.com
Network of Past Guests Collaborations
Network of Past Guests Collaborations Network of Past Guests Collaborations

Kyle and Asaf discuss a project in which we link former guests of the podcast based on their co-authorship of academic papers.

1 month, 4 weeks назад @ dataskeptic.com
The Network Diversion Problem
The Network Diversion Problem The Network Diversion Problem

In this episode, Professor Pål Grønås Drange from the University of Bergen, introduces the field of Parameterized Complexity - a powerful framework for tackling hard computational problems by focusing on specific structural aspects of the input. This framework allows researchers to solve NP-complete problems more efficiently when certain parameters, like the structure of the graph, are "well-behaved". At the center of the discussion is the network diversion problem, where the goal isn’t to block all routes between two points in a network, but to force flow - such as traffic, electricity, or data - through a specific path. While this problem appears deceptively similar to the classic "Min.Cu…

2 months, 1 week назад @ dataskeptic.com
Complex Dynamic in Networks
Complex Dynamic in Networks Complex Dynamic in Networks

In this episode, we learn why simply analyzing the structure of a network is not enough, and how the dynamics - the actual mechanisms of interaction between components - can drastically change how information or influence spreads. Our guest, Professor Baruch Barzel of Bar-Ilan University, is a leading researcher in network dynamics and complex systems ranging from biology to infrastructure and beyond. BarzelLab BarzelLab on Youtube Paper in focus: Universality in network dynamics, 2013

2 months, 3 weeks назад @ dataskeptic.com
Github Network Analysis
Github Network Analysis Github Network Analysis 2 months, 3 weeks назад @ dataskeptic.com
Networks and Complexity
Networks and Complexity Networks and Complexity

In this episode, Kyle does an overview of the intersection of graph theory and computational complexity theory. In complexity theory, we are about the runtime of an algorithm based on its input size. For many graph problems, the interesting questions we want to ask take longer and longer to answer! This episode provides the fundamental vocabulary and signposts along the path of exploring the intersection of graph theory and computational complexity theory.

3 months назад @ dataskeptic.com
Actantial Networks
Actantial Networks Actantial Networks

In this episode, listeners will learn about Actantial Networks—graph-based representations of narratives where nodes are actors (such as people, institutions, or abstract entities) and edges represent the actions or relationships between them. The one who will present these networks is our guest Armin Pournaki, a joint PhD candidate at the Max Planck Institute and Sciences, who specializes in computational social science, where he develops methods to extract and analyze political narratives using natural language processing and network science. Armin explains how these methods can expose conflicting narratives around the same events, as seen in debates on COVID-19, climate change, or the wa…

3 months, 2 weeks назад @ dataskeptic.com
Graphs for Causal AI
Graphs for Causal AI Graphs for Causal AI

How to build artificial intelligence systems that understand cause and effect, moving beyond simple correlations? As we all know, correlation is not causation. "Spurious correlations" can show, for example, how rising ice cream sales might statistically link to more drownings, not because one causes the other, but due to an unobserved common cause like warm weather. Our guest, Utkarshani Jaimini, a researcher from the University of South Carolina's Artificial Intelligence Institute, tries to tackle this problem by using knowledge graphs that incorporate domain expertise. Knowledge graphs (structured representations of information) are combined with neural networks in the field of neurosymbo…

3 months, 3 weeks назад @ dataskeptic.com
Power Networks
Power Networks Power Networks 4 months назад @ dataskeptic.com
Unveiling Graph Datasets
Unveiling Graph Datasets Unveiling Graph Datasets 4 months, 1 week назад @ dataskeptic.com
Network Manipulation
Network Manipulation Network Manipulation

In this episode we talk with Manita Pote, a PhD student at Indiana University Bloomington, specializing in online trust and safety, with a focus on detecting coordinated manipulation campaigns on social media. Key insights include how coordinated reply attacks target influential figures like journalists and politicians, how machine learning models can detect these inauthentic campaigns using structural and behavioral features, and how deletion patterns reveal efforts to evade moderation or manipulate engagement metrics. Follow our guest X/Twitter Google Scholar Papers in focus Coordinated Reply Attacks in Influence Operations: Characterization and Detection ,2025 Manipulating Twitter throug…

4 months, 3 weeks назад @ dataskeptic.com
The Small World Hypothesis
The Small World Hypothesis The Small World Hypothesis

Kyle discusses the history and proof for the small world hypothesis.

4 months, 4 weeks назад @ dataskeptic.com
Thinking in Networks
Thinking in Networks Thinking in Networks

Kyle asks Asaf questions about the new network science course he is now teaching. The conversation delves into topics such as contact tracing, tools for analyzing networks, example use cases, and the importance of thinking in networks.

5 months, 1 week назад @ dataskeptic.com
SuperDataScience SuperDataScience
последний пост 1 day, 14 hours назад
923: Graph Algorithms, GraphRAG and Causal Graphs, with Graph Guru Amy Hodler
923: Graph Algorithms, GraphRAG and Causal Graphs, with Graph Guru Amy Hodler 923: Graph Algorithms, GraphRAG and Causal Graphs, with Graph Guru Amy Hodler

Graphs, but not as you would expect them: Graph analytics guru Amy Hodler speaks to Jon Krohn about the graph data structure and graph applications, graph algorithms, graph RAG, and graphs as memory systems for AI agents. We can use graphs in a surprising number of ways. Money laundering and fraud, as well as supply-chain crime, leave breadcrumbs at multiple “touch-points” over time, behaviors that graphs are better suited to reveal than rows and tables. Amy sees that most interest in graphs has been in the cybersecurity space. But this work isn’t only restricted to fighting crime! Listen to the episode to hear more case examples and how to get into graph work. This episode is brought to yo…

1 day, 14 hours назад @ podtrac.com
922: AI for Manufacturing and Industry, with Hugo Dozois-Caouette
922: AI for Manufacturing and Industry, with Hugo Dozois-Caouette 922: AI for Manufacturing and Industry, with Hugo Dozois-Caouette

Hugo Dozois-Caouette speaks to Jon Krohn about his startup MaintainX and how he secured $100 million in venture capital. MaintainX manages and maintains computerized maintenance management systems (CMSs), or work-execution software, for the industrial and manufacturing industries. This “digitized version of a clipboard” with the help of web and mobile applications, provide a list of procedures, guidelines and regulations to help increase worker productivity and give a company the data-driven insights it needs to refine its processes. Listen to the episode to hear Hugo’s thoughts on the gaps in the manufacturing industry that technology can fill, the tech stack used by MaintainX, and the dis…

5 days, 14 hours назад @ podtrac.com
921: NPUs vs GPUs vs CPUs for Local AI Workloads, with Dell’s Ish Shah and Shirish Gupta
921: NPUs vs GPUs vs CPUs for Local AI Workloads, with Dell’s Ish Shah and Shirish Gupta 921: NPUs vs GPUs vs CPUs for Local AI Workloads, with Dell’s Ish Shah and Shirish Gupta

Using Windows for AI development and the bleeding edge of NPUs: Shirish Gupta and Ish Shah from Dell Technologies speak to Jon Krohn about the latest products from Dell, the future of neural-processing units (NPUs), and how AI developers can make sound hardware investments. This episode is brought to you by the Trainium2, the latest AI chip from AWS, by ODSC, the Open Data Science Conference and by Gurobi. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/921⁠⁠⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:18) Why Windows still outranks other operating systems (20…

1 week, 1 day назад @ podtrac.com
920: In Case You Missed It in August 2025
920: In Case You Missed It in August 2025 920: In Case You Missed It in August 2025

This month’s episode of In Case You Missed It gives us reasons to be cautiously optimistic about the future of large language models (LLMs), with guests discussing what to do about recent reports that found AI agents blackmailed human users when threatened, the importance of post-training LLMs, and the training we have available for data and AI engineers to create robust, secure, and useful AI. Jon Krohn includes clips from his interviews with Akshay Agrawal (Episode 911), Julien Launay (Episode 913), Michelle Yi (Episode 915), and Kirill Eremenko (Episode 917). Additional materials: ⁠⁠⁠⁠⁠www.superdatascience.com/920⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email natali…

1 week, 5 days назад @ podtrac.com
919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron
919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron 919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron

PyTorch, AGI, and the future of alignment research: Aurélien Géron joins Jon Krohn in this live interview to talk about the fourth edition of his bestselling Hands-On Machine Learning as well as what superintelligence makes him hopeful for, as well as what concerns him about machines surpassing human intelligence. This episode is brought to you by Gurobi and by the Dell AI Factory with NVIDIA⁠ Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/919⁠⁠⁠⁠⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:04) Why Aurélien wrote Hands-On Machine Learning (20:54) How Aurélien …

2 weeks, 1 day назад @ podtrac.com
918: Multi-Agent Systems with CrewAI
918: Multi-Agent Systems with CrewAI 918: Multi-Agent Systems with CrewAI

In this Five-Minute Friday, Jon Krohn introduces listeners to CrewAI, an open-source Python framework that can create and manage multi-agent teams. The clue is in the title: CrewAI assembles specialized agents into single “crews” that achieve complex goals between them. CrewAI’s agent teams can also learn and iterate, meaning that after the crew has achieved its goals for the first time, they can refine and tailor their approach to future goals. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/918⁠⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

2 weeks, 5 days назад @ podtrac.com
917: 8 Steps to Becoming an AI Engineer, with Kirill Eremenko
917: 8 Steps to Becoming an AI Engineer, with Kirill Eremenko 917: 8 Steps to Becoming an AI Engineer, with Kirill Eremenko

Founder of SuperDataScience, Kirill Eremenko, talks to Jon Krohn about how he found the best tools and approaches to help launch his 8-week AI engineering bootcamp. He breaks down the topics participants cover each week, and he also shares his tips with listeners who might want to start their own tech bootcamp or sign up for SuperDataScience’s September 2025 cohort. This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/917⁠⁠⁠⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will…

3 weeks, 1 day назад @ podtrac.com
916: The 5 Key GPT-5 Takeaways
916: The 5 Key GPT-5 Takeaways 916: The 5 Key GPT-5 Takeaways

GPT-5 has just been released, but with not very much fanfare. In this Five-Minute Friday, Jon Krohn asks if GPT-5 deserves the community’s underwhelmed response to its release. He outlines five features of the model and explains why people might be feeling less than enthusiastic in the broader context of LLM development. Which LLMs are leading the way, and which are still playing the game of catch-up? Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/916⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

3 weeks, 5 days назад @ podtrac.com
915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi
915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi 915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi

Tech leader, investor, and Generationship cofounder Michelle Yi talks to Jon Krohn about finding ways to trust and secure AI systems, the methods that hackers use to jailbreak code, and what users can do to build their own trustworthy AI systems. Learn all about “red teaming” and how tech teams can handle other key technical terms like data poisoning, prompt stealing, jailbreaking and slop squatting. This episode is brought to you by ⁠Trainium2, the latest AI chip from AWS⁠ and by the ⁠Dell AI Factory with NVIDIA⁠. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/915⁠⁠⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship…

4 weeks, 1 day назад @ podtrac.com
914: Data Lakes 101 (and Why They’re Key for AI Models), with Oz Katz
914: Data Lakes 101 (and Why They’re Key for AI Models), with Oz Katz 914: Data Lakes 101 (and Why They’re Key for AI Models), with Oz Katz

In this Five-Minute Friday, Cofounder and CTO of lakeFS Oz Katz talks to Jon Krohn about data warehouses, data lakes, and how companies can handle increasingly complex data infrastructures and formats. Hear about lakeFS’s collaboration with Legofest, lakeFS’s approach to helping users collaborate on data lakes, and how to overcome the challenges of working with multimodal data. Additional materials: ⁠www.superdatascience.com/914⁠ This episode is brought to you by the ⁠Dell AI Factory with NVIDIA⁠.

1 month назад @ podtrac.com
913: LLM Pre-Training and Post-Training 101, with Julien Launay
913: LLM Pre-Training and Post-Training 101, with Julien Launay 913: LLM Pre-Training and Post-Training 101, with Julien Launay

Julien Launay launched Adaptive to give data science teams in business enterprises their “RLOps tooling” to make reinforcement learning easier. Talking to Jon Krohn, Julien says, “Most of our users are data scientists who write Python codes to interface with the system”. Adaptive is also able to work with companies without data science teams, collaborating with partners like Deloitte to add the necessary personnel. Julien is currently working on making his platform more widely available. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/913⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

1 month назад @ podtrac.com
912: In Case You Missed It in July 2025
912: In Case You Missed It in July 2025 912: In Case You Missed It in July 2025

In this episode of In Case You Missed It, we look back on five great interview episodes from July. Hear from Lilith Bat-Leah (Episode 901), Sinan Ozdemir (Episode 903), Sebastian Gehrmann (Episode 905), Zohar Bronfman (Episode 907) and Robert Ness (Episode 909). They’ll tell you why data-centric machine learning is so important across disciplines, starting with law, and how we can use AI benchmarks and “red teaming” to refine our search for the best AI models. Additional materials: ⁠⁠⁠⁠www.superdatascience.com/912 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

1 month, 1 week назад @ podtrac.com
911: The Future of Python Notebooks is Here, with Marimo’s Dr. Akshay Agrawal
911: The Future of Python Notebooks is Here, with Marimo’s Dr. Akshay Agrawal 911: The Future of Python Notebooks is Here, with Marimo’s Dr. Akshay Agrawal

Reproducibility, Python notebooks, and data science communities: Software developer Akshay Agrawal speaks to Jon Krohn about Marimo, the next-generation computational notebook for Python, how he built and fostered a thriving community around the product, and what makes this notebook so versatile and accessible for users. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/911⁠⁠⁠⁠⁠ This episode is brought to you by ⁠Trainium2, the latest AI chip from AWS ⁠and by the ⁠Dell AI Factory with NVIDIA⁠. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

1 month, 1 week назад @ podtrac.com
910: AI is Disrupting Journalism: The Good, The Bad and The Opportunity
910: AI is Disrupting Journalism: The Good, The Bad and The Opportunity 910: AI is Disrupting Journalism: The Good, The Bad and The Opportunity

In this Five-Minute Friday, Jon Krohn looks into AI’s disruption of the journalism industry and how it has fundamentally reshaped news production. Multiple news outlets’ suing of ChatGPT over its use of copyrighted materials may have taken the most headlines to date, but this isn’t to say news media is rebuffing AI entirely. On the contrary, several outlets have launched summarization and analysis tools for both internal and external use, such as The New York Times’s Echo and The Washington Post’s Haystacker. This episode looks into the ways major news outlets are utilising AI, and what this means for journalists. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/910⁠⁠ Interested in sp…

1 month, 2 weeks назад @ podtrac.com
909: Causal AI, with Dr. Robert Usazuwa Ness
909: Causal AI, with Dr. Robert Usazuwa Ness 909: Causal AI, with Dr. Robert Usazuwa Ness

Researcher at Microsoft Robert Usazuwa Ness talks to Jon Krohn about how to achieve causality in AI with correlation-based learning, the right libraries, and handling statistical inference. When dealing with causal AI, Robert notes how important it is to keep aware of variables in the data that may mislead us and force inaccurate assumptions. Not all variables will be useful. It is essential, then, that any assumptions are grounded in a deeper understanding of how the data were gathered, and not what appears in the dataset. Listen to the episode to hear how you can apply causal AI to your projects. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/907⁠⁠⁠⁠ This episode is brought to you…

1 month, 2 weeks назад @ podtrac.com
Data Science at Home Data Science at Home
последний пост 14 часов назад
Why VCs Are Funding $100M Remote Control Toys (Ep. 290)
Why VCs Are Funding $100M Remote Control Toys (Ep. 290) Why VCs Are Funding $100M Remote Control Toys (Ep. 290)

ReferencesWar On The Rocks: https://warontherocks.com/2025/08/ukraine-isnt-the-model-for-winning-the-innovation-war/LinkedIn: https://www.linkedin.com/in/jonasrsinger/Spotify: https://tr.ee/Omy_1X8k1UApple Podcast: https://podcasts.apple.com/us/podcast/defence-innovation-podcast/id1797131332YouTube: https://youtube.com/@DefenceInnovationpodcast?si=cu2WlnVgL5XKnM0pSponsorsThis episode is proudly sponsored by Amethix Technologies.

At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve.

Discover more at https://amethix.comThis episode is brought to you by Intrepid AI.

From drones to satellites, Intrepid AI gives engineers…

14 часов назад @ datascienceathome.com
How Hacker Culture Died (Ep. 289)
How Hacker Culture Died (Ep. 289) How Hacker Culture Died (Ep. 289)

At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve.

Discover more at amethix.comDSH is brought to you by Intrepid AI.

🐦 Twitter: @DataScienceAtHome📘 LinkedIn: https://www.linkedin.com/in/fragadaleta/Instagram: https://www.instagram.com/datascienceathome/Facebook: https://www.facebook.com/datascienceAHLinkedIn: https://www.linkedin.com/company/data-science-at-home-podcastDiscord Channel: https://discord.gg/4UNKGf3NEW TO DATA SCIENCE AT HOME?

Data Science at Home explores the latest in AI, data science, and machine learning.

Send us mail at:[email protected]’t forget to like, subscribe, and hit the 🔔 for…

2 weeks, 4 days назад @ datascienceathome.com
Robots Suck (But It’s Not Their Fault) (Ep. 288)
Robots Suck (But It’s Not Their Fault) (Ep. 288) Robots Suck (But It’s Not Their Fault) (Ep. 288)

At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve.

Discover more at amethix.comDSH is brought to you by Intrepid AI.

🐦 Twitter: @DataScienceAtHome📘 LinkedIn: https://www.linkedin.com/in/fragadaleta/Instagram: https://www.instagram.com/datascienceathome/Facebook: https://www.facebook.com/datascienceAHLinkedIn: https://www.linkedin.com/company/data-science-at-home-podcastDiscord Channel: https://discord.gg/4UNKGf3NEW TO DATA SCIENCE AT HOME?

Data Science at Home explores the latest in AI, data science, and machine learning.

Send us mail at:[email protected]’t forget to like, subscribe, and hit the 🔔 for…

1 month, 1 week назад @ datascienceathome.com
Your Favorite AI Startup is Probably Bullshit (Ep. 287)
Your Favorite AI Startup is Probably Bullshit (Ep. 287) Your Favorite AI Startup is Probably Bullshit (Ep. 287)

The brutal truth about why Silicon Valley is blowing billions on glorified autocomplete while pretending it’s the next iPhone.

We’re diving deep into the AI investment circus where VCs who can’t code are funding companies that barely understand their own technology.

From blockchain déjà vu to the “ChatGPT wrapper” economy—this episode will make you question every AI valuation you’ve ever seen.

Fair warning: We’re naming names and calling out the hype.

Don’t listen if you work at a “revolutionary AI startup” that’s just OpenAI’s API with a pretty interface.

1 month, 2 weeks назад @ datascienceathome.com
Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 286) [RB]
Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 286) [RB] Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 286) [RB]

From the viral article “Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything” on my newsletter at https://defragzone.substack.com/p/techs-dumbest-mistake-why-firinghere are my thoughts about AI replacing programmers…🎙️ Sponsors AGNTCY — The open source collective building the Internet of Agents🌐 https://www.agntcy.org✨ Connect with us!

🐦 Twitter: @DataScienceAtHome📘 LinkedIn: https://www.linkedin.com/in/fragadaleta/Instagram: https://www.instagram.com/datascienceathome/Facebook: https://www.facebook.com/datascienceAHLinkedIn: https://www.linkedin.com/company/data-science-at-home-podcastDiscord Channel: https://discord.gg/4UNKGf3NEW TO DATA SCIENCE AT HOME?

Data Scie…

2 months, 1 week назад @ datascienceathome.com
Brains in the Machine: The Rise of Neuromorphic Computing (Ep. 285)
Brains in the Machine: The Rise of Neuromorphic Computing (Ep. 285) Brains in the Machine: The Rise of Neuromorphic Computing (Ep. 285)

In this episode of Data Science at Home, we explore the fascinating world of neuromorphic computing — a brain-inspired approach to computation that could reshape the future of AI and robotics.

The episode breaks down how neuromorphic systems differ from conventional AI architectures like transformers and LLMs, diving into spiking neural networks (SNNs), their benefits in energy efficiency and real-time processing, and their limitations in training and scalability.

Real-world applications are highlighted, including low-power drones, hearing aids, and event-based cameras.

Francesco closes with a vision of hybrid systems where neuromorphic chips and LLMs coexist, blending biological inspiratio…

3 months назад @ datascienceathome.com
DSH/Warcoded – AI in the Invisible Battlespace (Ep. 284)
DSH/Warcoded – AI in the Invisible Battlespace (Ep. 284) DSH/Warcoded – AI in the Invisible Battlespace (Ep. 284)

This episode explores the invisible battlespace of cyber and electronic warfare, where AI takes center stage.

SponsorsBuilding multi-agent software is hard — agent-to-agent and agent-to-tool communication is still the wild west.

At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve.

Discover more at amethix.comWarcoded is brought to you by Intrepid AI.

From drones to satellites, Intrepid AI gives engineers and defense innovators the tools to prototype, simulate, and deploy autonomous systems with confidence.

3 months, 2 weeks назад @ datascienceathome.com
DSH/Warcoded Swarming the Battlefield (Ep. 283)
DSH/Warcoded Swarming the Battlefield (Ep. 283) DSH/Warcoded Swarming the Battlefield (Ep. 283)

Swarming the Battlefield explores how artificial intelligence is revolutionizing combat through coordinated drone swarms.

This episode uncovers how these intelligent agents turn the chaos of the battlefield into a synchronized dance of machine warfare.

At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve.

Discover more at amethix.comWarcoded is brought to you by Intrepid AI.

From drones to satellites, Intrepid AI gives engineers and defense innovators the tools to prototype, simulate, and deploy autonomous systems with confidence.

3 months, 3 weeks назад @ datascienceathome.com
DSH/Warcoded Kill Chains and Algorithmic Warfare – Autonomy in Targeting and Engagement (Ep. 282)
DSH/Warcoded Kill Chains and Algorithmic Warfare – Autonomy in Targeting and Engagement (Ep. 282) DSH/Warcoded Kill Chains and Algorithmic Warfare – Autonomy in Targeting and Engagement (Ep. 282)

In this gripping follow-up, we dive into how AI is transforming kinetic operations—from identifying a threat to executing a strike.

At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve.

Discover more at amethix.comWarcoded is brought to you by Intrepid AI.

From drones to satellites, Intrepid AI gives engineers and defense innovators the tools to prototype, simulate, and deploy autonomous systems with confidence.

Whether it’s in the sky, on the ground, or in orbit—if it’s intelligent and mobile, Intrepid helps you build it.

4 months назад @ datascienceathome.com
DSH/Warcoded: Eyes and Ears of the Machine – AI Reconnaissance and Surveillance (Ep. 281)
DSH/Warcoded: Eyes and Ears of the Machine – AI Reconnaissance and Surveillance (Ep. 281) DSH/Warcoded: Eyes and Ears of the Machine – AI Reconnaissance and Surveillance (Ep. 281)

Welcome to DSH/WarcodedWe explore how AI is transforming ISR (Intelligence, Surveillance, Reconnaissance)—from satellite imagery to drone feeds.

At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve.

Discover more at amethix.com.”Warcoded is brought to you by Intrepid AI.

From drones to satellites, Intrepid AI gives engineers and defense innovators the tools to prototype, simulate, and deploy autonomous systems with confidence.

Learn more at intrepid.ai.”#AI #defensetech #ISR #LLM #Warcoded #DataScienceAtHome #OSINT #SIGINT #dronewarfare

4 months, 1 week назад @ datascienceathome.com
AI Agents with Atomic Agents 🚀 with Kenny Vaneetvelde (Ep. 280)
AI Agents with Atomic Agents 🚀 with Kenny Vaneetvelde (Ep. 280) AI Agents with Atomic Agents 🚀 with Kenny Vaneetvelde (Ep. 280)

🎙️ In this episode of Data Science at Home, we sit down with Kenny Vaneetvelde, the mastermind behind Atomic Agents, a groundbreaking framework redefining AI development.

🔍 Discover how atomicity simplifies complex AI systems, why modularity matters more than ever, and how Atomic Agents is eliminating hidden assumptions and redundant complexity in AI workflows.

💡 From real-world applications to the tech stack behind the framework, Kenny takes us on a deep dive into this lightweight, powerful tool for creating consistent and brand-aligned AI.

📌 Timestamps:0:00 – Intro2:30 – Kenny’s journey in AI5:00 – What are Atomic Agents?

10:45 – Why atomicity matters in AI18:20 – The tech behind Atomic A…

5 months, 1 week назад @ datascienceathome.com
Run massive models on crappy machines (Ep. 279)
Run massive models on crappy machines (Ep. 279) Run massive models on crappy machines (Ep. 279)

This episode explores how to break down barriers by running massive AI models on “crappy machines”—affordable, low-spec devices.

🐦 Twitter: @DataScienceAtHome📘 LinkedIn: https://www.linkedin.com/in/fragadaleta/Instagram: https://www.instagram.com/datascienceathome/Facebook: https://www.facebook.com/datascienceAHLinkedIn: https://www.linkedin.com/company/data-science-at-home-podcastDiscord Channel: https://discord.gg/4UNKGf3NEW TO DATA SCIENCE AT HOME?

Data Science at Home explores the latest in AI, data science, and machine learning.

Whether you’re a data professional, tech enthusiast, or just curious about the field, our podcast delivers insights, interviews, and discussions.

Send us mail …

5 months, 2 weeks назад @ datascienceathome.com
WeightWatcher: The AI Detective for LLMs (DeepSeek & OpenAI included) (Ep. 278)
WeightWatcher: The AI Detective for LLMs (DeepSeek & OpenAI included) (Ep. 278) WeightWatcher: The AI Detective for LLMs (DeepSeek & OpenAI included) (Ep. 278)

Enter WeightWatcher—the AI detective tool that peeks inside neural networks without needing their data.

🐦 Twitter: @DataScienceAtHome📘 LinkedIn: https://www.linkedin.com/in/fragadaleta/Instagram: https://www.instagram.com/datascienceathome/Facebook: https://www.facebook.com/datascienceAHLinkedIn: https://www.linkedin.com/company/data-science-at-home-podcastDiscord Channel: https://discord.gg/4UNKGf3NEW TO DATA SCIENCE AT HOME?

Data Science at Home explores the latest in AI, data science, and machine learning.

Whether you’re a data professional, tech enthusiast, or just curious about the field, our podcast delivers insights, interviews, and discussions.

Send us mail at:hello@datascienceathom…

5 months, 2 weeks назад @ datascienceathome.com
Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 278)
Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 278) Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 278)

From the viral article “Tech’s Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything” on my newsletter at https://defragzone.substack.com/p/techs-dumbest-mistake-why-firinghere are my thoughts about AI replacing programmers…✨ Connect with us!

🐦 Twitter: @DataScienceAtHome📘 LinkedIn: https://www.linkedin.com/in/fragadaleta/Instagram: https://www.instagram.com/datascienceathome/Facebook: https://www.facebook.com/datascienceAHLinkedIn: https://www.linkedin.com/company/data-science-at-home-podcastDiscord Channel: https://discord.gg/4UNKGf3NEW TO DATA SCIENCE AT HOME?

Data Science at Home explores the latest in AI, data science, and machine learning.

Whether you’re a data profes…

5 months, 3 weeks назад @ datascienceathome.com
Scaling Smart: AI, Data, and Building Future-Ready Enterprises with Josh Miramant (Ep. 276)
Scaling Smart: AI, Data, and Building Future-Ready Enterprises with Josh Miramant (Ep. 276) Scaling Smart: AI, Data, and Building Future-Ready Enterprises with Josh Miramant (Ep. 276)

In this episode, we dive into the transformative world of AI, data analytics, and cloud infrastructure with Josh Miramant, CEO of Blue Orange Digital.

As a seasoned entrepreneur with over $25 million raised across ventures and two successful exits, Josh shares invaluable insights on scaling data-driven businesses, integrating machine learning frameworks, and navigating the rapidly evolving landscape of cloud data architecture.

From generative AI to large language models, Josh explores cutting-edge trends shaping financial services, real estate, and consumer goods.

Tune in for a masterclass in leveraging data for impact and innovation!

Linkshttps://blueorange.digital/https://blueorange.digit…

8 months, 4 weeks назад @ datascienceathome.com