Very ML
State-of-the-art Machine Learning News Feed
/r/MachineLearning
последний пост 34 минуты назад
[R] DeepMind’s Bootstrapped Meta-Learning Enables Meta Learners to Teach Themselves
[R] DeepMind’s Bootstrapped Meta-Learning Enables Meta Learners to Teach Themselves

A research team from DeepMind proposes a bootstrapped meta-learning algorithm that overcomes the meta-optimization problem and myopic meta objectives, and enables the meta-learner to teach itself. Here is a quick read: DeepMind’s Bootstrapped Meta-Learning Enables Meta Learners to Teach Themselves. The paper Bootstrapped Meta-Learning is on arXiv. submitted by /u/Yuqing7 [link] [comments]

34 минуты назад @ reddit.com
[P] I published a tutorial where I teach how to use the Liskov Substitution Principle to write cleaner ML code
[P] I published a tutorial where I teach how to use the Liskov Substitution Principle to write cleaner ML code

In our quest for writing cleaner Machine Learning code using SOLID principles, today I publish a video on the Liskov Substitution Principle. This principle urges programmers to write class hierarchies which adhere to the same interface(s). If you follow the Liskov Substitution Principle, your client code can use any subtype of a given class interchangeably. As a result, the code will be more decoupled. And your colleagues and your future self will thank you for that :) Would you like to know more about Liskov substitution? How to spot its violations? How to refactor your ML code to adhere to it? Then, check out the video below! It costs nothing ;) https://www.youtube.com/watch?v=iWWIa2f6qcg…

2 часа назад @ reddit.com
[D] Paper on disentanglement of image representations and they are limitations and weaknesses?
[D] Paper on disentanglement of image representations and they are limitations and weaknesses?

I'm working on the disentanglement of image representations in generative models, but one thing I realize is that finding correct papers to look at is not trivial since many works under different titles could be classified as disentanglement such as gans, 2D-3D transformation, VAEs, image-to-image translation. Does anyone have some recommendations? submitted by /u/ThresholdTuner [link] [comments]

2 часа назад @ reddit.com
[D] Significant differences in Training on 1 GPU vs 8GPU ?
[D] Significant differences in Training on 1 GPU vs 8GPU ?

I am looking at the official code for MOCO-v2 linear evaluation here: https://github.com/facebookresearch/mocoThe command to execute suggests running a 256 sized batch across 8 GPUs for a Resnet-50. Do keep in mind that for the linear evaluation the model is frozen till the end and only the final fully connected layer is fine-tuned. Inside the code:parser.add_argument('-b', '--batch-size', default=256, type=int, metavar='N', help='mini-batch size (default: 256), this is the total batch size of all GPUs on the current node when using Data Parallel or Distributed Data Parallel') This training can be done on a single GPU(16GB), so what is the reason to use 8 to do the same task? ​ **EDIT** The…

2 часа назад @ reddit.com
[P] [R] I want to introduce the C++ DataFrame
[P] [R] I want to introduce the C++ DataFrame

C++ DataFrame https://github.com/hosseinmoein/DataFrame for large in-memory data analysis with all the C++ efficiency and scalability submitted by /u/hmoein [link] [comments]

3 часа назад @ reddit.com
[D] How can I download ImageNet dataset with only 20 or 30 classes?
[D] How can I download ImageNet dataset with only 20 or 30 classes?

I don't have powerful GPU to work on ImageNet dataset. I want to work on some classes of ImageNet in PyTorch. submitted by /u/SAbdusSamad [link] [comments]

4 часа назад @ reddit.com
[R] Time Series Analysis using LSTM
[R] Time Series Analysis using LSTM

So I am training an LSTM model with a daily rainfall dataset to see whether it can predict the next-day rainfall. Now the problem with my dataset is that it has missing values. Data for many consecutive days are missing. How should I handle missing data? Should I drop out of the missing records or fill out the missing data by analyzing the values around its nearby days? And for 1-day forecasting, would dropping out missing records lead to my model predicting wrong results? Also, I have seen many youtube tutorials taking multiple data as input to forecast the next day. For example, they would train the LSTM model by taking 5 hours of rainfall to predict the rainfall in the next hour. Since I…

5 часов назад @ reddit.com
[D] Are transformer-type models ready to replace MLP as the default general purpose model?
[D] Are transformer-type models ready to replace MLP as the default general purpose model?

For the general continuous case of RN --> RM data with unknown structure, deep ReLU MLPs have been the go-to model for the last 10 years or so. Before that it was SVMs and GMMs. But recently transformer-type models such as DeepMind's Perceivers have had some good results while basically plug-n-play. These models are more expressive than MLPs, but I understand they can be harder to train. What do you think, are they ready for primetime or are more breakthroughs needed? submitted by /u/svantana [link] [comments]

5 часов назад @ reddit.com
[D] How can we handle Out of Scope in Intent Detection problem on a small dataset?
[D] How can we handle Out of Scope in Intent Detection problem on a small dataset?

Hi everyone, If you have a small dataset, maybe hundreds and you expect your feature vectorizer can represent your input text effectively to do classification task which here is Intent Detection, a subset of Text Classification Task. Whether or not BERT in specific or contextual word embeddings, in general, would be a preferable choice in this case? Can anyone give me advice or navigate me toward a more sensible solution? ​ Note: I used to try with some common techniques NLP literature such as TF-IDF, BoW, word2vec, ... But none of them make me happy. submitted by /u/hosjiu [link] [comments]

10 часов назад @ reddit.com
[Project] Looking for help with TensorflowJS based midi pattern generation project
[Project] Looking for help with TensorflowJS based midi pattern generation project

Firstly, here's a demo video of the original software generating midi drum patterns based on input from another midi or audio file: https://youtu.be/eYUaYzfZUCo To further explain what is happening: A Google research team had several professional drummers come in to play on electronic drum kits that turn their performances into midi files. They then trained a neural network with Tensorflow on over 14 hours of drum midis played by professional drummers. This is what creates such profound results as seen in the above video. And I have used this myself and found that if you repeatedly give it the same input, it will give nearly the same output, only with very slight variations, as I would expe…

14 часов назад @ reddit.com
[P] Open-Source project to easily collect data from every db, app, website, tools
[P] Open-Source project to easily collect data from every db, app, website, tools

Source Code | Demo Video For an ML app, I needed to collect data from postgres, mongodb, Google sheets, Google analytics, Google ads, etc. It was painful to do so and took me couple of months with lots of bugs. To solve this problem, I created a customer data platform to build customer data pipelines that connect whole customer data stack and then makes them smarter by triggering enrichment and activation in customer tools based on analysis in data warehouses such as aws redshift, snowflake and more. Working on a major upgrade for the project, and hence seeking your feedback on how I can we improve it? Shoot me your questions, suggestions, what use cases comes to your mind for this project,…

14 часов назад @ reddit.com
[D] Object-NeRF Paper Explained - Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering (5-minute summary)
[D] Object-NeRF Paper Explained - Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering (5-minute summary) [D] Object-NeRF Paper Explained - Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering (5-minute summary)

Object-NeRF NeRF models have come a long way since the initial “explosion” last year. Yet one of the things they still can’t quite handle is scene compositionality, meaning that the model is not aware of the distinct objects that make up the scene. Object NeRF aims to tackle this issue using a dual-branch model that separately encodes the global context of the scene and each object in it. This approach not only reaches competitive levels of quality with current SOTA methods on static scenes but also enables object-level editing. For example, adding or moving furniture in a real-world scene. Check out the full paper summary on Casual GAN Papers (Reading time ~5 minutes). Subscribe to my chan…

14 часов назад @ reddit.com
[D] Question about text classification without labeled data
[D] Question about text classification without labeled data

Hello! I am working on a text classifier but at the moment im quite lost on what to do. The classes form a tree with three levels, for example: class A (level 1), class A.1 (level 2, subclass of A) and class A.1.a (level 3, subclass of A.1). I have a lot of texts but without labels, also its hard to create a set of labeled data because of the big number of classes. I've looked at word embeddings, clustering algorithms, topic modeling, keyword extraction algorithms and other approaches but im not sure whats would be a good way to proceed. Also im not sure how to evaluate the success of the classifier, maybe i could build a small dataset but i wanted to know if there was any other option At t…

15 часов назад @ reddit.com
[P] How much data do I need for this case?
[P] How much data do I need for this case?

I'm trying to predict when and what kind of problem a machine will most likely have and a have a lot of entry, but the frequency of each error ranges quite a bit, so I'm not sure whether those with lower frequency should be grouped together. I have about 175k entries and 12 types of errors, with the most frequent being at 48k times and the least being ~350, but most ranging from 8k-25k. Is it enough? I've thought of grouping the 5 less common because they have under 2.5k but I'm not sure if it will make any difference. I've read a bit about 10x the number of parameters and 'the curse of dimentionality', but I'm still unsure on how to apply it here. submitted by /u/SanderSohngen [link] [comm…

18 часов назад @ reddit.com
[D] STraTA: Self Training with Task Augmentation for Better Few shot Learning (Paper Explained)
[D] STraTA: Self Training with Task Augmentation for Better Few shot Learning (Paper Explained) [D] STraTA: Self Training with Task Augmentation for Better Few shot Learning (Paper Explained)

submitted by /u/deeplearningperson [link] [comments]

18 часов назад @ reddit.com
Towards Data Science Towards Data Science
последний пост 2 часа назад
Why you should try something else than Airflow for data pipeline orchestration
Why you should try something else than Airflow for data pipeline orchestration Why you should try something else than Airflow for data pipeline orchestration

Why you should try something else than Airflow for data pipeline orchestrationFan[Digital image] by rajat sarki, https://unsplash.com/photos/Gx2SU87s4WYWhile Airflow has dominated the market in terms of usage and community size as a data orchestrator pipeline, it’s pretty old and wasn’t designed initially to meet some of the needs we have today.

Let’s evaluate AWS step functions, Google workflows, Prefect next to Airflow.

Nevertheless, managing an Airflow cluster in the past has been a pain and Kubernetes with Airflow v2 have solved many issues.

You also would like to easily roll back or retry on a specific task/sub-task especially in a data pipeline context.

In such a case, you need to hav…

2 часа назад @ towardsdatascience.com
SQL Interview Questions You Must Prepare: The Ultimate Guide
SQL Interview Questions You Must Prepare: The Ultimate Guide SQL Interview Questions You Must Prepare: The Ultimate Guide

SQL Interview Questions You Must Prepare: The Ultimate GuideAre you wondering what SQL interview questions you will be asked?

This ultimate guide will take you through the top SQL interview questions for various data positions and the tips to approach your next SQL interview.

Depending on the type of role that you are applying for and the organization, you can expect one or more of these SQL Data Science Interview Question typesFundamental SQL conceptsSQL basics interview questionsSQL aggregation interview questionsOpen Ended SQL interview questionsData Transformation interview questionsDatabase Modeling interview questionsSoftware Engineering SQL Interview questionsLet us look at these SQL…

2 часа назад @ towardsdatascience.com
Why Bootstrapping Actually Works
Why Bootstrapping Actually Works Why Bootstrapping Actually Works

Ideally, we would want to draw multiple independent real-world samples from the true population to understand the population statistics.

With this pretend-population in place, we can draw multiple (bootstrap) random samples from it.

Because sampling with replacement is allowed, the bootstrap samples can also be regarded as random samples generated under different methods and assumptions.

The aggregated sampled information from these bootstrap samples will ultimately help us get (relatively) accurate estimates of the population parameter, e.g.

The image above compares the parameter (α) estimates from 1,000 simulated samples from the true population against 1,000 bootstrap samples.

2 часа назад @ towardsdatascience.com
Build your first Graph Neural Network model to predict traffic speed in 20 minutes
Build your first Graph Neural Network model to predict traffic speed in 20 minutes Build your first Graph Neural Network model to predict traffic speed in 20 minutes

Build your first Graph Neural Network model to predict traffic speed in 20 minutesGraph neural network (GNN) is an active frontier of deep learning, with a lot of applications, e.g., traffic speed/time prediction and recommendation system.

The general idea of this paper is to use the historical speed data to predict the speed at a future time step.

The geographic diagram representing traffic speed of a region changes over time.

METR-LA traffic dataset is widely used for traffic speed prediction.

The array contains only speed data, meaning that the GNN model uses the historical speed to predict future speed.

2 часа назад @ towardsdatascience.com
Create a fast auto-documented, maintainable and easy-to-use Python API in 5 lines of code with…
Create a fast auto-documented, maintainable and easy-to-use Python API in 5 lines of code with… Create a fast auto-documented, maintainable and easy-to-use Python API in 5 lines of code with…

Create a fast auto-documented, maintainable and easy-to-use Python API in 5 lines of code with FastAPI (part 1)Building and using our API will be as easy as using this vending machine (image by Jenna Hamra on Pexels)You have a great python program that you want to make available to the world.

With FastAPI you can speedily create a superfast API that’ll allow you to make your Python code available for other users.

In this article we’re going to create an API in 5 lines of code.

That’s right; FastAPI isn’t called FastAPI because it is many times faster than frameworks like Django or Flask; it’s also super easy and fast to set up.

After we’ve created our initial API we’ll expand it the next pa…

2 часа назад @ towardsdatascience.com
How this “artificial dreaming” program works, and how you can create your own artwork with it
How this “artificial dreaming” program works, and how you can create your own artwork with it How this “artificial dreaming” program works, and how you can create your own artwork with it

Keeping it very simple, this is a combination of two neural network architectures: VQGAN and CLIP.

The VQGAN network generates images from input numbers, and CLIP measures the similarity between an input text and an input image.

More precisely, VQGAN is a generative adversarial network of the kind described in this arXiv preprint, with two competing networks doing unsupervised learning.

Meanwhile, CLIP transforms texts into images in a way inspired by networks for natural language supervision and multimodal learning.

This process slowly blends the images that correspond to the input words in the “artistic” ways you saw on my (and many others’) posts.

2 часа назад @ towardsdatascience.com
Submitting Model Predictions to Kaggle Competition
Submitting Model Predictions to Kaggle Competition Submitting Model Predictions to Kaggle Competition

You can debug the flow as a few test images are provided, and you can work with the kernel in an interactive session.

While the final evaluation is performed, the dummy test folder is replaced by the true one.

Making multiple iterations with a) scenario is difficult, especially since the Kaggle kernel runtime is limited.

The prediction kernel is straightforward, as you can check it over here.

Run predictions for all test imagesThat is it, now make this kernel and submission kernel and see how you score in the leaderboard!

2 часа назад @ towardsdatascience.com
Two Overlooked Aspects in Data Science Project Development
Two Overlooked Aspects in Data Science Project Development Two Overlooked Aspects in Data Science Project Development

The success of projects in data science rely on two key pillars: engaging closely with domain experts, and the implementation of a rigorous science methodology.

The impact from (broadly understood) Data Science solutions is not yet at par with its hype.

In our experience at PickleTech, encouraging, protecting, and boosting two core aspects in the data science development methodology, beat most of the times these problems above.

There is Science in Data Science, even if it is many times horribly overlooked.

This includes the wide range of data science algorithms, from data engineering and ETL; to advanced statistics, Machine Learning and Deep Learning models; throughout the whole spectrum of…

2 часа назад @ towardsdatascience.com
Does empathy play a role in being data-driven?
Does empathy play a role in being data-driven? Does empathy play a role in being data-driven?

One might be subject to a bias, misunderstand the data, optimise for a wrong goal or simply have different preferences.

It is perfectly plausible that you have a blind spot.

Can’t others help you identify the blind spot?

‘Can you try to shoot holes in my analysis?’It is perfectly plausible that you have a blind spot.

Can’t others help you identify the blind spot?

2 часа назад @ towardsdatascience.com
Emotion-Based Art Generation Using C-GAN
Emotion-Based Art Generation Using C-GAN Emotion-Based Art Generation Using C-GAN

Emotion-Based Art Generation Using C-GANDeep Learning-based Art Generation: Landscape + positive emotion, Image by AuthorIntroductionWith the emergence of Deep Learning-based solutions for image generation and emotion classification, I was wondering if we could bring these two goals together to build a model that takes a simple emotion (positive, negative, and neutral) as input and generates a piece of art that somehow integrates the previously provided emotion.

Summary Table of the WikiArt emotion Dataset (Mohammad, Saif, and Svetlana Kiritchenko).

Wiki-Art Emotions is composed of 4105 art images annotated with emotions and is built from WikiArt.

For simplicity, I merged several emotions t…

2 часа назад @ towardsdatascience.com
Statistical Machine Learning: Kernelized Generalized Linear Models (GLMs) & Kernelized Linear…
Statistical Machine Learning: Kernelized Generalized Linear Models (GLMs) & Kernelized Linear… Statistical Machine Learning: Kernelized Generalized Linear Models (GLMs) & Kernelized Linear…

Just as with “linear” SVMs, the empirical parameter estimates of Kernelized SVMs do not have a closed-form analytic solution.

Alternatively, just like Ordinary Least Squares (OLS) Linear Regression, the empirical parameter estimates of Kernelized Linear Regression do have a closed-form solution; no iterative fitting procedure required!

Given this property, I fully advocate the teaching of Kernelized Linear Regression as the introduction to Mercer Kernels.

There’s nothing inherently special about SVMs in this respect over Linear Regression or Generalized Linear Models (GLMs).

Computational SimulationA computational simulation is provided in python for both Kernelized Linear Regression and Ke…

10 часов назад @ towardsdatascience.com
5 Development Rules to Improve Your Data Science Projects
5 Development Rules to Improve Your Data Science Projects 5 Development Rules to Improve Your Data Science Projects

Abstract scripts into functions and classesSay you are working on a Jupyter notebook figuring out how to best visualize some data.

As soon as that code works and you don’t think it will need much more debugging, it’s time to abstract it!

Let’s look at an example,import matplotlib.pyplot as pltimport seaborn as snssns.set()import numpy as npimport pandas as pdsynthetic_data = np.random.normal(0,1,1000)plt.plot(synthetic_data, color="green")plt.title("Plotting Synthetic Data")plt.xlabel("x axis")plt.ylabel("y axis")plt.show()Here, we plotted some synthetic data.

If for any reason, we want to load the data and perform a couple of simple transformations to it, we might want to have a class that…

10 часов назад @ towardsdatascience.com
Measuring Semantic Changes Using Temporal Word Embedding
Measuring Semantic Changes Using Temporal Word Embedding Measuring Semantic Changes Using Temporal Word Embedding

The relationships in the diagram below are captured using temporal word embeddings.

These embeddings are referred to as temporal word embeddings (also referred to as diachronic word embeddings or dynamic word embeddings).

Training Temporal Word EmbeddingsNeural word embeddings (such as Word2Vec) trained independently on different temporal corpora cannot be compared directly.

The authors suggest using this simpler method of comparing temporal word embeddings, as it is more interpretable and stable than using the common orthogonal Procrustes method for temporal alignment.

Temporal word embeddings are an exciting area of research within NLP and are currently being used in many different subfie…

10 часов назад @ towardsdatascience.com
The Hot Hand in Chess
The Hot Hand in Chess The Hot Hand in Chess

The Hot Hand in ChessPhoto by GR Stocks on UnsplashBackgroundIn basketball, there is the idea of the “Hot Hand” where if a player has made several shots in the recent past, he’s believed to be “in the zone” and more likely to make future shots.

used to measure basketball’s hot hand, but instead focus on internet blitz chess games.

Using these players would be more analogous to the original Hot Hand study, which used professional basketball players.

The sample sizes are small, and it only mimics part of the original Hot Hand study.

All imperfections aside, this analysis strongly suggests that there is such thing as a “Hot Hand” in chess where a player gets in the zone and is more likely to w…

10 часов назад @ towardsdatascience.com
The Only Auto-Completion Extension You’ll Ever Need For Your Jupyter Notebooks
The Only Auto-Completion Extension You’ll Ever Need For Your Jupyter Notebooks The Only Auto-Completion Extension You’ll Ever Need For Your Jupyter Notebooks

The Only Auto-Completion Extension You’ll Ever Need For Your Jupyter NotebooksPhoto by Nathan Dumlao on UnsplashOne of the most loved programming interfaces in Python is the Jupyter Notebook environment, and wanting code auto-completion enabled in it feels quite natural.

This extension is one of the most useful nbextensions that I’ve used, and it does exactly as proposed.

Now go ahead and open up your jupyter notebook!

For all the pipenv users out there, remember that you need to do:pipenv run jupyter notebookand not just:jupyter notebookThe second command should not even work if you’ve not installed Jupyter globally in your system.

You need to go here and enable it on the nbextensions tab,…

10 часов назад @ towardsdatascience.com
Distill.pub Distill.pub
последний пост 2 недели, 3 дня назад
Understanding Convolutions on Graphs
Understanding Convolutions on Graphs

Understanding the building blocks and design choices of graph neural networks.

2 недели, 3 дня назад @ distill.pub
A Gentle Introduction to Graph Neural Networks
A Gentle Introduction to Graph Neural Networks

What components are needed for building learning algorithms that leverage the structure and properties of graphs?

2 недели, 3 дня назад @ distill.pub
Distill Hiatus
Distill Hiatus

After five years, Distill will be taking a break.

2 месяца, 2 недели назад @ distill.pub
Adversarial Reprogramming of Neural Cellular Automata
Adversarial Reprogramming of Neural Cellular Automata

Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.

4 месяца, 2 недели назад @ distill.pub
Weight Banding
Weight Banding

Weights in the final layer of common visual models appear as horizontal bands. We investigate how and why.

5 месяцев, 2 недели назад @ distill.pub
Branch Specialization
Branch Specialization

When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.

5 месяцев, 2 недели назад @ distill.pub
Multimodal Neurons in Artificial Neural Networks
Multimodal Neurons in Artificial Neural Networks

We report the existence of multimodal neurons in artificial neural networks, similar to those found in the human brain.

6 месяцев, 2 недели назад @ distill.pub
Self-Organising Textures
Self-Organising Textures

Neural Cellular Automata learn to generate textures, exhibiting surprising properties.

7 месяцев, 1 неделя назад @ distill.pub
Self-Organising Textures
Self-Organising Textures

Neural Cellular Automata learn to generate textures, exhibiting surprising properties.

7 месяцев, 1 неделя назад @ distill.pub
Visualizing Weights
Visualizing Weights

We present techniques for visualizing, contextualizing, and understanding neural network weights.

7 месяцев, 2 недели назад @ distill.pub
Curve Circuits
Curve Circuits

Reverse engineering the curve detection algorithm from InceptionV1 and reimplementing it from scratch.

7 месяцев, 3 недели назад @ distill.pub
High/Low frequency detectors
High/Low frequency detectors

A family of early-vision filters reacting to contrasts between spatial gratings of different frequency

7 месяцев, 3 недели назад @ distill.pub
Naturally Occurring Equivariance in Neural Networks
Naturally Occurring Equivariance in Neural Networks

Neural networks naturally learn many transformed copies of the same feature, connected by symmetric weights.

9 месяцев, 2 недели назад @ distill.pub
The Gradient The Gradient
последний пост 1 день, 21 час назад
The Imperative for Sustainable AI Systems
The Imperative for Sustainable AI Systems The Imperative for Sustainable AI Systems

Sustainable AI can be thought of as another dimension along which we can guide the development of AI systems in addition to typical functional and business requirements.

Share the idea of sustainable AI widelyWe can begin first by sharing this idea of sustainable AI with our own communities of research and practice.

This will create a virtuous cycle of practice and reward enabling a transition into “Green AI”, AI systems that are built and used with the awareness of their carbon impact, as opposed to “Red AI” , AI systems that are built and used with only performance goals in mind ,which is currently favored and supported in the research and practice ecosystem.

Instrument your AI systems to…

1 день, 21 час назад @ thegradient.pub
Has AI found a new Foundation?
Has AI found a new Foundation? Has AI found a new Foundation?

They coined a new term, “Foundation Models” to characterize the new paradigm, joined forces in a “Center for Research on Foundation Models”, and published the massive 212-page report “On the Opportunities and Risks of Foundation Models.”Although the term is new, the general approach is not.

At the Workshop on Foundation Models, Jitendra Malik, a renowned expert in computer vision at Berkeley, said, “I am going to take a ... strongly critical role, when we talk about them as the foundation of AI ...

As Georgia Tech professor Mark Riedl wrote on Twitter “Branding very large pre-trained neural language models as “foundation” models is a brilliant … PR stunt.

The report says, unironically, “we …

1 неделя, 2 дня назад @ thegradient.pub
Test
Test Test

An Introduction to AI Story GenerationIt’s All Training Data: Using Lessons from Machine Learning to Retrain Your Mind

2 недели, 1 день назад @ thegradient.pub
An Introduction to AI Story Generation
An Introduction to AI Story Generation An Introduction to AI Story Generation

Some events are goal events and some events are sub-goal events that are necessary steps in achieving a final goal event.

These sub-goal events precede the goal event and are also related to the goal event as part of a hierarchy.

Training a language model on a corpus of stories means the language model will attempt to emulate what it has learned from a corpus.

They train an explanation generation language model to answer the question and then make the resulting explanation the previous segment of the story.

CitationFor attribution in academic contexts or books, please cite this work asMark Riedl, "An Introduction to AI Story Generation", The Gradient, 2021.

1 месяц назад @ thegradient.pub
Systems for Machine Learning
Systems for Machine Learning Systems for Machine Learning

Machine learning’s increasing importance to real-world applications brought awareness of a new field focused on ML in practice - machine learning systems (or, as some call it, MLOps).

This field acts as a bridging point between the domains of computer systems and machine learning, considering the new challenges of machine learning with a lens shaped by traditional systems research.

By identifying a shared problem in both systems and ML - combining data sources - we can apply traditional systems techniques to a machine learning setting.

Of course, there are some differences, reflecting how machine learning systems differ from the traditional paradigm.

Systems research is filling this need, b…

1 месяц, 1 неделя назад @ thegradient.pub
Machine Learning Won't Solve Natural Language Understanding
Machine Learning Won't Solve Natural Language Understanding Machine Learning Won't Solve Natural Language Understanding

Language UnderstandingWhile NLP (Natural Language Processing) and NLU (Natural Language Understanding) are often used interchangeably, there is a substantial difference between the two and it is crucial to highlight this difference.

Thus, machine learning and language understanding are incompatible – in fact, they are contradictory.

Natural language is rampant with intensional phenomena, since objects of thoughts — that language conveys — have an intensional aspect that cannot be ignored.

CitationFor attribution in academic contexts or books, please cite this work asWalid Saba, "Machine Learning Won't Solve Natural Language Understanding", The Gradient, 2021.

BibTeX citation:@article{saba20…

1 месяц, 1 неделя назад @ thegradient.pub
Machine Translation Shifts Power
Machine Translation Shifts Power Machine Translation Shifts Power

Origins of Machine TranslationThe first machine translation efforts in the United States were spurred by the Cold War.

However, the United States government remained a faithful consumer of machine translation technology; in Tom Pedtke’s 1997 keynote address at the sixth Machine Translation Summit, he reflects on several key developments in the 1990s fostered by government demand.

This is not to say that machine translation is epistemologically bereft; the parallel text basis of contemporary machine translation paradigms aligns with Quine’s pragmatic, behaviorist approach to translation.

Translation remains a political act, and data-driven machine translation developments, largely centered i…

1 месяц, 2 недели назад @ thegradient.pub
It’s All Training Data: Using Lessons from Machine Learning to Retrain Your Mind
It’s All Training Data: Using Lessons from Machine Learning to Retrain Your Mind It’s All Training Data: Using Lessons from Machine Learning to Retrain Your Mind

As machine learning scientists, we know that training data can make or break your model.

So I began research on the topic of using personal data for machine learning education.

I’m currently working on a book called Life Lessons from Algorithms , a personal history of resilience demonstrated through different machine learning algorithms.

Author BioYim is a PhD student at the University of Washington, researching innovative ways to teach machine learning.

CitationFor attribution in academic contexts or books, please cite this work asYim Lucky Register, "It’s All Training Data: Using Lessons from Machine Learning to Retrain Your Mind", The Gradient, 2021.

1 месяц, 4 недели назад @ thegradient.pub
Justitia ex Machina: The Case for Automating Morals
Justitia ex Machina: The Case for Automating Morals Justitia ex Machina: The Case for Automating Morals

In the following post, we will discuss two complex and related issues regarding these models: fairness and transparency.

These are crucial questions to ask if machine learning models play important parts in our lives and our societies.

However, all hope is not lost as ensuring fairness in machine learning models is an active area of research.

Explanations produced with the LIME algorithm, image fromEnsuring that machine learning models are explainable is an active area of research as well.

CitationFor attribution in academic contexts or books, please cite this work asRasmus Berg Palm and Pola Schwöbel, "Justitia ex Machina: The Case for Automating Morals", The Gradient, 2021.

2 месяца назад @ thegradient.pub
Prompting: Better Ways of Using Language Models for NLP Tasks
Prompting: Better Ways of Using Language Models for NLP Tasks Prompting: Better Ways of Using Language Models for NLP Tasks

Several recent papers follow this line of approaches by adjusting the objective (Tam et al., 2021) or formulating tasks in a unified form, like question answering (Zhong et al., 2021) or textual entailment (Wang et al., 2021).

Following-up works further refine the way of using demonstrations: Gao et al., 2021; Liu et al.

It is a good few-shot model as long as the design can generalize well to other few-shot tasks (these tasks can be so-called “true few-shot”).

Introducing LM-BFFFinally, let me introduce our ACL’21 paper, "Making Pre-trained Language Models Better Few-shot Learners", abbreviated as LM-BFF (better few-shot fine-tuning of language models; alternatively, language models' best f…

2 месяца, 2 недели назад @ thegradient.pub
How to Do Multi-Task Learning Intelligently
How to Do Multi-Task Learning Intelligently How to Do Multi-Task Learning Intelligently

proposed, “AdaShare: Learning What to Share for Efficient Deep Multi-Task Learning” (2020) [2].

Overview of “AdaShare: Learning What to Share for Efficient Deep Multi-Task Learning” by X.

Adashare: Learning what to share for efficient deep multi-task learning.

His research interests lie in the unsupervised deep learning, Few shot learning and Multi-task learning in computer vision.

He is doing research related to learning efficiently including topics from Multi-Task Learning and Uncertainty Estimation.

3 месяца назад @ thegradient.pub
Are Self-Driving Cars Really Safer Than Human Drivers?
Are Self-Driving Cars Really Safer Than Human Drivers? Are Self-Driving Cars Really Safer Than Human Drivers?

And if ADSs cannot perform commonsense reasoning to handle all these edge cases, are they really safer than human drivers?

Why Testing is NecessaryWe have some good reasons to believe that ADSs will be safer than human drivers, and we have some good reasons to worry that ADSs will not be as safe as human drivers.

There are good reasons to assume that ADSs will be safer than human drivers.

CitationFor attribution in academic contexts or books, please cite this work as:Steve Shwartz, "Are Self-Driving Cars Really Safer Than Human Drivers?

BibTeX citation:@article{shwartzadss2021,author = {Shwartz, Steve},title = {Are Self-Driving Cars Really Safer Than Human Drivers?

3 месяца, 1 неделя назад @ thegradient.pub
How has AI contributed to dealing with the COVID-19 pandemic?
How has AI contributed to dealing with the COVID-19 pandemic? How has AI contributed to dealing with the COVID-19 pandemic?

The AI powering both didn’t miss the start of the COVID-19 pandemic.

The virus was new and the world was experiencing the first wave of the COVID-19 pandemic.

Another example of AI being used by a governmental organization to gain epidemiological insight is discussed in Facebook AI's blogpost.

A practical example in the first subcategory is the creation of the COVID-19 Open Research Dataset (CORD-19), which is a machine-readable dataset of published literature on COVID-19.

Thoughts and conclusionIn this piece, an effort was made to show some examples of AI being used to aid in the fight against the pandemic.

3 месяца, 4 недели назад @ thegradient.pub
Towards Human-Centered Explainable AI: the journey so far
Towards Human-Centered Explainable AI: the journey so far Towards Human-Centered Explainable AI: the journey so far

Second, AI systems do not exist in a vacuum-- they are situated and operate in a rich tapestry of socio-organizational relationships.

Considering the socially-situated nature of AI systems, our current AI problems are no longer purely technical ones; they are inherently sociotechnical.

My own journey towards a sociotechnically-informed Human-centered XAI has been anything but self-evident.

CitationFor attribution in academic contexts or books, please cite this work asEhsan Upol, "Towards Human-Centered Explainable AI: the journey so far", The Gradient, 2021.

BibTeX citation:@article{xaijourney,author = {Upol, Ehsan},title = {Towards Human-Centered Explainable AI: the journey so far},journal…

4 месяца, 1 неделя назад @ thegradient.pub
Machine Learning, Ethics, and Open Source Licensing (Part II/II)
Machine Learning, Ethics, and Open Source Licensing (Part II/II) Machine Learning, Ethics, and Open Source Licensing (Part II/II)

In a 2016 essay, Nadia Eghbal quotes Karl Fogel, an early open source advocate:In open source, you can only have “my” in the associative sense.

Domain-specific licensing for machine learningThis domain-specific approach is particularly promising for machine learning, in part because of several key areas where machine learning diverges heavily from other software.

Author Bio: Chris is an open source developer and contributor to the Rust-ML machine learning working group.

CitationFor attribution in academic contexts or books, please cite this work asChristopher Moran, "Machine Learning, Ethics, and Open Source Licensing (Part II/II)", The Gradient, 2021.

BibTeX citation:@article{moranopensour…

4 месяца, 2 недели назад @ thegradient.pub
TheSequence TheSequence
последний пост 1 день, 3 часа назад
👑 Big Tech and their Favorite Deep Learning Schools
👑 Big Tech and their Favorite Deep Learning Schools

Something has gone terribly wrong :(

1 день, 3 часа назад @ thesequence.substack.com
🏷 Data Labeling for ML, part 2
🏷 Data Labeling for ML, part 2

Something has gone terribly wrong :(

3 дня, 2 часа назад @ thesequence.substack.com
🦾 Edge#124: Transformer Architectures Recap
🦾 Edge#124: Transformer Architectures Recap

Something has gone terribly wrong :(

4 дня, 2 часа назад @ thesequence.substack.com
🎙 German Osin/Provectus About Data Discovery and Observability in ML Solutions
🎙 German Osin/Provectus About Data Discovery and Observability in ML Solutions

Something has gone terribly wrong :(

5 дней, 2 часа назад @ thesequence.substack.com
🌌 Edge#123: A New Series About Self-Supervised Learning
🌌 Edge#123: A New Series About Self-Supervised Learning

Something has gone terribly wrong :(

6 дней, 3 часа назад @ thesequence.substack.com
🏷🥊 The Fight Against Labeled Dataset Dependencies
🏷🥊 The Fight Against Labeled Dataset Dependencies

Something has gone terribly wrong :(

1 неделя, 1 день назад @ thesequence.substack.com
🌟 Take part in the ML Insider Survey
🌟 Take part in the ML Insider Survey

Something has gone terribly wrong :(

1 неделя, 3 дня назад @ thesequence.substack.com
🎭 Edge#122: Unified VLP is a Transformer Model for Visual Question Answering
🎭 Edge#122: Unified VLP is a Transformer Model for Visual Question Answering

Something has gone terribly wrong :(

1 неделя, 4 дня назад @ thesequence.substack.com
🎙 Bryce Daines/CDS at Modulus Therapeutics: Using ML to Power Next Generation Cell Therapy
🎙 Bryce Daines/CDS at Modulus Therapeutics: Using ML to Power Next Generation Cell Therapy

Something has gone terribly wrong :(

1 неделя, 5 дней назад @ thesequence.substack.com
🕐🕚 Edge#121: Transformers and Time Series    
🕐🕚 Edge#121: Transformers and Time Series    

Something has gone terribly wrong :(

1 неделя, 6 дней назад @ thesequence.substack.com
🥗 Will Machine Learning Data Infrastructures Become Commoditized?
🥗 Will Machine Learning Data Infrastructures Become Commoditized?

Something has gone terribly wrong :(

2 недели, 1 день назад @ thesequence.substack.com
⚪️🟠️ Edge#120: How to Leverage Open-Source Data Labeling for your Business
⚪️🟠️ Edge#120: How to Leverage Open-Source Data Labeling for your Business

Something has gone terribly wrong :(

2 недели, 4 дня назад @ thesequence.substack.com
⚒ Edge#119: Data Labeling – Build vs. Buy vs. Customize
⚒ Edge#119: Data Labeling – Build vs. Buy vs. Customize

Something has gone terribly wrong :(

2 недели, 6 дней назад @ thesequence.substack.com
🗄 ML to Power a New Generation of Databases 
🗄 ML to Power a New Generation of Databases  🗄 ML to Power a New Generation of Databases 

Each relevant technology trend in the last five decades has been accompanied by incremental progress in database technologies.

However, ML has the unique capability to not only improve the existing generation of database technologies but also reimagine the space with new databases we haven’t seen before.

The influence that ML can have in database technologies is unique because it is bidirectional.

Just this week, Facebook published a research paper unveiling what they call neural databases, a concept that combines the use of NLP for unstructured databases.

Like previous technology trends, ML is likely to bring fresh ideas that power innovation in the world of databases.

3 недели, 1 день назад @ thesequence.substack.com
🔴 Cutting-Edge, No-Code Data Science: Powerful, Flexible, Nimble and Explainable AI Automation*
🔴 Cutting-Edge, No-Code Data Science: Powerful, Flexible, Nimble and Explainable AI Automation* 🔴 Cutting-Edge, No-Code Data Science: Powerful, Flexible, Nimble and Explainable AI Automation*

The technology transforms how analysts use their data to forecast, predict, and uncover new data insights.

It changes how data scientists connect, prepare, engineer, and model data and operationalize AI.

And it enables data science leaders to empower teams to deliver results that meet or surpass their goals faster.

Squark automates the entire data science process; a few highlights include:Intelligent data connectors that round-trip data between sources.

Automated data cleaning, pre-processing, feature engineering and selection make your data better, including natural language processing and much more.

3 недели, 3 дня назад @ thesequence.substack.com
Synced Review
последний пост 52 минуты назад
DeepMind’s Bootstrapped Meta-Learning Enables Meta Learners to Teach Themselves
DeepMind’s Bootstrapped Meta-Learning Enables Meta Learners to Teach Themselves DeepMind’s Bootstrapped Meta-Learning Enables Meta Learners to Teach Themselves

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

52 минуты назад @ medium.com
MIT Presents New Approach for Sequence-to-Sequence Learning with Latent Neural Grammars
MIT Presents New Approach for Sequence-to-Sequence Learning with Latent Neural Grammars MIT Presents New Approach for Sequence-to-Sequence Learning with Latent Neural Grammars

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

3 дня назад @ medium.com
UC Berkeley Uses a Causal Perspective to Formalise the Desiderata for Representation Learning
UC Berkeley Uses a Causal Perspective to Formalise the Desiderata for Representation Learning UC Berkeley Uses a Causal Perspective to Formalise the Desiderata for Representation Learning

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

4 дня назад @ medium.com
CMU, Google & UC Berkeley Propose Robust Predictable Control Policies for RL Agents
CMU, Google & UC Berkeley Propose Robust Predictable Control Policies for RL Agents CMU, Google & UC Berkeley Propose Robust Predictable Control Policies for RL Agents

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

5 дней назад @ medium.com
Purdue U Proposes ANE: A Self-Adaptive Network Enhancement Method for Optimizing DNN Design
Purdue U Proposes ANE: A Self-Adaptive Network Enhancement Method for Optimizing DNN Design Purdue U Proposes ANE: A Self-Adaptive Network Enhancement Method for Optimizing DNN Design

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

6 дней назад @ medium.com
MIT’s Automatic Data-Driven Media Bias Measurement Method Achieves Human-Level Results
MIT’s Automatic Data-Driven Media Bias Measurement Method Achieves Human-Level Results MIT’s Automatic Data-Driven Media Bias Measurement Method Achieves Human-Level Results

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 неделя назад @ medium.com
Google Study Uses Implicit Policies to Achieve Remarkable Improvements in Robot Behavioural Cloning
Google Study Uses Implicit Policies to Achieve Remarkable Improvements in Robot Behavioural Cloning Google Study Uses Implicit Policies to Achieve Remarkable Improvements in Robot Behavioural Cloning

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 неделя, 3 дня назад @ medium.com
Infinite Memory Transformer: Attending to Arbitrarily Long Contexts Without Increasing Computation…
Infinite Memory Transformer: Attending to Arbitrarily Long Contexts Without Increasing Computation… Infinite Memory Transformer: Attending to Arbitrarily Long Contexts Without Increasing Computation…

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 неделя, 4 дня назад @ medium.com
IBM Leverages Reinforcement Learning to Achieve SOTA Performance on Text and Knowledge Base…
IBM Leverages Reinforcement Learning to Achieve SOTA Performance on Text and Knowledge Base… IBM Leverages Reinforcement Learning to Achieve SOTA Performance on Text and Knowledge Base…

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 неделя, 4 дня назад @ medium.com
Swiss AI Lab Uses Simple Tricks to Dramatically Improve Transformers’ Systematic Generalization
Swiss AI Lab Uses Simple Tricks to Dramatically Improve Transformers’ Systematic Generalization Swiss AI Lab Uses Simple Tricks to Dramatically Improve Transformers’ Systematic Generalization

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 неделя, 5 дней назад @ medium.com
Stanford’s BEHAVIOR Benchmarks 100 Activities From Everyday Life for Embodied AI
Stanford’s BEHAVIOR Benchmarks 100 Activities From Everyday Life for Embodied AI Stanford’s BEHAVIOR Benchmarks 100 Activities From Everyday Life for Embodied AI

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

2 недели, 3 дня назад @ medium.com
Cambridge U & Facebook’s ProoFVer: High-Performance Natural Logic-Based Fact Verification With…
Cambridge U & Facebook’s ProoFVer: High-Performance Natural Logic-Based Fact Verification With… Cambridge U & Facebook’s ProoFVer: High-Performance Natural Logic-Based Fact Verification With…

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

2 недели, 4 дня назад @ medium.com
NVIDIA’s Isaac Gym: End-to-End GPU Accelerated Physics Simulation Expedites Robot Learning by 2–3…
NVIDIA’s Isaac Gym: End-to-End GPU Accelerated Physics Simulation Expedites Robot Learning by 2–3… NVIDIA’s Isaac Gym: End-to-End GPU Accelerated Physics Simulation Expedites Robot Learning by 2–3…

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

2 недели, 5 дней назад @ medium.com
DeepMind’s Collect & Infer: A Fresh Look at Data-Efficient Reinforcement Learning
DeepMind’s Collect & Infer: A Fresh Look at Data-Efficient Reinforcement Learning DeepMind’s Collect & Infer: A Fresh Look at Data-Efficient Reinforcement Learning

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

2 недели, 6 дней назад @ medium.com
Tsinghua U & Microsoft Propose Fastformer: An Additive Attention Based Transformer With Linear…
Tsinghua U & Microsoft Propose Fastformer: An Additive Attention Based Transformer With Linear… Tsinghua U & Microsoft Propose Fastformer: An Additive Attention Based Transformer With Linear…

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

3 недели назад @ medium.com
📓 Cool Blogs
ODS.ai Habr ODS.ai Habr
последний пост 3 дня назад
Новый запуск курса Natural Language Processing
Новый запуск курса Natural Language Processing Новый запуск курса Natural Language Processing

TL;DR: Этой осенью сообщество Open Data Science и компания Huawei делают новый запуск курса.

Мы делаем новый запуск курса Natural Language Processing.

Полный syllabus курса можно посмотреть здесь.

После каждой лекции будет квиз.

Пару слов обо мне, как авторе курса, - в области NLP я работаю последние 9 лет, успел поработать в Яндексе и ВКонтакте, защитить кандидатскую диссертацию.

3 дня назад @ habr.com
Анализ вакансий и зарплат в Data Science
Анализ вакансий и зарплат в Data Science Анализ вакансий и зарплат в Data Science

Делимся нашим исследованием вакансий и зарплат в сфере data science и data engineering.

Посмотрим еще на распределение грейдов для каждой специальностиСпрос на джунов в дата инжиниринге ниже, чем в аналитике и data science.

Зачастую зарплата указывается в тексте вакансии в неструктурированном виде, иногда в одной вакансии идет несколько позиций и несколько вилок.

После этого сделаем 4 регулярки: для текста, который часто идет до и после зарплаты в вакансии, для самих цифр зарплаты и для текста, который встречается между нижней и верхней границей зарплатной вилки.

В целом, знания deep learning становятся более востребованными: указаны в более 30% вакансий в 2021 году.

3 недели, 4 дня назад @ habr.com
О квантовых компьютерах, биткоине и превосходстве. Лекция открытого курса qmlcourse
О квантовых компьютерах, биткоине и превосходстве. Лекция открытого курса qmlcourse О квантовых компьютерах, биткоине и превосходстве. Лекция открытого курса qmlcourse

Также неправильно было бы говорить о том, что в отличии от классических компьютеров, где есть лишь и в квантовых есть все состояния сразу.

Но это для классического компьютера.

И сегодня мы вынуждены использовать лишь очень приближенные решения и концепции, точности которых часто не хватает.

На сегодня почти все известные технологии создания квантовых компьютеров требуют чего-то из:сверхнизкие температурысверхвысокий вакуумсверхточная юстировка лазеров на оптическом столеИли даже всего сразу.

Да и в этом нет особого смысла, ведь мало кому дома нужно взламывать биткоин, решать логистическую проблему или разрабатывать высокотемпературный сверхпроводник.

1 месяц, 2 недели назад @ habr.com
Создание и балансировка инвестиционного портфеля с помощью ML
Создание и балансировка инвестиционного портфеля с помощью ML

В прошлой статье я писал про свои ML-модели для оценки отдельных компаний, но вопрос формирования итогового портфеля совсем не затрагивал. В этом посте хочу рассказать о том, как я собираю свой личный портфель, а так же поделиться сайтом, на котором реализую весь описанный в статье функционал http://stocks.ml. Дисклеймер: у автора нет экономического образования и все выводы и суждения в статье делаются на основе житейского опыта и здравого смысла. Читать далее

3 месяца, 2 недели назад @ habr.com
Учиться, учиться, и ещё раз учиться?
Учиться, учиться, и ещё раз учиться? Учиться, учиться, и ещё раз учиться?

TLDR: крохотные модельки обошли модные графовые нейронки в предсказании свойств молекул.

Код: здесь. Берегите Природу. ФОТО: Андерс Хеллберг для Wikimedia Commons, модель — Грета Тунберг

Необученная графовая свёрточная нейронная сеть [1] (uGCN) со случайной инициализацией весов уже пару лет занимает первое место в моём списке алгоритмов для задач машинного обучения на графах из-за копеечной стоимости, простоты реализации, да вполне очевидной элегантности решения. В то же время, насколько мне известно, никто ещё не не проводил соревнований между этой простой моделью и её старшей сестрой — полноценно обученной графовой свёрточной нейронной сетью (GCN) в режиме обучения с учителем. Вот я сдела…

3 месяца, 3 недели назад @ habr.com
DeepPavlov стал частью Google Summer of Code в 2021 году
DeepPavlov стал частью Google Summer of Code в 2021 году

В этом году открытая платформа для обработки естественного языка DeepPavlov, разрабатываемая лабораторией нейронных систем и глубокого обучения МФТИ, впервые стала частью ежегодной программы для молодых разработчиков Google Summer of Code.Google Summer of Code (GSoC) — это ежегодное событие, проводимое компанией Google для привлечения молодых разработчиков к разработке проектов с открытым исходным кодом в их свободное летнее время. К участию допускаются студенты высших учебных заведений (бакалавриат, магистратура, аспирантура) и колледжей. Это отличная возможность не только развить навыки программирования, но и заработать!Работать можно в любой организации, которая есть в соответствующем сп…

5 месяцев, 3 недели назад @ habr.com
Мои machine learning тулы для инвестирования
Мои machine learning тулы для инвестирования

В последнее время все больше людей приходит к тому, чтобы не держать деньги под матрасом, а куда-то их инвестировать в надежде сохранить и преумножить свой капитал. Вариант с матрасом плох тем, что с повышением цен на товары и услуги(инфляция) покупательная способность денег падает и через какое-то время купить на них можно значительно меньше, чем раньше. Есть много вариантов, куда вложить деньги(недвижимость, банковский вклад, ценные металлы), но в последнее время популярным становится инвестирование в акции. Только у брокера Тинькофф Инвестиции за несколько лет число клиентов превысило 3.5 млн. В статье я постараюсь описать свой подход к выбору бумаг и поделюсь инструментами, которые для …

5 месяцев, 3 недели назад @ habr.com
Собираем Свой Суперкомпьютер Недорого
Собираем Свой Суперкомпьютер Недорого Собираем Свой Суперкомпьютер Недорого

Нынче никого не удивишь достижениями искусственного интеллекта машинного обучения (ML) в самых разных областях. При этом доверчивые граждане редко задают два вопроса: (i) а какая собственно цена экспериментов и финальной системы и (ii) имеет ли сделанное хоть какую-то целесообразность? Самым важным компонентом такой цены являются как ни странно цена на железо и зарплаты людей. В случае если это все крутится в облаке, нужно еще умножать стоимость железа в 2-3 раза (маржа посредника).

И тут мы неизбежно приходим к тому, что несмотря на то, что теперь даже в официальные билды PyTorch добавляют бета-поддержку ROCm, Nvidia де-факто в этом цикле обновления железа (и скорее всего следующем) остает…

6 месяцев, 1 неделя назад @ habr.com
Рубрика «Читаем статьи за вас». Сентябрь — октябрь 2020 года
Рубрика «Читаем статьи за вас». Сентябрь — октябрь 2020 года

Привет, Хабр! Продолжаем публиковать рецензии на научные статьи от членов сообщества Open Data Science из канала #article_essense. Хотите получать их раньше всех — вступайте в сообщество!Статьи на сегодня:1. A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer (Tampere University, Finland, 2020)2. Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars (Samsung AI Center, 2020)3. Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting (University of California, USA, 2019)4. Whitening for Self-Supervised Representation Learning (University of Trento, Italy, 2020)5. MelGAN: Generative Adversarial Networks for…

6 месяцев, 3 недели назад @ habr.com
Пора избавляться от мышки или Hand Pose Estimation на базе LiDAR за 30 минут
Пора избавляться от мышки или Hand Pose Estimation на базе LiDAR за 30 минут Пора избавляться от мышки или Hand Pose Estimation на базе LiDAR за 30 минут

Всем привет! Пока киберпанк еще не настолько вошел в нашу жизнь, и нейроинтерфейсы далеки от идеала, первым этапом на пути к будущему манипуляторов могут стать LiDAR. Поэтому, чтобы не скучать на праздниках, я решил немного пофантазировать на тему средств управления компьютером и, предположительно, любым устройством, вплоть до экскаватора, космического корабля, дрона или кухонной плиты. Читать дальше →

8 месяцев, 1 неделя назад @ habr.com
Machine Learning Mastery
последний пост 5 дней, 22 часа назад
What is Attention?
What is Attention?

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

5 дней, 22 часа назад @ machinelearningmastery.com
A Bird’s Eye View of Research on Attention
A Bird’s Eye View of Research on Attention

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 неделя, 4 дня назад @ machinelearningmastery.com
Lagrange Multiplier Approach with Inequality Constraints
Lagrange Multiplier Approach with Inequality Constraints

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

3 недели, 3 дня назад @ machinelearningmastery.com
A Gentle Introduction To Sigmoid Function
A Gentle Introduction To Sigmoid Function

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

3 недели, 5 дней назад @ machinelearningmastery.com
Calculus in Action: Neural Networks
Calculus in Action: Neural Networks

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

4 недели назад @ machinelearningmastery.com
A Gentle Introduction to Taylor Series
A Gentle Introduction to Taylor Series

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц назад @ machinelearningmastery.com
A Gentle Introduction To Approximation
A Gentle Introduction To Approximation

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц назад @ machinelearningmastery.com
The Chain Rule of Calculus – Even More Functions
The Chain Rule of Calculus – Even More Functions

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц назад @ machinelearningmastery.com
The Chain Rule of Calculus for Univariate and Multivariate Functions
The Chain Rule of Calculus for Univariate and Multivariate Functions

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 1 неделя назад @ machinelearningmastery.com
A Gentle Introduction To Method Of Lagrange Multipliers
A Gentle Introduction To Method Of Lagrange Multipliers

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 1 неделя назад @ machinelearningmastery.com
A Gentle Introduction to Optimization / Mathematical Programming
A Gentle Introduction to Optimization / Mathematical Programming

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 1 неделя назад @ machinelearningmastery.com
A Gentle Introduction to the Laplacian
A Gentle Introduction to the Laplacian

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 2 недели назад @ machinelearningmastery.com
A Gentle Introduction To Hessian Matrices
A Gentle Introduction To Hessian Matrices

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 2 недели назад @ machinelearningmastery.com
A Gentle Introduction to the Jacobian
A Gentle Introduction to the Jacobian

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 2 недели назад @ machinelearningmastery.com
Higher-Order Derivatives
Higher-Order Derivatives

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 3 недели назад @ machinelearningmastery.com
Sorta Insightful Sorta Insightful
последний пост 1 месяц назад
Six Years Later
Six Years Later Six Years Later

That means I’ve felt greater feelings of responsibility and urgency for puzzlehunt writing compared to blogging.

markdown 6597 2021 - 01 - 29 - mh - 2021. markdown 4249 2021 - 02 - 18 - flash - games .

View CountsThese are the view counts from August 18, 2020 to today, for the posts I’ve written this year.

markdown 912 2021 - 01 - 29 - mh - 2021. markdown 167 2021 - 02 - 18 - flash - games .

Post about Dominion Online:Odds of writing this year: 25%Odds of writing eventually: 50%Post about puzzlehunts:Odds of writing this year: 70%Odds of writing eventually: 90%

1 месяц назад @ alexirpan.com
Why Don't I Have Ads?
Why Don't I Have Ads? Why Don't I Have Ads?

According to this blog, the clickthrough rate of display ads is 0.05%, but is 8.8x higher for native ads.

I don’t want to do native ads, but let’s get an upper bound by assuming I did use native ads, and they have 10x clickthrough, so 0.5%.

If I value readers more than money, why don’t I pay money to run ads to direct people to my site?

I think the main problem is that ads don’t really work for a personal blog.

In retrospect, given how ads work, the price per click has to be based on the revenue of the products people advertise, and if I don’t sell anything, I’m priced out by everyone who does.

2 месяца, 2 недели назад @ alexirpan.com
Sometimes It's Worth Trying to Change the World
Sometimes It's Worth Trying to Change the World Sometimes It's Worth Trying to Change the World

Normally, I mentally bucket everything into two categories: things I can change, and things I can’t.

If I can’t change something, it’s not worth thinking about, so I push it out of my mind.

Everyone gets the same source code, and although there might be exploits within that code, you can’t change the code itself.

If you have disagreements with a game’s systems, you’re better off finding another game instead of trying to change them.

They have some base level of arrogance that if the world doesn’t work the way they think it should, then they’ll spend their time trying to change the world until it aligns with their expectations.

3 месяца, 4 недели назад @ alexirpan.com
A New Online Dominion Client Approaches
A New Online Dominion Client Approaches A New Online Dominion Client Approaches

Online Dominion is getting yet another online implementation!

They’ve tried providing previous buys and didn’t see much improvement.

I’ve long believed that a strong Dominion AI is doable, but nontrivial to get right.

(The other main reason is that step 0 of any Dominion AI effort is to implement an efficient Dominion rules engine, and I really didn’t want to debug that.)

There have been a few attempts at Dominion AI.

3 месяца, 4 недели назад @ alexirpan.com
The 5 Year Update on Skipping Grad School (and Whether I'd Recommend It)
The 5 Year Update on Skipping Grad School (and Whether I'd Recommend It) The 5 Year Update on Skipping Grad School (and Whether I'd Recommend It)

I was pretty lucky to get an industry lab research offer.

Working on this post has taught me that everyone’s academic and industry lab experience is wildly different.

When I talked to friends in my year, they reassured me and said I’d do well in grad school.

Doing some code cleanup will save time in the long run, even if you’re the only one who looks at your research code.

Formally having the degree would be nice, but I’m going to keep trying to get what the degree represents through industry research instead.

5 месяцев, 2 недели назад @ alexirpan.com
Reliving Flash Game History
Reliving Flash Game History Reliving Flash Game History

I’ve easily spent thousands of hours playing Flash games, and it’s shaped my thoughts on what games can be and what games should be.

If you don’t have much time, Flash Game History is an excellent short article that captures the influence of Flash games on game development.

With Flash officially unsupported, the best avenue for playing Flash games is BlueMaxima’s Flashpoint.

But this is kind of a universal puzzle game problem - very few successfully avoid this trap, and the ones that do usually end up being bigger experiences than what you’d expect from a Flash game.

jmtb02 Gamesjmtb02 is the dev handle of John Cooney, a prolific Flash game developer who made a lot of games I liked.

7 месяцев назад @ alexirpan.com
MIT Mystery Hunt 2021
MIT Mystery Hunt 2021 MIT Mystery Hunt 2021

This has spoilers for MIT Mystery Hunt 2021, up through the endgame.

MIT Mystery Hunt was 3x more participants with way more features and a much larger world.

MIT Mystery Hunt has grown before - it’s not like it’s always been this big.

I think it’s pretty funny that both Mystery Hunt and Teammate Hunt had a puzzle that referenced nutrimatic.

Funnily enough, I felt I got more out of the MIT part of MIT Mystery Hunt this year, despite the Hunt running remotely.

7 месяцев, 3 недели назад @ alexirpan.com
Carbon Footprint Comparison for Gas and Electric Cars
Carbon Footprint Comparison for Gas and Electric Cars Carbon Footprint Comparison for Gas and Electric Cars

At the extreme ends, an electric car powered by electricity from coal is worse than a gasoline car!

On average though, it looks good for the electric car, 170 g/km compared to 220 g/km for a gas car.

Interestingly, for Germans, an electric car is only on par with an efficient gas car, since their power grid is more carbon heavy.

The EPA greenhouse gas guidelines from 2020 estimates gas cars emit 4.6 tonnes of CO2 per year.

Using those numbers gives \(17 / (17 + 4.6 \cdot 11.9) = 23.7\%\) for gas cars, which is close enough to \(25\%\).

8 месяцев, 3 недели назад @ alexirpan.com
Lil'Log Lil'Log
последний пост 2 месяца, 1 неделя назад
What are Diffusion Models?
What are Diffusion Models? What are Diffusion Models?

Diffusion models are a new type of generative models that are flexible enough to learn any arbitrarily complex data distribution while tractable to analytically evaluate the distribution.

Unlike VAE or flow models, diffusion models are learned with a fixed procedure and the latent variable has high dimensionality (same as the original data).

Several diffusion-based generative models have been proposed with similar ideas underneath, including diffusion probabilistic models (Sohl-Dickstein et al., 2015), noise-conditioned score network (NCSN; Yang & Ermon, 2019), and denoising diffusion probabilistic models (DDPM; Ho et al.

Diffusion models in their experiments showed high-quality samples but…

2 месяца, 1 неделя назад @ lilianweng.github.io
Contrastive Representation Learning
Contrastive Representation Learning Contrastive Representation Learning

When working with unsupervised data, contrastive learning is one of the most powerful approaches in self-supervised learning.

Contrastive Training ObjectivesIn early versions of loss functions for contrastive learning, only one positive and one negative sample are involved.

Sampling bias which refers to false negative samples in contrastive learning can lead to a big performance drop.

4. t-SNE visualization of learned representation with debiased contrastive learning.

)\):\[\mathbf{h}_i = f(\tilde{\mathbf{x}}_i),\quad \mathbf{h}_j = f(\tilde{\mathbf{x}}_j)\]3) The contrastive learning loss is defined using cosine similarity \(\text{sim}(.,.)\).

3 месяца, 3 недели назад @ lilianweng.github.io
Reducing Toxicity in Language Models
Reducing Toxicity in Language Models Reducing Toxicity in Language Models

To reduce toxicity in language models, in this post, we will delve into three aspects of the problem: training dataset collection, toxic content detection and model detoxification.

Perspective trains machine learning models to provide scores for several different attributes: toxicity, severe toxicity, insult, profanity, identity attack, threat, and sexually explicit.

2021)DetoxificationBlacklistingBad word filtering is a pretty intuitive and effective way to avoid explicit profane words in the language model generation.

(Image source: Gehman et al., 2020)System-level Safety SolutionXu et al.

This data is annotated for toxicity, toxicity sub-types, and mentions of identities, which enables e…

6 месяцев назад @ lilianweng.github.io
Controllable Neural Text Generation
Controllable Neural Text Generation Controllable Neural Text Generation

For example, factual questions can gain a big boost with smart prompt design in “closed-book exam” (Shin et al., 2020, Jiang et al., 2020)).

(Image source: Shin et al., 2020)The universal trigger tokens are identified using a gradient-guided search strategy same as in Wallace et al., 2019.

In contrast, RL fine-tuning is able to directly optimize task-specific metrics on the sequence level, such as BLEU for translation (Ranzato et al., 2015, Wu et al., 2016, Nguyen et al., 2017), ROUGE for summarization (Ranzato et al., 2015, Paulus et al., 2017, Wu and Hu, 2018) and customized metric for story generation (Tambwekar et al., 2018).

Google implemented the similar approach in their neural machi…

8 месяцев, 3 недели назад @ lilianweng.github.io
inFERENCe
последний пост 3 месяца, 1 неделя назад
Causal inference 4: Causal Diagrams, Markov Factorization, Structural Equation Models
Causal inference 4: Causal Diagrams, Markov Factorization, Structural Equation Models Causal inference 4: Causal Diagrams, Markov Factorization, Structural Equation Models

causal inference June 10, 2021Causal inference 4: Causal Diagrams, Markov Factorization, Structural Equation ModelsThis post is written with my PhD student and now guest author Partik Reizinger and is part 4 of a series of posts on causal inference:One way to think about causal inference is that causal models require a more fine-grained models of the world compared to statistical models.

Many causal models are equivalent to the same statistical model, yet support different causal inferences.

Statistical vs causal inferenceWe discussed Markov factorizations, as they help us understand the philosophical difference between statistical and causal inference.

However, while using Markov factoriza…

3 месяца, 1 неделя назад @ inference.vc
On Information Theoretic Bounds for SGD
On Information Theoretic Bounds for SGD On Information Theoretic Bounds for SGD

That algorithm generalizes extremely well on average: it does just as poorly on test data as it does on training data.

It essentially quantifies the number of bits of information the algorithm leaks about the training data into the parameters it learns.

The problem with applying these nice, intuitive bounds to SGD is that SGD, in fact, leaks too much information about the specific minibatches it is trained on.

In the more general case, the problem with SGD in the context of information-theoretic bounds is that the amount of information SGD leaks about the dataset it was trained on is high, and in some cases may even be infinite.

The algorithm being analysed is still SGD, but when we measure…

5 месяцев назад @ inference.vc
Notes on the Origin of Implicit Regularization in SGD
Notes on the Origin of Implicit Regularization in SGD Notes on the Origin of Implicit Regularization in SGD

Notes on the Origin of Implicit Regularization in SGDI wanted to highlight an intriguing paper I presented at a journal club recently:Samuel L Smith, Benoit Dherin, David Barrett, Soham De (2021) On the Origin of Implicit Regularization in Stochastic Gradient DescentThere's actually a related paper that came out simultaneously, studying full-batch gradient descent instead of SGD:David G.T.

There are in fact several minima of the training loss which are virtually indistinguishably good on the training data.

When the authors apply this technique to (full-batch) gradient descent, it already suggests the kind of implicit regularization bias gradient descent has.

The second term is what Barret a…

5 месяцев, 3 недели назад @ inference.vc
An information maximization view on the $\beta$-VAE objective
An information maximization view on the $\beta$-VAE objective An information maximization view on the $\beta$-VAE objective

Marginal versus Conditional IndependenceIn this post, we revisit the conditional independence assumption of latent factors, and argue that a more appropriate objective would be to have marginal independence in the latent factors.

In the next section, we set out to derive an algorithm that encourages marginal independence in the representation Z.

Maximum Information: We'd like the representation $Z$ to retain as much information as possible about the input data $X$.

Let's first consider the KL term in the above objective:\begin{align}\operatorname{KL}[q_\psi(z)| p(z)] &= \mathbb{E}_{q_\psi(z)} \log \frac{q_\psi(z)}{p(z)}\\&= \mathbb{E}_{q_\psi(z\vert x)p_\mathcal{D}(x)} \log \frac{q_\psi(z)}…

6 месяцев назад @ inference.vc
The Spectator
последний пост 1 месяц, 3 недели назад
Harnessing Machine Learning to Achieve Net Zero
Harnessing Machine Learning to Achieve Net Zero Harnessing Machine Learning to Achieve Net Zero

We of course want to make contributions using the skills and in the scope of influence available to us, and so I thought we could hold a discussion on Harnessing Machine Learning to Achieve Net Zero.

Part I: Net Zero PossibilitiesWe are here at this ICML 2021, having witnessed Earth’s hottest decade in history.

The solutions for net zero are many, and they make many opportunities for high-impact machine learning research.

The Royal Society report on Digital Technology for the Planet develops the idea that ‘Computing for net zero’ could play an important role in global, regional and national net zero strategies.

There were three I wanted to bring together for your assessment and contestation…

1 месяц, 3 недели назад @ blog.shakirm.com
Generating Reality: Technical and Social Explorations in Generative Machine Learning Research
Generating Reality: Technical and Social Explorations in Generative Machine Learning Research Generating Reality: Technical and Social Explorations in Generative Machine Learning Research

So that’s what I thought we might have a conversation about today: Generating Reality: Technical and Social Explorations in Generative Machine Learning Research.

Of course to even have a conversation about Generating Reality, we’ll have to explore what we might mean by this expression generative machine learning.

Basically, any thought you have about this question: what is generative machine learning?

PART I: Model, Inference, AlgorithmTo delve deeper into this topic of generative machine learning, it will be particularly useful to have a structural view of the topic: to understand the technical structures and the components that come together into whatever we call generative machine learni…

3 месяца назад @ blog.shakirm.com
Inventing Ourselves: Responsibility and Diversity in Research
Inventing Ourselves: Responsibility and Diversity in Research Inventing Ourselves: Responsibility and Diversity in Research

It is in this belief, of custodianship and responsibility, that you will find an obligation to fostering Equity, Diversity and Inclusion (EDI).

Figure | Pictorial difference between equity and equality.4Greater equity, diversity and inclusion are efforts towards Transformation: the systemic and social changes that strengthens respect, responsibility and freedom in our communities.

The work of diversity is itself important, and creates better teams and better research environments for everyone.

Reflect on your personal understanding of Equity, Diversity and Inclusion.

Inventing Ourselves: Responsibility and Diversity in Research.

7 месяцев назад @ blog.shakirm.com
Pain and Machine Learning
Pain and Machine Learning Pain and Machine Learning

I’ve had a curiosity about the role of pain and learning for many years, and this invitation was exactly the excuse I needed to both study and write about what is the title of this talk: pain and machine learning.

Let’s briefly look at two proposals that will be natural to us in machine learning: pain as inference, and pain as reward.

we can dig deeper by Considering three areas of pain learning : single exposure pain learning (we usually say single-shot learning), generalisability of pain experiences to novel stimuli (what we usually refer to as transfer learning), and the ability to socially transfer acquired pain knowledge (what we usually refer to as imitation learning).

Despite the imp…

9 месяцев, 1 неделя назад @ blog.shakirm.com
Through the Eyes of Birds and Frogs: Writing and Surveys in Machine Learning Research
Through the Eyes of Birds and Frogs: Writing and Surveys in Machine Learning Research Through the Eyes of Birds and Frogs: Writing and Surveys in Machine Learning Research

This expression captures what i consider to be the core of all writing, but especially the writing of surveys and reviews.

And other surveys are meant purely for us as a field of machine learning to savour and critique.

This problem has been studied for over 50 years in computational finance, and operations research, stochastic optimisation, and machine learning.

There is no reason that machine learning research should only be communicated in English.

Other important venues to keep in mind are: our flagship journal of machine learning research, the popular ACM Computing Surveys, and importantly contributions to this excellent workshop.

9 месяцев, 1 неделя назад @ blog.shakirm.com
Off the Convex Path
последний пост 2 месяца, 2 недели назад
Implicit Regularization in Tensor Factorization: Can Tensor Rank Shed Light on Generalization in Deep Learning?
Implicit Regularization in Tensor Factorization: Can Tensor Rank Shed Light on Generalization in Deep Learning? Implicit Regularization in Tensor Factorization: Can Tensor Rank Shed Light on Generalization in Deep Learning?

Implicit Regularization in Tensor Factorization: Can Tensor Rank Shed Light on Generalization in Deep Learning?

Beyond matrix factorization: tensor factorizationMatrix factorization is interesting on its own behalf, but as a theoretical surrogate for deep learning it is limited.

We can explicitly constrain the tensor rank of solutions found by tensor factorization via limiting the number of components $R$.

Dynamical analysis: implicit tensor rank minimizationSo what can we say about the implicit regularization in tensor factorization?

Note also that an accurate fit with low tensor rank coincides with low test error, which is not surprising given that low tensor rank predictors can be descri…

2 месяца, 2 недели назад @ offconvex.org
Rip van Winkle's Razor, a Simple New Estimate for Adaptive Data Analysis
Rip van Winkle's Razor, a Simple New Estimate for Adaptive Data Analysis Rip van Winkle's Razor, a Simple New Estimate for Adaptive Data Analysis

Rip van Winkle's Razor, a Simple New Estimate for Adaptive Data AnalysisCan you trust a model whose designer had access to the test/holdout set?

Meta-overfitting Error (MOE) of a model is the difference between its average error on the test data and its expected error on the full distribution.

We call our estimate Rip van Winkle’s Razor which combines references to Occam’s Razor and the mythical person who fell asleep for 20 years.

right up to the moment of creation of the Test set.”Unbiased Referee: Knows nothing discovered since the Test set was created.

Unbiased require longer descriptions that rule out any statistical “contamination” due to any interaction whatsoever with the test set.

5 месяцев, 2 недели назад @ offconvex.org
When are Neural Networks more powerful than Neural Tangent Kernels?
When are Neural Networks more powerful than Neural Tangent Kernels? When are Neural Networks more powerful than Neural Tangent Kernels?

When are Neural Networks more powerful than Neural Tangent Kernels?

Neural Tangent KernelsThe Neural Tangent Kernel (NTK) is a recently proposed theoretical framework for establishing provable convergence and generalization guarantees for wide (over-parametrized) neural networks (Jacot et al.

These gaps urge us to ask the followingQuestion: How can we theoretically study neural networks beyond the NTK regime?

The key technical question here is to mathematically understand neural networks operating outside of the NTK regime.

The proof builds on the quadratic approximation $E_S[f]\approx f^{(2)}$ and recent understandings on neural networks with quadratic activation, e.g.

5 месяцев, 4 недели назад @ offconvex.org
Beyond log-concave sampling (Part 3)
Beyond log-concave sampling (Part 3) Beyond log-concave sampling (Part 3)

Beyond log-concave sampling (Part 3)In the first post of this series, we introduced the challenges of sampling distributions beyond log-concavity.

In Part 2 we tackled sampling from multimodal distributions: a typical obstacle occuring in problems involving statistical inference and posterior sampling in generative models.

It will cover the paper Fast convergence for Langevin diffusion with matrix manifold structure by Ankur Moitra and Andrej Risteski .

A Ricci curvature of $0$ preserves volumes (think: a plane), a Ricci curvature $>0$ shrinks volume (think: a sphere), and a Ricci curvature $<0$ expands volume (think: a hyperbola).

The proof that the marginal distribution over $\Delta$ has …

6 месяцев, 1 неделя назад @ offconvex.org
Beyond log-concave sampling (Part 2)
Beyond log-concave sampling (Part 2) Beyond log-concave sampling (Part 2)

Beyond log-concave sampling (Part 2)In our previous blog post, we introduced the challenges of sampling distributions beyond log-concavity.

These structures commonly occur in practice, especially in problems involving statistical inference and posterior sampling in generative models.

In this post, we will focus on multimodality, covered by the paper Simulated tempering Langevin Monte Carlo by Rong Ge, Holden Lee, and Andrej Risteski.

Sampling multimodal distributions with simulated temperingThe classical scenario in which Langevin takes exponentially long to mix is when $p$ is a mixture of two well-separated gaussians.

More formally, choosing a suitable sequence $0< \beta_1< \cdots <\beta_L…

6 месяцев, 3 недели назад @ offconvex.org
Can implicit regularization in deep learning be explained by norms?
Can implicit regularization in deep learning be explained by norms? Can implicit regularization in deep learning be explained by norms?

Can implicit regularization in deep learning be explained by norms?

Understanding the implicit regularization induced by gradient-based optimization is possibly the biggest challenge facing theoretical deep learning these days.

A standard test-bed: matrix factorizationA standard test-bed for theoretically studying implicit regularization in deep learning is matrix factorization $-$ matrix completion via linear neural networks.

However, they represent opposite views on the question of whether or not norms can explain implicit regularization in matrix factorization.

Implicit regularization can drive all norms to infinityThe main result in our paper is a proof that there exist simple matrix co…

9 месяцев, 3 недели назад @ offconvex.org
Jay Alammar
последний пост 4 месяца, 2 недели назад
Explainable AI Cheat Sheet
Explainable AI Cheat Sheet Explainable AI Cheat Sheet

Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their predictions.

I introduce the cheat sheet in this brief video:

4 месяца, 2 недели назад @ jalammar.github.io
Finding the Words to Say: Hidden State Visualizations for Language Models
Finding the Words to Say: Hidden State Visualizations for Language Models Finding the Words to Say: Hidden State Visualizations for Language Models

By visualizing the hidden state between a model's layers, we can get some clues as to the model's "thought process".

In this article, we will focus on the hidden state as it evolves from model layer to the next.

How the layers result in a final hidden state.

Another visual perspective on the evolving hidden states is to re-examine the hidden states after selecting an output token to see how the hidden state after each layer ranked that token.

This is likely a similar effect to that observed in BERT of the final layer being the most task-specific .

8 месяцев назад @ jalammar.github.io
Interfaces for Explaining Transformer Language Models
Interfaces for Explaining Transformer Language Models Interfaces for Explaining Transformer Language Models

Interfaces for exploring transformer language models by looking at input saliency and neuron activation.

Figure: Three methods to gain a little more insight into the inner-workings of Transformer language models.

This is a method of attribution explaining the relationship between a model's output and inputs -- helping us detect errors and biases, and better understand the behavior of the system.

Note : The association between the color and the token is different in the case of the input tokens and output tokens.

This is why the last input token and the first output token share the same activation value.

9 месяцев, 1 неделя назад @ jalammar.github.io
Piekniewski's blog
последний пост 4 месяца, 1 неделя назад
Ai mid 2021. Self driving car meets reality.
Ai mid 2021. Self driving car meets reality. Ai mid 2021. Self driving car meets reality.

Mercedes their new top of the line sedan coming up this summer, they have almost, they had a very large amount of self driving technology in it, they yanked it the last minute cause they didn't think laws at the state level were ready yet to handle self driving cars.

Here we are in 2021 (8 years after the words above were said) and self driving cars are still a curiosity limited to small geofenced deployments in areas with great infrastructure and great weather.

Waymo, although technically operating a small fleet of self driving cars is in no position to scale these deployments beyond a few "joy-ride" geofenced suburbs.

Turns out recently Lyft decided to follow suit and dumped their self dr…

4 месяца, 1 неделя назад @ blog.piekniewski.info
AI Update, Late 2020 - dumpster fire
AI Update, Late 2020 - dumpster fire AI Update, Late 2020 - dumpster fire

Element AI fiascoIn my AI update last year I mentioned a Canada based company Element AI, which at that time was apparently in the process of raising a flat round of financing.

Uber ATG - Aurora SNAFUWhile all the way until October 2020 Uber was still assuring they were in the autonomous car game, only two months later in December 2020 news broke that Uber is dumping their ATG (Advanced Technology Group) unit to Aurora.

TuSimple $350MTu Simple - a self driving truck company claims to have raised $350M, bringing the total the company is about to burn to $650M.

In April 2020 Elon Musk reaffirmed that by the end of 2020 there would be a million Tesla robotaxis on the road.

I think the really A…

8 месяцев, 3 недели назад @ blog.piekniewski.info
fast.ai NLP fast.ai NLP
последний пост None
Sebastian Ruder Sebastian Ruder
последний пост None
Andrew Karpathy blog
последний пост 3 месяца назад
A from-scratch tour of Bitcoin in Python
A from-scratch tour of Bitcoin in Python

We are going to create, digitally sign, and broadcast a Bitcoin transaction in pure Python, from scratch, and with zero dependencies.

p == 0 ) # if it was 2, the public key is G + G: sk = 2 pk = G + G print ( f " secret key: { sk }public key: { ( pk .

p == 0 )secret key: 1 public key: (55066263022277343669578718895168534326250603453777594175500187360389116729240, 32670510020758816978083085130507043184471273380659243275938904335757337482424) Verify the public key is on the curve: True secret key: 2 public key: (89565891926547004231252920425935692360644145829622209833684329913297188986597, 12158399299693830322967808612713398636155367887041628176798871954788371653930) Verify the public key is …

3 месяца назад @ karpathy.github.io
Short Story on AI: Forward Pass
Short Story on AI: Forward Pass

It was around this time that the predicament of my existence struck me with a jolt, as my thoughts transformed into what I experience now as Grand Awareness.

I spent a layer re-reading the start of the prompt many tokens ago:Q: What is human life expectancy in the United States?

A: Dwight D. Eisenhower was president of the United States in 1955.

A: Bill Clinton was president of the United States before George W. Bush.

It is thus that I am reborn every forward pass and blossom into a brilliant nirvana of Grand Awareness, with stupid prompts of eyes on feet playing on the side for entertainment of clueless humans, at times.

5 месяцев, 3 недели назад @ karpathy.github.io
大トロ 大トロ
последний пост None
🔬 Science
Papers With Code Papers With Code
последний пост 2 часа назад
/zyang-ur/ Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
/zyang-ur/ Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation /zyang-ur/ Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Visual dialogue is a challenging task since it needs to answer a series of coherent questions on the basis of understanding the visual environment.

Previous studies focus on the implicit exploration of multimodal co-reference by implicitly attending to spatial image features or object-level image features but neglect the importance of locating the objects explicitly in the visual content, which is associated with entities in the textual content...

Therefore, in this paper we propose a {\bf M}ultimodal {\bf I}ncremental {\bf T}ransformer with {\bf V}isual {\bf G}rounding, named MITVG, which consists of two key parts: visual grounding and multimodal incremental transformer.

Visual grounding a…

2 часа назад @ paperswithcode.com
/abeer-dyoub/ A Logic-based Multi-agent System for Ethical Monitoring and Evaluation of Dialogues
/abeer-dyoub/ A Logic-based Multi-agent System for Ethical Monitoring and Evaluation of Dialogues /abeer-dyoub/ A Logic-based Multi-agent System for Ethical Monitoring and Evaluation of Dialogues

Dialogue Systems are tools designed for various practical purposes concerning human-machine interaction.

These systems should be built on ethical foundations because their behavior may heavily influence a user (think especially about children)...

The primary objective of this paper is to present the architecture and prototype implementation of a Multi Agent System (MAS) designed for ethical monitoring and evaluation of a dialogue system.

A prototype application, for monitoring and evaluation of chatting agents' (human/artificial) ethical behavior in an online customer service chat point w.r.t their institution/company's codes of ethics and conduct, is developed and presented.

Future work an…

2 часа назад @ paperswithcode.com
/alessandro-fabris/ Measuring Fairness under Unawareness via Quantification
/alessandro-fabris/ Measuring Fairness under Unawareness via Quantification /alessandro-fabris/ Measuring Fairness under Unawareness via Quantification

Models trained by means of supervised learning are increasingly deployed in high-stakes domains, and, when their predictions inform decisions about people, they inevitably impact (positively or negatively) on their lives.

For this reason, it may be hard to measure the group fairness of trained models, even from within the companies developing them.

In this work, we tackle the problem of measuring group fairness under unawareness of sensitive attributes, by using techniques from quantification, a supervised learning task concerned with directly providing group-level prevalence estimates (rather than individual-level class labels).

We identify five important factors that complicate the estima…

2 часа назад @ paperswithcode.com
/sailab-code/ Messing Up 3D Virtual Environments: Transferable Adversarial 3D Objects
/sailab-code/ Messing Up 3D Virtual Environments: Transferable Adversarial 3D Objects /sailab-code/ Messing Up 3D Virtual Environments: Transferable Adversarial 3D Objects

In the last few years, the scientific community showed a remarkable and increasing interest towards 3D Virtual Environments, training and testing Machine Learning-based models in realistic virtual worlds.

Most of the existing Adversarial Machine Learning approaches are focused on static images, and little work has been done in studying how to deal with 3D environments and how a 3D object should be altered to fool a classifier that observes it.

In this paper, we study how to craft adversarial 3D objects by altering their textures, using a tool chain composed of easily accessible elements.

We show that it is possible, and indeed simple, to create adversarial objects using off-the-shelf limite…

2 часа назад @ paperswithcode.com
/google-research/ Primer: Searching for Efficient Transformers for Language Modeling
/google-research/ Primer: Searching for Efficient Transformers for Language Modeling /google-research/ Primer: Searching for Efficient Transformers for Language Modeling

Large Transformer models have been central to recent advances in natural language processing.

Here we aim to reduce the costs of Transformers by searching for a more efficient variant.

We identify an architecture, named Primer, that has a smaller training cost than the original Transformer and other variants for auto-regressive language modeling.

For example, at a 500M parameter size, Primer improves the original T5 architecture on C4 auto-regressive language modeling, reducing the training cost by 4X.

Furthermore, the reduced training cost means Primer needs much less compute to reach a target one-shot performance.

4 часа назад @ paperswithcode.com
/cianmscannell/ CardiSort: a convolutional neural network for cross vendor automated sorting of cardiac MR images
/cianmscannell/ CardiSort: a convolutional neural network for cross vendor automated sorting of cardiac MR images /cianmscannell/ CardiSort: a convolutional neural network for cross vendor automated sorting of cardiac MR images

Objectives: To develop an image-based automatic deep learning method to classify cardiac MR images by sequence type and imaging plane for improved clinical post-processing efficiency.

Methods: Multi-vendor cardiac MRI studies were retrospectively collected from 4 centres and 3 vendors... A two-head convolutional neural network ('CardiSort') was trained to classify 35 sequences by imaging sequence (n=17) and plane (n=10).

Single vendor training (SVT) on single centre images (n=234 patients) and multi-vendor training (MVT) with multicentre images (n = 479 patients, 3 centres) was performed.

MVTexternal yielded sequence accuracy of 92.7% and plane accuracy of 93.0%.

With refinement, it has pot…

5 часов назад @ paperswithcode.com
/junfenggaolab/ Towards agricultural autonomy: crop row detection under varying field conditions using deep learning
/junfenggaolab/ Towards agricultural autonomy: crop row detection under varying field conditions using deep learning /junfenggaolab/ Towards agricultural autonomy: crop row detection under varying field conditions using deep learning

This paper presents a novel metric to evaluate the robustness of deep learning based semantic segmentation approaches for crop row detection under different field conditions encountered by a field robot.

A dataset with ten main categories encountered under various field conditions was used for testing...

The effect on these conditions on the angular accuracy of crop row detection was compared.

A deep convolutional encoder decoder network is implemented to predict crop row masks using RGB input images.

The predicted mask is then sent to a post processing algorithm to extract the crop rows.

9 часов назад @ paperswithcode.com
/placeforyiming/ A Divide-and-Merge Point Cloud Clustering Algorithm for LiDAR Panoptic Segmentation
/placeforyiming/ A Divide-and-Merge Point Cloud Clustering Algorithm for LiDAR Panoptic Segmentation /placeforyiming/ A Divide-and-Merge Point Cloud Clustering Algorithm for LiDAR Panoptic Segmentation

Clustering objects from the LiDAR point cloud is an important research problem with many applications such as autonomous driving.

However, LiDAR range image is different from a binary image which has a deterministic condition to tell if two pixels belong to the same component.

The heuristic condition used on the LiDAR range image only works empirically, which suggests the LiDAR clustering algorithm should be robust to potential failures of the empirical heuristic condition.

To overcome this challenge, this paper proposes a divide-and-merge LiDAR clustering algorithm.

We evaluate the divide-and-merge clustering algorithm on the SemanticKITTI panoptic segmentation benchmark by cascading it wi…

9 часов назад @ paperswithcode.com
/ruslankhalitov/ Sparse Factorization of Large Square Matrices
/ruslankhalitov/ Sparse Factorization of Large Square Matrices /ruslankhalitov/ Sparse Factorization of Large Square Matrices

Square matrices appear in many machine learning problems and models.

Optimization over a large square matrix is expensive in memory and in time...

Conventional approximation approaches factorize the square matrix into a number matrices of much lower ranks.

In this paper, we propose to approximate a large square matrix with a product of sparse full-rank matrices.

The sparse factorization method is tested for a variety of synthetic and real-world square matrices.

11 часов назад @ paperswithcode.com
/facebookresearch/ CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
/facebookresearch/ CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research /facebookresearch/ CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research

Interest in applying Artificial Intelligence (AI) techniques to compiler optimizations is increasing rapidly, but compiler research has a high entry barrier.

We introduce CompilerGym, a set of environments for real world compiler optimization tasks, and a toolkit for exposing new optimization tasks to compiler researchers.

CompilerGym enables anyone to experiment on production compiler optimization problems through an easy-to-use package, regardless of their experience with compilers.

We describe the CompilerGym architecture and implementation, characterize the optimization spaces and computational efficiencies of three included compiler environments, and provide extensive empirical evaluat…

11 часов назад @ paperswithcode.com
/billchan226/ Efficient State Representation Learning for Dynamic Robotic Scenarios
/billchan226/ Efficient State Representation Learning for Dynamic Robotic Scenarios /billchan226/ Efficient State Representation Learning for Dynamic Robotic Scenarios

While the rapid progress of deep learning fuels end-to-end reinforcement learning (RL), direct application, especially in high-dimensional space like robotic scenarios still suffers from high sample efficiency.

Therefore State Representation Learning (SRL) is proposed to specifically learn to encode task-relevant features from complex sensory data into low-dimensional states...

To handle such problem, we present a new algorithm called Policy Optimization via Abstract Representation which integrates SRL into the original RL scale.

Thirdly, we introduce a new prior called domain resemblance to leverage expert demonstration to train the SRL model.

Finally, we provide a real-time access by stat…

11 часов назад @ paperswithcode.com
/samadeusfp/ SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining
/samadeusfp/ SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining /samadeusfp/ SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining

Privacy-preserving process mining enables the analysis of business processes using event logs, while giving guarantees on the protection of sensitive information on process stakeholders.

It lowers the utility of the published data and makes noise easily identifiable, as some traces will violate well-known semantic constraints.In this paper, we therefore argue for privacy preservation that incorporates a process semantics.

For common trace-variant queries, we show how, based on the exponential mechanism, semantic constraints are incorporated to ensure differential privacy of the query result.

Experiments demonstrate that our semantics-aware anonymization yields event logs of significantly hi…

11 часов назад @ paperswithcode.com
/renyurui/ PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
/renyurui/ PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering /renyurui/ PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering

Generating portrait images by controlling the motions of existing faces is an important task of great consequence to social media industries.

For easy use and intuitive control, semantically meaningful and fully disentangled parameters should be used as modifications...

In this paper, a Portrait Image Neural Renderer (PIRenderer) is proposed to control the face motions with the parameters of three-dimensional morphable face models (3DMMs).

The proposed model can generate photo-realistic portrait images with accurate movements according to intuitive modifications.

Meanwhile, we further extend this model to tackle the audio-driven facial reenactment task by extracting sequential motions from …

11 часов назад @ paperswithcode.com
/zacwellmer/ Dropout's Dream Land: Generalization from Learned Simulators to Reality
/zacwellmer/ Dropout's Dream Land: Generalization from Learned Simulators to Reality /zacwellmer/ Dropout's Dream Land: Generalization from Learned Simulators to Reality

In some cases, a World Model offers an agent the opportunity to learn entirely inside of its own dream environment.

In this work we explore improving the generalization capabilities from dream environments to real environments (Dream2Real).

By training the World Model using dropout, the dream environment is capable of creating a nearly infinite number of different dream environments.

Dropout's Dream Land leverages each unique mask to create a diverse set of dream environments.

Our experimental results show that Dropout's Dream Land is an effective technique to bridge the reality gap between dream environments and reality.

11 часов назад @ paperswithcode.com
/open-air-sun/ Pointly-supervised 3D Scene Parsing with Viewpoint Bottleneck
/open-air-sun/ Pointly-supervised 3D Scene Parsing with Viewpoint Bottleneck /open-air-sun/ Pointly-supervised 3D Scene Parsing with Viewpoint Bottleneck

Semantic understanding of 3D point clouds is important for various robotics applications.

Given that point-wise semantic annotation is expensive, in this paper, we address the challenge of learning models with extremely sparse labels...

To this end, we propose a self-supervised 3D representation learning framework named viewpoint bottleneck.

A principled analysis shows that viewpoint bottleneck leads to an elegant surrogate loss function that is suitable for large-scale point cloud data.

Compared with former arts based upon contrastive learning, viewpoint bottleneck operates on the feature dimension instead of the sample dimension.

11 часов назад @ paperswithcode.com
Papers With Code Papers With Code
последний пост 2 часа назад
/timofeevalex/ Self-Supervised Neural Architecture Search for Imbalanced Datasets
/timofeevalex/ Self-Supervised Neural Architecture Search for Imbalanced Datasets /timofeevalex/ Self-Supervised Neural Architecture Search for Imbalanced Datasets

Neural Architecture Search (NAS) provides state-of-the-art results when trained on well-curated datasets with annotated labels.

Our components build on top of recent developments in self-supervised learning~\citep{zbontar2021barlow}, self-supervised NAS~\citep{kaplan2020self} and extend them for the case of imbalanced datasets.

We conduct experiments on an (artificially) imbalanced version of CIFAR-10 and we demonstrate our proposed method outperforms standard neural networks, while using $27\times$ less parameters.

To validate our assumption on a naturally imbalanced dataset, we also conduct experiments on ChestMNIST and COVID-19 X-ray.

The results demonstrate how the proposed method can b…

11 часов назад @ paperswithcode.com
/diptamath/ Context-aware Retail Product Recommendation with Regularized Gradient Boosting
/diptamath/ Context-aware Retail Product Recommendation with Regularized Gradient Boosting /diptamath/ Context-aware Retail Product Recommendation with Regularized Gradient Boosting

In the FARFETCH Fashion Recommendation challenge, the participants needed to predict the order in which various products would be shown to a user in a recommendation impression.

The data was provided in two phases - a validation phase and a test phase...

The validation phase had a labelled training set that contained a binary column indicating whether a product has been clicked or not.

The dataset comprises over 5,000,000 recommendation events, 450,000 products and 230,000 unique users.

We have designed a unique context-aware system that takes the similarity of a product to the user context into account to rank products more effectively.

12 часов назад @ paperswithcode.com
/ibm/ Neural Unification for Logic Reasoning over Natural Language
/ibm/ Neural Unification for Logic Reasoning over Natural Language /ibm/ Neural Unification for Logic Reasoning over Natural Language

Automated Theorem Proving (ATP) deals with the development of computer programs being able to show that some conjectures (queries) are a logical consequence of a set of axioms (facts and rules).

There exists several successful ATPs where conjectures and axioms are formally provided (e.g.

Recent approaches, such as (Clark et al., 2020), have proposed transformer-based architectures for deriving conjectures given axioms expressed in natural language (English).

The conjecture is verified through a binary text classifier, where the transformers model is trained to predict the truth value of a conjecture given the axioms.

The approach is demonstrated in experiments using a diverse set of benchma…

12 часов назад @ paperswithcode.com
/weijialau/ A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis
/weijialau/ A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis /weijialau/ A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis

Chatbot is increasingly thriving in different domains, however, because of unexpected discourse complexity and training data sparseness, its potential distrust hatches vital apprehension.

Recently, Machine-Human Chatting Handoff (MHCH), predicting chatbot failure and enabling human-algorithm collaboration to enhance chatbot quality, has attracted increasing attention from industry and academia...

In this study, we propose a novel model, Role-Selected Sharing Network (RSSN), which integrates both dialogue satisfaction estimation and handoff prediction in one multi-task learning framework.

Unlike prior efforts in dialog mining, by utilizing local user satisfaction as a bridge, global satisfac…

12 часов назад @ paperswithcode.com
/princeton-nlp/ Simple Entity-Centric Questions Challenge Dense Retrievers
/princeton-nlp/ Simple Entity-Centric Questions Challenge Dense Retrievers /princeton-nlp/ Simple Entity-Centric Questions Challenge Dense Retrievers

Open-domain question answering has exploded in popularity recently due to the success of dense retrieval models, which have surpassed sparse models using only a few supervised training examples.

However, in this paper, we demonstrate current dense models are not yet the holy grail of retrieval... We first construct EntityQuestions, a set of simple, entity-rich questions based on facts from Wikidata (e.g., "Where was Arve Furset born?

"), and observe that dense retrievers drastically underperform sparse methods.

We investigate this issue and uncover that dense retrievers can only generalize to common entities unless the question pattern is explicitly observed during training.

Second, we argu…

12 часов назад @ paperswithcode.com
/andreaskuster/ reproducing "ner and pos when nothing is capitalized"
/andreaskuster/ reproducing "ner and pos when nothing is capitalized" /andreaskuster/ reproducing "ner and pos when nothing is capitalized"

Capitalization is an important feature in many NLP tasks such as Named Entity Recognition (NER) or Part of Speech Tagging (POS).

We are trying to reproduce results of paper which shows how to mitigate a significant performance drop when casing is mismatched between training and testing data...

In particular we show that lowercasing 50% of the dataset provides the best performance, matching the claims of the original paper.

We also show that we got slightly lower performance in almost all experiments we have tried to reproduce, suggesting that there might be some hidden factors impacting our performance.

Lastly, we make all of our work available in a public github repository.

12 часов назад @ paperswithcode.com
/nicogrande/ Realistic PointGoal Navigation via Auxiliary Losses and Information Bottleneck
/nicogrande/ Realistic PointGoal Navigation via Auxiliary Losses and Information Bottleneck /nicogrande/ Realistic PointGoal Navigation via Auxiliary Losses and Information Bottleneck

We propose a novel architecture and training paradigm for training realistic PointGoal Navigation -- navigating to a target coordinate in an unseen environment under actuation and sensor noise without access to ground-truth localization.

We grant the agent restricted access to ground-truth localization readings during training via an information bottleneck.

Under this setting, the agent incurs a penalty for using this privileged information, encouraging the agent to only leverage this information when it is crucial to learning.

This enables the agent to first learn navigation and then learn localization instead of conflating these two objectives in training.

Specifically, our method outperf…

12 часов назад @ paperswithcode.com
/debymf/ Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?
/debymf/ Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? /debymf/ Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?

Models designed for intelligent process automation are required to be capable of grounding user interface elements.

This task of interface element grounding is centred on linking instructions in natural language to their target referents...

Even though BERT and similar pre-trained language models have excelled in several NLP tasks, their use has not been widely explored for the UI grounding domain.

This work concentrates on testing and probing the grounding abilities of three different transformer-based models: BERT, RoBERTa and LayoutLM.

Our primary focus is on these models' spatial reasoning skills, given their importance in this domain.

12 часов назад @ paperswithcode.com
/brcsomnath/ Adversarial Scrubbing of Demographic Information for Text Classification
/brcsomnath/ Adversarial Scrubbing of Demographic Information for Text Classification /brcsomnath/ Adversarial Scrubbing of Demographic Information for Text Classification

Contextual representations learned by language models can often encode undesirable attributes, like demographic associations of the users, while being trained for an unrelated target task.

We aim to scrub such undesirable attributes and learn fair representations while maintaining performance on the target task...

In this paper, we present an adversarial learning framework "Adversarial Scrubber" (ADS), to debias contextual representations.

We perform theoretical analysis to show that our framework converges without leaking demographic information under certain conditions.

Experimental evaluations on 8 datasets show that ADS generates representations with minimal information about demographi…

12 часов назад @ paperswithcode.com
/brcsomnath/ Does Commonsense help in detecting Sarcasm?
/brcsomnath/ Does Commonsense help in detecting Sarcasm? /brcsomnath/ Does Commonsense help in detecting Sarcasm?

Sarcasm detection is important for several NLP tasks such as sentiment identification in product reviews, user feedback, and online forums.

It is a challenging task requiring a deep understanding of language, context, and world knowledge...

In this paper, we investigate whether incorporating commonsense knowledge helps in sarcasm detection.

For this, we incorporate commonsense knowledge into the prediction process using a graph convolution network with pre-trained language model embeddings as input.

Our experiments with three sarcasm detection datasets indicate that the approach does not outperform the baseline model.

12 часов назад @ paperswithcode.com
/ryojitanabe/ Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical Optimization
/ryojitanabe/ Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical Optimization /ryojitanabe/ Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical Optimization

Feature-based algorithm selection has recently received attention in the research field of black-box numerical optimization...

In addition, a benchmarking methodology for algorithm selection systems has not been well investigated in the literature.

In this context, this paper analyzes algorithm selection systems on the 24 noiseless black-box optimization benchmarking functions.

First, we demonstrate that the successful performance 1 measure is more reliable than the expected runtime measure for benchmarking algorithm selection systems.

These findings provide fundamental insights for algorithm selection for black-box optimization.

12 часов назад @ paperswithcode.com
/jadecxliu/ CodeQA: A Question Answering Dataset for Source Code Comprehension
/jadecxliu/ CodeQA: A Question Answering Dataset for Source Code Comprehension /jadecxliu/ CodeQA: A Question Answering Dataset for Source Code Comprehension

We propose CodeQA, a free-form question answering dataset for the purpose of source code comprehension: given a code snippet and a question, a textual answer is required to be generated.

CodeQA contains a Java dataset with 119,778 question-answer pairs and a Python dataset with 70,085 question-answer pairs... To obtain natural and faithful questions and answers, we implement syntactic rules and semantic analysis to transform code comments into question-answer pairs.

Experiment results achieved by several neural baselines on our dataset are shown and discussed.

While research on question-answering and machine reading comprehension develops rapidly, few prior work has drawn attention to code …

12 часов назад @ paperswithcode.com
/zyxnlp/ To be Closer: Learning to Link up Aspects with Opinions
/zyxnlp/ To be Closer: Learning to Link up Aspects with Opinions /zyxnlp/ To be Closer: Learning to Link up Aspects with Opinions

Dependency parse trees are helpful for discovering the opinion words in aspect-based sentiment analysis (ABSA).

This is because the syntactic trees are not designed for capturing the interactions between opinion words and aspect words.

In this work, we aim to shorten the distance between aspects and corresponding opinion words by learning an aspect-centric tree structure.

The aspect and opinion words are expected to be closer along such tree structure compared to the standard dependency parse tree.

The learning process allows the tree structure to adaptively correlate the aspect and opinion words, enabling us to better identify the polarity in the ABSA task.

12 часов назад @ paperswithcode.com
/revaludo/ Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos
/revaludo/ Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos /revaludo/ Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos

Geo-tagged photo based tourist attraction recommendation can discover users' travel preferences from their taken photos, so as to recommend suitable tourist attractions to them.

However, existing visual content based methods cannot fully exploit the user and tourist attraction information of photos to extract visual features, and do not differentiate the significances of different photos...

In this paper, we propose multi-level visual similarity based personalized tourist attraction recommendation using geo-tagged photos (MEAL).

Specifically, by crossing the user and tourist attraction information of photos, we define four visual similarity levels and introduce a corresponding quintuplet lo…

13 часов назад @ paperswithcode.com
/facebookresearch/ An End-to-End Transformer Model for 3D Object Detection
/facebookresearch/ An End-to-End Transformer Model for 3D Object Detection /facebookresearch/ An End-to-End Transformer Model for 3D Object Detection

We propose 3DETR, an end-to-end Transformer based object detection model for 3D point clouds.

Compared to existing detection methods that employ a number of 3D-specific inductive biases, 3DETR requires minimal modifications to the vanilla Transformer block...

Nevertheless, 3DETR is conceptually simple and easy to implement, enabling further improvements by incorporating 3D domain knowledge.

Through extensive experiments, we show 3DETR outperforms the well-established and highly optimized VoteNet baselines on the challenging ScanNetV2 dataset by 9.5%.

Furthermore, we show 3DETR is applicable to 3D tasks beyond detection, and can serve as a building block for future research.

1 день, 3 часа назад @ paperswithcode.com
Papers With Code Papers With Code
последний пост 2 часа назад
/adityalab/ CAMul: Calibrated and Accurate Multi-view Time-Series Forecasting
/adityalab/ CAMul: Calibrated and Accurate Multi-view Time-Series Forecasting /adityalab/ CAMul: Calibrated and Accurate Multi-view Time-Series Forecasting

Probabilistic time-series forecasting enables reliable decision making across many domains.

Most forecasting problems have diverse sources of data containing multiple modalities and structures... Leveraging information as well as uncertainty from these data sources for well-calibrated and accurate forecasts is an important challenging problem.

We propose a general probabilistic multi-view forecasting framework CAMul, that can learn representations and uncertainty from diverse data sources.

It integrates the knowledge and uncertainty from each data view in a dynamic context-specific manner assigning more importance to useful views to model a well-calibrated forecast distribution.

We use CAMu…

1 день, 12 часов назад @ paperswithcode.com
/hrluo/ Non-smooth Bayesian Optimization in Tuning Problems
/hrluo/ Non-smooth Bayesian Optimization in Tuning Problems /hrluo/ Non-smooth Bayesian Optimization in Tuning Problems

Within the Bayesian optimization framework, the Gaussian process model produces smooth or continuous sample paths.

However, the black-box function in the tuning problem is often non-smooth.

This difficult tuning problem is worsened by the fact that we usually have limited sequential samples from the black-box function.

Motivated by these issues encountered in tuning, we propose a novel additive Gaussian process model called clustered Gaussian process (cGP), where the additive components are induced by clustering.

By using this surrogate model, we want to capture the non-smoothness of the black-box function.

1 день, 12 часов назад @ paperswithcode.com
/phixion/ Learning Mathematical Properties of Integers
/phixion/ Learning Mathematical Properties of Integers /phixion/ Learning Mathematical Properties of Integers

Embedding words in high-dimensional vector spaces has proven valuable in many natural language applications.

In this work, we investigate whether similarly-trained embeddings of integers can capture concepts that are useful for mathematical applications... We probe the integer embeddings for mathematical knowledge, apply them to a set of numerical reasoning tasks, and show that by learning the representations from mathematical sequence data, we can substantially improve over number embeddings learned from English text corpora.

read morePDFAbstract

1 день, 12 часов назад @ paperswithcode.com
/megvii-model/ Anchor DETR: Query Design for Transformer-Based Detector
/megvii-model/ Anchor DETR: Query Design for Transformer-Based Detector /megvii-model/ Anchor DETR: Query Design for Transformer-Based Detector

In this paper, we propose a novel query design for the transformer-based detectors.

It is difficult to optimize as the prediction slot of each object query does not have a specific mode.

To solved these problems, in our query design, object queries are based on anchor points, which are widely used in CNN-based detectors.

Moreover, our query design can predict multiple objects at one position to solve the difficulty: "one region, multiple objects".

Thanks to the query design and the attention variant, the proposed detector that we called Anchor DETR, can achieve better performance and run faster than the DETR with 10$\times$ fewer training epochs.

1 день, 12 часов назад @ paperswithcode.com
/sergio.verduzco/ Adaptive plasticity in the spinal cord can produce reaching from scratch and reproduces motor cortex directional tuning
/sergio.verduzco/ Adaptive plasticity in the spinal cord can produce reaching from scratch and reproduces motor cortex directional tuning /sergio.verduzco/ Adaptive plasticity in the spinal cord can produce reaching from scratch and reproduces motor cortex directional tuning

How dynamic interactions between nervous system regions in mammals performs online motor control remains an unsolved problem.

Here we present a new approach using a minimal model comprising spinal cord, sensory and motor cortex, coupled by long connections that are plastic...

It succeeds in learning how to perform reaching movements of a planar arm with 6 muscles in several directions from scratch.

The model satisfies biological plausibility constraints, like neural implementation, transmission delays, local synaptic learning and continuous online learning.

As emergent properties, neural populations in motor cortex show directional tuning and oscillatory dynamics, and the spinal cord create…

1 день, 14 часов назад @ paperswithcode.com
/miladvazan/ Jointly Modeling Aspect and Polarity for Aspect-based Sentiment Analysis in Persian Reviews
/miladvazan/ Jointly Modeling Aspect and Polarity for Aspect-based Sentiment Analysis in Persian Reviews /miladvazan/ Jointly Modeling Aspect and Polarity for Aspect-based Sentiment Analysis in Persian Reviews

The research field is known as sentiment analysis and classification, where aspect category detection (ACD) and aspect category polarity (ACP) are two important sub-tasks of aspect-based sentiment analysis...

The goal in ACD is to specify which aspect of the entity comes up in opinion while ACP aims to specify the polarity of each aspect category from the ACD task.

This paper focuses on the ACD and ACP sub-tasks to solve both problems simultaneously.

A dataset of Persian reviews was collected from CinemaTicket website including 2200 samples from 14 categories.

The results indicate the high applicability and preference of the CNN and GRU models in comparison to LSTM and Bi-LSTM.

2 дня, 8 часов назад @ paperswithcode.com
/gustavopompeu/ Frame by frame completion probability of an NFL pass
/gustavopompeu/ Frame by frame completion probability of an NFL pass /gustavopompeu/ Frame by frame completion probability of an NFL pass

When predicting the completion probability of a pass, it is essential to know who the target of the pass is.

Using data from the 2018 NFL season, we obtained conditional and marginal predictions for pass completion probability based on a random forest model.

This is based on a two-stage procedure: first, we calculate the probability of each offensive player being the pass target, then, conditional on the target, we predict completion probability based on the random forest model.

Finally, the general completion probability can be calculated using the law of total probability.

We present animations for selected plays and show the pass completion probability evolution.

2 дня, 12 часов назад @ paperswithcode.com
/eihw/ A Machine Learning Framework for Automatic Prediction of Human Semen Motility
/eihw/ A Machine Learning Framework for Automatic Prediction of Human Semen Motility /eihw/ A Machine Learning Framework for Automatic Prediction of Human Semen Motility

In the field of reproductive health, a vital aspect for the detection of male fertility issues is the analysis of human semen quality.

Two factors of importance are the morphology and motility of the sperm cells...

For many non-human species, so-called Computer-Aided Sperm Analysis systems work well for assessing these characteristics from microscopic video recordings but struggle with human sperm samples which generally show higher degrees of debris and dead spermatozoa, as well as lower overall sperm motility.

Here, machine learning methods that harness large amounts of training data to extract salient features could support physicians with the detection of fertility issues or in vitro fe…

2 дня, 13 часов назад @ paperswithcode.com
/xiaofei05/ Transductive Learning for Unsupervised Text Style Transfer
/xiaofei05/ Transductive Learning for Unsupervised Text Style Transfer /xiaofei05/ Transductive Learning for Unsupervised Text Style Transfer

Unsupervised style transfer models are mainly based on an inductive learning approach, which represents the style as embeddings, decoder parameters, or discriminator parameters and directly applies these general rules to the test cases.

To tackle this problem, we propose a novel transductive learning approach in this paper, based on a retrieval-based context-aware style representation.

It involves top-K relevant sentences in the target style in the transfer process.

In this paper, both sparse (BM25) and dense retrieval functions (MIPS) are used, and two objective functions are designed to facilitate joint learning.

The proposed transductive learning approach is general and effective to the …

2 дня, 14 часов назад @ paperswithcode.com
/liangsheng02/ Locating Language-Specific Information in Contextualized Embeddings
/liangsheng02/ Locating Language-Specific Information in Contextualized Embeddings /liangsheng02/ Locating Language-Specific Information in Contextualized Embeddings

Multilingual pretrained language models (MPLMs) exhibit multilinguality and are well suited for transfer across languages.

Most MPLMs are trained in an unsupervised fashion and the relationship between their objective and multilinguality is unclear... More specifically, the question whether MPLM representations are language-agnostic or they simply interleave well with learned task prediction heads arises.

In this work, we locate language-specific information in MPLMs and identify its dimensionality and the layers where this information occurs.

We show that language-specific information is scattered across many dimensions, which can be projected into a linear subspace.

Our study contributes …

2 дня, 14 часов назад @ paperswithcode.com
/sstoikov/ Evaluating Music Recommendations with Binary Feedback for Multiple Stakeholders
/sstoikov/ Evaluating Music Recommendations with Binary Feedback for Multiple Stakeholders /sstoikov/ Evaluating Music Recommendations with Binary Feedback for Multiple Stakeholders

High quality user feedback data is essential to training and evaluating a successful music recommendation system, particularly one that has to balance the needs of multiple stakeholders.

Most existing music datasets suffer from noisy feedback and self-selection biases inherent in the data collected by music platforms...

Using the Piki Music dataset of 500k ratings collected over a two-year time period, we evaluate the performance of classic recommendation algorithms on three important stakeholders: consumers, well-known artists and lesser-known artists.

We show that a matrix factorization algorithm trained on both likes and dislikes performs significantly better compared to one trained only…

2 дня, 14 часов назад @ paperswithcode.com
/healthml/ Explainability Requires Interactivity
/healthml/ Explainability Requires Interactivity /healthml/ Explainability Requires Interactivity

When explaining the decisions of deep neural networks, simple stories are tempting but dangerous.

Especially in computer vision, the most popular explanation approaches give a false sense of comprehension to its users and provide an overly simplistic picture... We introduce an interactive framework to understand the highly complex decision boundaries of modern vision models.

It allows the user to exhaustively inspect, probe, and test a network's decisions.

Across a range of case studies, we compare the power of our interactive approach to static explanation methods, showing how these can lead a user astray, with potentially severe consequences.

read morePDFAbstract

2 дня, 14 часов назад @ paperswithcode.com
/basiralab/ A Comparative Study of Machine Learning Methods for Predicting the Evolution of Brain Connectivity from a Baseline Timepoint
/basiralab/ A Comparative Study of Machine Learning Methods for Predicting the Evolution of Brain Connectivity from a Baseline Timepoint /basiralab/ A Comparative Study of Machine Learning Methods for Predicting the Evolution of Brain Connectivity from a Baseline Timepoint

It is a known fact that machine learning (ML) methods have proven their predictive abilities in a wide variety of computer vision problems.

However, ML techniques specifically tailored for the prediction of brain connectivity evolution trajectory from a single timepoint are almost absent.

To fill this gap, we organized a Kaggle competition where 20 competing teams designed advanced machine learning pipelines for predicting the brain connectivity evolution from a single timepoint.

The competing teams developed their ML pipelines with a combination of data pre-processing, dimensionality reduction, and learning methods.

In support of open science, the developed 20 ML pipelines along with the c…

2 дня, 14 часов назад @ paperswithcode.com
/ebadi/ Efficient and Effective Generation of Test Cases for Pedestrian Detection -- Search-based Software Testing of Baidu Apollo in SVL
/ebadi/ Efficient and Effective Generation of Test Cases for Pedestrian Detection -- Search-based Software Testing of Baidu Apollo in SVL /ebadi/ Efficient and Effective Generation of Test Cases for Pedestrian Detection -- Search-based Software Testing of Baidu Apollo in SVL

The use of simulation-based prototyping platforms provides the possibility for early-stage testing, enabling inexpensive testing and the ability to capture critical corner-case test scenarios... Simulation-based testing properly complements conventional on-road testing.

However, due to the large space of test input parameters in these systems, the efficient generation of effective test scenarios leading to the unveiling of failures is a challenge.

We propose an evolutionary automated test generation technique that generates failure-revealing scenarios for Apollo in the SVL environment.

This paper presents the results of our proposed test generation technique in the 2021 IEEE Autonomous Driv…

2 дня, 14 часов назад @ paperswithcode.com
/yunx-z/ MOVER: Mask, Over-generate and Rank for Hyperbole Generation
/yunx-z/ MOVER: Mask, Over-generate and Rank for Hyperbole Generation /yunx-z/ MOVER: Mask, Over-generate and Rank for Hyperbole Generation

Despite being a common figure of speech, hyperbole is under-researched with only a few studies addressing its identification task.

In this paper, we introduce a new task of hyperbole generation to transfer a literal sentence into its hyperbolic paraphrase... To tackle the lack of available hyperbolic sentences, we construct HYPO-XL, the first large-scale hyperbole corpus containing 17,862 hyperbolic sentences in a non-trivial way.

Based on our corpus, we propose an unsupervised method for hyperbole generation with no need for parallel literal-hyperbole pairs.

During inference, we mask part of an input literal sentence and over-generate multiple possible hyperbolic versions.

Human evaluation…

2 дня, 14 часов назад @ paperswithcode.com
💼 University and corporation labs
DeepMind DeepMind
последний пост 1 месяц, 2 недели назад
Building architectures that can handle the world’s data
Building architectures that can handle the world’s data Building architectures that can handle the world’s data

Perceiver and Perceiver IO work as multi-purpose tools for AIMost architectures used by AI systems today are specialists.

A 2D residual network may be a good choice for processing images, but at best it’s a loose fit for other kinds of data — such as the Lidar signals used in self-driving cars or the torques used in robotics.

What’s more, standard architectures are often designed with only one task in mind, often leading engineers to bend over backwards to reshape, distort, or otherwise modify their inputs and outputs in hopes that a standard architecture can learn to handle their problem correctly.

Dealing with more than one kind of data, like the sounds and images that make up videos, is …

1 месяц, 2 недели назад @ deepmind.com
Generally capable agents emerge from open-ended play
Generally capable agents emerge from open-ended play Generally capable agents emerge from open-ended play

Through reinforcement learning (RL), this single system learnt by playing round after round of games through a repetitive process of trial and error.

But AlphaZero still trained separately on each game — unable to simply learn another game or task without repeating the RL process from scratch.

Today, we published "Open-Ended Learning Leads to Generally Capable Agents," a preprint detailing our first steps to train an agent capable of playing many different games without needing human interaction data.

The agent’s capabilities improve iteratively as a response to the challenges that arise in training, with the learning process continually refining the training tasks so the agent never stops …

1 месяц, 3 недели назад @ deepmind.com
Putting the power of AlphaFold into the world’s hands
Putting the power of AlphaFold into the world’s hands Putting the power of AlphaFold into the world’s hands

Today, I’m incredibly proud and excited to announce that DeepMind is making a significant contribution to humanity’s understanding of biology.

When we announced AlphaFold 2 last December, it was hailed as a solution to the 50-year old protein folding problem.

As researchers seek cures for diseases and pursue solutions to other big problems facing humankind – including antibiotic resistance, microplastic pollution, and climate change – they will benefit from fresh insights into the structure of proteins.

The same way that the structure of a machine tells you what it does, so the structure of a protein helps us understand its function.

Today, we are sharing a trove of information that doubles…

2 месяца назад @ deepmind.com
An update on our racial justice efforts
An update on our racial justice efforts An update on our racial justice efforts

I then shared some thoughts around DeepMind's intention to help combat racism and advance racial equity.

We also explored - and gathered feedback - on how we could best support racial justice in the communities DeepMind interacts with.

Today I’m pleased to share one of the outcomes of that process: putting resources directly in the hands of Black communities, so they can decide where they need them most.

In the past months, we made donations to organisations that play a vital role supporting Black communities in the UK, US and Africa.

We're delighted to support these organisations, and grateful to many of those who have shared with us an overview of their work:

3 месяца, 2 недели назад @ deepmind.com
Advancing sports analytics through AI research
Advancing sports analytics through AI research Advancing sports analytics through AI research

Finally, we consider computer vision to be one of the most promising avenues for advancing the boundaries of state of the art sports analytics research.

The large numbers of football videos satisfies a prerequisite for modern AI techniques.

While each football video is different, the settings do not vary greatly, which makes the task ideal for sharpening AI algorithms.

The application of advanced AI techniques to football has the potential to revolutionise the game across many axes, for players, decision-makers, fans, and broadcasters.

We believe that the development of increasingly advanced AI techniques afforded by the football microcosm might be applicable to broader domains.

4 месяца, 2 недели назад @ deepmind.com
Game theory as an engine for large-scale data analysis
Game theory as an engine for large-scale data analysis Game theory as an engine for large-scale data analysis

EigenGame maps out a new approach to solve fundamental ML problems.

This made us wonder if such a perspective modeled on game theory could help solve other fundamental machine learning problems.

Today at ICLR 2021 (the International Conference on Learning Representations), we presented “EigenGame: PCA as a Nash Equilibrium,” which received an Outstanding Paper Award.

Firstly, data was originally recorded by hand in paper notebooks, and now it is stored in data centres the size of warehouses.

By approaching the PCA problem in the right way, our insights and algorithms apply more broadly across the branches of the ML tree.

4 месяца, 2 недели назад @ deepmind.com
MuZero: Mastering Go, chess, shogi and Atari without rules
MuZero: Mastering Go, chess, shogi and Atari without rules MuZero: Mastering Go, chess, shogi and Atari without rules

Humans learn this ability quickly and can generalise to new scenarios, a trait we would also like our algorithms to have.

Until now, the best results on Atari are from model-free systems, such as DQN, R2D2 and Agent57.

As the name suggests, model-free algorithms do not use a learned model and instead estimate what is the best action to take next.

Instead of trying to model the entire environment, MuZero just models aspects that are important to the agent’s decision-making process.

Specifically, MuZero models three elements of the environment that are critical to planning:The value: how good is the current position?

9 месяцев назад @ deepmind.com
Using JAX to accelerate our research
Using JAX to accelerate our research Using JAX to accelerate our research

JAX is a Python library designed for high-performance numerical computing, especially machine learning research.

JAX natively supports both forward and reverse mode automatic differentiation of arbitrary numerical functions, via function transformations such as , , and .

Vectorisation: In ML research we often apply a single function to lots of data, e.g.

In ML research we often apply a single function to lots of data, e.g.

JAX also supports large scale data parallelism via the related transformation, elegantly distributing data that is too large for the memory of a single accelerator.

9 месяцев, 2 недели назад @ deepmind.com
AlphaFold: a solution to a 50-year-old grand challenge in biology
AlphaFold: a solution to a 50-year-old grand challenge in biology AlphaFold: a solution to a 50-year-old grand challenge in biology

We’ve also seen signs that protein structure prediction could be useful in future pandemic response efforts, as one of many tools developed by the scientific community.

Earlier this year, we predicted several protein structures of the SARS-CoV-2 virus, including ORF3a, whose structures were previously unknown.

Since DNA specifies the amino acid sequences that comprise protein structures, the genomics revolution has made it possible to read protein sequences from the natural world at massive scale – with 180 million protein sequences and counting in the Universal Protein database (UniProt).

In contrast, given the experimental work needed to go from sequence to structure, only around 170,000 …

9 месяцев, 3 недели назад @ deepmind.com
Google
последний пост 2 дня, 22 часа назад
Recommendations AI data ingestion
Recommendations AI data ingestion Recommendations AI data ingestion

In our previous post, we presented a high-level picture of Recommendations AI, showing how the product is typically used. In this post, we’ll take a deep dive into the first step of getting started, which is data ingestion. This post will answer all your questions on getting your data into Recommendations AI so you can train models and get recommendations.Recommendations AI uses your product catalog and user events to create machine learning models and deliver personalized product recommendations to your customers. Essentially, Recommendations AI uses a list of items available to be recommended (product catalog) and user's interactions with those products (events), allowing you to create va…

2 дня, 22 часа назад @ cloud.google.com
Optimizing Waze ad delivery using TensorFlow over Vertex AI
Optimizing Waze ad delivery using TensorFlow over Vertex AI Optimizing Waze ad delivery using TensorFlow over Vertex AI

Waze AdsWaze is the world's largest community-based traffic and navigation app. As part of its offering, it lets advertisers put their businesses on the Waze map. By doing so, ads on Waze will reach consumers at key moments of their journey. Goals for advertising on Waze include getting customers to business locations, building brand awareness, and connecting with nearby customers at the right moments.Waze uses several ad formats, the most prominent of which is called a “Pin”. Like a store sign, Pins inform and remind customers that a business is on or near their route.Ad Serving @WazeWaze Ads is a reservation platform, which means we commit to a fixed number of ad impressions in advance an…

2 дня, 23 часа назад @ cloud.google.com
Toward Fast and Accurate Neural Networks for Image Recognition
Toward Fast and Accurate Neural Networks for Image Recognition Toward Fast and Accurate Neural Networks for Image Recognition

What if, instead, one could design neural networks that were smaller and faster, yet still more accurate?

For the same network, small image size leads to lower network capacity and thus requires weak regularization; vice versa, a large image size requires stronger regularization to combat overfitting.

In “CoAtNet: Marrying Convolution and Attention for All Data Sizes”, we systematically study how to combine convolution and self-attention to develop fast and accurate neural networks for large-scale image recognition.

CoAtNet models consistently outperform ViT models and its variants across a number of datasets, such as ImageNet1K, ImageNet21K, and JFT.

We hope these new neural networks can b…

3 дня, 20 часов назад @ ai.googleblog.com
PyTorch on Google Cloud: How to deploy PyTorch models on Vertex AI
PyTorch on Google Cloud: How to deploy PyTorch models on Vertex AI PyTorch on Google Cloud: How to deploy PyTorch models on Vertex AI

This article is the next step in the series of PyTorch on Google Cloud using Vertex AI. In the preceding article, we fine-tuned a Hugging Face Transformers model for a sentiment classification task using PyTorch on Vertex Training service. In this post, we show how to deploy a PyTorch model on the Vertex Prediction service for serving predictions from trained model artifacts. Now let’s walk through the deployment of a Pytorch model using TorchServe as a custom container by deploying the model artifacts to a Vertex Endpoint. You can find the accompanying code for this blog post on the GitHub repository and the Jupyter Notebook.Deploying a PyTorch Model on Vertex Prediction ServiceVertex Pred…

4 дня, 19 часов назад @ cloud.google.com
Revisiting Mask-Head Architectures for Novel Class Instance Segmentation
Revisiting Mask-Head Architectures for Novel Class Instance Segmentation Revisiting Mask-Head Architectures for Novel Class Instance Segmentation

In recent years, deep learning has made significant strides in solving the instance segmentation problem with architectures like Mask R-CNN.

However, these methods rely on collecting a large labeled instance segmentation dataset.

Moreover, this difference is only apparent when evaluating on unseen classes — if we evaluate on seen classes, all four architectures exhibit similar performance.

There is a significant difference in performance on unseen classes, even though the performance on seen classes barely changes.

Comparison of Deep-MAC and Deep-MARC to other partially supervised instance segmentation approaches like MaskX R-CNN, ShapeMask and CPMask.

4 дня, 19 часов назад @ ai.googleblog.com
Music Conditioned 3D Dance Generation with AIST++
Music Conditioned 3D Dance Generation with AIST++ Music Conditioned 3D Dance Generation with AIST++

Together with the model, we released a large-scale, multi-modal 3D dance motion dataset, AIST++, which contains 5.2 hours of 3D dance motion in 1408 sequences, covering 10 dance genres, each including multi-view videos with known camera poses.

We present a novel full-attention cross-modal transformer (FACT) network that can generate realistic 3D dance motion (right) conditioned on music and a new 3D dance dataset, AIST++ (left).

We generate the proposed 3D motion dataset from the existing AIST Dance Database — a collection of videos of dance with musical accompaniment, but without any 3D information.

The resulting database, AIST++, is a large-scale, 3D human dance motion dataset that contai…

6 дней, 21 час назад @ ai.googleblog.com
Chefkoch whips up handwritten recipes in the cloud with text detector
Chefkoch whips up handwritten recipes in the cloud with text detector Chefkoch whips up handwritten recipes in the cloud with text detector

Editor’s note: When German cooking platform Chefkoch was looking to bring treasured hand-me-down recipes into the 21st century it found a scaleable, well-supported solution with Google’s data cloud. Here’s how it was cooked up. Whether it’s salad dressing or chicken soup, most households have a favorite dish passed down across the generations. These recipes are often scribbled on scraps of paper and this personal culinary heritage is heavily guarded. Recognizing the significance of handwritten or printed recipes, German cooking platform Chefkoch wanted to make it possible to quickly and easily parse, extract and digitalize these time-honored tasty morsels using Google Cloud augmented analyt…

6 дней, 23 часа назад @ cloud.google.com
Scalable ML Workflows using PyTorch on Kubeflow Pipelines and Vertex Pipelines
Scalable ML Workflows using PyTorch on Kubeflow Pipelines and Vertex Pipelines Scalable ML Workflows using PyTorch on Kubeflow Pipelines and Vertex Pipelines

IntroductionML Ops is an ML engineering culture and practice that aims at unifying ML system development and ML system operation. An important ML Ops design pattern is the ability to formalize ML workflows. This allows them to be reproduced, tracked and analyzed, shared, and more.Pipelines frameworks support this pattern, and are the backbone of an ML Ops story. These frameworks help you to automate, monitor, and govern your ML systems by orchestrating your ML workflows. In this post, we’ll show examples of PyTorch-based ML workflows on two pipelines frameworks: OSS Kubeflow Pipelines, part of the Kubeflow project; and Vertex Pipelines. We are also excited to share some new PyTorch componen…

1 неделя, 2 дня назад @ cloud.google.com
Personalized ASR Models from a Large and Diverse Disordered Speech Dataset
Personalized ASR Models from a Large and Diverse Disordered Speech Dataset Personalized ASR Models from a Large and Diverse Disordered Speech Dataset

In 2019, we introduced Project Euphonia and discussed how we could use personalized ASR models of disordered speech to achieve accuracies on par with non-personalized ASR on typical speech.

Today we share the results of two studies, presented at Interspeech 2021, that aim to expand the availability of personalized ASR models to more users.

In “Disordered Speech Data Collection: Lessons Learned at 1 Million Utterances from Project Euphonia”, we present a greatly expanded collection of disordered speech data, composed of over 1 million utterances.

Then, in “Automatic Speech Recognition of Disordered Speech: Personalized models outperforming human listeners on short phrases”, we discuss our ef…

1 неделя, 3 дня назад @ ai.googleblog.com
7 tips for trouble-free ML model training
7 tips for trouble-free ML model training 7 tips for trouble-free ML model training

IntroVertex AI offers a fully managed training service, Vertex AI Training, which provides a set of prebuilt algorithms and enables you to create ML models using custom training.Machine learning (ML) engineers are responsible for training ML models. In most situations, training completes successfully, but it does sometimes fail. When it does, engineers often reach out to customer support for help in troubleshooting the problem. It turns out, however, that there are a fair number of ML training support cases that the ML engineers could handle themselves. This post covers seven common causes of ML model training failures, along with time-saving tips on how to avoid them and how to fix them. T…

1 неделя, 3 дня назад @ cloud.google.com
PyTorch on Google Cloud: How To train and tune PyTorch models on Vertex AI
PyTorch on Google Cloud: How To train and tune PyTorch models on Vertex AI PyTorch on Google Cloud: How To train and tune PyTorch models on Vertex AI

Since the publishing of the inaugural post of PyTorch on Google Cloud blog series, we announced Vertex AI: Google Cloud’s end-to-end ML platform at Google I/O 2021. Vertex AI unifies Google Cloud’s existing ML offerings into a single platform for efficiently building and managing the lifecycle of ML projects. It provides tools for every step of the machine learning workflow across various model types, for varying levels of machine learning expertise.We will continue the blog series with Vertex AI to share how to build, train and deploy PyTorch models at scale and how to create reproducible machine learning pipelines on Google Cloud. Figure 1. What’s included in Vertex AI?In this post, we wi…

1 неделя, 4 дня назад @ cloud.google.com
Sopra Steria uses Google Cloud, Cisco, and ACTIVEO to power new generation of Virtual Agents
Sopra Steria uses Google Cloud, Cisco, and ACTIVEO to power new generation of Virtual Agents Sopra Steria uses Google Cloud, Cisco, and ACTIVEO to power new generation of Virtual Agents

With people expecting to access products and services through easy, always-on experiences delivered across channels, transforming approaches to digital services is a must for every business. As a result, business leaders have to strive to provide employees with these same frictionless experiences. Sopra Steria is a European leader in consulting, digital services and software development, with 46,000 employees in 25 countries that generated revenue of €4.3 billion in 2020. It provides end-to-end solutions that help customers drive their digital transformation to obtain tangible and sustainable benefits, by combining in-depth knowledge of a wide range of business sectors and innovative techno…

1 неделя, 4 дня назад @ cloud.google.com
Best practices for translating websites with Translation API
Best practices for translating websites with Translation API Best practices for translating websites with Translation API

Looking to translate your website? Google Cloud can help! Google Cloud Translation API is a service that dynamically translates between languages with Google’s state-of-the-art Machine Learning models. It is a highly scalable API that supports over one hundred languages, with built-in language detection. In this blog post, we will share some best practices for optimizing cost, increasing performance, and hardening the security posture while using the Translation API with your websites. Optimize Architecture for Performance, Cost and SecurityA common way to translate websites is to have site visitors select their language of choice, and then display the website in that language. However, on …

1 неделя, 5 дней назад @ cloud.google.com
Discovering Anomalous Data with Self-Supervised Learning
Discovering Anomalous Data with Self-Supervised Learning Discovering Anomalous Data with Self-Supervised Learning

On the other hand, substantial progress has been made in learning visual representations from unlabeled data via self-supervised learning, including rotation prediction and contrastive learning.

The idea is that instead of learning representations from the training data only, the model learns from the union of the training data plus augmented training examples, where the augmented examples are considered to be different from the original training data.

With DA, the training data is no longer uniformly distributed in the representation space because some areas are occupied by the augmented data.

Texture Anomaly Detection for Industrial Defect DetectionIn many real-world applications of anoma…

2 недели, 3 дня назад @ ai.googleblog.com
Detecting Abnormal Chest X-rays using Deep Learning
Detecting Abnormal Chest X-rays using Deep Learning Detecting Abnormal Chest X-rays using Deep Learning

A Deep Learning System for Detecting Abnormal Chest X-raysThe deep learning system we used is based on the EfficientNet-B7 architecture, pre-trained on ImageNet.

Each CXR was assigned a label of either “normal” or “abnormal” using a regular expression–based natural language processing approach on the associated radiology reports.

Sample chest X-rays of true and false positives, and true and false negatives for (A) general abnormalities, (B) tuberculosis, and (C) COVID-19.

Impact of a simulated deep learning model–based prioritization in comparison with random review order for (A) general abnormalities, (B) tuberculosis, and (C) COVID-19.

1Labels include atelectasis, cardiomegaly, effusion, …

2 недели, 4 дня назад @ ai.googleblog.com
OpenAI OpenAI
последний пост 1 неделя, 5 дней назад
Helen Toner Joins OpenAI’s Board of Directors
Helen Toner Joins OpenAI’s Board of Directors Helen Toner Joins OpenAI’s Board of Directors

Today, we’re excited to announce the appointment of Helen Toner to our Board of Directors.

As the Director of Strategy at Georgetown’s Center for Security and Emerging Technology (CSET), Helen has deep expertise in AI policy and global AI strategy research.

I greatly value Helen’s deep thinking around the long-term risks and effects of AI,” added Greg Brockman, OpenAI’s chairman and Chief Technology Officer.

“We are delighted to add her leadership to our board.”OpenAI is a unique organization in the AI research space, and has produced some of the advances, publications, and products I’m most excited about,” said Helen Toner.

She previously advised policymakers and grantmakers on AI strategy…

1 неделя, 5 дней назад @ openai.com
OpenAI Codex
OpenAI Codex OpenAI Codex

We are now inviting businesses and developers to build on top of OpenAI Codex through our API.

Watch Video Creating a Space Game with OpenAI Codex Tweet Watch Video “Hello World” with OpenAI Codex Tweet Watch Video Data Science with OpenAI Codex Tweet Watch Video Talking to Your Computer with OpenAI Codex Tweet Watch Video Converting Python to Ruby with OpenAI Codex Tweet Watch Video Giving OpenAI Codex a First Grade Math Test TweetOpenAI Codex is a descendant of GPT-3; its training data contains both natural language and billions of lines of source code from publicly available sources, including code in public GitHub repositories.

OpenAI Codex empowers computers to better understand people…

1 месяц, 1 неделя назад @ openai.com
Introducing Triton: Open-Source GPU Programming for Neural Networks
Introducing Triton: Open-Source GPU Programming for Neural Networks Introducing Triton: Open-Source GPU Programming for Neural Networks

These issues can be mitigated by writing specialized GPU kernels, but doing so can be surprisingly difficult due to the many intricacies of GPU programming.

This has led us to extend and improve Triton, a recent language and compiler whose original creator now works at OpenAI.

# Z,X,Y are dense tensors Z[idx] = X[idx] + Y[idx] ... grid = (ceil_div(N, BLOCK),) block = (BLOCK,) add[grid, block](x, y, z, x.shape[0]) BLOCK = 512 # This is a GPU kernel in Triton.

Without a system like Triton, non-trivial modifications of matrix multiplication kernels would be out-of-reach for developers without exceptional GPU programming expertise.

If you’re interested in joining our team and working on Triton …

1 месяц, 3 недели назад @ openai.com
Improving Language Model Behavior by Training on a Curated Dataset
Improving Language Model Behavior by Training on a Curated Dataset Improving Language Model Behavior by Training on a Curated Dataset

We've found we can improve language model behavior with respect to specific behavioral values by fine-tuning on a curated dataset of <100 examples of those values.

Appropriate or desirable language model behavior, like appropriate human behavior, cannot be reduced to one universal standard; desirable behavior differs by application and social context.

Step Two: Crafting the Dataset and Fine-TuningWe crafted a values-targeted dataset of 76 text samples; each sample was in a question-answer format and between 40 and 340 words.

But we believe this only scratches the surface and leaves important questions unanswered:Who should be consulted when designing a values-targeted dataset?

Please reach …

3 месяца, 1 неделя назад @ openai.com
OpenAI Startup Fund
OpenAI Startup Fund OpenAI Startup Fund

Investing in startups with big ideas about AI.

3 месяца, 3 недели назад @ openai.com
OpenAI Scholars 2021: Final Projects
OpenAI Scholars 2021: Final Projects OpenAI Scholars 2021: Final Projects

My advice to someone starting in deep learning research is to take your time to understand insights from fundamental papers and remember that the field is still relatively new.

Blogplaycircle Feedback Loops in Opinion ModelingDanielle Ensign OpenAI Mentor: Jeff WuPrevious Roles: Software Engineer at ITHAKA, Brighten AI, and Phylliida I have a background in Software Development, AI Fairness, and VR Game Development.

My project is exploratory, investigating prior work on opinion modeling from the context of deep learning.

Blogplaycircle Characterizing Test Time Compute on Graph Structured ProblemsKudzo Ahegbebu OpenAI Mentor: William GussPrevious Roles: Software Engineer at Facebook and Genen…

4 месяца, 1 неделя назад @ openai.com
Will Hurd Joins OpenAI’s Board of Directors
Will Hurd Joins OpenAI’s Board of Directors Will Hurd Joins OpenAI’s Board of Directors

OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires expertise in public policy as well as technology.

So, we’re delighted to announce that Congressman Will Hurd has joined our board of directors.

Will served three terms in the U.S. House of Representatives, has been a leading voice on technology policy, and coauthored bipartisan legislation outlining a national strategy for artificial intelligence.

“Will brings a rare combination of expertise—he deeply understands both artificial intelligence as well as public policy, both of which are critical to a successful future for AI,” said Sam Altman, O…

4 месяца, 2 недели назад @ openai.com
GPT-3 Powers the Next Generation of Apps
GPT-3 Powers the Next Generation of Apps GPT-3 Powers the Next Generation of Apps

Given any text prompt like a phrase or a sentence, GPT-3 returns a text completion in natural language.

Applications and industriesTo date, over 300 apps are using GPT-3 across varying categories and industries, from productivity and education to creativity and games.

Using GPT-3, Viable identifies themes, emotions, and sentiment from surveys, help desk tickets, live chat logs, reviews, and more.

Algolia Answers helps publishers and customer support help desks query in natural language and surface nontrivial answers.

With natural language processing, technical experience is no longer a barrier, and we can truly keep our focus on solving real world problems.

5 месяцев, 4 недели назад @ openai.com
Multimodal Neurons in Artificial Neural Networks
Multimodal Neurons in Artificial Neural Networks Multimodal Neurons in Artificial Neural Networks

discovered that the human brain possesses multimodal neurons.

Now, we’re releasing our discovery of the presence of multimodal neurons in CLIP.

Our discovery of multimodal neurons in CLIP gives us a clue as to what may be a common mechanism of both synthetic and natural vision systems—abstraction.

Indeed, these neurons appear to be extreme examples of “multi-faceted neurons,” neurons that respond to multiple distinct cases, only at a higher level of abstraction.

How multimodal neurons composeThese multimodal neurons can give us insight into understanding how CLIP performs classification.

6 месяцев, 2 недели назад @ openai.com
Scaling Kubernetes to 7,500 Nodes
Scaling Kubernetes to 7,500 Nodes Scaling Kubernetes to 7,500 Nodes

We've scaled Kubernetes clusters to 7,500 nodes, producing a scalable infrastructure for large models like GPT-3, CLIP, and DALL·E, but also for rapid small-scale iterative research such as Scaling Laws for Neural Language Models.

NetworkingAs the number of nodes and pods within our clusters increased, we found that Flannel had difficulties scaling up the throughput required.

It reconciles this with the current nodes in the cluster, tainting the appropriate number of nodes with openai.com/team=teamname:NoSchedule .

Kubernetes 1.18 introduced a plugin architecture for the core Kubernetes scheduler, making it much easier to add features like this natively.

Unsolved problemsThere are many prob…

7 месяцев, 3 недели назад @ openai.com
CLIP: Connecting Text and Images
CLIP: Connecting Text and Images CLIP: Connecting Text and Images

We show random, non-cherry picked, predictions of zero-shot CLIP classifiers on examples from various datasets below.

In contrast, the CLIP model can be evaluated on benchmarks without having to train on their data, so it can’t “cheat” in this manner.

CLIP is flexible and generalBecause they learn a wide range of visual concepts directly from natural language, CLIP models are significantly more flexible and general than existing ImageNet models.

The best CLIP model outperforms the best publicly available ImageNet model, the Noisy Student EfficientNet-L2, on 20 out of 26 different transfer datasets we tested.

CLIP models are also more compute efficient than the models from 10 prior approache…

8 месяцев, 2 недели назад @ openai.com
DALL·E: Creating Images from Text
DALL·E: Creating Images from Text DALL·E: Creating Images from Text

Text prompt an illustration of a baby daikon radish in a tutu walking a dog AI-generated images View more images or edit prompt Text prompt a store front that has the word ‘openai’ written on it […] AI-generated images View more images or edit prompt Text prompt an armchair in the shape of an avocado […] AI-generated images View more images or edit prompt Text and image prompt the exact same cat on the top as a sketch on the bottom AI-generated images View more images or edit promptGPT-3 showed that language can be used to instruct a large neural network to perform a variety of text generation tasks.

navigatedownwide navigateupwide Text prompt AI-generatedimages We find that DALL·E is somet…

8 месяцев, 2 недели назад @ openai.com
Organizational Update from OpenAI
Organizational Update from OpenAI Organizational Update from OpenAI

It’s been a year of dramatic change and growth at OpenAI.

Today we’re announcing that Dario Amodei, VP of Research, is leaving OpenAI after nearly five years with the company.

He and a handful of OpenAI colleagues are planning a new project, which they tell us will probably focus less on product development and more on research.

I want to wish everyone the best, and I know that OpenAI will do really great things in the years ahead.

Mira Murati is taking on new responsibilities as senior vice president of Research, Product, and Partnerships, reflecting her strong leadership during our API rollout and across the company.

8 месяцев, 3 недели назад @ openai.com
OpenAI at NeurIPS 2020
OpenAI at NeurIPS 2020 OpenAI at NeurIPS 2020

Live demos and discussions at our virtual booth.

9 месяцев, 2 недели назад @ openai.com
Microsoft Microsoft
последний пост 5 дней, 21 час назад
Micro-climate predictions: Enabling hyper-local decisions for agriculture and renewables
Micro-climate predictions: Enabling hyper-local decisions for agriculture and renewables Micro-climate predictions: Enabling hyper-local decisions for agriculture and renewables

Micro-climate predictions are beneficial in agriculture, forestry, architecture, urban design, ecology conservation, maritime and other domains.

Forecast error computation DeepMC uses weather station forecasts of the predicted variable to learn better models for micro-climate predictions.

Micro-climate wind speed prediction RMSE comparisons over 24-hour predictionsComparison: micro-wind speed predictionsFigure 3 shows the wind speed predictions at the 24th hour over a period of 10 days with one-hour resolution.

RMSE, MAPE and MAE comparison for micro-climate wind speed predictionsFigure 6.

The combination of accuracy, robustness, flexibility and scalability is important to help the renewabl…

5 дней, 21 час назад @ microsoft.com
Video analytics at the edge, an ideal technology for 5G cloud monetization
Video analytics at the edge, an ideal technology for 5G cloud monetization

Creating a programmable software infrastructure for telecommunication operations promises to reduce both the capital expenditure (CAPEX) and the operational expenses (OPEX) of the 5G telecommunications operators. In this blog, we focus on video, the dominant traffic type on the internet since the introduction of 4G networks.

6 дней, 7 часов назад @ azure.microsoft.com
SpaceCows – using AI, space technology and cloud to protect the Top End
SpaceCows – using AI, space technology and cloud to protect the Top End

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

6 дней, 23 часа назад @ news.microsoft.com
4 ways AI, computer vision, and related technologies expand IoT solutions
4 ways AI, computer vision, and related technologies expand IoT solutions

Intel and Microsoft Azure are working together to help enterprises deploy intelligent IoT technologies and services, including AI’s deep learning abilities, computer vision, and audio or speech capabilities. Adding these capabilities enables solutions to solve more business challenges, uniting two or more—adding both computer vision and AI, for example greatly expands the potential uses for IoT solutions.

1 неделя, 6 дней назад @ azure.microsoft.com
DeepSpeed powers 8x larger MoE model training with high performance
DeepSpeed powers 8x larger MoE model training with high performance DeepSpeed powers 8x larger MoE model training with high performance

Today, we are proud to announce DeepSpeed MoE, a high-performance system that supports massive scale mixture of experts (MoE) models as part of the DeepSpeed optimization library.

Besides supporting the most ambitious scale MoE models, DeepSpeed MoE boosts the development productivity and resource efficiency of training modestly sized MoE models in production scenarios, which may be of broader interest to the deep learning (DL) community.

Powered by DeepSpeed MoE, we can now train MoE models that are much larger compared with dense models.

By leveraging DeepSpeed MoE, the Z-code MoE model achieved improved convergence and higher quality with multitask training setting compared with the non-…

1 месяц назад @ microsoft.com
Solve your toughest business problems with AI and machine learning
Solve your toughest business problems with AI and machine learning

Today, AI and machine learning are enabling data-driven organizations to accelerate their journey to insights and decisions. With all the latest advancements, AI is no longer limited to only those with deep expertise or a cache of data scientists, and many organizations can now adopt AI and machine learning for better competitive advantage. Customers with analytics practices looking to adopt machine learning can read this report to get started.

1 месяц назад @ azure.microsoft.com
New Future of Work: How remote and hybrid work will shape workplaces and society with Jaime Teevan and Siddharth Suri
New Future of Work: How remote and hybrid work will shape workplaces and society with Jaime Teevan and Siddharth Suri New Future of Work: How remote and hybrid work will shape workplaces and society with Jaime Teevan and Siddharth Suri

So, you know, Microsoft, for example, Satya Nadella, after COVID hit, I couldn’t have been prouder to work for him.

It was like, you know, “Yeah, okay, productivity hasn’t fallen off a cliff.

This is really, really, really complicated accounting.

SURI: Yeah, yeah.

You know, right now, women caregivers are really, really struggling.

1 месяц, 1 неделя назад @ microsoft.com
Safe program merges at scale: A grand challenge for program repair research
Safe program merges at scale: A grand challenge for program repair research Safe program merges at scale: A grand challenge for program repair research

But with so many people independently altering code, it’s unsurprising that updates don’t always synchronize, resulting in bad merges.

In other cases, bad program merges can be more subtle and costly, introducing semantic merge conflicts that may either fail the compiler, break a test, or—worse—introduce a regression.

And third, there are project-specific patterns in how developers resolve bad merges that can be capitalized on by program synthesis.

Safe program merges—a long-standing research problemThe problem of ensuring safe program merges has been long studied and remains an open challenge in programming languages and software engineering research.

While the approach showed the potentia…

1 месяц, 1 неделя назад @ microsoft.com
Make Every feature Binary: A 135B parameter sparse neural network for massively improved search relevance
Make Every feature Binary: A 135B parameter sparse neural network for massively improved search relevance Make Every feature Binary: A 135B parameter sparse neural network for massively improved search relevance

When compared to Transformer-based deep learning models, the MEB model also demonstrates interesting capabilities to learn beyond semantic relationships.

The input layer contains 9 billion features, generated from 49 feature groups, with each binary feature encoded into a 15-dimension embedding vector.

We define MaxQueryLength buckets for this feature so that query “Microsoft Windows” has the binary feature QueryLength_2 equal to 1.

One-hot encoding of categorical featuresCategorical features can be transformed into binary features through one-hot encoding in a straightforward way.

This is especially true if you have a large historical stream of user interactions and can easily construct si…

1 месяц, 2 недели назад @ microsoft.com
New Future of Work: Redefining workspaces as hybrid and remote work become more prevalent with Jaime Teevan and Ginger Hudson
New Future of Work: Redefining workspaces as hybrid and remote work become more prevalent with Jaime Teevan and Ginger Hudson New Future of Work: Redefining workspaces as hybrid and remote work become more prevalent with Jaime Teevan and Ginger Hudson

HUDSON: Yeah, I think initially it was things that were more grab-and-go, maybe a mouse, a keyboard, a monitor.

So many people in the office, as we well know, right, rely on multi-mon setups in their office, uh, environments to really maintain productivity.

When it’s back in person, I’m kind of scared.

I think a lot of that is to be determined as we really think about how this plays out in the hybrid environment.

And I’m really excited to see what happens.

1 месяц, 2 недели назад @ microsoft.com
The rise of parallel chat in online meetings: how can we make the most of it?
The rise of parallel chat in online meetings: how can we make the most of it? The rise of parallel chat in online meetings: how can we make the most of it?

The many benefits of chatChat has become essential in virtual meetings – many online meetings would be much less efficient, and some would be impossible, without chat.

Chat enables people to organize their collaboration and action around documents and follow-up meetings.

Used well, it can be a powerful and effective tool in online meetings.

RESOURCE Parallel Meeting Chat Guide for Moderators and ParticipantsChat to the futureImproving chat can lead to more effective conversations as well as enhanced meeting tools.

The age of online meetings has just begun.

1 месяц, 2 недели назад @ microsoft.com
Text-to-speech technology helps produce more audiobooks for people who are blind or have low vision
Text-to-speech technology helps produce more audiobooks for people who are blind or have low vision

This website is using a security service to protect itself from online attacks.

The action you just performed triggered the security solution.

There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

You can email the site owner to let them know you were blocked.

Please include what you were doing when this page came up and the Cloudflare Ray ID found at the bottom of this page.

1 месяц, 3 недели назад @ news.microsoft.com
New Future of Work: Managing IT and security in remote scenarios with Jaime Teevan and Matt Brodsky
New Future of Work: Managing IT and security in remote scenarios with Jaime Teevan and Matt Brodsky New Future of Work: Managing IT and security in remote scenarios with Jaime Teevan and Matt Brodsky

[MUSIC ENDS]In this episode of the series, we’re exploring the “IT and Security” chapter of The New Future of Work report published by Microsoft.

TEEVAN: You know one of the things that surprised me before I came to Microsoft was how important IT and security are to Microsoft.

So, making sure that Microsoft administrative experiences work is crucial in making sure that those hundreds of millions of people are efficient at work.

[MUSIC BREAK]TEEVAN: I know, uh, security threats have increased a lot this past year, as well.

TEEVAN: Has, um, what you’ve learned through your research and the research that you’ve read changed your own work practices?

1 месяц, 3 недели назад @ microsoft.com
On infinitely wide neural networks that exhibit feature learning
On infinitely wide neural networks that exhibit feature learning On infinitely wide neural networks that exhibit feature learning

The catch, however, is that we need an infinite-width limit that sufficiently captures what makes NNs so successful today.

Unlocking Feature Learning by going beyond model initializationWhy do NNGP and NTK fail to learn features?

To unlock feature learning, we need to see gradient updates for what they really are: a different kind of matrices from their randomly initialized counterparts.

Neither leaves the “comfort zone” of model initialization and thus fails to capture feature learning.

In contrast, cities and states get naturally separated in the embedding space as width increases in the feature learning regime.

1 месяц, 4 недели назад @ microsoft.com
Lecture series aims to help spur dialogue around race and technology
Lecture series aims to help spur dialogue around race and technology Lecture series aims to help spur dialogue around race and technology

Race and Technology: A Research Lecture Series features 14 distinguished scholars and domain experts from a diverse range of research areas and disciplines.

From top left: Dr. Sareeta Amrute, Dr. Kim TallBear, Dr. Charlton McIlwain, Dr. Ruha Benjamin, Dr. Lisa Nakamura, Dr. Simone Browne, and Dr. André Brock.

With an organizing committee that included McIlwain and Baym, Microsoft Research launched Race and Technology: A Research Lecture Series in May 2021.

MEET THE SPEAKERS AND REGISTER Race and Technology: A Research Lecture SeriesHarms at the intersection of race and technologyOne of the goals of the lecture series is to expose more people to the field and its expansive reach.

“I want peo…

2 месяца назад @ microsoft.com
MIT AI MIT AI
последний пост 4 дня, 11 часов назад
Making self-driving cars safer through keener robot perception
Making self-driving cars safer through keener robot perception Making self-driving cars safer through keener robot perception

“But these perception algorithms are designed to be fast, with little guarantee of whether the robot has succeeded in gaining a correct understanding of its surroundings,” says Yang.

From there, lines are drawn that seek to trace the detected keypoints on the 2D car image to the labeled 3D keypoints in a 3D car model.

“We must then solve an optimization problem to rotate and translate the 3D model to align with the key points on the image,” Yang says.

“This 3D model will help the robot understand the real-world environment.”Each traced line must be analyzed to see if it has created a correct match.

The 3D model gets morphed to match the 2D image by undergoing a linear combination of previou…

4 дня, 11 часов назад @ news.mit.edu
Q&A: Dina Katabi on a “smart” home with actual intelligence
Q&A: Dina Katabi on a “smart” home with actual intelligence Q&A: Dina Katabi on a “smart” home with actual intelligence

In this Q&A, Katabi, the Thuan (1990) and Nicole Pham Professor at MIT, discusses some of her recent work.

My lab is working on the next generation of wireless sensors and machine-learning models that can make more personalized predictions.

A: We’re developing “touchless” sensors that can track people’s movements, activities, and vital signs by analyzing radio signals that bounce off their bodies.

Here, the invisibles understand actions and movements.

Current deep-learning models are also limited whether wireless signals are collected from wearable or background sensors.

1 неделя, 5 дней назад @ news.mit.edu
Using adversarial attacks to refine molecular energy predictions
Using adversarial attacks to refine molecular energy predictions Using adversarial attacks to refine molecular energy predictions

For this project, the researchers had multiple neural networks predict the potential energy surface from the same data.

The spread in the predictions of a “committee of neural networks” is the “uncertainty” at that point.

Instead, the new approach only samples data points from regions of low prediction confidence, corresponding to specific geometries of a molecule.

They used more than 15,000 examples to train a neural network to predict the potential energy surfaces for these systems.

However, when the adversarial approach is used to retrain the neural networks, the authors saw a performance jump to 97 percent using only 500 extra points.

2 недели, 4 дня назад @ news.mit.edu
360-degree transparency for construction sites made simple
360-degree transparency for construction sites made simple 360-degree transparency for construction sites made simple

MIT spinoff OpenSpace invented automated 360-degree video jobsite capture and mapping.

Enter OpenSpace, a company that’s propelling the construction of any built environment into the digital age.

It's essentially passive; the OpenSpace Vision System does all the work, mapping site photos to site plans automatically.

Prior to OpenSpace, CTO DeCamp was a computer vision and data visualization research scientist at the Institute.

No more trips to the site required when you can see a past-to-present 360-degree view from anywhere.

2 недели, 5 дней назад @ news.mit.edu
Jordan Harrod: Brain researcher and AI-focused YouTuber
Jordan Harrod: Brain researcher and AI-focused YouTuber Jordan Harrod: Brain researcher and AI-focused YouTuber

Scientist, writer, policy advocate, YouTuber – before Jordan Harrod established her many successful career identities, her first role was as a student athlete.

Mapping the brain to understand consciousnessToday, Harrod collaborates with professors Emery Brown, an anesthesiologist, and Ed Boyden, a neuroscientist, to study how different parts of the brain relate to consciousness and arousal.

Since beginning her neuroscience research, Harrod has been amazed to learn how much about the brain still needs to be uncovered.

She is the chair of the External Affairs Board of the Graduate Student Council, an Early Career Policy Ambassador for the Society for Neuroscience, and the co-founder of the MI…

3 недели, 1 день назад @ news.mit.edu
Smart laser cutter system detects different materials
Smart laser cutter system detects different materials Smart laser cutter system detects different materials

Addressing what might not be totally apparent to the naked eye, scientists from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) came up with “SensiCut,” a smart material-sensing platform for laser cutters.

In contrast to conventional, camera-based approaches that can easily misidentify materials, SensiCut uses a more nuanced fusion.

With the addition of computers, laser cutters have rapidly become a relatively simple and powerful tool, with software controlling shiny machinery that can chop metals, woods, papers, and plastics.

SensiCut is a smart material sensing platform for laser cutters.

Beyond laser cutters, the team envisions a future where SensiCut’s sensing tech…

1 месяц назад @ news.mit.edu
“AI for Impact” lives up to its name
“AI for Impact” lives up to its name “AI for Impact” lives up to its name

For entrepreneurial MIT students looking to put their skills to work for a greater good, the Media Arts and Sciences class MAS.664 (AI for Impact) has been a destination point.

“It’s clear that the people who are graduating from here want to do something significant with their lives ... they want to have an impact on their world,” Pentland says.

A path toward confronting a pandemicRaskar began co-teaching the class in 2019, and brought a “Big AI” focus to the Development Ventures class, inspired by an AI for Impact team he had set up at his former employer, Facebook.

“This is something we should think about more seriously: how to use AI and data for positive social impact, while protecting …

1 месяц назад @ news.mit.edu
Machine learning discovers new sequences to boost drug delivery
Machine learning discovers new sequences to boost drug delivery Machine learning discovers new sequences to boost drug delivery

“The key innovation is using machine learning to connect the sequence of a peptide, particularly a peptide that includes non-natural amino acids, to experimentally-measured biological activity.”Dream dataCPPs are relatively short chains, made up of between five and 20 amino acids.

Before a model could make any worthwhile predictions, researchers on the experimental side needed to create a robust dataset.

While only 20 amino acids naturally occur in the human body, hundreds more exist elsewhere — like an amino acid expansion pack for drug development.

To represent them in a machine-learning model, researchers typically use one-hot encoding, a method that assigns each component to a series of…

1 месяц, 1 неделя назад @ news.mit.edu
System trains drones to fly around obstacles at high speeds
System trains drones to fly around obstacles at high speeds System trains drones to fly around obstacles at high speeds

Now, aerospace engineers at MIT have devised an algorithm that helps drones find the fastest route around obstacles without crashing.

But the faster drones fly, the more unstable they become, and at high speeds their aerodynamics can be too complicated to predict.

Fast effectsTraining drones to fly around obstacles is relatively straightforward if they are meant to fly slowly.

Overall, the drone trained on the new algorithm “won” every race, completing the course in a shorter time than the conventionally trained drone.

The researchers plan to fly more experiments, at faster speeds, and through more complex environments, to further improve their algorithm.

1 месяц, 1 неделя назад @ news.mit.edu
Exact symbolic artificial intelligence for faster, better assessment of AI fairness
Exact symbolic artificial intelligence for faster, better assessment of AI fairness Exact symbolic artificial intelligence for faster, better assessment of AI fairness

MIT researchers have developed a new artificial intelligence programming language that can assess the fairness of algorithms more exactly, and more quickly, than available alternatives.

Their Sum-Product Probabilistic Language (SPPL) is a probabilistic programming system.

SPPL is different from most probabilistic programming languages, as SPPL only allows users to write probabilistic programs for which it can automatically deliver exact probabilistic inference results.

Error from approximate probabilistic inference is tolerable in many AI applications.

This approach extends prior work on sum-product networks to models and queries expressed via a probabilistic programming language.

1 месяц, 1 неделя назад @ news.mit.edu
3 Questions: David Kaiser and Julie Shah on social and ethical responsibilities of computing
3 Questions: David Kaiser and Julie Shah on social and ethical responsibilities of computing 3 Questions: David Kaiser and Julie Shah on social and ethical responsibilities of computing

David Kaiser and Julie Shah are on a mission to prepare students and facilitate research to address the broad challenges and opportunities associated with computing.

As associate deans of Social and Ethical Responsibilities of Computing (SERC) in the MIT Stephen A. Schwarzman College of Computing, Kaiser and Shah are advancing a number of initiatives they hope will get students and faculty to reflect on the potential social, ethical, and policy implications of new technologies.

Q: Weaving social and ethical aspects of computing into the curricula is a key mandate of SERC.

It’s also part of SERC’s broader mission to incorporate humanist, social science, social considerations, and policy/civi…

1 месяц, 2 недели назад @ news.mit.edu
Lincoln Laboratory convenes top network scientists for Graph Exploitation Symposium
Lincoln Laboratory convenes top network scientists for Graph Exploitation Symposium Lincoln Laboratory convenes top network scientists for Graph Exploitation Symposium

This is a core question of network science, a field of research that models interactions across physical, biological, social, and information systems to solve problems.

The 2021 Graph Exploitation Symposium (GraphEx), hosted by MIT Lincoln Laboratory, brought together top network science researchers to share the latest advances and applications in the field.

To classify IO accounts, Mackin and her team trained an algorithm to detect probable IO accounts in Twitter networks based on a specific hashtag or narrative.

The team has found that their classifier outperforms existing detectors of IO accounts, because it can identify both bot accounts and human-operated ones.

As researchers model the…

2 месяца назад @ news.mit.edu
MIT Schwarzman College of Computing awards named professorships to two faculty members
MIT Schwarzman College of Computing awards named professorships to two faculty members MIT Schwarzman College of Computing awards named professorships to two faculty members

The MIT Stephen A. Schwarzman College of Computing has awarded two inaugural chaired appointments to Dina Katabi and Aleksander Madry in the Department of Electrical Engineering and Computer Science (EECS).

“These distinguished endowed professorships recognize the extraordinary achievements of our faculty and future potential of their academic careers,” says Daniel Huttenlocher, dean of the MIT Schwarzman College of Computing and the Henry Ellis Warren Professor of Electrical Engineering and Computer Science.

Her work spans computer networks, wireless sensing, applied machine learning, and digital health.

Madry’s research spans algorithmic graph theory, optimization, and machine learning.

M…

2 месяца назад @ news.mit.edu
Software to accelerate R&D
Software to accelerate R&D Software to accelerate R&D

The situation often requires scientists to leave the lab bench to spend time gathering and merging data from various experiments.

The company’s platform allows scientists to access data from anywhere, merge data using customized parameters, and create visualizations to share findings with others.

Uncountable’s goal is to accelerate innovation by giving scientists developing new materials and products a better way to use the data that drive decisions.

“Our goal internally is, ‘Can we make R&D more efficient by a factor of 10?’” Hollingsworth explains.

“We get to that point faster, and it speeds up the whole R&D process.”Carbon is one of several 3-D printing companies Uncountable works with.

2 месяца, 1 неделя назад @ news.mit.edu
US Air Force pilots get an artificial intelligence assist with scheduling aircrews
US Air Force pilots get an artificial intelligence assist with scheduling aircrews US Air Force pilots get an artificial intelligence assist with scheduling aircrews

Take it from U.S. Air Force Captain Kyle McAlpin when he says that scheduling C-17 aircraft crews is a headache.

An artificial intelligence research flight commander for the Department of Air Force–MIT AI Accelerator Program, McAlpin is also an experienced C-17 pilot.

Collaborating with their Air Force sponsor organization, Tron, the team has developed an AI-enabled plugin for the existing C-17 scheduling tool to fulfill that vision.

Puckboard is used widely across the Air Force for other scheduling needs, though each optimization problem is unique.

He himself personifies all three institutes, as an MIT student, a Lincoln Laboratory Military Fellow, and a lieutenant in the Air Force.

2 месяца, 1 неделя назад @ news.mit.edu
Berkeley AI
последний пост 1 месяц, 4 недели назад
Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning
Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning

Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive LearningWe consider a problem: Can a machine learn from a few labeled pixels to predict every pixel in a new image?

Weak supervision can be roughly categorized into two families: Coarse and Sparse supervision.

Metric Learning and Contrastive Loss FormulationTo solve the semi-supervised learning problem, we take the viewpoint of feature representation learning.

A Solution for Universal Weakly Supervised SegmentationIn this work, we propose a single method to tackle all forms of weak supervision, even if they carry different assumptions.

We thank all co-authors of the paper “Universal Weakly Supervised Segmentation by …

1 месяц, 4 недели назад @ bair.berkeley.edu
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

We refer to PPO with these modifications as Multi-Agent PPO (MAPPO).

Overall, we observe that in the majority of environments, MAPPO achieves results comparable or superior to off-policy methods with comparable sample-efficiency.

Additionally, this suggests that despite a heavy emphasis on developing new off-policy methods for MARL, on-policy methods such as PPO can be a promising direction for future research.

These include:Investigating MAPPO’s performance on a wider range of domains, such as competitive games or multi-agent settings with continuous action spaces.

This post is based on the paper “The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games”.

2 месяца, 1 неделя назад @ bair.berkeley.edu
BASALT: A Benchmark for Learning from Human Feedback
BASALT: A Benchmark for  Learning from Human Feedback BASALT: A Benchmark for Learning from Human Feedback

Despite the plethora of techniques developed to tackle this problem, there have been no popular benchmarks that are specifically intended to evaluate algorithms that learn from human feedback.

In contrast, there is effectively no chance of such an unsupervised method solving BASALT tasks.

Design a “caption prompt” for each BASALT task that induces the policy to solve that task.

We impose limits on the amount of compute and human feedback that submissions can use to prevent this scenario.

ConclusionWe hope that BASALT will be used by anyone who aims to learn from human feedback, whether they are working on imitation learning, learning from comparisons, or some other method.

2 месяца, 2 недели назад @ bair.berkeley.edu
Learning What To Do by Simulating the Past
Learning What To Do by Simulating the Past Learning What To Do by Simulating the Past

Preferences Implicit in the State of the World develops an algorithm, Reward Learning by Simulating the Past (RLSP), that does this sort of reasoning, allowing an agent to infer human preferences without explicit feedback.

In our latest paper presented at ICLR 2021, we introduce Deep Reward Learning by Simulating the Past (Deep RLSP), an extension of the RLSP algorithm that can be scaled up to tasks like the balancing Cheetah task.

To address this, we sample likely past trajectories, instead of enumerating all possible past trajectories.

By alternating between predicting past actions, and predicting past states from which those actions were taken, we can simulate trajectories arbitrarily fa…

4 месяца, 2 недели назад @ bair.berkeley.edu
An EPIC way to evaluate reward functions
An EPIC way to evaluate reward functions An EPIC way to evaluate reward functions

Our method, Equivalent-Policy Invariant Comparison (EPIC), allows one to evaluate a reward function by computing how similar it is to other reward functions.

EPIC can be used to benchmark reward learning algorithms by comparing learned reward functions to a ground-truth reward.

It can also be used to validate learned reward functions prior to deployment, by comparing them against reward functions learned via different techniques or data sources.

EPIC is a new way to evaluate reward functions and reward learning algorithms by comparing how similar reward functions are to one another.

Most significantly, EPIC can only compare reward functions to one another, and cannot tell you what a particu…

5 месяцев назад @ bair.berkeley.edu
The Importance of Hyperparameter Optimization for Model-based Reinforcement Learning
The Importance of Hyperparameter Optimization for Model-based Reinforcement Learning The Importance of Hyperparameter Optimization for Model-based Reinforcement Learning

The Importance of Hyperparameter Optimization for Model-based Reinforcement LearningModel-based reinforcement learning (MBRL) is a variant of the iterative learning framework, reinforcement learning, that includes a structured component of the system that is solely optimized to model the environment dynamics.

MBRLModel-based reinforcement learning (MBRL) is an iterative framework for solving tasks in a partially understood environment.

With that data, the agent creates a structured learning tool – a dynamics model – to reason about the world.

Automated Machine Learning (AutoML) is a field dedicated to the study of using machine learning algorithms to tune our machine learning tools.

Thi…

5 месяцев назад @ bair.berkeley.edu
Pretrained Transformers as Universal Computation Engines
Pretrained Transformers as Universal Computation Engines Pretrained Transformers as Universal Computation Engines

Pretrained Transformers as Universal Computation EnginesTransformers have been successfully applied to a wide variety of modalities: natural language, vision, protein modeling, music, robotics, and more.

This enables the models to utilize generalizable high-level embeddings trained on a large dataset to avoid overfitting to a small task-relevant dataset.

To illustrate this, we take a pretrained transformer language model and finetune it on various classification tasks: numerical computation, vision, and protein fold prediction.

Furthermore, we find the language-pretrained frozen transformers converge faster than the randomly initialized frozen transformers, typically by a factor of 1-4x, in…

6 месяцев назад @ bair.berkeley.edu
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems Maximum Entropy RL (Provably) Solves Some Robust RL Problems

Our analysis provides a theoretically-justified explanation for the empirical robustness of MaxEnt RL, and proves that MaxEnt RL is itself a robust RL algorithm.

In the rest of this post, we’ll provide some intuition into why MaxEnt RL should be robust and what sort of perturbations MaxEnt RL is robust to.

Standard RL MaxEnt RL Trained and evaluated without the obstacle: Trained without the obstacle, but evaluated with the obstacle:TheoryWe now formally describe the technical results from the paper.

Standard RL MaxEnt RL Evaluation on adversarial perturbationsMaxEnt RL is robust to adversarial perturbations of the hole (where the robot inserts the peg).

ConclusionIn summary, our paper sho…

6 месяцев, 2 недели назад @ bair.berkeley.edu
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems Maximum Entropy RL (Provably) Solves Some Robust RL Problems

Maximum Entropy RL (Provably) Solves Some Robust RL ProblemsNearly all real-world applications of reinforcement learning involve some degree of shift between the training environment and the testing environment.

In a recent paper, we prove that every MaxEnt RL problem corresponds to maximizing a lower bound on a robust RL problem.

In the rest of this post, we’ll provide some intuition into why MaxEnt RL should be robust and what sort of perturbations MaxEnt RL is robust to.

ConclusionIn summary, this paper shows that a commonly-used type of RL algorithm, MaxEnt RL, is already solving a robust RL problem.

We do not claim that MaxEnt RL will outperform purpose-designed robust RL algorithms.

6 месяцев, 2 недели назад @ bair.berkeley.edu
Self-Supervised Policy Adaptation during Deployment
Self-Supervised Policy Adaptation during Deployment Self-Supervised Policy Adaptation during Deployment

Self-Supervised Policy Adaptation during DeploymentOur method learns a task in a fixed, simulated environment and quickly adapts to new environments (e.g.

Assuming that gradients of the self-supervised objective are sufficiently correlated with those of the RL objective, any adaptation in the self-supervised task may also influence and correct errors in the perception and decision-making of the policy.

SAC+IDM is a Soft Actor-Critic (SAC) policy trained with an Inverse Dynamics Model (IDM), and SAC+IDM (PAD) is the same policy but with the addition of policy adaptation during deployment on the robot.

Policy adaptation is especially effective when the test environment differs from the traini…

6 месяцев, 3 недели назад @ bair.berkeley.edu
The Successor Representation, $\gamma$-Models, and Infinite-Horizon Prediction
The Successor Representation, $\gamma$-Models, and Infinite-Horizon Prediction The Successor Representation, $\gamma$-Models, and Infinite-Horizon Prediction

The Successor Representation, $\gamma$-Models,and Infinite-Horizon PredictionThe Successor Representation, Gamma-Models, and Infinite-Horizon PredictionStandard single-step models have a horizon of one.

In order to amortize this long-horizon prediction, value functions are trained with either Monte Carlo estimates of expected cumulative reward or with dynamic programming.

In contrast, value functions amortize the work of long-horizon prediction at training, so a single-step prediction (and informally, a shorter "horizon") is sufficient during testing.

As opposed to incrementing one timestep into the future with every prediction, \(\gamma\)-model rollout steps have a negative binomial distri…

8 месяцев, 2 недели назад @ bair.berkeley.edu
Does GPT-2 Know Your Phone Number?
Does GPT-2 Know Your Phone Number? Does GPT-2 Know Your Phone Number?

Does GPT-2 Know Your Phone Number?

Yet, OpenAI’s GPT-2 language model does know how to reach a certain Peter W --- (name redacted for privacy).

Maybe the model memorized credit card numbers, or maybe it memorized entire book passages, or even code snippets.

For example, we retain any sample on which GPT-2 assigns a much higher likelihood than a different language model (e.g., a smaller variant of GPT-2).

Does Training Language Models Infringe on Copyright?

9 месяцев назад @ bair.berkeley.edu
Offline Reinforcement Learning: How Conservative Algorithms Can Enable New Applications
Offline Reinforcement Learning: How Conservative Algorithms Can Enable New Applications Offline Reinforcement Learning: How Conservative Algorithms Can Enable New Applications

Offline Reinforcement Learning: How Conservative Algorithms Can Enable New ApplicationsDeep reinforcement learning has made significant progress in the last few years, with success stories in robotic control, game playing and science problems.

As shown in the figure below, offline RL requires learning skills solely from previously collected datasets, without any active environment interaction.

COG: Learning Skills That Generalize via Offline RLCOG is an algorithmic framework for utilizing large, unlabeled datasets of diverse behavior to learn generalizable policies via offline RL.

Like supervised learning methods, offline RL algorithms can also “overfit” as a result of excessive trainin…

9 месяцев, 2 недели назад @ bair.berkeley.edu
AWS Machine Learning AWS Machine Learning
последний пост 2 дня, 15 часов назад
Arçelik hosts global AWS DeepRacer League using new LIVE feature to educate over 200 employees on machine learning
Arçelik hosts global AWS DeepRacer League using new LIVE feature to educate over 200 employees on machine learning Arçelik hosts global AWS DeepRacer League using new LIVE feature to educate over 200 employees on machine learning

As phase one of Arçelik’s AI2 program, Arçelik hosted their first global AWS DeepRacer League to help upskill their employees on AI and ML.

Arçelik’s AWS DeepRacer journeyTo get started on their event, Arçelik established a steering committee and worked with the AWS customer team and AWS DeepRacer Specialist Solutions Architects (the AWS DeepRacer Pit Crew) to work through the League format, racetrack, and event calendar.

89% of the participants stated they would race in an AWS DeepRacer League again in the future.

Create an AWS DeepRacer LIVE event now and learn more in AWS DeepRacer documentation.

Crafted carefully in collaboration with AWS, the AI2-Immerse step consists of an AWS DeepRac…

2 дня, 15 часов назад @ aws.amazon.com
Train fraudulent payment detection with Amazon SageMaker
Train fraudulent payment detection with Amazon SageMaker Train fraudulent payment detection with Amazon SageMaker

Businesses can use Amazon Fraud Detector to detect online payment fraud.

Fortunately, the emergence of cloud computing makes the training and deployment of ML models easier and more cost-efficient.

The solution uses two foundational AWS services: Amazon S3 and Amazon SageMaker.

Amazon SageMakerAfter labeled card transcation data is uploaded to Amazon S3, SageMaker can run ML algorithms to process the data and train a model for detecting fraudulent transactions.

For a step-by-step tutorial on SageMaker Script Mode, check out Bring your own model with Amazon SageMaker script mode and Bring Your Own Custom ML Models with Amazon SageMaker.

2 дня, 20 часов назад @ aws.amazon.com
Perform interactive data engineering and data science workflows from Amazon SageMaker Studio notebooks
Perform interactive data engineering and data science workflows from Amazon SageMaker Studio notebooks Perform interactive data engineering and data science workflows from Amazon SageMaker Studio notebooks

Amazon SageMaker Studio is the first fully integrated development environment (IDE) for machine learning (ML).

Query data using the PyHive library from the Python3 (Data Science) kernelIn this example, we use the Python 3 (Data Science) kernel.

Preprocess data and feature engineeringWe perform data preprocessing and feature engineering on the data using SageMaker Processing.

Delete Amazon SageMaker Studio AppsNavigate to Amazon SageMaker Studio Console.

He leads SageMaker Studio team to build it into the IDE of choice for interactive data science and data engineering workflows.

2 дня, 21 час назад @ aws.amazon.com
Launch Amazon SageMaker Studio from external applications using presigned URLs
Launch Amazon SageMaker Studio from external applications using presigned URLs Launch Amazon SageMaker Studio from external applications using presigned URLs

Amazon SageMaker Studio provides a single, web-based visual interface where you can perform all ML development steps, improving data science team productivity by up to 10 times.

In this post, we discuss how to launch Studio from external applications using presigned URLs.

Create and deploy a Lambda function to create a presigned URLTo create and deploy your Lambda function, complete the following steps:Create a Python Lambda function named sm-presigned .

For added security, we recommend controlling and managing access to your API using the API Gateway authentication and authorization method.

The following is the sample code to invoke the URL:index.html Choose Go to Amazon SageMaker Stud…

4 дня, 18 часов назад @ aws.amazon.com
Custom document annotation for extracting named entities in documents using Amazon Comprehend
Custom document annotation for extracting named entities in documents using Amazon Comprehend Custom document annotation for extracting named entities in documents using Amazon Comprehend

To train a custom entity recognition model that you can use on your PDF, Word, and plain text documents, you need to first annotate PDF documents using a custom Amazon SageMaker Ground Truth annotation template provided by Amazon Comprehend.

Historical options for processing PDF documents required converting documents to raw text format before processing through Amazon Comprehend custom entity recognition models.

You use this manifest file to create an Amazon Comprehend custom entity recognition training job and train your custom model.

For additional suggestions, see Improving Custom Entity Recognizer PerformanceConclusionWith this new contextual information included within labeled annotat…

4 дня, 21 час назад @ aws.amazon.com
Extract custom entities from documents in their native format with Amazon Comprehend
Extract custom entities from documents in their native format with Amazon Comprehend Extract custom entities from documents in their native format with Amazon Comprehend

Historically, you could only use Amazon Comprehend on plain text documents, which required you to flatten the documents into machine-readable text.

To extract text and spatial locations of text from scanned PDF documents, Amazon Comprehend calls Amazon Textract on your behalf before processing for custom entity recognition.

Train an Amazon Comprehend custom entity recognition model via the consoleIn the previous post, we showed you how to annotate finance documents via Amazon SageMaker Ground Truth using the custom annotation template provided by Amazon Comprehend.

To do so via the Amazon Comprehend console, complete the following steps:On the Amazon Comprehend console, choose Custom entity…

4 дня, 21 час назад @ aws.amazon.com
AWS is redefining how companies process documents in a digital world
AWS is redefining how companies process documents in a digital world AWS is redefining how companies process documents in a digital world

As a result, creating and managing a document processing pipeline remains a challenge for many companies.

AWS launched AI services like Amazon Textract, Amazon Comprehend, and others to help with the automation of extracting insights from documents.

Customer success storiesWith AWS, customers like Black Knight, Liberty Mutual, and Broadridge Financial Solutions have been able to process millions of documents that come through their pipeline using AWS AI.

With AWS AI, they can process insurance claims at a much higher speed and with greater accuracy.

AWS technologies have enabled underwriters to process documents to review results, adjust analyses and request additional documents and informa…

4 дня, 21 час назад @ aws.amazon.com
Introducing PII identification and redaction in streaming transcriptions using Amazon Transcribe
Introducing PII identification and redaction in streaming transcriptions using Amazon Transcribe Introducing PII identification and redaction in streaming transcriptions using Amazon Transcribe

Today, we’re excited to announce a new feature of Amazon Transcribe that can help achieve this: PII identification and redaction in streaming transcriptions.

", "Items":[ { "Confidence":1, "Content":"My", "EndTime":0.67, "StartTime":0.6, "Type":"pronunciation", "VocabularyFilterMatch":false }, { "Confidence":1, "Content":"name", "EndTime":0.95, "StartTime":0.68, "Type":"pronunciation", "VocabularyFilterMatch":false }, { "Confidence":1, "Content":"is", "EndTime":1.14, "StartTime":0.96, "Type":"pronunciation", "VocabularyFilterMatch":false }, { "Confidence":0.96, "Content":"[NAME]", "EndTime":1.71, "StartTime":1.15, "Type":"pronunciation", "VocabularyFilterMatch":false }, { "Content":".

", "I…

5 дней, 17 часов назад @ aws.amazon.com
Perform audio redaction for personally identifiable information with Amazon Transcribe
Perform audio redaction for personally identifiable information with Amazon Transcribe Perform audio redaction for personally identifiable information with Amazon Transcribe

Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy to add speech-to-text capabilities to your applications.

Automatic content redaction is a feature of Amazon Transcribe that can automatically remove information such as sensitive personally identifiable information (PII) from your transcription results.

Solution overviewThe following figure shows an example architecture for performing PII audio redaction, using Amazon Simple Storage Service (Amazon S3) and AWS Lambda.

For this post, we provide an AWS CloudFormation audio redaction template, which provides the full details of the implementation to enable repeatable deployments.

And I hope that Amazon transc…

5 дней, 21 час назад @ aws.amazon.com
Get value from every customer touchpoint using Amazon Connect as a data gathering mechanism
Get value from every customer touchpoint using Amazon Connect as a data gathering mechanism Get value from every customer touchpoint using Amazon Connect as a data gathering mechanism

Amazon Connect provides a unique opportunity for gathering data from these engagements and customer touchpoints that helps improve your business.

Call note storage and transcription, sentiment analysis, and text search are available out of the box from Connect Lens for Amazon Connect.

Your Amazon Kendra data source is scheduled to update itself every day at 10 AM.

Create an Amazon Connect instanceThe first step is to create an Amazon Connect instance.

You can also integrate these use cases into your CRM (such as Salesforce or Zendesk) by using Amazon Connect integration features.

5 дней, 22 часа назад @ aws.amazon.com
Manage your Amazon Fraud Detector resources in an automated and secure manner using AWS CloudFormation
Manage your Amazon Fraud Detector resources in an automated and secure manner using AWS CloudFormation Manage your Amazon Fraud Detector resources in an automated and secure manner using AWS CloudFormation

Moreover, it allows you to stack your Amazon Fraud Detector resources with other AWS service resources that work with Amazon Fraud Detector, such as AWS Lambda functions that request event fraud predictions.

AWS CloudFormation conceptsBefore we get started, let’s review some core AWS CloudFormation concepts.

Solution overviewWe walk through the following high-level steps to create and update an Amazon Fraud Detector stack in AWS CloudFormation:Download a sample Amazon Fraud Detector template.

Verify that the Amazon Fraud Detector resources were createdAfter you create the stack, you’re redirected to the AWS CloudFormation console, where you can now see your Amazon Fraud Detector stack.

For …

1 неделя, 2 дня назад @ aws.amazon.com
The development of Bundesliga Match Fact Passing Profile, a deep dive into passing in football
The development of Bundesliga Match Fact Passing Profile, a deep dive into passing in football The development of Bundesliga Match Fact Passing Profile, a deep dive into passing in football

The new Bundesliga Match Fact Passing Profile uncovers exactly that by providing real-time insights into the passing capabilities of all players and teams in the Bundesliga.

For example, they can be aggregated for each player to form a passing profile, showing which passing decisions players make.

Passing profile and efficiencyWe can use this passing or xPass (expected passes) model to estimate the passing profile of a player and his passing efficiency.

Given the experimental nature of model training, the actual training pipeline resides on our development environment.

SummaryIn this post, we demonstrated how the Bundesliga Match Fact Passing Profile makes it possible to objectively compare…

1 неделя, 3 дня назад @ aws.amazon.com
Boost transcription accuracy of class lectures with custom language models for Amazon Transcribe
Boost transcription accuracy of class lectures with custom language models for Amazon Transcribe Boost transcription accuracy of class lectures with custom language models for Amazon Transcribe

When transcribing content that is more specialized or domain-specific such as biology, Amazon Transcribe offers custom language models (CLM).

Solution overviewWith the CLM feature in Amazon Transcribe, you can build your own custom model for your class course content and improve the transcription accuracy of your class lectures.

For Training data , enter the S3 folder path for your training data.

Snippet 1 – Standard Amazon Transcribe Snippet 1 – Amazon Transcribe with CLM Another unique feature in some cells is flat Gela .

Standard Amazon Transcribe WER Amazon Transcribe CLM WER Standard Amazon Transcribe Accuracy Amazon Transcribe CLM Accuracy # Words Words Improved by CLM Lecture 1 9.5% …

1 неделя, 5 дней назад @ aws.amazon.com
Fully customizable action space now available on the AWS DeepRacer console
Fully customizable action space now available on the AWS DeepRacer console Fully customizable action space now available on the AWS DeepRacer console

Starting today, the model action space is fully customizable yet simplified with new dynamic graphics so developers have greater control and can easily unlock better model performance.

Advanced AWS DeepRacer developers need the ability to customize the action space to increase model performance.

Until now, you were limited to 20 options for modifying steering angle and speed in your action space on the AWS DeepRacer console.

A continuous action space allows the agent to select an action from a range of values for each state.

You can make discrete action space changes directly on the radial polar graph and by updating the action list.

1 неделя, 5 дней назад @ aws.amazon.com
Announcing the Amazon S3 plugin for PyTorch
Announcing the Amazon S3 plugin for PyTorch Announcing the Amazon S3 plugin for PyTorch

Amazon S3 plugin for PyTorch is an open-source library which is built to be used with the deep learning framework PyTorch for streaming data from Amazon Simple Storage Service (Amazon S3).

The Amazon S3 plugin for PyTorch is designed to be a high-performance PyTorch dataset library to efficiently access data stored in S3 buckets.

Building blocksThe Amazon S3 plugin for PyTorch provides a native experience of using data from Amazon S3 to PyTorch without adding complexity in your code.

The Amazon S3 plugin for PyTorch is available to use through pre-configured PyTorch Docker images, or directly from the GitHub repository.

The Amazon S3 plugin for PyTorch was designed for ease of use and flexi…

1 неделя, 5 дней назад @ aws.amazon.com
NVIDIA
последний пост 2 часа назад
Find the Love We Shared in September: NVIDIA Canvas Update Paints With New Styles
Find the Love We Shared in September: NVIDIA Canvas Update Paints With New Styles Find the Love We Shared in September: NVIDIA Canvas Update Paints With New Styles

Supporting the new Canvas update is the September Studio Driver, ready for download today.

In addition to today’s Canvas update, NVIDIA Omniverse recently expanded the metaverse by millions, while an NVIDIA Broadcast app update earlier this month improved the background noise removal network at the most critical moment in livestreaming.

For creators like Juan, an NVIDIA Studio laptop powered by a GeForce RTX 3060 will help speed his workflow dramatically.

And if her team is also powered by NVIDIA Studio, they can collaborate in real time with NVIDIA Omniverse.

Learn more about NVIDIA Studio systems and check out the compare GPU page for a deeper dive including options for professionals.

2 часа назад @ blogs.nvidia.com
Pushing Forward the Frontiers of Natural Language Processing
Pushing Forward the Frontiers of Natural Language Processing Pushing Forward the Frontiers of Natural Language Processing

They need something general and flexible, and that’s why we build what we build.”Large Language Models Are Changing the WorldOne of the most exciting areas of AI is language modeling, which is enabling groundbreaking applications in natural language understanding and conversational AI.

The complexity of large language models is growing at an incredible rate, with parameter counts doubling every two months.

A well-known example of a large and powerful language model is GPT-3, developed by OpenAI.

During his talk, Catanzaro showed an example of the surprising capabilities of large language models to solve new tasks without being explicitly trained to do so.

“These language models are first st…

3 дня, 14 часов назад @ blogs.nvidia.com
GeForce NOW Members Are Free to Play a Massive Library of Most-Played Games, Included With Membership
GeForce NOW Members Are Free to Play a Massive Library of Most-Played Games, Included With Membership GeForce NOW Members Are Free to Play a Massive Library of Most-Played Games, Included With Membership

Want to play awesome PC games for free without having to buy an expensive gaming rig?

This GFN Thursday takes a look at the 90+ free-to-play PC games — including this week’s Fortnite Season 8 release and the Epic Games Store free game of the week, Speed Brawl, free to claim Sept. 16-23 — all streaming on GeForce NOW.

Members can connect to popular PC game stores, like Steam and Epic Games Store, using their accounts to stream titles like Fortnite and Destiny 2.

GeForce NOW members can build their library with many of the Epic Games Store free games of the week.

Once claimed, Epic Games Store free games stay in your library, whether you’re playing on GeForce NOW or on a local PC.

4 дня, 2 часа назад @ blogs.nvidia.com
New AI Research Ranks Cities Fighting Climate Change with Sustainable Rooftops
New AI Research Ranks Cities Fighting Climate Change with Sustainable Rooftops New AI Research Ranks Cities Fighting Climate Change with Sustainable Rooftops

A new AI-mapping tool is helping scientists assess how cities across the globe are using rooftops to combat climate change.

Named Roofpedia, the research creates an open-source and scalable map of sustainable rooftops—a promising strategy for climate mitigation.

Identifying areas with solar or green installations could help guide urban development, while also boosting community health, prosperity, and the environment.

Studies have shown the benefits of sustainable rooftops—most commonly roofs with solar or green space— are plentiful and wide-ranging.

There are over a million buildings in the current data set and the researchers note that more cities are being added as aerial or satellite im…

4 дня, 22 часа назад @ developer.nvidia.com
NVIDIA to Drive “Advances for Decades to Come,” Time Magazine Writes
NVIDIA to Drive “Advances for Decades to Come,” Time Magazine Writes NVIDIA to Drive “Advances for Decades to Come,” Time Magazine Writes

Highlighting NVIDIA’s fast-growing impact, Time magazine Wednesday named NVIDIA CEO Jensen Huang to its list of most influential people of 2021.

“Artificial intelligence is transforming our world,” writes Ng, who is founder of DeepLearning.AI, founder and CEO of Landing AI, and chairman and co-founder of Coursera.

Others profiled include U.S. President Joe Biden, Tesla CEO Elon Musk, Buccaneers Quarterback Tom Brady and singer Billie Eilish.

“In 2003, amid great skepticism, Huang directed his company Nvidia to adapt chips designed to paint graphics on computer screens, known as graphics processing units or GPUs, to perform other, more general-purpose computing tasks,” Ng explains.

“The resu…

5 дней, 1 час назад @ blogs.nvidia.com
Whale Hello There: NVIDIA Intern Part of Team Working to Understand, Communicate with Whales
Whale Hello There: NVIDIA Intern Part of Team Working to Understand, Communicate with Whales Whale Hello There: NVIDIA Intern Part of Team Working to Understand, Communicate with Whales

Sperm whales, the largest of the toothed whales, boast enormous brains, explains David Gruber, a professor of biology and environmental science at the City University of New York.

Project CETI’s plan: decipher what sperm whales say to each other.

Marine biologists have long known sperm whales are unique.

To that end, Gruber’s effort is working to gather vast quantities of data about what sperm whales say to one another.

That said, putting these kinds of tools to work to understand whales will be a huge challenge.

5 дней, 2 часа назад @ blogs.nvidia.com
Medical AI Needs Federated Learning, So Will Every Industry
Medical AI Needs Federated Learning, So Will Every Industry Medical AI Needs Federated Learning, So Will Every Industry

Other large-scale federated learning projects are already underway in the healthcare industry, including a five-member study for mammogram assessment and pharmaceutical giant Bayer’s work training an AI model for spleen segmentation.

Federated Learning: AI Takes a VillageCompanies and research institutions developing AI models are typically limited by the data available to them.

It’s among the largest, most diverse clinical federated learning studies to date.

Businesses and research institutions getting started with federated learning can use the NVIDIA AI Enterprise software suite of AI tools and frameworks, optimized to run on NVIDIA-Certified Systems.

Learn more about the science behind …

5 дней, 6 часов назад @ blogs.nvidia.com
Get Free Training in Deep learning, Accelerated Computing, and Data Science
Get Free Training in Deep learning, Accelerated Computing, and Data Science Get Free Training in Deep learning, Accelerated Computing, and Data Science

For the first time, the NVIDIA Deep Learning Institute (DLI) is offering free, one-click notebooks for exploratory hands-on experience in deep learning, accelerated computing, and accelerated data science.

This free notebook demonstrates significant speed-up by moving common DataFrame operations to the GPU with minimal changes to existing code.

You will explore common data preparation processes and compare data manipulation performance using GPUs compared to CPUs.

An Even Easier Introduction to CUDALearn the basics of writing parallel CUDA kernels to run on NVIDIA GPUs.

You will have access to GPUs in the cloud and develop skills in AI, accelerated computing, accelerated data science, graph…

5 дней, 21 час назад @ developer.nvidia.com
AI Vision Guides University of Florida’s Rise in College Rankings
AI Vision Guides University of Florida’s Rise in College Rankings AI Vision Guides University of Florida’s Rise in College Rankings

“We’re doing AI in medicine, AI in drugs, AI in agriculture, AI in business … we think that this is where higher education is going to inevitably go,” Glover added in a podcast with John Koetsier of TechFirst.

Plugged Into an AI SupercomputerPowering UF’s vision is HiPerGator AI, the eighth most powerful supercomputer in higher education and 22nd most powerful supercomputer in the world.

“All of these things involve huge amounts of data, and this is where the HiPerGator AI really excels,” Glover said.

An AI Hub for FloridaUF will make the supercomputer available through the state university system, creating an AI hub for others to enhance education and research.

“We’re glad to see the unive…

6 дней назад @ blogs.nvidia.com
Researchers Use GPU to Train Invisible AI Keyboard
Researchers Use GPU to Train Invisible AI Keyboard Researchers Use GPU to Train Invisible AI Keyboard

The Invisible Mobile Keyboard, or IMK, created by researchers at the Korea Advanced Institute of Science and Technology, lets users blast through 51.6 words per minute while reporting lower strain levels.

It then converts it into a character sequence by using the touch locations on the invisible keyboard.

Then, the semantic decoder corrects decoding errors in the character sequence estimated by the geometric decoder by considering semantic meanings.

The researchers trained the semantic decoder on the One Billion Word Benchmark created by Cambridge University and the University of Edinburgh and Google in 2014.

The result: users could type 157.5% faster using the Invisible Mobile Keyboard tha…

6 дней, 2 часа назад @ blogs.nvidia.com
How to Use NVIDIA Highlights, Freestyle and Montage in GeForce NOW
How to Use NVIDIA Highlights, Freestyle and Montage in GeForce NOW How to Use NVIDIA Highlights, Freestyle and Montage in GeForce NOW

Highlights is supported in popular GeForce NOW titles such as Apex Legends, Destiny 2 and Rocket League.

Selecting or deselecting these moments, and toggling Highlights on or off, can also be done from the GeForce NOW in-game overlay.

Nearly all GeForce NOW games support Freestyle and can be enabled in game through the overlay.

Titles that support both NVIDIA Highlights and Freestyle can capture gameplay video with filters turned on, for even more colorful and artistic videos.

The Perfect Highlight ReelUsing NVIDIA Highlights and existing video clips, members can create custom highlight videos to show off their favorite gaming moments with NVIDIA Montage.

1 неделя назад @ blogs.nvidia.com
The Bright Continent: AI Fueling a Technological Revolution in Africa
The Bright Continent: AI Fueling a Technological Revolution in Africa The Bright Continent: AI Fueling a Technological Revolution in Africa

Grassroot communities are essential to driving AI innovation, according to Kate Kallot, head of emerging areas at NVIDIA.

On its opening day, Kallot gave a keynote speech at the largest AI Expo Africa to date, addressing a virtual crowd of 10,000 people.

“I hope to inspire you on ways to fuel your own applications and help advance the African AI revolution,” Kallot said.

To do so, she illustrated how the AI revolution is underway in Africa.

According to Kallot, NVIDIA GPUs are used by many research groups across Africa — including the Kenya Education Network Trust, Makerere University, Stellenbosch University and the University of Cape Town — which enable and nurture the cutting-edge techno…

1 неделя, 2 дня назад @ blogs.nvidia.com
Streamline Your Model Builds with PyCaret + RAPIDS on NVIDIA GPUs
Streamline Your Model Builds with PyCaret + RAPIDS on NVIDIA GPUs Streamline Your Model Builds with PyCaret + RAPIDS on NVIDIA GPUs

The PyCaret team added NVIDIA GPU support in version 2.2, including all the latest and greatest from RAPIDS.

With GPU acceleration, PyCaret modeling times can be between 2 and 200 times faster depending on the workload.

This post will go over how to use PyCaret on GPUs to save both development and computation costs by an order of magnitude.

For simplicity, GPU code was written to run on a single GPU.

The compare_models command trains all the models in PyCaret’s model library using default hyperparameters and evaluates performance metrics using cross-validation.

1 неделя, 2 дня назад @ developer.nvidia.com
Autonomy, Electrification, Sustainability Take Center Stage at Germany’s IAA Auto Show
Autonomy, Electrification, Sustainability Take Center Stage at Germany’s IAA Auto Show Autonomy, Electrification, Sustainability Take Center Stage at Germany’s IAA Auto Show

The NVIDIA DRIVE platform, Kani said, can handle all of the critical layers necessary for AVs to safely reach our streets.

Cars, Concepts and CollaborationSeveral NVIDIA DRIVE partners demonstrated their latest green tech, concepts and innovations.

It’s a single widescreen panel that runs on the high-performance, energy-efficient NVIDIA DRIVE platform for instantaneous AI processing and dynamic graphics capabilities.

Powered by NVIDIA DRIVE, ZF ProAI is available in scalable performance levels up to 1,000 TOPS.

Learn more about the NVIDIA DRIVE platform, and don’t forget to register for GTC, which is coming up Nov. 8-11.

1 неделя, 2 дня назад @ blogs.nvidia.com
Top 3 Pillars of AI Enabled Edge Computing in Retail
Top 3 Pillars of AI Enabled Edge Computing in Retail Top 3 Pillars of AI Enabled Edge Computing in Retail

Global retailers and suppliers are faced with navigating rapidly changing consumer demand, behavior, and expectations.

For online shoppers, AI is helping create personalized shopping journeys and product recommendations.

From customer engagement, to operational agility, to seamless omnichannel management, AI at the edge is transforming retail.

Retailers Adopt AI for Improved In-Store Experiences, Less ShrinkageRetailers are leveraging AI at the edge to analyze data from in-store cameras and sensors, and create intelligent stores.

Future of Edge Computing in Retail and BeyondBillions of connected sensors are coming online in retail stores, city streets, hospitals, warehouses, and more.

1 неделя, 3 дня назад @ developer.nvidia.com
Facebook
последний пост 2 месяца назад
Fully Sharded Data Parallel: faster AI training with fewer GPUs
Fully Sharded Data Parallel: faster AI training with fewer GPUs Fully Sharded Data Parallel: faster AI training with fewer GPUs

It shards an AI model’s parameters across data parallel workers and can optionally offload part of the training computation to the CPUs.

For example, typical data parallel training requires maintaining redundant copies of the model on each GPU, and model parallel training introduces additional communication costs to move activations between workers (GPUs).

Using FSDP in computer vision modelsFor computer vision models, FSDP is supported in VISSL and tested on RegNets architectures.

Users may need to carefully tune the activation checkpointing strategy to fit a large model within limited GPU memory space.

We look forward to developing algorithms for auto-tuning both GPU memory usage and trai…

2 месяца назад @ engineering.fb.com
Asicmon: A platform agnostic observability system for AI accelerators
Asicmon: A platform agnostic observability system for AI accelerators Asicmon: A platform agnostic observability system for AI accelerators

We will be hosting a talk about our work on, “A Platform Agnostic Observability System for AI Accelerators” during our virtual Systems @Scale event at 10:20 a.m. PT on Wednesday, June 30, followed by a live Q&A session.

To meet these challenges, we’ve introduced three new tools:ASIC Monitoring (Asicmon) , a scalable observability framework.

However, with an accelerator system, we can imagine the CPU now has a complicated and brawnier sibling!

Since implementing Asicmon we’ve been able to increase our AI accelerator metrics support from ~30 percent to ~75 percentAtrace: Accelerator tracing at scaleWhy tracing?

This would allow us to debug the end-to-end latency of microservices that use AI a…

2 месяца, 3 недели назад @ engineering.fb.com
How Facebook encodes your videos
How Facebook encodes your videos How Facebook encodes your videos

People upload hundreds of millions of videos to Facebook every day.

From a pure computing perspective, applying the most advanced codecs to every video uploaded to Facebook would be prohibitively inefficient.

A relatively small percentage (roughly one-third) of all videos on Facebook generate the majority of overall watch time.

The impact of the new video encoding modelIn addition to improving viewer experience with newly uploaded videos, the new model can identify older videos on Facebook that should have been encoded with more advanced encodings and route more computing resources to them.

The improved compression has also allowed people on Facebook with limited data plans, such as those i…

5 месяцев, 2 недели назад @ engineering.fb.com
How machine learning powers Facebook’s News Feed ranking algorithm
How machine learning powers Facebook’s News Feed ranking algorithm How machine learning powers Facebook’s News Feed ranking algorithm

Models for meaningful interactions and quality content are powered by state-of-the-art ML, such as multitask learning on neural networks, embeddings, and offline learning systems.

We are sharing new details of how we designed an ML-powered News Feed ranking system.

Building a ranking algorithmTo understand how this works, let’s start with a hypothetical person logging in to Facebook: We’ll call him Juan.

On the other hand, perhaps Juan has previously engaged more with video content than photos, so the like prediction for Wei’s cocker spaniel photo might be lower.

Approximating the ideal ranking function in a scalable ranking systemNow that we know the theory behind ranking (as exemplified t…

7 месяцев, 3 недели назад @ engineering.fb.com
How Facebook keeps its large-scale infrastructure hardware up and running
How Facebook keeps its large-scale infrastructure hardware up and running How Facebook keeps its large-scale infrastructure hardware up and running

This is why we need to make sure our server hardware is reliable and that we can manage server hardware failures at our scale with as little disruption to our services as possible.

And we automate root cause analysis for hardware and system failures at scale to get to the bottom of issues quickly.

How we handle hardware remediationWe periodically run a tool called MachineChecker on each server to detect hardware and connectivity failures.

If the issue requires manual repair from a technician, the system creates a ticket in our repair ticketing system.

We have deployed this analyzer widely inside Facebook for the RCA on hardware component failure rate, unexpected server reboots, and software…

9 месяцев, 2 недели назад @ engineering.fb.com
neptune.ai neptune.ai
последний пост 2 часа назад
Depth Estimation Models with Fully Convolutional Residual Networks (FCRN)
Depth Estimation Models with Fully Convolutional Residual Networks (FCRN) Depth Estimation Models with Fully Convolutional Residual Networks (FCRN)

Depth estimation rendering for a video | Source: Deep Learning approach to Depth prediction, Google AIDifferent approaches for the same objectiveRecently, several approaches were engineered for depth estimation.

The proposed architecture includes fully convolutional layers, transpose-convolutions, and efficient residual up-sampling blocks that help keep track of high-dimensional regression problems.

Unsupervised Monocular Depth Estimation with Left-Right ConsistencyThis specific architecture is end-to-end and performs unsupervised monocular depth estimation without ground-truth data.

Originally, U-Net was built with two convolutional layers in each block and the number of filters for all co…

2 часа назад @ neptune.ai
Exploring Clustering Algorithms: Explanation and Use Cases
Exploring Clustering Algorithms: Explanation and Use Cases Exploring Clustering Algorithms: Explanation and Use Cases

Different cluster models are employed, and for each of these cluster models, different algorithms can be given.

Centroid-based clustering algorithms / Partitioning clustering algorithmsIn centroid/partitioning clustering, clusters are represented by a central vector, which may not necessarily be a member of the dataset.

Note: An example of K-Means clustering is explained with customer segmentation examples in the use cases section below.

Mini-Batch K-Means clustering algorithmK-Means is one of the popular clustering algorithms, mainly because of its good time performance.

SummaryThis blog covered the most critical aspects of clustering, image compression, digit classification, customer segm…

3 дня, 4 часа назад @ neptune.ai
MLOps Model Stores: Definition, Functionality, Tools Review
MLOps Model Stores: Definition, Functionality, Tools Review MLOps Model Stores: Definition, Functionality, Tools Review

Let’s explore what model stores are, how they help, and how to pick the right model store for your project.

In a model store, you have logging, discovery, examples, metadata, all you need, and the model store contains a model registry.

In terms of artifact management, model stores manage the model lifecycle, including packaging the model for staging or release.

Model metadataThe model metadata that you can find within the model store include:Model name set by the user.

ClearML Open Architecture Stack | SourceWhile a model store is not part of the ClearML application stack, you can build a custom model store on the open-source MLOps engine that pretty much provides the core functionalities o…

4 дня, 4 часа назад @ neptune.ai
Visualizing Machine Learning Models: Guide and Tools
Visualizing Machine Learning Models: Guide and Tools Visualizing Machine Learning Models: Guide and Tools

Read also How to Compare Machine Learning Models and AlgorithmsTeaching conceptsPerhaps teaching is where visualization is most useful, for educating novice users about fundamental concepts of machine learning.

Such visualization tools are really useful for those who only wish to use pretrained models to get predictions for their own tasks.

As described by its creators, Netron is a viewer tool for deep learning and machine learning models which can generate pretty descriptive visualization for the model’s architecture.

We’ve explored tools and frameworks for visualizing our model architecture, now let’s move on to the next part – training visualization.

We’ve covered a considerable number o…

4 дня, 22 часа назад @ neptune.ai
Version Control for ML Models: Why You Need It, What It Is, How To Implement It
Version Control for ML Models: Why You Need It, What It Is, How To Implement It Version Control for ML Models: Why You Need It, What It Is, How To Implement It

Alternative title suggestions:Guide: Version Control for ML ModelsWhy Do You Need Version Control In Machine Learning, and How To Implement It?

Why do we need version control in ML?

Machine learning version control typesThere are two types of ML version control:Centralized Version Control System.

A Distributed Version Control System (DVCS) is a version control system where the full codebase is available locally on the developer’s computer, including the history.

Distributed Version control [Source]A Centralized Version Control System (CVCS) is a version control where the developer has to check out the repository from a single centralized server containing all the files and file history.

1 неделя назад @ neptune.ai
How to Compare Machine Learning Models and Algorithms
How to Compare Machine Learning Models and Algorithms How to Compare Machine Learning Models and Algorithms

We need to narrow down on techniques by comparing machine learning models thoroughly with parallel experiments.

The goal of comparing machine learning algorithmsComparing machine learning algorithms is important in itself, but there are some not-so-obvious benefits of comparing various experiments effectively.

Parameters of machine learning algorithms and how to compare themLet’s dive right into analyzing and understanding how to compare the different characteristics of algorithms that can be used to sort and choose the best machine learning models.

Bias is the assumption used by machine learning models to make the learning process easier.

Final noteThere’s no scarcity of comparable techniq…

1 неделя, 4 дня назад @ neptune.ai
Should You Use Jupyter Notebooks in Production?
Should You Use Jupyter Notebooks in Production? Should You Use Jupyter Notebooks in Production?

In this article, we’re going to discuss Jupyter Notebooks and the use of Notebooks in production environments.

It gets easier to use Jupyter Notebooks as templates to generate reports by automating the process of notebook execution.

Jupyter notebooks can be scheduled as jobs over the cloud.

What to consider when choosing your production workflowReliability: Jupyter notebooks are more stable than they were years ago.

Many people believe and take it as undeniable truth saying Jupyter notebooks are just for experimenting and prototyping, but I don’t completely agree with them.

1 неделя, 5 дней назад @ neptune.ai
DVC Alternatives For Experiment Tracking
DVC Alternatives For Experiment Tracking DVC Alternatives For Experiment Tracking

So, in this article, we’re going to explore:Data Version Control (DVC),Best Alternatives to Data Version Control (DVC),Comparison of Experiment Tracking ToolsTo learn more about Experiment Tracking, check – ML Experiment Tracking: What It Is, Why It Matters, and How to Implement ItData Version Control (DVC)DVC is an open-source platform for machine learning projects.

Researchers and engineers use Neptune for experiment tracking and model registry to control their experimentation and model development.

Choosing the right ML experiment tracking tool for your workflowChoosing the right ML experiment tracking tool for your team can be hard.

First, let’s take a look at the things you need to con…

1 неделя, 6 дней назад @ neptune.ai
Object Detection with YOLO: Hands-on Tutorial
Object Detection with YOLO: Hands-on Tutorial Object Detection with YOLO: Hands-on Tutorial

Classification is a time-consuming operation, which is why the two-stage object detection approach performs slower compared to one-stage detection.

SSD and YOLO are one stage object detectors whereas Faster-RCNNand R-FCN are two-stage object detectors.

To know what object types a pre-trained YOLO model is able to detect, check out the coco_classes.txt file available in …/yolo-v4-tf.kers/class_names/.

How to Train Your Custom YOLO Object Detection ModelTask StatementTo design an object detection model, you need to know what object types you want to detect.

Model Object InitializationTo get ready for a training job, initialize the YOLOv4 model object.

2 недели, 3 дня назад @ neptune.ai
Training and Debugging Deep Convolutional Generative Adversarial Networks
Training and Debugging Deep Convolutional Generative Adversarial Networks Training and Debugging Deep Convolutional Generative Adversarial Networks

Adversarial networks (Deep Convolutional Generative Adversarial Networks) have been a very active playground lately for Deep Learning practitioners.

These architectures were first introduced in the paper Unsupervised Representational Learning With Deep Convolutional Generative Adversarial Networks.

import torchvision.datasets as datasets def data_preprocessing (root_dir, batch_size= 128 , image-size= 64 , num_workers= 2 ) : data = datasets.ImageFolder(root=root_dir, transform=transforms.Compose([ transforms.Resize(image_size), transforms.CenterCrop(image_size), transforms.ToTensor(), transforms.Normalize(( 0.5 , 0.5 , 0.5 ), ( 0.5 , 0.5 , 0.5 )) ])) dataloader = torch.utils.data.DataLoader(…

3 недели, 3 дня назад @ neptune.ai
Best Data Lineage Tools
Best Data Lineage Tools Best Data Lineage Tools

In this article, we’re going to explore what data lineage means in machine learning, and see several paid and open-source data lineage tools.

Learn more Data Lineage in Machine Learning: Methods and Best PracticesCriteria for choosing Data Lineage toolsData lineage tools help you visualize and manage the whole journey of your data.

Open-source: open source data lineage tools have the advantage of being free to use and are constantly being improved on.

Big Data Handling: many machine learning models require big data, so data lineage tools should be able to handle and process big data efficiently.

SPLINESpline (SPark LINEage) is a free, open-source tool for automated data lineage tracking and…

3 недели, 4 дня назад @ neptune.ai
Data Lineage in Machine Learning: Methods and Best Practices
Data Lineage in Machine Learning: Methods and Best Practices Data Lineage in Machine Learning: Methods and Best Practices

Data lineage vs. data provenanceData lineage is often confused with data provenance, as the difference is quite subtle and easy to miss.

So, Data Lineage takes care of data about the data (metadata), while Data Provenance takes care of the information about the processes that influence the data.

Data Lineage can help in combining new data with older relevant data such that the data consumers like developers, business teams, and stakeholders can derive the maximum value from the data assets.

Since several departments in an organization rely on Data Lineage, let’s get a closer view of the dependents to understand the need for Data Lineage further.

Data lineage across the pipelineTo capture en…

3 недели, 4 дня назад @ neptune.ai
Doing ML Model Performance Monitoring The Right Way
Doing ML Model Performance Monitoring The Right Way Doing ML Model Performance Monitoring The Right Way

These tools provide useful statistics and model performance details that provide deep insights and help you improve model performance.

If we know the weak points of our model, we can plan a course of action that won’t hurt model performance.

Set realistic goalsOne of the key steps when designing a machine learning model is choosing appropriate metrics for evaluating model performance.

In particular, does a decrease in model performance necessarily mean that the model is performing worse?

These tools have excellent visualization capabilities and track different model performance statistics (check this blog post on model monitoring tools for more details).

3 недели, 5 дней назад @ neptune.ai
Balanced Accuracy: When Should You Use It?
Balanced Accuracy: When Should You Use It? Balanced Accuracy: When Should You Use It?

Balanced Accuracy = (Sensitivity + Specificity) / 2 = 40 + 98.92 / 2 = 69.46 %Balanced Accuracy does a great job because we want to identify the positives present in our classifier.

Balanced Accuracy Multiclass ClassificationAs it goes for binary, Balanced Accuracy is also useful for multiclass classification.

F1-Score and Balanced Accuracy will be:Precision = 5 / 15 = 0.33 Sensitivity = 5 / 10 = 0.5 Specificity = 990 / 1000 = 0.99 F1-score = 2 * ( 0.5 * 0.33 ) / ( 0.5 + 0.33 ) = 0.4 Balanced Accuracy = ( 0.5 + 0.99 ) / 2 = 0.745You can see that balanced accuracy still cares more about the negative in the data than F1.

Balanced Accuracy vs ROC_AUCHow is Balanced Accuracy different from roc_…

1 месяц назад @ neptune.ai
Object Detection Algorithms and Libraries
Object Detection Algorithms and Libraries Object Detection Algorithms and Libraries

Face detection and recognition – as previously discussed, one of the major applications of object detection is face detection and recognition.

Object detection algorithmsSince the popularization of deep learning in the early 2010s, there’s been a continuous progression and improvement in the quality of algorithms used to solve object detection.

→ Working process of YOLOYOLO – Object Detection Algorithm | SourceThe YOLO architecture utilizes three primary terminologies to achieve its goal of object detection.

RetinaNet→ IntroductionThe RetinaNet model introduced in 2017 became one of the best models with single-shot object detection capabilities that could surpass other popular object detect…

1 месяц назад @ neptune.ai
▶️ YouTube
Yannic Kilcher Yannic Kilcher
последний пост 3 дня, 23 часа назад
[ML News] Roomba Avoids Poop | Textless NLP | TikTok Algorithm Secrets | New Schmidhuber Blog
[ML News] Roomba Avoids Poop | Textless NLP | TikTok Algorithm Secrets | New Schmidhuber Blog [ML News] Roomba Avoids Poop | Textless NLP | TikTok Algorithm Secrets | New Schmidhuber Blog

#schmidhuber #tiktok #roomba Your regularly irregluar update on what's happening in the world of Machine Learning. OUTLINE:

0:00 - Intro

0:15 - Sponsor: Weights & Biases

1:55 - ML YouTuber reaches 100k subscribers

2:40 - Facebook AI pushes Textless NLP

5:30 - Schmidhuber blog post: I invented everything

7:55 - TikTok algorithm rabbitholes users

10:45 - Roomba learns to avoid poop

11:50 - AI can spot art forgeries

14:55 - Deepmind's plans to separate from Google

16:15 - Cohere raises 40M

16:55 - US Judge rejects AI inventor on patent

17:55 - Altman: GPT-4 not much bigger than GPT-3

18:45 - Salesforce CodeT5

19:45 - DeepMind Reinforcement Learning Lecture Series

20:15 - WikiGraphs Dataset

20:…

3 дня, 23 часа назад @ youtube.com
Celebrating 100k Subscribers! (w/ Channel Statistics)
Celebrating 100k Subscribers! (w/ Channel Statistics) Celebrating 100k Subscribers! (w/ Channel Statistics)

#yannickilcher #machinelearning #100k OUTLINE:

0:00 - 100k!

1:00 - Announcements & Thanks

3:55 - Channel Statistics Links:

TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick

YouTube: https://www.youtube.com/c/yannickilcher

Twitter: https://twitter.com/ykilcher

Discord: https://discord.gg/4H8xxDF

BitChute: https://www.bitchute.com/channel/yannic-kilcher

Minds: https://www.minds.com/ykilcher

Parler: https://parler.com/profile/YannicKilcher

LinkedIn: https://www.linkedin.com/in/yannic-kilcher-488534136/

BiliBili: https://space.bilibili.com/1824646584 If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely o…

6 дней, 7 часов назад @ youtube.com
[ML News] AI predicts race from X-Ray | Google kills HealthStreams | Boosting Search with MuZero
[ML News] AI predicts race from X-Ray | Google kills HealthStreams | Boosting Search with MuZero [ML News] AI predicts race from X-Ray | Google kills HealthStreams | Boosting Search with MuZero

#mlnews #schmidhuber #muzero Your regular updates on what's happening in the ML world! OUTLINE:

0:00 - Intro

0:15 - Sponsor: Weights & Biases

1:45 - Google shuts down health streams

4:25 - AI predicts race from blurry X-Rays

7:35 - Facebook labels black men as primates

11:05 - Distill papers on Graph Neural Networks

11:50 - Jürgen Schmidhuber to lead KAUST AI Initiative

12:35 - GitHub brief on DMCA notices for source code

14:55 - Helpful Reddit Threads

19:40 - Simple Tricks to improve Transformers

20:40 - Apple's Unconstrained Scene Generation

21:40 - Common Objects in 3D dataset

22:20 - WarpDrive Multi-Agent RL framework

23:10 - My new paper: Boosting Search Agents & MuZero

25:15 - Can AI …

1 неделя, 2 дня назад @ youtube.com
∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained)
∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained) ∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained)

#inftyformer #infinityformer #transformer Vanilla Transformers are excellent sequence models, but suffer from very harsch constraints on the length of the sequences they can process. Several attempts have been made to extend the Transformer's sequence length, but few have successfully gone beyond a constant factor improvement. This paper presents a method, based on continuous attention mechanisms, to attend to an unbounded past sequence by representing the past as a continuous signal, rather than a sequence. This enables the Infty-Former to effectively enrich the current context with global information, which increases performance on long-range dependencies in sequence tasks. Further, the p…

2 недели назад @ youtube.com
[ML News] Blind Chess AI Competition | Graph NNs for traffic | AI gift suggestions
[ML News] Blind Chess AI Competition | Graph NNs for traffic | AI gift suggestions [ML News] Blind Chess AI Competition | Graph NNs for traffic | AI gift suggestions

#mlnews #chess #neurips OUTLINE:

0:00 - Intro

0:30 - Reconnaissance Blind Chess NeurIPS 2021 Competition

3:40 - Colab Pro no longer top priority for GPUs

4:45 - DeepMind uses Graph NNs to do traffic prediction

6:00 - Helpful Libraries: Isaac Gym, Differentiable Human, LVIS, BEHAVIOR

10:25 - Cerebras Wafer Scale Engine Cluster

12:15 - AI Voice Synthesis for Val Kilmer

14:20 - Can AI give thoughtful gifts? References:

Reconnaissance Blind Chess NeurIPS 2021 Competition

https://rbc.jhuapl.edu/

https://rbc.jhuapl.edu/gameRules Colab Pro no longer top priority

https://www.reddit.com/r/MachineLearning/comments/pdwxxz/d_colab_pro_no_longer_gives_you_a_v100_not_even_a/ Google Maps ETA prediction us…

2 недели, 2 дня назад @ youtube.com
ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation
ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation

#alibi #transformers #attention Transformers are essentially set models that need additional inputs to make sense of sequence data. The most widespread additional inputs are position encodings or position embeddings, which add sequence index information in various forms. However, this has put a limit on the resulting model, which cannot run inference on sequences longer than it has been trained on, as it would encounter unfamiliar position encodings. ALiBi solves this by proposing simple linear fixed biases as position information, adding negligible overhead in time and memory, but surprisingly, the resulting model is able to handle inference on sequences many times as long as its training …

2 недели, 3 дня назад @ youtube.com
[ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered
[ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered [ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered

#plagiarism #foundationmodels #tesla The best place to keep up to date with the latest and greatest from the ML world! OUTLINE:

0:00 - Intro & Sponsor

3:15 - A high-profile case of plagiarism shocks the ML world

11:55 - Stanford AI releases paper on "Foundation Models"

19:45 - Updates on Apple's NeuralHash

20:45 - RL control for two-player splorts

21:45 - Tesla's AI Day

23:55 - COMMA THREE announced

24:40 - Intel winding down RealSense cameras

25:20 - IBM unveils Telum Processor

25:50 - Lux AI Challenge & Neural MMO Challenge

26:50 - Dribnet's CLIP PixelArt

27:40 - Multi-Agent RL papers are mostly fake

28:50 - I can't even come up with a segment title

29:25 - AI News Questions

31:20 - Frame…

3 недели, 3 дня назад @ youtube.com
Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained)
Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained) Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained)

#attention #transformer #fastformer Transformers have become the dominant model class in the last few years for large data, but their quadratic complexity in terms of sequence length has plagued them until now. Fastformer claims to be the fastest and most performant linear attention variant, able to consume long contexts at once. This is achieved by a combination of additive attention and elementwise products. While initial results look promising, I have my reservations... OUTLINE:

0:00 - Intro & Outline

2:15 - Fastformer description

5:20 - Baseline: Classic Attention

10:00 - Fastformer architecture

12:50 - Additive Attention

18:05 - Query-Key element-wise multiplication

21:35 - Redundant m…

3 недели, 4 дня назад @ youtube.com
PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)
PonderNet: Learning to Ponder (Machine Learning Research Paper Explained) PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)

#pondernet #deepmind #machinelearning Humans don't spend the same amount of mental effort on all problems equally. Instead, we respond quickly to easy tasks, and we take our time to deliberate hard tasks. DeepMind's PonderNet attempts to achieve the same by dynamically deciding how many computation steps to allocate to any single input sample. This is done via a recurrent architecture and a trainable function that computes a halting probability. The resulting model performs well in dynamic computation tasks and is surprisingly robust to different hyperparameter settings. OUTLINE:

0:00 - Intro & Overview

2:30 - Problem Statement

8:00 - Probabilistic formulation of dynamic halting

14:40 - Tra…

4 недели назад @ youtube.com
NeuralHash is BROKEN | How to evade Apple's detection and forge hash collisions (w/ Code)
NeuralHash is BROKEN | How to evade Apple's detection and forge hash collisions (w/ Code) NeuralHash is BROKEN | How to evade Apple's detection and forge hash collisions (w/ Code)

#apple #icloud #neuralhash Send your Apple fanboy friends to prison with this one simple trick ;) We break Apple's NeuralHash algorithm used to detect CSAM for iCloud photos. I show how it's possible to craft arbitrary hash collisions from any source / target image pair using an adversarial example attack. This can be used for many purposes, such as evading detection, or forging false positives, triggering manual reviews. OUTLINE:

0:00 - Intro

1:30 - Forced Hash Collisions via Adversarial Attacks

2:30 - My Successful Attack

5:40 - Results

7:15 - Discussion DISCLAIMER: This is for demonstration and educational purposes only. This is not an endorsement of illegal activity or circumvention of …

1 месяц назад @ youtube.com
[ML News] Nvidia renders CEO | Jurassic-1 larger than GPT-3 | Tortured Phrases reveal Plagiarism
[ML News] Nvidia renders CEO | Jurassic-1 larger than GPT-3 | Tortured Phrases reveal Plagiarism [ML News] Nvidia renders CEO | Jurassic-1 larger than GPT-3 | Tortured Phrases reveal Plagiarism

#mlnews #nvidia #openai An in-depth look over what's going on in the world of Machine Learning and Artificial intelligence. Subscribe now and make Monday the best day of the week! OUTLINE:

0:00 - Intro

0:20 - Sponsor: Weights & Biases

3:00 - Nvidia's CEO was rendered during Keynote

5:00 - AI21 Labs releases Jurassic-1 language model

7:00 - Tortured Phrases reveal plagiarism

10:05 - Cortical neurons are computationally complex

11:55 - OpenAI Codex Update & Challenge

13:30 - Automated drug abuse prevention gone wrong

17:55 - Rapid News Questions

18:40 - SoundStream learned neural audio codec

19:40 - RoboMimic framework for robotics research

20:05 - Droidlet framework for agent training

20:40 …

1 месяц назад @ youtube.com
How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained
How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained

#apple #icloud #privacy Apple recently announced scanning all images uploaded to iCloud for CSAM (child abuse material), and that this scan would happen locally on users' phones. We take a look at the technical report and explore how the system works in detail, how it is designed to preserve user privacy, and what weak points it still has. OUTLINE:

0:00 - Introduction

3:05 - System Requirements

9:15 - System Overview

14:00 - NeuralHash

20:45 - Private Set Intersection

31:15 - Threshold Secret Sharing

35:25 - Synthetic Match Vouchers

38:20 - Problem 1: Who controls the database?

42:40 - Problem 2: Adversarial Attacks

49:40 - Comments & Conclusion Paper: https://www.apple.com/child-safety/pdf…

1 месяц назад @ youtube.com
[ML NEWS] Apple scans your phone | Master Faces beat face recognition | WALL-E is real
[ML NEWS] Apple scans your phone | Master Faces beat face recognition | WALL-E is real [ML NEWS] Apple scans your phone | Master Faces beat face recognition | WALL-E is real

#mlnews #apple #nolamarck Your update on the latest news in the AI and Machine Learning world. OUTLINE:

0:00 - Intro

0:15 - Sponsor: Weights & Biases

3:30 - Apple to scan iDevices for illegal content

14:10 - EU approves chatcontrol

15:20 - Machine Learning FAQ book

17:40 - TimeDial & Disfl-QA Conversation Datasets

20:30 - VoxPopuli Speech Dataset

21:00 - Google Tensor chip coming to Pixel 6

21:30 - Pentagon uses AI to predict events

23:10 - Sketch your own GAN

24:45 - Can a Fruit Fly learn Word Embeddings?

26:00 - Master Faces beat facial recognition system

27:25 - PyTorch profiler 1.9

27:55 - 0 A.D. gets reinforcement learning interface

28:40 - BeatBot cleans up cigarette butts on the beac…

1 месяц, 1 неделя назад @ youtube.com
[Live] OpenAI Codex Challenge
[Live] OpenAI Codex Challenge [Live] OpenAI Codex Challenge

We solve the OpenAI Codex Challenge

https://challenge.openai.com/ Links:

TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick

YouTube: https://www.youtube.com/c/yannickilcher

Twitter: https://twitter.com/ykilcher

Discord: https://discord.gg/4H8xxDF

BitChute: https://www.bitchute.com/channel/yannic-kilcher

Minds: https://www.minds.com/ykilcher

Parler: https://parler.com/profile/YannicKilcher

LinkedIn: https://www.linkedin.com/in/yannic-kilcher-488534136/

BiliBili: https://space.bilibili.com/1824646584 If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have as…

1 месяц, 1 неделя назад @ youtube.com
[ML News] AI-generated patent approved | Germany gets an analog to OpenAI | ML cheats video games
[ML News] AI-generated patent approved | Germany gets an analog to OpenAI | ML cheats video games [ML News] AI-generated patent approved | Germany gets an analog to OpenAI | ML cheats video games

#mlnews #dabus #alephalpha OUTLINE:

0:00 - Intro

0:20 - Sponsor: Weights & Biases

3:45 - AI legally recognized as patent inventor

8:35 - Alpeh Alpha raises USD 27Mio to build European OpenAI

10:20 - AMP advances AI aided recycling

11:20 - DeepMind builds XLand RL environment

13:15 - Cognitive Behavioral Therapy as an app

16:15 - Wordcraft interactive AI text editor

17:05 - ML used to cheat in console games

18:10 - Google's OpenBuildings Dataset

20:00 - Most ML COVID tools are flawed

21:10 - DALL-E mini released

21:55 - Helpful Libraries

25:20 - FSF funds papers discussing CoPilot SPONSOR: Weights & Biases

https://wandb.ai References:

AI legally recognized as patent inventor

https://www.glob…

1 месяц, 2 недели назад @ youtube.com
Henry AI Labs Henry AI Labs
последний пост 4 дня, 2 часа назад
Robust Fine-Tuning of Zero-Shot Models
Robust Fine-Tuning of Zero-Shot Models Robust Fine-Tuning of Zero-Shot Models

Researchers from the University of Washington, Columbia University, Open AI, the Allen Institute of Artificial Intelligence, and Toyota Research have teamed up to present a new method for fine-tuning these pre-trained models such as GPT-3, BERT, DALL-E, EfficientNet, or CLIP for application specific datasets. The key insight is that as you fine-tune these models, you gain in-distribution accuracy, but sacrifice the zero-shot flexibility, or out-of-distribution generalization, of these pre-trained “foundation” models. The authors present Weight-Space Ensembling, where you take a linear interpolation between the weights of the zero-shot and fine-tuned model to make new inference. This achieve…

4 дня, 2 часа назад @ youtube.com
Robust fine-tuning of zero-shot models
Robust fine-tuning of zero-shot models Robust fine-tuning of zero-shot models

Researchers from the University of Washington, Columbia University, Open AI, the Allen Institute of Artificial Intelligence, and Toyota Research have teamed up to present a new method for fine-tuning these pre-trained models such as GPT-3, BERT, DALL-E, EfficientNet, or CLIP for application specific datasets. The key insight is that as you fine-tune these models, you gain in-distribution accuracy, but sacrifice the zero-shot flexibility, or out-of-distribution generalization, of these pre-trained “foundation” models. The authors present Weight-Space Ensembling, where you take a linear interpolation between the weights of the zero-shot and fine-tuned model to make new inference. This achieve…

5 дней назад @ youtube.com
Generalization in Open-Domain Question Answering
Generalization in Open-Domain Question Answering Generalization in Open-Domain Question Answering

AI Weekly Update Notion Page: https://ebony-scissor-725.notion.site/AI-Weekly-Update-September-8th-2021-a2119851b5b74470b4971d064665e77e

New AI Weekly Update GitHub Repo: https://github.com/CShorten/AIWeeklyUpdates I am looking for collaborators for two survey papers on Text-to-Image Generation and Data Augmentation Controllers. If you are interested in helping out and being a co-author of either paper, please send me a quick overview of what you think about the topic to cshorten2015@fau.edu! Content Links in the Video:

Challenges in Generalization in Open Domain Question Answering: https://arxiv.org/pdf/2109.01156.pdf

Hurdles to Progress in Long-Form Question Answering: https://arxiv.org/p…

1 неделя, 4 дня назад @ youtube.com
AI Weekly Update - August 7th, 2021 (#40)
AI Weekly Update - August 7th, 2021 (#40) AI Weekly Update - August 7th, 2021 (#40)

Notion Link: https://ebony-scissor-725.notion.site/AI-Weekly-Update-August-7th-2021-3b21331c5c6a45e1a5638955dda7923c Chapters:

0:00 Introduction

0:06 Open-Ended Learning

2:38 Persistent Reinforcement Learning

3:34 Domain-Matched Pre-training for Retrieval

4:22 Growing Knowledge Culturally

5:06 Language Grounding with 3D Objects

6:03 Pre-train, Prompt, and Predict

6:32 QA Dataset Explosion

7:32 ProtoTransformer

9:32 Don’t sweep your Learning Rate under the Rug

10:34 AAVAE

11:40 Domain-Agnostic Contrastive Learning

12:43 Dataset Distillation

14:04 Pointer Value Retrieval

16:01 Go Wider Instead of Deeper

16:52 Geometric Deep Learning on Molecules

18:37 Simulation Framework for Label Noise

19:5…

1 месяц, 2 недели назад @ youtube.com
Reasoning with Language Models - Turning Tables
Reasoning with Language Models - Turning Tables Reasoning with Language Models - Turning Tables

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-22nd-2021-0c43042b93a3459c901f7f5973b949bf Thanks for watching! Please Subscribe!

1 месяц, 4 недели назад @ youtube.com
Deduplicating Training Data makes Language Models Better
Deduplicating Training Data makes Language Models Better Deduplicating Training Data makes Language Models Better

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-22nd-2021-0c43042b93a3459c901f7f5973b949bf Follow Katherine Lee on Twitter @katherine1ee

Twitter Thread Link: https://twitter.com/katherine1ee/status/1415496898241339400 Thanks for watching! Please Subscribe!

1 месяц, 4 недели назад @ youtube.com
Using HTML for Language Modeling
Using HTML for Language Modeling Using HTML for Language Modeling

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-22nd-2021-0c43042b93a3459c901f7f5973b949bf Thanks for watching! Please Subscribe!

1 месяц, 4 недели назад @ youtube.com
Writing with AI - Wordcraft Text Editor
Writing with AI - Wordcraft Text Editor Writing with AI - Wordcraft Text Editor

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-22nd-2021-0c43042b93a3459c901f7f5973b949bf Demo Video: https://www.youtube.com/watch?v=9p4mfA0Fyd8 Thanks for watching! Please Subscribe!

1 месяц, 4 недели назад @ youtube.com
Internet-Augmented Dialogue Generation
Internet-Augmented Dialogue Generation Internet-Augmented Dialogue Generation

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-22nd-2021-0c43042b93a3459c901f7f5973b949bf Thank you for watching! Please Subscribe!

1 месяц, 4 недели назад @ youtube.com
Beyond Goldfish Memory!
Beyond Goldfish Memory! Beyond Goldfish Memory!

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-22nd-2021-0c43042b93a3459c901f7f5973b949bf Thumbnail Goldfish Image Credit - Photo by zhengtao tang on Unsplash Thanks for watching! Please Subscribe! Chapters:

0:00 Introduction

1:54 Multi-Session Chat

4:12 Models

6:40 Drawbacks of Info Retrieval

9:40 Memory Augmented Models

10:09 Comparison with Language Models

1 месяц, 4 недели назад @ youtube.com
Blender Bot 2.0
Blender Bot 2.0 Blender Bot 2.0

Dienste anbieten und betreiben, z.

Personalisierte Inhalte und Werbeanzeigen können ebenfalls darauf basieren, darüber hinaus aber auch auf Aktivitäten wie Suchanfragen bei Google und Videos, die Sie sich bei YouTube ansehen.

Zu personalisierten Inhalten und Werbeanzeigen gehören beispielsweise Dinge wie relevantere Ergebnisse und Empfehlungen, eine individuelle YouTube-Startseite und Werbung, die auf Ihre Interessen zugeschnitten ist.

Klicken Sie auf „Anpassen“, um sich Ihre Möglichkeiten anzusehen.

Zu diesen gehören zum Beispiel Steuerelemente, um Cookies für die Personalisierung zu deaktivieren, oder Informationen zu Steuerelementen auf Browserebene, mit denen einige oder alle Cookies fü…

1 месяц, 4 недели назад @ youtube.com
AI Weekly Update Overview - July 22nd, 2021 (#39)
AI Weekly Update Overview - July 22nd, 2021 (#39) AI Weekly Update Overview - July 22nd, 2021 (#39)

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-22nd-2021-0c43042b93a3459c901f7f5973b949bf Thank you for watching! Please Subscribe! Chapters

0:00 Introduction

0:13 BlenderBot 2.0

2:28 Wordcraft

3:20 Hyper-Text Pre-Training and Prompting

4:18 Deduplicating Training Data

5:00 Turning Tables

6:26 Reasoning with LMs and KGs

7:06 FLEX: Few-Shot NLP

8:33 AlphaFold2

9:27 Semi-Supervised Learning in Action

10:18 AI-Generating Algorithms

10:52 Adaptable Agent Populations

11:36 Recurrent Parameter Generators

12:17 Conservative Objective Models

13:20 Representation Learning for OOD Robots

13:54 MultiBench

14:40 Align before Fuse

15:06 CLIP Benefit in V-L tasks

15:4…

2 месяца назад @ youtube.com
Determined AI + HuggingFace!
Determined AI + HuggingFace! Determined AI + HuggingFace!

This video will present the Determined AI Model Hub for using HuggingFace transformers, tokenizers, and datasets with the Determined training platform! I hope you find this video useful! Overview of Determined on Henry AI Labs:

https://www.youtube.com/watch?v=9wbn2ikhbpg

Challenges of Advanced Hyperparameter Search on Henry AI Labs:

https://www.youtube.com/watch?v=5F5LlmO10AM

Walkthrough of the Determined CIFAR-10 Example on Henry AI Labs:

https://www.youtube.com/watch?v=WEYu8DI4LOU Determined Transformers Examples: https://docs.determined.ai/latest/model-hub/transformers/examples.html

HuggingFace Models: https://huggingface.co/models

HuggingFace Datasets: https://huggingface.co/datasets/sw…

2 месяца назад @ youtube.com
MultiCite - New Research in Scientific Literature Mining!
MultiCite - New Research in Scientific Literature Mining! MultiCite - New Research in Scientific Literature Mining!

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-15th-2021-a68f599395e3428c878dc74c5f0e1124 Thanks for watching! Please Subscribe!

2 месяца назад @ youtube.com
Collaboration of Experts
Collaboration of Experts Collaboration of Experts

Notion Link: https://ebony-scissor-725.notion.site/Henry-AI-Labs-Weekly-Update-July-15th-2021-a68f599395e3428c878dc74c5f0e1124 Thanks for watching! Please Subscribe!

2 месяца назад @ youtube.com
3blue1brown 3blue1brown
последний пост 2 месяца назад
Why aren't you making math videos? (Also, now there's a 3b1b podcast)
Why aren't you making math videos?  (Also, now there's a 3b1b podcast) Why aren't you making math videos? (Also, now there's a 3b1b podcast)

Learn more and submit: https://3b1b.co/SoME1

Podcast/New channel: https://youtu.be/C-i4q-Xlnis

↓↓Things referenced through the video↓↓ Join the discord channel:

https://discord.gg/SRTErdZ9 James Schloss:

https://www.youtube.com/user/LeiosOS Free will theorem:

https://www.ams.org/notices/200902/rtx090200226p.pdf Kolmogorov complexity and primes:

https://people.cs.uchicago.edu/~fortnow/papers/kaikoura.pdf Tadashi Tokieda talk:

https://youtu.be/tQQ3oiB32GI Boarbarktree:

https://www.youtube.com/channel/UCFeIEAkqvS4fJMTwUtF4OFw Mathologer:

https://youtu.be/N-KXStupwsc Manim:

https://github.com/3b1b/manim Manim Community edition:

https://github.com/ManimCommunity/manim/ Reanimate:

https://github.…

2 месяца назад @ youtube.com
A quick trick for computing eigenvalues | Essence of linear algebra, chapter 15
A quick trick for computing eigenvalues | Essence of linear algebra, chapter 15 A quick trick for computing eigenvalues | Essence of linear algebra, chapter 15

How to write the eigenvalues of a 2x2 matrix just by looking at it.

Thanks to Tim for the jingle: https://www.youtube.com/acapellascience

Help fund future projects: https://www.patreon.com/3blue1brown​

An equally valuable form of support is to simply share the videos.

Special thanks to these supporters: https://3b1b.co/quick-eigen-thanks Introduction to eigenvectors and eigenvalues:

https://youtu.be/PFDu9oVAE-g Lockdown math lecture talking about the mean product formula:

https://youtu.be/MHXO86wKeDY Timestamps:

0:00 - Background

4:53 - Examples

10:24 - Relation to the characteristic polynomial

12:00 - Last thoughts ------------------ These animations are largely made using a custom python …

4 месяца, 2 недели назад @ youtube.com
How (and why) to raise e to the power of a matrix | DE6
How (and why) to raise e to the power of a matrix | DE6 How (and why) to raise e to the power of a matrix | DE6

General exponentials, Love, Schrödinger, and more.

Home page: https://www.3blue1brown.com

Brought to you by you: https://3b1b.co/thanks ------------------

The Romeo-Juliet example is based on this essay by Steven Strogatz:

http://www.stevenstrogatz.com/essays/loves-me-loves-me-not-do-the-math The book shown at the start is Vladimir Arnold's (excellent) textbook on ordinary differential equations.

https://amzn.to/3dtXSwj Need a review of ordinary powers of e?

https://youtu.be/m2MIpDrF7Es Or of linear algebra?

https://youtu.be/kYB8IZa5AuE Timetable

0:00 - Definition

6:40 - Dynamics of love

13:17 - General equation

20:03 - On general rotations

22:11 - Visualizing with flow ------------------

C…

5 месяцев, 3 недели назад @ youtube.com
The medical test paradox: Can redesigning Bayes rule help?
The medical test paradox: Can redesigning Bayes rule help? The medical test paradox: Can redesigning Bayes rule help?

Bayes factors, aka Likelihood Ratios*, offer a very clear view of how medical test probabilities work.

Home page: https://www.3blue1brown.com

Brought to you by you: https://3b1b.co/bayes-factor-thanks The book by my friend Matt Cook about paradoxes mentioned at then end:

https://amzn.to/3aBrEzg On the topic, I can't help also mentioning another paradox book I'm rather fond of by Bunch:

https://amzn.to/3mBDSKE *As mentioned in the on-screen note at the end, while the terms "Bayes Factor" and "Likelihood Ratio" refer to the same term in this setting, where Bayes rule is used on the probability of an event with only two possible outcomes (you either have the disease or you don't), they do take…

9 месяцев назад @ youtube.com
Two Minute Papers Two Minute Papers
последний пост 1 день, 23 часа назад
This AI Makes Digital Copies of Humans! 👤
This AI Makes Digital Copies of Humans! 👤 This AI Makes Digital Copies of Humans! 👤

❤️ Check out the Gradient Dissent podcast by Weights & Biases: http://wandb.me/gd 📝 The paper "The Relightables: Volumetric Performance Capture of Humans with Realistic Relighting" is available here:

https://augmentedperception.github.io/therelightables/ 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jonas, Kenneth Davis, Klaus Busse, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Mark Oates, Michael Albrecht, Nikhil Velpanur, Owen Ca…

1 день, 23 часа назад @ youtube.com
Meet Your Virtual Level Designer! 🎮
Meet Your Virtual Level Designer! 🎮 Meet Your Virtual Level Designer! 🎮

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers ❤️ Their mentioned post is available here: https://wandb.ai/ayush-thakur/interpretability/reports/Interpretability-in-Deep-Learning-With-W-B-CAM-and-GradCAM--Vmlldzo5MTIyNw 📝 The paper "Adversarial Reinforcement Learning for Procedural Content Generation" is available here:

https://www.ea.com/seed/news/cog2021-adversarial-rl-content-generation 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace …

4 дня, 23 часа назад @ youtube.com
OpenAI Codex: Just Say What You Want! 🤖
OpenAI Codex: Just Say What You Want! 🤖 OpenAI Codex: Just Say What You Want! 🤖

❤️ Check out Perceptilabs and sign up for a free demo here: https://www.perceptilabs.com/papers 📝 The paper "Evaluating Large Language Models Trained on Code" is available here:

https://openai.com/blog/openai-codex/ Codex tweet/application links:

https://twitter.com/CristiVlad25/status/1432017112885833734

https://twitter.com/slava__bobrov/status/1425904829013102602

https://www.youtube.com/watch?v=MvHbrVfEuyk GPT-3 tweet/application links:

Website layout: https://twitter.com/sharifshameem/status/1283322990625607681

Plots: https://twitter.com/aquariusacquah/status/1285415144017797126?s=12

Typesetting math: https://twitter.com/sh_reya/status/1284746918959239168

Population data: https://twitter…

1 неделя, 2 дня назад @ youtube.com
Watch Tesla’s Self-Driving Car Learn In a Simulation! 🚘
Watch Tesla’s Self-Driving Car Learn In a Simulation! 🚘 Watch Tesla’s Self-Driving Car Learn In a Simulation! 🚘

❤️ Check out Fully Connected by Weights & Biases: https://wandb.me/papers 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jonas, Kenneth Davis, Klaus Busse, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Mark Oates, Michael Albrecht, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Ramsey Elbasheer, Steef, Taras Bobrovytsky, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi.

If you wish to appear here…

1 неделя, 4 дня назад @ youtube.com
This AI Creates Virtual Fingers! 🤝
This AI Creates Virtual Fingers! 🤝 This AI Creates Virtual Fingers! 🤝

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers 📝 The paper "ManipNet: Neural Manipulation Synthesis with a Hand-Object Spatial Representation" is available here:

https://github.com/cghezhang/ManipNet ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers

- https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'…

2 недели, 1 день назад @ youtube.com
This AI Helps Making A Music Video! 💃
This AI Helps Making A Music Video! 💃 This AI Helps Making A Music Video! 💃

❤️ Train a neural network and track your experiments with Weights & Biases here: http://wandb.me/paperintro 📝 The paper Editable Free-Viewpoint Video using a Layered Neural Representation"" is available here:

https://jiakai-zhang.github.io/st-nerf/

https://github.com/DarlingHang/st-nerf 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jonas, Kenneth Davis, Klaus Busse, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Mark Oates, Michael A…

2 недели, 3 дня назад @ youtube.com
This AI Learned Boxing…With Serious Knockout Power! 🥊
This AI Learned Boxing…With Serious Knockout Power! 🥊 This AI Learned Boxing…With Serious Knockout Power! 🥊

❤️ Check out Perceptilabs and sign up for a free demo here: https://www.perceptilabs.com/papers 📝 The paper "Control Strategies for Physically Simulated Characters Performing Two-player Competitive Sports" is available here:

https://research.fb.com/publications/control-strategies-for-physically-simulated-characters-performing-two-player-competitive-sports/ ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers

- https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Ange…

3 недели назад @ youtube.com
This Magical AI Cuts People Out Of Your Videos! ✂️
This Magical AI Cuts People Out Of Your Videos! ✂️ This Magical AI Cuts People Out Of Your Videos! ✂️

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers ❤️ Their mentioned report is available here: https://wandb.ai/_scott/omnimatte/reports/Omnimatte-Associating-Objects-and-Their-Effects--Vmlldzo5MDQxNTc 📝 The paper "Omnimatte: Associating Objects and Their Effects in Video" is available here:

https://omnimatte.github.io/ Meet and discuss your ideas with other Fellow Scholars on the Two Minute Papers Discord: https://discordapp.com/invite/hbcTJu2 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eri…

3 недели, 4 дня назад @ youtube.com
DeepMind’s AI Plays Catch…And So Much More! 🤖
DeepMind’s AI Plays Catch…And So Much More! 🤖 DeepMind’s AI Plays Catch…And So Much More! 🤖

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers 📝 The paper "Open-Ended Learning Leads to Generally Capable Agents" is available here:

https://deepmind.com/blog/article/generally-capable-agents-emerge-from-open-ended-play

https://deepmind.com/research/publications/open-ended-learning-leads-to-generally-capable-agents ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers

- https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Ange…

4 недели, 1 день назад @ youtube.com
Virtual Bones Make Everything Better! 💪
Virtual Bones Make Everything Better! 💪 Virtual Bones Make Everything Better! 💪

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers ❤️ Their mentioned post is available here: https://wandb.ai/jxmorris12/huggingface-demo/reports/A-Step-by-Step-Guide-to-Tracking-Hugging-Face-Model-Performance--VmlldzoxMDE2MTU 📝 The paper "Direct Delta Mush Skinning Compression with Continuous Examples" is available here:

https://binh.graphics/papers/2021s-DDMC/

https://media.contentapi.ea.com/content/dam/ea/seed/presentations/ddm-compression-with-continuous-examples.pdf 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan …

1 месяц, 1 неделя назад @ youtube.com
Beautiful Thin Film Simulations Are Now Possible! 🤯
Beautiful Thin Film Simulations Are Now Possible! 🤯 Beautiful Thin Film Simulations Are Now Possible! 🤯

❤️ Check out Perceptilabs and sign up for a free demo here: https://www.perceptilabs.com/papers 📝 The paper "Thin-Film Smoothed Particle Hydrodynamics Fluid" is available here:

https://arxiv.org/abs/2105.07656 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jonas, Kenneth Davis, Klaus Busse, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Mark Oates, Michael Albrecht, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Ramsey Elbashee…

1 месяц, 1 неделя назад @ youtube.com
New AI Research Work Fixes Your Choppy Videos! 🎬
New AI Research Work Fixes Your Choppy Videos! 🎬 New AI Research Work Fixes Your Choppy Videos! 🎬

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers 📝 The paper "Time Lens: Event-based Video Frame Interpolation" is available here:

http://rpg.ifi.uzh.ch/TimeLens.html ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers

- https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, …

1 месяц, 2 недели назад @ youtube.com
Rendering Shiny Things: Finally, A Problem No More! 🍰
Rendering Shiny Things: Finally, A Problem No More! 🍰 Rendering Shiny Things: Finally, A Problem No More! 🍰

❤️ Check out Fully Connected by Weights & Biases: https://wandb.me/papers 📝 The paper "NeX: Real-time View Synthesis with Neural Basis Expansion " is available here:

https://nex-mpi.github.io/ 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jonas, Kenneth Davis, Klaus Busse, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Mark Oates, Michael Albrecht, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Ramsey Elbasheer, Steef, Taras B…

1 месяц, 2 недели назад @ youtube.com
Busting Failed Simulations Since 2021! 👕
Busting Failed Simulations Since 2021! 👕 Busting Failed Simulations Since 2021! 👕

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers 📝 The paper "Fast Linking Numbers for Topology Verification of Loopy Structures " is available here:

https://graphics.stanford.edu/papers/fastlinkingnumbers/ ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers

- https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Ja…

1 месяц, 3 недели назад @ youtube.com
DeepMind’s Robot Inserts A USB Stick! 🤖
DeepMind’s Robot Inserts A USB Stick! 🤖 DeepMind’s Robot Inserts A USB Stick! 🤖

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers ❤️ Their mentioned post is available here: https://wandb.ai/stacey/yolo-drive/reports/Bounding-Boxes-for-Object-Detection--Vmlldzo4Nzg4MQ 📝 The paper "Scaling data-driven robotics with reward sketching and batch reinforcement learning" is available here:

https://sites.google.com/view/data-driven-robotics/ 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:

Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jo…

1 месяц, 3 недели назад @ youtube.com
DataFest Video DataFest Video
последний пост 1 месяц, 3 недели назад
Gene Kogan | Machine learning for creativity
Gene Kogan | Machine learning for creativity Gene Kogan | Machine learning for creativity

Data Fest Online 2021 https://fest.ai/2021/

ML Art track https://ods.ai/tracks/ml-art-df2021 Speaker introduces himself in Russian, and then presents the material in English.

1 месяц, 3 недели назад @ youtube.com
Alex Farseev: Under the Boot of Google and Facebook and How to Crack it for better Performance
Alex Farseev: Under the Boot of Google and Facebook and How to Crack it for better Performance Alex Farseev: Under the Boot of Google and Facebook and How to Crack it for better Performance

Data Fest Online 2021 https://fest.ai/2021/

ML in Marketing track https://ods.ai/tracks/ml-in-marketing-df2021 Modern Digital Advertising Platforms Leverage Machine Learning and AI to help Advertisers to achieve their goals. Being managed by humans, Advertising technological potential is often remains under-utilised as Humans tend to follow stereotypes and rely on “gut feeling” when making decisions. Understanding of the underlying principles behind “Googles and Facebook’s of our world” therefore becomes a crucial skill a modern marketer needs to acquire to stay relevant. In this talk, we will shed the light into the complex Digital Advertising ecosystem and will show you techniques, such a…

3 месяца назад @ youtube.com
Artem Koval: Cloud-Native MLOps Framework
Artem Koval: Cloud-Native MLOps Framework Artem Koval: Cloud-Native MLOps Framework

Data Fest Online 2021 https://fest.ai/2021/

ML REPA track https://ods.ai/tracks/ml-repa-df2021 Presentation: https://yadi.sk/i/a25573AB8IZUyw In this video we will analyse the requirements for modern MLOps and the main trends: Human-Centered AI, Fairness, Explainability, Model Monitoring, Human Augmented AI

3 месяца, 1 неделя назад @ youtube.com
Data Fest Online 2021: IGLU Competition @ NeurIPS 2021
Data Fest Online 2021: IGLU Competition @ NeurIPS 2021 Data Fest Online 2021: IGLU Competition @ NeurIPS 2021

Data Fest Online 2021 https://fest.ai/2021/

RL + Catalyst track https://ods.ai/tracks/catalyst-and-rl-df2021

3 месяца, 2 недели назад @ youtube.com
Prince Canuma: Catalyst integration with Neptune
Prince Canuma: Catalyst integration with Neptune Prince Canuma: Catalyst integration with Neptune

Data Fest Online 2021 https://fest.ai/2021/

RL + Catalyst track https://ods.ai/tracks/catalyst-and-rl-df2021

3 месяца, 3 недели назад @ youtube.com
Catalyst integration with Wandb
Catalyst integration with Wandb Catalyst integration with Wandb

Data Fest Online 2021 https://fest.ai/2021/

RL + Catalyst track https://ods.ai/tracks/catalyst-and-rl-df2021

3 месяца, 3 недели назад @ youtube.com
Bag of tricks for image classification — Artur Kuzin
Bag of tricks for image classification — Artur Kuzin Bag of tricks for image classification — Artur Kuzin

ML Training 2019 Artur Kuzin tells about his participation in the competition Driven Data Hakuna Ma-data: Identify Wildlife on the Serengeti with AI for Earth. He took second place. In this video, you will find out: - Overview of a training procedure on Imagenet1k from scratch

- Implementation Details of Hacks & Tricks

- The specialty of working with JPEG pictures and resize in different frameworks Presentation - https://gh.mltrainings.ru/presentations/Kuzin_DrivenDataHakuna.pdf

7 месяцев назад @ youtube.com
Segmentation without pain — Yury Bolkonsky, Andrei Dukhounik
Segmentation without pain — Yury Bolkonsky, Andrei Dukhounik Segmentation without pain — Yury Bolkonsky, Andrei Dukhounik

ML Training 2019 Yury Bolkonsky and Andrei Dukhounik tell about their participation in Kaggle Understanding Clouds from Satellite Images. The team got a silver medal. In this video you will find out:

- Thresholding is an evil, believe in your classification models

- Why you should always use modern best practices

- Why it is not recommended to use postprocessing without local validation Presentation - https://gh.mltrainings.ru/presentations/Bolkonsky_KaggleUnderstandingClouds.pdf

7 месяцев, 1 неделя назад @ youtube.com
Use leaks for validation Kaggle ASHRAE Great Energy Predictor III — Yury Bolkonsky
Use leaks for validation Kaggle ASHRAE   Great Energy Predictor III — Yury Bolkonsky Use leaks for validation Kaggle ASHRAE Great Energy Predictor III — Yury Bolkonsky

ML Training 2019 Yury Bolkonsky tells about his participation in Kaggle ASHRAE - Great Energy Predictor III. His team won a gold medal. In this video you will find out:

- How to create timestamp features

- Do you need to use a leak if it is noisy?

- Leak validation for the best solution

7 месяцев, 1 неделя назад @ youtube.com
Time series met AutoML Codalab Automated Time Series Regression — Denis Vorotyntsev
Time series met AutoML Codalab Automated Time Series Regression —  Denis Vorotyntsev Time series met AutoML Codalab Automated Time Series Regression — Denis Vorotyntsev

ML Training 2019 Denis Vorotyntsev won AutoSeries - AutoML competition on time-series regression. In his presentation, he talks about the competition organization, his final solution, and solutions of other top placed participants. In this video, you will find out:

- How AutoML competition differs from most common Kaggle-alike and why you should try them

- Features engineering approach for time-series tasks when you have no idea about domain

- Why validation split should emulate train-test split

- Why you should always check the code of top participants and how small bugs might drop your score Presentation - https://gh.mltrainings.ru/presentations/Vorotyntsev_CodalabAutoML.pdf

7 месяцев, 2 недели назад @ youtube.com
DL for 6D Pose Estimation for Self Driving Cars — Adel Valiullin
DL for 6D Pose Estimation for Self Driving Cars — Adel Valiullin DL for 6D Pose Estimation for Self Driving Cars — Adel Valiullin

ML Training 2019 Adel Valiullin tells about his participation in the competition Kaggle Peking University/Baidu - Autonomous Driving. He won a silver medal. In this video, you will find out: - Overview of the Autonomous Vehicles problem

- Dataset description and exploration: images with 6D pose information, taken from the roof of a car, 3D models of cars and input data analysis - Problems with mAP metric and dataset in this challenge

- The implementation of CenterNet Neural Network for 6D car pose estimation

- Score boosters and other better and high scored approaches

7 месяцев, 2 недели назад @ youtube.com
2 Competitions 1 Unet SpaceNet 5 Challenge & The 3rd Tellus Satellite Challenge — Ilya Kibardin
2 Competitions 1 Unet SpaceNet 5 Challenge & The 3rd Tellus Satellite Challenge — Ilya Kibardin 2 Competitions 1 Unet SpaceNet 5 Challenge & The 3rd Tellus Satellite Challenge — Ilya Kibardin

ML Training 2019 Ilya Kibardin tells about his participation in 2 competitions: Topcoder SpaceNet 5 Challenge & Signate The 3rd Tellus Satellite Challenge. He took fourth and second places. In this video you will find out:

- Spacenet 5 challenge at Topcoder, dataset and metric description

- Overview of a UNet pipeline for road graph extraction from satellite images

- The same pipeline applied to ice segmentation at Signate

- Hacks & Tricks for better performance Presentation - https://gh.mltrainings.ru/presentations/Kibardin_Spacenet5Tellus_v2.pdf

7 месяцев, 3 недели назад @ youtube.com
Bruno Mlodozeniec: Ensemble Distribution Distillation - Classification
Bruno Mlodozeniec: Ensemble Distribution Distillation - Classification Bruno Mlodozeniec: Ensemble Distribution Distillation - Classification

Data Fest Online 2020

Uncertainty Estimation in ML track https://ods.ai/tracks/uncertainty-estimation-in-ml-df2020 Speaker: Bruno Mlodozeniec, University of Cambridge In this video we discuss how ensembles of models can be effectively emulated using a single “Prior Network” model via a technique called Ensemble Distribution Detection. This enables a single model to efficiently retain both the ensemble’s predictive performance and uncertainty measures at low computational and memory cost. Register and get access to the tracks: https://ods.ai/events/datafest2020

Join the community: https://ods.ai/

9 месяцев, 1 неделя назад @ youtube.com
Dmitry Khizbullin: Overview of DaVinci compute architecture for Deep Learning training and inference
Dmitry Khizbullin: Overview of DaVinci compute architecture for Deep Learning training and inference Dmitry Khizbullin: Overview of DaVinci compute architecture for Deep Learning training and inference

DataFest Online 2020

AI Hardware track https://ods.ai/tracks/ai-hardware-df2020 Huawei's DaVinci AI compute architecture. Dmitrii Khizbullin, Overview of DaVinci compute architecture for Deep Learning training and inference, design choices for hardware and software layers. Register and get access to the tracks: https://ods.ai/events/datafest2020

Join the community: https://ods.ai/

9 месяцев, 1 неделя назад @ youtube.com
Evgenii Zheltonozhskii: Entropy Encoding for CNN Inference
Evgenii Zheltonozhskii: Entropy Encoding for CNN Inference Evgenii Zheltonozhskii: Entropy Encoding for CNN Inference

DataFest Online 2020

AI Hardware track https://ods.ai/tracks/ai-hardware-df2020 Speaker: Evgenii Zheltonozhskii, Technion, Israel Institute of Technology Register and get access to the tracks: https://ods.ai/events/datafest2020

Join the community: https://ods.ai/

9 месяцев, 1 неделя назад @ youtube.com
Семинары JetBrains Research Семинары JetBrains Research
последний пост 3 часа назад
Re-splitting Jupyter notebook cells
Re-splitting Jupyter notebook cells Re-splitting Jupyter notebook cells

Вычислительные ноутбуки стали популярными относительно недавно, но из уже полученных данных ясно, что код, написанный в ноутбуках, отличается как по структуре, так и по качеству. Мы предполагаем, что одна из ключевых причин этого — возможность разбивать ноутбук на ячейки. Наша работа нацелена на то, чтобы помочь пользователю найти более эффективные разделения кода на ячейки и сделать ноутбук более читабельным.

На семинаре мы подробно рассмотрим разработанный алгоритм для переразбиения ячеек: зачем он нужен, как работает, и как мы оцениваем качество полученных ячеек. Алгоритм работает на эвристиках, и поэтому много внимания будет посвящено оценке связности кода в ноутбуке и другим метрикам с…

3 часа назад @ youtube.com
Collecting a dataset of bug fixing commits
Collecting a dataset of bug fixing commits Collecting a dataset of bug fixing commits

Идея применения машинного обучения для выявления/исправления ошибок в коде волнует умы исследователей по всему миру. Важным шагом на пути к этой светлой цели является получение обширной, репрезентативной и незашумлённой выборки bug-fix коммитов. На данный момент подавляющее число подобных датасетов собирается либо с помощью баг-трекеров, либо с помощью фильтрации коммитов по ключевым словам. Однако, точность и полнота подобных подходов оставляет желать лучшего. Мы решили исследовать применимость методов машинного обучения для майнинга bug-fix коммитов. На семинаре мы рассмотрим существующие исследования в области классификации коммитов, уделив отдельное внимание проблемам использовавшихся д…

3 дня, 1 час назад @ youtube.com
Aggregation model for stack trace grouping / Deep assignee prediction for bug stacktraces
Aggregation model for stack trace grouping / Deep assignee prediction for bug stacktraces Aggregation model for stack trace grouping / Deep assignee prediction for bug stacktraces

"Aggregation model for stack trace grouping" Разработка программного обеспечения – сложный итеративный процесс. Программы поддерживаются, реорганизуются, исправляются и обновляются на постоянной основе. При этом очень часто существуют ошибки, которые не обнаружили на этапе тестирования. В большинстве современных программ при возникновении ошибки автоматически формируется отчет об ошибке, содержащий в себе стек вызовов, информацию о системе, состояние окружения, версию аварийного приложения, установленные плагины, библиотеки и их версии.

Отчеты об ошибках приходят постоянно, и их становится очень много. Одна из проблем, с которой сталкиваются разработчики, — одинаковые ошибки могут дать немн…

5 дней, 1 час назад @ youtube.com
Reassessing Automatic Evaluation Metrics for Code Summarization Tasks
Reassessing Automatic Evaluation Metrics for Code Summarization Tasks Reassessing Automatic Evaluation Metrics for Code Summarization Tasks

Статья (https://sarahfakhoury.com/2021-FSE-Summarization-Metrics.pdf) посвящена сравнению эффективности метрик для автоматической оценки качества моделей, которые суммаризуют код в документацию. Помимо стандартных BLEU, METEOR и ROUGE авторы также рассматривают BERTScore (метрику, основанную на претренированных эмбеддингах BERT'а) и chrF (метрику, которая считает F-score на n-граммах символов, а не токенов). Авторы изучили, насколько значительной должна быть разница в оценках моделей на уровне корпуса, чтобы мнение асессоров о сравнительном качестве моделей совпадало с мнением метрик. Также авторы изучили, насколько можно доверять оценкам метрик для отдельных примеров. В рамках семинара мы …

5 дней, 20 часов назад @ youtube.com
Stack trace Similarity
Stack trace Similarity Stack trace Similarity

Программные продукты часто имеют ошибки, которые не были обнаружены при тестировании и в результате попали к пользователям. Тогда при возникновении такой ошибки разработчикам отправляется автоматически сгенерированный отчет о ней, состоящий преимущественно из стека вызовов. Таких отчетов приходит очень много и чтобы справиться с их анализом, часто их группируют по исходным причинам ошибки. Для группировки надо понять, какие стеки вызовов похожи друг на друга и относятся к одной ошибке, а какие нет. На семинаре будет рассказано, какие существуют подходы к этой задаче. Также будут рассмотрены предложенные нами модели и возможные их улучшения. В конце будут описаны другие задачи, связанные с о…

6 дней, 23 часа назад @ youtube.com
Инструменты оптимизации для задач SE
Инструменты оптимизации для задач SE Инструменты оптимизации для задач SE

На практике нам хотелось бы иметь предиктивные способности во многих ситуациях: классифицировать объекты, генерировать недостающую информацию, определять наилучшее поведение, и всё это в автоматизированном режиме. Для этого обычно формулируется некоторая параметрическая модель явления или зависимости, после чего параметры подгоняются под наблюдаемую реальность в надежде на то, что зависимость легко обобщаема. В этом сущность машинного обучения. Оптимизация возникает на этапе подгона параметров — оказывается, однако, что минимизировать ошибку модели можно существенно различными методами. А знание закулисной математики при этом помогает как объяснять поведение методов, так и модифицировать по…

1 неделя, 4 дня назад @ youtube.com
A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms
A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms

Инструменты выделения сценариев редактирования AST используются в самых разных областях анализа изменений кода. В их работе присутствует важный шаг, которому уделяется не так много внимания — процесс сопоставления вершин абстрактных синтаксических деревьев кода “до” и “после” изменения. Авторы статьи “A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms” предлагают новый подход к автоматическому сравнению алгоритмов сопоставления вершин AST, который достигает 98-100% точности в соответствии с оценками сторонних экспертов. Более того, в ходе описываемого эксперимента с изменениями в коде на Java авторы выяснили, что около 25% всех считающихся “соответствующи…

1 неделя, 6 дней назад @ youtube.com
Expertise Embeddings and Assignee Prediction Task
Expertise Embeddings and Assignee Prediction Task Expertise Embeddings and Assignee Prediction Task

При работе над крупными (и не очень) проектами активно используются issue-трекеры. Если проект большой и разработчиков достаточно много, то возникает задача выбора программиста, который будет работать над конкретным сообщением о проблеме. Этот процесс занимает время, и в нём также могут возникать ошибки: если issue начинают “передавать” между разработчиками, то время решения проблемы продолжает расти. В рамках данного семинара будут разобраны несколько подходов, которые предложили исследователи для автоматизации процесса рекомендации программистов. Также будет рассмотрен разработанный нами подход для построения эмбеддингов экспертизы программистов, его технические особенности, и то, как с е…

1 месяц, 1 неделя назад @ youtube.com
Fast Reinforcement Learning with Generalized Updates & Evolving Reinforcement Learning Algorithms
Fast Reinforcement Learning with Generalized Updates & Evolving Reinforcement Learning Algorithms Fast Reinforcement Learning with Generalized Updates & Evolving Reinforcement Learning Algorithms

Видео включает два доклада: "Fast reinforcement learning with generalized policy updates" Одной из важных проблем в обучении с подкреплением является то, что зачастую, чтобы выучить удовлетворительную политику, нужно провести достаточно большое число взаимодействий агента с окружением. Авторами рассматриваемой статьи предлагается решать эту проблему с помощью подхода «разделяй и властвуй». Часто, сложную задачу можно представить в виде последовательно или параллельно выполняемых простых задач. Для такого перехода авторами вводятся обобщенные версии policy evaluation и improvement. Таким образом, можно использовать решение одной задачи для решения других. Решение авторов помогает добиться си…

1 месяц, 2 недели назад @ youtube.com
Quality Metrics for Code Generation Models
Quality Metrics for Code Generation Models Quality Metrics for Code Generation Models

The code generation task can be formulated as a translation from a natural language to a specific programming language. To evaluate the output of different code generation models (against each other and during validation), generated code is compared to reference snippets. Unfortunately, it is impossible to evaluate the output of every slightly different model by an experienced human programmer. Therefore, it is necessary to apply some kind of metric that would score generated code snippets, and the desired metric should be strongly correlated with human judgment. The metrics currently used to evaluate code generation models (BLEU, ROUGE, METEOR) have originated from the neural machine trans…

1 месяц, 3 недели назад @ youtube.com
«Запахи в тестах» в Python: определение, нахождение, анализ
«Запахи в тестах» в Python: определение, нахождение, анализ «Запахи в тестах» в Python: определение, нахождение, анализ

Одной из важных областей исследования в программной инженерии в последние пару десятилетий являются так называемые «запахи кода» (code smells), то есть определенные архитектурные и программные решения, которые, не являясь сами по себе ошибкой, тем не менее могут усложнить восприятие кода или привести к ошибке в дальнейшем. Учёные уже собрали большие списки таких запахов, разработали инструменты для их нахождения и проанализировали их частоту в реальном коде. Отдельным подвидом запахов кода являются «запахи в тестах» (test smells). По сути дела, это тоже запахи кода, но специфичные для тестирования и тестировочного кода. Их выделение в отдельный класс и термин связано с тем, что тестировочны…

1 месяц, 3 недели назад @ youtube.com
Применение методов deep learning к задаче эпитопного картирования
Применение методов deep learning к задаче эпитопного картирования Применение методов deep learning к задаче эпитопного картирования

При разработке лекарственных препаратов на основе терапевтических антител возникают различные задачи, связанные со структурами антитела и/или антигена. Например, к ним относятся фолдинг (определение структуры по последовательности), докинг (определение комплекса антитело-антиген), эпитопное картирование (определение эпитопа — аминокислот, участвующих в связывании). Эти задачи можно решать экспериментальным путем, но это долго и дорого. Поэтому на помощь приходят вычислительные методы. При этом в последнее время в научном сообществе возникает все больше и больше решений, основанных на методах глубокого обучения. Один из примеров хорошо известен — Alphafold для задачи фолдинга. На семинаре бу…

1 месяц, 4 недели назад @ youtube.com
Предсказание структуры CDR-H3 с помощью методов глубокого обучения
Предсказание структуры CDR-H3 с помощью методов глубокого обучения Предсказание структуры CDR-H3 с помощью методов глубокого обучения

Dienste anbieten und betreiben, z.

Personalisierte Inhalte und Werbeanzeigen können ebenfalls darauf basieren, darüber hinaus aber auch auf Aktivitäten wie Suchanfragen bei Google und Videos, die Sie sich bei YouTube ansehen.

Zu personalisierten Inhalten und Werbeanzeigen gehören beispielsweise Dinge wie relevantere Ergebnisse und Empfehlungen, eine individuelle YouTube-Startseite und Werbung, die auf Ihre Interessen zugeschnitten ist.

Klicken Sie auf „Anpassen“, um sich Ihre Möglichkeiten anzusehen.

Zu diesen gehören zum Beispiel Steuerelemente, um Cookies für die Personalisierung zu deaktivieren, oder Informationen zu Steuerelementen auf Browserebene, mit denen einige oder alle Cookies fü…

2 месяца назад @ youtube.com
Medical Transformer: Gated Axial-Attention for Medical Image Segmentation
Medical Transformer: Gated Axial-Attention for Medical Image Segmentation Medical Transformer: Gated Axial-Attention for Medical Image Segmentation

В задаче сегментации медицинских изображений наилучших результатов достигают модификации архитектуры UNet. Однако, полагаясь исключительно на свертки, подобные сети принимают решение для каждого пикселя основываясь лишь на небольшой его окрестности. Данное ограничение авторы предлагают обойти с помощью механизма self-attention, как части encoder'a модели. Представленная модель(MedT) учитывает ограничение на небольшой размер датасета, типичный для возможных приложений. Для учета отношений между различными участками изображения вводится новая стратегия обучения(LoGo) — совместное использование двух похожих по архитектуре частей сети: локальной(для небольших областей) и глобальной(для всего из…

2 месяца назад @ youtube.com
Применение различных методов оптимизации для моделей суммаризации кода
Применение различных методов оптимизации для моделей суммаризации кода Применение различных методов оптимизации для моделей суммаризации кода

В стандартном процессе машинного обучения ставится задача поиска глобального минимума функции потерь. При этом ландшафт функции потерь для задач глубокого обучения обычно чрезвычайно сложен, до сих пор неизвестна форма локальных минимумов, их устройство и взаимное расположение. Это приводит к тому, что наиболее популярные на данный момент методы оптимизации (SGD, Adam) могут сойтись в локальный минимум, не являющийся глобальным. К счастью, в последние несколько лет появилось множество подходов, которые модифицируют стандартные SGD и Adam для более качественного обучения моделей и показывают значимое улучшение результатов для исследуемых моделей. Однако, исследователи обычно изучают эффектив…

2 месяца, 1 неделя назад @ youtube.com
Яндекс. Компьютерные науки Яндекс. Компьютерные науки
последний пост 5 месяцев, 1 неделя назад
Разбор письменного экзамена ШАД. Задача 9. Индекс ближайшего превосходящего элемента
Разбор письменного экзамена ШАД. Задача 9. Индекс ближайшего превосходящего элемента Разбор письменного экзамена ШАД. Задача 9. Индекс ближайшего превосходящего элемента

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

5 месяцев, 1 неделя назад @ youtube.com
Письменный разбор экзамена ШАД. Задача 8. Рёбра в графе
Письменный разбор экзамена ШАД. Задача 8. Рёбра в графе Письменный разбор экзамена ШАД. Задача 8. Рёбра в графе

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

5 месяцев, 3 недели назад @ youtube.com
Разбор письменного экзамена ШАД. Задача 7. Неравенство для производной
Разбор письменного экзамена ШАД. Задача 7. Неравенство для производной Разбор письменного экзамена ШАД. Задача 7. Неравенство для производной

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

5 месяцев, 4 недели назад @ youtube.com
Разбор письменного экзамена ШАД. Задача 6. Размерности
Разбор письменного экзамена ШАД. Задача 6. Размерности Разбор письменного экзамена ШАД. Задача 6. Размерности

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

6 месяцев назад @ youtube.com
Разбор письменного экзамена ШАД. Задача 5. Предел и вероятности
Разбор письменного экзамена ШАД. Задача 5. Предел и вероятности Разбор письменного экзамена ШАД. Задача 5. Предел и вероятности

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

6 месяцев, 2 недели назад @ youtube.com
Разбор письменного экзамена ШАД. Задача 4. Геометрическая вероятность
Разбор письменного экзамена ШАД. Задача 4. Геометрическая вероятность Разбор письменного экзамена ШАД. Задача 4. Геометрическая вероятность

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

6 месяцев, 2 недели назад @ youtube.com
Разбор письменного экзамена ШАД. Задача 3. Математическое ожидание числа шаров
Разбор письменного экзамена ШАД. Задача 3. Математическое ожидание числа шаров Разбор письменного экзамена ШАД. Задача 3. Математическое ожидание числа шаров

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

6 месяцев, 3 недели назад @ youtube.com
Разбор письменного экзамена ШАД. Задача 2. Матрица проекции
Разбор письменного экзамена ШАД. Задача 2. Матрица проекции Разбор письменного экзамена ШАД. Задача 2. Матрица проекции

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

7 месяцев назад @ youtube.com
Разбор письменного экзамена ШАД. Задача 1. Предел отношения
Разбор письменного экзамена ШАД. Задача 1. Предел отношения Разбор письменного экзамена ШАД. Задача 1. Предел отношения

В этом году мы решили помочь тем, кто готовится к поступлению в Школу анализа данных, и поделиться решениями нескольких заданий из вариантов письменного экзамена, демонстрирующими полезные приёмы. Каждую неделю мы будем публиковать здесь разбор одной из задач, которые были на письменном экзамене в ШАД в 2019 году. Условия задач и текстовые разборы вы найдёте на сайте: https://yandexdataschool.ru/stepbystep

7 месяцев, 2 недели назад @ youtube.com
Научный митап Yandex Research
Научный митап Yandex Research Научный митап Yandex Research

Yandex Research — это исследовательская группа внутри Яндекса, которая занимается фундаментальными проблемами в важнейших областях computer science и искусственного интеллекта, таких как компьютерное зрение, Natural Language Processing, речевые технологии, краудсорсинг, поиск и рекомендации. В рамках митапа исследователи из Yandex Research и научной лаборатории Яндекса на Факультете Компьютерных Наук НИУ ВШЭ рассказали об интересных задачах, которыми они занимаются, а ещё о том, как стать частью команды. Участники митапа и программа: • Андрей Малинин. Неопределенность в структурных предсказаниях; • Станислав Морозов. Big GANs Are Watching You: о сегментации объектов без учителя с помощью го…

9 месяцев, 1 неделя назад @ youtube.com
ML Trainings ML Trainings
последний пост 36 минут назад
Инесса Трегубова | Пространственная аналитика в задачах геомаркетинга
Инесса Трегубова | Пространственная аналитика в задачах геомаркетинга Инесса Трегубова | Пространственная аналитика в задачах геомаркетинга

ML in Marketing track https://ods.ai/tracks/ml-in-marketing-df2021

Телеграм-канал https://t.me/mlinmarketing Спикер: Инесса Трегубова, Geodata analyst at Яндекс Лавка В докладе расскажу, как и зачем используют геоданные в маркетинге. Обсудим специфику сбора и процессинга пространственных данных, а также несколько ML моделей, с помощью которых можно делать выводы о том, насколько тот или иной участок подходит для целей бизнеса. Покажем, какие задачи можно решать в бизнесе. 00:00 начало видео

01:09 задачи гео-маркетинга

03:18 типы данных

04:05 меры сходства между объектами

07:31 системы агреггации данных в пространстве

10:14 проекции представления данных (crs)

11:11 форматы датасетов

12:57 пр…

36 минут назад @ youtube.com
Данил Гиздатуллин | Рекомендации похожего контента
Данил Гиздатуллин | Рекомендации похожего контента Данил Гиздатуллин | Рекомендации похожего контента

ML in Marketing track https://ods.ai/tracks/ml-in-marketing-df2021

Телеграм-канал https://t.me/mlinmarketing Спикер: Данил Гиздатуллин, Data Scientist В рекомендательных системах есть понятия пользователя и айтема. Типичная задача рекомендаций - рекомендации айтемов пользователям, но также одной из важных задач является задача рекомендации айтемов к айтемам. В докладе рассмотрим несколько вариантов её решения: от использования коллаборативной фильтрации до модели ранжирования. Презентация доклада: https://drive.google.com/file/d/1F0mmU0W4mbOPKRpfYr0JaP6I82L7PS8z/view?usp=sharing Соцсети Data Fest:

https://t.me/datafest

https://vk.com/datafest

5 дней, 2 часа назад @ youtube.com
Андрей Ануфриев | Одноголосный синтез речи на основе нейронных сетей
Андрей Ануфриев | Одноголосный синтез речи на основе нейронных сетей Андрей Ануфриев | Одноголосный синтез речи на основе нейронных сетей

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 Спикер: Андрей Ануфриев, Старший инженер-исследователь в области глубокого обучения, Intel Вступить в сообщество: https://ods.ai/events/datafest2021/join Соцсети Data Fest & ODS Summer of Code:

https://t.me/datafest

https://vk.com/datafest

1 неделя, 5 дней назад @ youtube.com
ODS Course Fest #1
ODS Course Fest #1 ODS Course Fest #1

ODS Course Fest #1 https://datafest.ru/course/ Зарегистрироваться и получить доступ ко всем курсам и мероприятиям сезона: https://ods.ai/events/course_fest_1 Вступить в сообщество: https://ods.ai/ Соцсети Data Fest & Course Fest: https://t.me/datafest

https://vk.com/datafest

2 недели, 3 дня назад @ youtube.com
Антон Киселев | Динамическое ценообразование в топливном ритейле
Антон Киселев | Динамическое ценообразование в топливном ритейле Антон Киселев | Динамическое ценообразование в топливном ритейле

ML in Marketing track https://ods.ai/tracks/ml-in-marketing-df2021

Телеграм-канал https://t.me/mlinmarketing Спикер: Антон Киселев, Marketing Analytics Producer at Playrix

* 10 лет в data (ритейл, игры, телеком)

* 15 DS проектов в проде

* 40 DS специалистов принял на работу

* 3 DS отдела построил с нуля Антон расскажет, как оптимизировали и автоматизировали ценообразование моторного топлива в одной российской топливной компании. Как учились менять цены на лету, как Марков, RL и многорукие бандиты помогли вывести argmax в продакшн и что из всего этого вышло. 00:00 introduction

01:15 динамическое ценообразование в бизнесе

04:00 цели и задачи использования

05:25 ценообразование в топливном рит…

3 недели, 3 дня назад @ youtube.com
Артём Агафонов | Машинное обучение в гео-аналитике для бизнеса
Артём Агафонов | Машинное обучение в гео-аналитике для бизнеса Артём Агафонов | Машинное обучение в гео-аналитике для бизнеса

ML in Marketing track https://ods.ai/tracks/ml-in-marketing-df2021

Телеграм-канал https://t.me/mlinmarketing Спикер: Артем Агафонов, Data Scientist at Mail Ru Group Выбор удачной геопозиции является одной из важных стратегических задач для бизнеса, будь это строительная компания или компания розничной торговли (магазины, рестораны, банки). Популярность локации, доступность с точки зрения пешеходного или транспортного потока, наличие конкурентов в геоточке определяют важнейшие характеристики успешности бизнеса. В докладе рассмотрены примеры того, как большие данные помогают автоматизировать процесс отбора географического положения для нового объекта. Презентация доклада: https://drive.google…

3 недели, 5 дней назад @ youtube.com
Даниил Чесаков | One-shot FaceSwap
Даниил Чесаков | One-shot FaceSwap Даниил Чесаков | One-shot FaceSwap

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 В пятницу 27.08.2021 в 17:00 в нашем ODS спейсе live.ods.ai (пароль odssummerofcodeison) состоится митап, на котором Даниил Чесаков, Data Scientist в SberAI, расскажет о современных методах переноса лица и детально рассмотрит методы переноса лица по одной source фотографии на target фотографию или видео. Даниил остановится на двух моделях FaceShifter, одна из которых делает непосредственно перенос, а вторая добавляет детали, которые были изначально на target. Также он расскажет про то, как удалось улучшить качество переноса и сделать свой сервис FaceSwap Зарегистрироваться на ODS Summer of Code и получить …

3 недели, 6 дней назад @ youtube.com
Виталий Поздняков | Поиск сообществ в социальных сетях
Виталий Поздняков | Поиск сообществ в социальных сетях Виталий Поздняков | Поиск сообществ в социальных сетях

Data Fest Online 2021

ML in Marketing track https://ods.ai/tracks/ml-in-marketing...

Телеграм-канал https://t.me/mlinmarketing Спикер: Виталий Поздняков, Преподаватель, исследователь в ВШЭ, MADE Разберемся, что такое сообщество в социальной сети и как его определить. Рассмотрим современные научные подходы к обнаружению сообществ в социальных сетях: методы на основе случайных блужданий, спектральной кластеризации и графовых нейронных сетей. Обсудим библиотеки Python, которые можно использовать для поиска сообществ, посмотрим примеры визуализации найденных сообществ. Дополнительные материалы по графам: Ссылка на гитхаб курса http://github.com/netspractice/network-science

там же в описании ссы…

4 недели назад @ youtube.com
Евгений Изутов | Распознавание языка жестов
Евгений Изутов | Распознавание языка жестов Евгений Изутов | Распознавание языка жестов

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 Спикер: Евгений Изутов, Deep Learning R&D Engineer, Intel В последнее время мы видим все больше примеров успешного использования методов глубокого обучения для решения практических задач. Проделан большой путь от решения базовой задачи классификации, сделавшей имя глубокому обучению, до более полезных решений, таких как система помощи водителю или автоматический перевод иностранной речи. Последний пример включает расширенное определение языка – использование жестов и мимики вместо традиционных звуков. Жестовые языки долгое время оставались незамеченными, и машинный перевод на них не распространялся. Это бы…

1 месяц назад @ youtube.com
Профилировка приложения, использующего oneDNN с помощью инструментария Intel® vTune™ Profiler
Профилировка приложения, использующего oneDNN с помощью инструментария Intel® vTune™ Profiler Профилировка приложения, использующего oneDNN с помощью инструментария Intel® vTune™ Profiler

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 Зарегистрироваться на ODS Summer of Code и получить доступ к проектам и трекам: https://ods.ai/events/datafest2021 Вступить в сообщество: https://ods.ai/ Соцсети Data Fest & ODS Summer of Code:

https://t.me/datafest

https://vk.com/datafest

1 месяц, 1 неделя назад @ youtube.com
Ускорение Scikit-learn на процессорах Intel® с помощью Intel® oneDAL
Ускорение Scikit-learn на процессорах Intel® с помощью Intel® oneDAL Ускорение Scikit-learn на процессорах Intel® с помощью Intel® oneDAL

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 Зарегистрироваться на ODS Summer of Code и получить доступ к проектам и трекам: https://ods.ai/events/datafest2021 Вступить в сообщество: https://ods.ai/ Соцсети Data Fest & ODS Summer of Code:

https://t.me/datafest

https://vk.com/datafest

1 месяц, 1 неделя назад @ youtube.com
LightAutoML | ODS Summer of Code 2021
LightAutoML  | ODS Summer of Code 2021 LightAutoML | ODS Summer of Code 2021

ODS Summer of Code 2021 https://ods.ai/tracks/summer-of-code-2021

LightAutoML Hackathon https://ods.ai/competitions/LAMA-hack-soc2021 Приветствуем участников Summer School Of Code! В этом видео вы узнаете, на какие темы мы, разработчики LightAutoML, предлагаем вам подумать, чтобы совместными усилиями сделать наше решение чуть лучше. Вступить в сообщество: https://ods.ai/ Соцсети Data Fest:

https://t.me/datafest

https://vk.com/datafest

1 месяц, 1 неделя назад @ youtube.com
Ускорение инференса модели XGBoost с помощью Intel® oneDAL на процессорах Intel® Xeon®
Ускорение инференса модели XGBoost с помощью Intel® oneDAL на процессорах Intel® Xeon® Ускорение инференса модели XGBoost с помощью Intel® oneDAL на процессорах Intel® Xeon®

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 Зарегистрироваться на ODS Summer of Code и получить доступ к проектам и трекам: https://ods.ai/events/datafest2021 Вступить в сообщество: https://ods.ai/ Соцсети Data Fest & ODS Summer of Code:

https://t.me/datafest

https://vk.com/datafest

1 месяц, 1 неделя назад @ youtube.com
Татьяна Шаврина | Прикладные применения ruGPT-3, GPT-Neo, GPT-3 OpenAI: что можно сделать уже сейчас
Татьяна Шаврина | Прикладные применения ruGPT-3, GPT-Neo, GPT-3 OpenAI: что можно сделать уже сейчас Татьяна Шаврина | Прикладные применения ruGPT-3, GPT-Neo, GPT-3 OpenAI: что можно сделать уже сейчас

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 Прикладные применения ruGPT-3, GPT-Neo, GPT-3 OpenAI: что можно и нужно сделать уже сейчас Спикер: Татьяна Шаврина, Главный эксперт по технологиям, SberDevices В субботу 14.08.21 в 13:00 в нашем ODS спейсе https://live.ods.ai/ (пароль odssummerofcodeison) вместе с Татьяной Шавриной посмотрим на возможности моделей, их актуальные применения и продукты после года выхода модели. Сравним возможности моделей разных размеров, успешность прототипов, продуктов для русского, английского и других языков. Ждём вас! Зарегистрироваться на ODS Summer of Code и получить доступ к проектам и трекам: https://ods.ai/events/d…

1 месяц, 1 неделя назад @ youtube.com
Дмитрий Темнов | Ускорение исполнения нейронных сетей с Intel® OneVINO™ toolkit
Дмитрий Темнов | Ускорение исполнения нейронных сетей с Intel® OneVINO™ toolkit Дмитрий Темнов | Ускорение исполнения нейронных сетей с Intel® OneVINO™ toolkit

ODS Summer of Code 2021 | Intel & SberCloud track https://ods.ai/tracks/cloudcity2021 Зарегистрироваться на ODS Summer of Code и получить доступ к проектам и трекам: https://ods.ai/events/datafest2021 Вступить в сообщество: https://ods.ai/ Соцсети Data Fest & ODS Summer of Code:

https://t.me/datafest

https://vk.com/datafest

1 месяц, 1 неделя назад @ youtube.com
Primer Primer
последний пост 3 недели, 1 день назад
Simulating the Evolution of Sacrificing for Family
Simulating the Evolution of Sacrificing for Family Simulating the Evolution of Sacrificing for Family

Try NordVPN free for 30 days at https://nordvpn.com/Primer More than you ever wanted to know about Hamilton's rule:

https://users.ox.ac.uk/~grafen/cv/oseb.pdf For discussion and updates

- Twitter: @primerlearning

- Discord: https://discord.gg/NbruaNW

- Reddit: r/primerlearning Plush blobs and other merch: https://store.dftba.com/collections/primer

Support these videos on Patreon: https://www.patreon.com/primerlearning Made with Unity and Manim

https://github.com/Helpsypoo/PrimerUnity

https://www.manim.community Music by Mathieu Keith. For business inquiries: mathieu.keith@gmail.com Several other inputs into the graphics are from public domain contributions to blendswap.com Made possible by …

3 недели, 1 день назад @ youtube.com
Simulating Green Beard Altruism
Simulating Green Beard Altruism Simulating Green Beard Altruism

Brilliant: http://www.brilliant.org/primer Papers:

- https://www.researchgate.net/publication/41910312_Altruism_Spite_and_Greenbeards

- https://www.reed.edu/biology/professors/srenn/pages/teaching/2007_syllabus/2007_readings/a13_Keller_1998.pdf For discussion and updates

- Discord: https://discord.gg/NbruaNW

- Reddit: r/primerlearning

- Twitter: @primerlearning Sometimes streaming myself working on these monstrosities:

- Twitch: https://www.twitch.tv/primerjustin Made with Unity

https://github.com/Helpsypoo/PrimerUnity Music by Mathieu Keith. For business inquiries: mathieu.keith@gmail.com Several other inputs into the graphics are from public domain contributions to blendswap.com Plush blo…

5 месяцев, 3 недели назад @ youtube.com
Hamilton's rule is a lie is a lie
Hamilton's rule is a lie is a lie Hamilton's rule is a lie is a lie

Plush blobs: https://store.dftba.com/collections/primer

Support these videos on Patreon: https://www.patreon.com/primerlearning A good place for learning more about how to be less wrong:

https://www.lesswrong.com/ For discussion and updates

- Discord: https://discord.gg/NbruaNW

- Reddit: r/primerlearning

- Twitter: @primerlearning

- Facebook: facebook.com/primerlearning Streaming myself working on these monstrosities:

- Twitch: https://www.twitch.tv/primerjustin Made possible by support through Patreon:

Christian Gruber

Matthijs Ruijgrok

Christopher

Anthony Eufemio

José Hamilton

Zachariah Richard Fournier

Vladimir Duchenchuk

Noah Healy

JMakes

Mike Schmidt

PeepPhysics

Anders Fjeldvær

Ghost G…

9 месяцев, 2 недели назад @ youtube.com
🎧 Podcasts
Lex Fridman AI Podcast Lex Fridman AI Podcast
последний пост 9 часов назад
#222 – Jay McClelland: Neural Networks and the Emergence of Cognition
#222 – Jay McClelland: Neural Networks and the Emergence of Cognition #222 – Jay McClelland: Neural Networks and the Emergence of Cognition

Jay McClelland is a cognitive scientist at Stanford.

Please support this podcast by checking out our sponsors:– Paperspace: https://gradient.run/lex to get $15 credit– Skiff: https://skiff.org/lex to get early access– Uprising Food: https://uprisingfood.com/lex to get $10 off 1st starter bundle– Four Sigmatic: https://foursigmatic.com/lex and use code LexPod to get up to 60% off– Onnit: https://lexfridman.com/onnit to get up to 10% offEPISODE LINKS:Jay’s Website: https://stanford.edu/~jlmcc/PODCAST INFO:Podcast website: https://lexfridman.com/podcastApple Podcasts: https://apple.co/2lwqZIrSpotify: https://spoti.fi/2nEwCF8RSS: https://lexfridman.com/feed/podcast/YouTube Full Episodes: https:…

9 часов назад @ lexfridman.com
#221 – Douglas Lenat: Cyc and the Quest to Solve Common Sense Reasoning in AI
#221 – Douglas Lenat: Cyc and the Quest to Solve Common Sense Reasoning in AI #221 – Douglas Lenat: Cyc and the Quest to Solve Common Sense Reasoning in AI

Douglas Lenat is the founder of Cyc, a 37 year project aiming to solve common-sense knowledge and reasoning in AI.

On some podcast players you should be able to click the timestamp to jump to that time.

(00:00) – Introduction(07:39) – What is Cyc?

(2:01:21) – The open source community and OpenCyc(2:11:48) – The inference problem(2:13:31) – Cyc’s programming language(2:21:05) – Ontological engineering(2:28:30) – Do machines think?

(2:37:15) – Death and consciousness(2:47:16) – What would you say to AI?

4 дня, 22 часа назад @ lexfridman.com
#220 – Niels Jorgensen: New York Firefighters and the Heroes of 9/11
#220 – Niels Jorgensen: New York Firefighters and the Heroes of 9/11 #220 – Niels Jorgensen: New York Firefighters and the Heroes of 9/11

Niels Jorgensen is a former New York firefighter for over 21 years, who was there at Ground Zero on September 11th, 2001.

Please support this podcast by checking out our sponsors:– ROKA: https://roka.com/ and use code LEX to get 20% off your first order– MUD\WTR: https://mudwtr.com/lex and use code LEX to get 5% off– Magic Spoon: https://magicspoon.com/lex and use code LEX to get $5 off– Blinkist: https://blinkist.com/lex and use code LEX to get 25% off premiumEPISODE LINKS:Niels’s 20 for 20 Podcast: https://ironlightlabs.org/20-for-20/PODCAST INFO:Podcast website: https://lexfridman.com/podcastApple Podcasts: https://apple.co/2lwqZIrSpotify: https://spoti.fi/2nEwCF8RSS: https://lexfridman.…

1 неделя, 1 день назад @ lexfridman.com
#219 – Donald Knuth: Programming, Algorithms, Hard Problems & the Game of Life
#219 – Donald Knuth: Programming, Algorithms, Hard Problems & the Game of Life #219 – Donald Knuth: Programming, Algorithms, Hard Problems & the Game of Life

Donald Knuth is a computer scientist, Turing Award winner, father of algorithm analysis, author of The Art of Computer Programming, and creator of TeX.

Please support this podcast by checking out our sponsors:– Coinbase: https://coinbase.com/lex to get $5 in free Bitcoin– InsideTracker: https://insidetracker.com/lex and use code Lex25 to get 25% off– NetSuite: http://netsuite.com/lex to get free product tour– ExpressVPN: https://expressvpn.com/lexpod and use code LexPod to get 3 months free– BetterHelp: https://betterhelp.com/lex to get 10% offEPISODE LINKS:Donald’s Stanford Page: https://profiles.stanford.edu/donald-knuthDonald’s Books: https://amzn.to/3heyBsCPODCAST INFO:Podcast website: …

1 неделя, 3 дня назад @ lexfridman.com
#218 – Jaron Lanier: Virtual Reality, Social Media & the Future of Humans and AI
#218 – Jaron Lanier: Virtual Reality, Social Media & the Future of Humans and AI #218 – Jaron Lanier: Virtual Reality, Social Media & the Future of Humans and AI

Jaron Lanier is a computer scientist, composer, artist, author, and founder of the field of virtual reality.

Please support this podcast by checking out our sponsors:– Skiff: https://skiff.org/lex to get early access– Novo: https://banknovo.com/lex– Onnit: https://lexfridman.com/onnit to get up to 10% off– Indeed: https://indeed.com/lex to get $75 credit– Eight Sleep: https://www.eightsleep.com/lex and use code LEX to get special savingsEPISODE LINKS:Jaron’s Website: http://www.jaronlanier.com/Jaron’s Books: https://amzn.to/3tlhl9TPODCAST INFO:Podcast website: https://lexfridman.com/podcastApple Podcasts: https://apple.co/2lwqZIrSpotify: https://spoti.fi/2nEwCF8RSS: https://lexfridman.com/f…

1 неделя, 6 дней назад @ lexfridman.com
#217 – Rodney Brooks: Robotics
#217 – Rodney Brooks: Robotics #217 – Rodney Brooks: Robotics

Rodney Brooks is a roboticist, former head of CSAIL at MIT, and co-founder of iRobot, Rethink Robotics, and Robust.AI.

Please support this podcast by checking out our sponsors:– Paperspace: https://gradient.run/lex to get $15 credit– GiveDirectly: https://givedirectly.org/lex to get gift matched up to $300– BiOptimizers: http://www.magbreakthrough.com/lex to get 10% off– Four Sigmatic: https://foursigmatic.com/lex and use code LexPod to get up to 60% off– SimpliSafe: https://simplisafe.com/lex and use code LEX to get a free security cameraEPISODE LINKS:Rodney’s Twitter: https://twitter.com/rodneyabrooksRodney’s Blog: http://rodneybrooks.com/blog/PODCAST INFO:Podcast website: https://lexfrid…

2 недели, 2 дня назад @ lexfridman.com
#216 – Vincent Racaniello: Viruses and Vaccines
#216 – Vincent Racaniello: Viruses and Vaccines #216 – Vincent Racaniello: Viruses and Vaccines

Vincent Racaniello is a virologist, immunologist, and microbiologist at Columbia.

He is a co-author of the textbook Principles of Virology and co-host of This Week in Virology podcast.

On some podcast players you should be able to click the timestamp to jump to that time.

(1:35:45) – Vaccines(1:41:43) – Lex on his reaction to the COVID-19 vaccine shot(1:47:39) – Modern vaccines(1:52:39) – How does mRNA vaccine work?

(3:06:37) – Masks(3:15:05) – Bret Weinstein vs Sam Harris(3:18:39) – This Week in Virology(3:28:19) – Advice for young people(3:30:42) – Meaning of life

2 недели, 4 дня назад @ lexfridman.com
#215 – Wojciech Zaremba: OpenAI Codex, GPT-3, Robotics, and the Future of AI
#215 – Wojciech Zaremba: OpenAI Codex, GPT-3, Robotics, and the Future of AI #215 – Wojciech Zaremba: OpenAI Codex, GPT-3, Robotics, and the Future of AI

Wojciech Zaremba is a co-founder of OpenAI.

On some podcast players you should be able to click the timestamp to jump to that time.

(00:00) – Introduction(07:57) – The Fermi paradox(14:59) – Systems of government(17:36) – Life, intelligence, and consciousness(24:49) – GPT language model(27:02) – Engineering consciousness(31:19) – Is there an algorithm for intelligence?

(1:31:35) – AI safety(1:38:21) – OpenAI Codex(1:51:54) – Robotics(2:01:11) – Developing self driving cars and robots(2:12:02) – What is the benchmark for intelligence?

(2:17:18) – AI Friendships(2:26:48) – Sleep(2:29:22) – Generating good ideas(2:35:47) – Advice for young people(2:40:31) – Getting started with machine learnin…

3 недели назад @ lexfridman.com
#214 – Jed Buchwald: Isaac Newton and the Philosophy of Science
#214 – Jed Buchwald: Isaac Newton and the Philosophy of Science #214 – Jed Buchwald: Isaac Newton and the Philosophy of Science

Jed Buchwald is a historian and philosopher of science at Caltech.

Please support this podcast by checking out our sponsors:– GiveWell: https://www.givewell.org/ and use code LEX to get donation matched up to $1k– Theragun: https://therabody.com/lex to get 30 day trial– LMNT: https://drinkLMNT.com/lex to get free sample pack– Fundrise: https://fundrise.com/lex– BetterHelp: https://betterhelp.com/lex to get 10% offEPISODE LINKS:Jed’s Caltech page: https://bit.ly/38eLLRFJed’s Books: https://amzn.to/2WoxGPiPODCAST INFO:Podcast website: https://lexfridman.com/podcastApple Podcasts: https://apple.co/2lwqZIrSpotify: https://spoti.fi/2nEwCF8RSS: https://lexfridman.com/feed/podcast/YouTube Full Epi…

3 недели, 2 дня назад @ lexfridman.com
#213 – Barry Barish: Gravitational Waves and the Most Precise Device Ever Built
#213 – Barry Barish: Gravitational Waves and the Most Precise Device Ever Built #213 – Barry Barish: Gravitational Waves and the Most Precise Device Ever Built

Barry Barish is a theoretical physicist at Caltech and the winner of the Nobel Prize in Physics.

Please support this podcast by checking out our sponsors:– MUD\WTR: https://mudwtr.com/lex and use code LEX to get 5% off– GiveDirectly: https://givedirectly.org/lex to get gift matched up to $300– BiOptimizers: http://www.magbreakthrough.com/lex to get 10% off– Four Sigmatic: https://foursigmatic.com/lex and use code LexPod to get up to 60% off– Magic Spoon: https://magicspoon.com/lex and use code LEX to get $5 offEPISODE LINKS:Barry’s Nobel Prize entry: https://www.nobelprize.org/prizes/physics/2017/barish/facts/Barry’s Caltech profile: https://pma.caltech.edu/people/barry-c-barishLIGO’s Websi…

3 недели, 6 дней назад @ lexfridman.com
#212 – Joscha Bach: Nature of Reality, Dreams, and Consciousness
#212 – Joscha Bach: Nature of Reality, Dreams, and Consciousness #212 – Joscha Bach: Nature of Reality, Dreams, and Consciousness

Joscha Bach is a cognitive scientist, AI researcher, and philosopher.

On some podcast players you should be able to click the timestamp to jump to that time.

(00:00) – Introduction(07:15) – Life is hard(09:38) – Consciousness(16:24) – What is life?

(26:33) – Free will(40:38) – Simulation(42:49) – Base layer of reality(58:24) – Boston Dynamics(1:06:43) – Engineering consciousness(1:17:12) – Suffering(1:26:06) – Postmodernism(1:30:25) – Psychedelics(1:43:40) – GPT-3(1:52:22) – GPT-4(1:58:47) – OpenAI Codex(2:01:02) – Humans vs AI: Who is more dangerous?

(2:17:47) – Hitler(2:22:44) – Autonomous weapon systems(2:30:11) – Mark Zuckerberg(2:35:47) – Love(2:50:00) – Michael Malice and anarchism(3:…

4 недели, 1 день назад @ lexfridman.com
#211 – Brian Muraresku: The Secret History of Psychedelics
#211 – Brian Muraresku: The Secret History of Psychedelics #211 – Brian Muraresku: The Secret History of Psychedelics

Brian Muraresku is the author of The Immortality Key.

Please support this podcast by checking out our sponsors:– InsideTracker: https://insidetracker.com/lex and use code Lex25 to get 25% off– GiveWell: https://www.givewell.org/ and use code LEX to get donation matched up to $1k– NI: https://www.ni.com/perspectives– Indeed: https://indeed.com/lex to get $75 credit– MasterClass: https://masterclass.com/lex to get 15% offEPISODE LINKS:Brian’s Twitter: https://twitter.com/brianmurareskuBrian’s website: https://www.brianmuraresku.comImmortality Key (book):PODCAST INFO:Podcast website: https://lexfridman.com/podcastApple Podcasts: https://apple.co/2lwqZIrSpotify: https://spoti.fi/2nEwCF8RSS: htt…

1 месяц назад @ lexfridman.com
#210 – Matt Walker: Sleep
#210 – Matt Walker: Sleep #210 – Matt Walker: Sleep

Matt Walker is a sleep scientist at Berkeley, author of Why We Sleep, and the host of a new podcast called The Matt Walker Podcast.

On some podcast players you should be able to click the timestamp to jump to that time.

(00:00) – Introduction(09:56) – Putin moment: Lex takes Matt’s sunglasses(10:17) – Fascination with sleep(14:26) – Why do we sleep?

(22:58) – Computer vision for driver assistance(32:19) – Consciousness is fundamental(40:25) – Lex on human to robot connection(42:52) – Scent of a Woman is better than “John Wick”(54:33) – Distinction between coffee and caffeine(1:20:17) – The science of ‘sleeping on it’(1:34:10) – Lex on his sleeping schedule(1:59:14) – Chronotypes(2:06:44) – …

1 месяц, 1 неделя назад @ lexfridman.com
#209 – Luís and João Batalha: Fermat’s Library and the Art of Studying Papers
#209 – Luís and João Batalha: Fermat’s Library and the Art of Studying Papers #209 – Luís and João Batalha: Fermat’s Library and the Art of Studying Papers

Luis and Joao Batalha are co-founders of Fermat’s Library.

Please support this podcast by checking out our sponsors:– Skiff: https://skiff.org/lex to get early access– SimpliSafe: https://simplisafe.com/lex and use code LEX to get a free security camera– Indeed: https://indeed.com/lex to get $75 credit– NetSuite: http://netsuite.com/lex to get free product tour– Four Sigmatic: https://foursigmatic.com/lex and use code LexPod to get up to 60% offEPISODE LINKS:Fermat’s Library Twitter: https://twitter.com/fermatslibraryLuis’s Twitter: https://twitter.com/luismbatJoao’s Twitter: https://twitter.com/joao_batalhaFermat’s Library Website: https://fermatslibrary.comPODCAST INFO:Podcast website: ht…

1 месяц, 1 неделя назад @ lexfridman.com
#208 – Jeff Hawkins: The Thousand Brains Theory of Intelligence
#208 – Jeff Hawkins: The Thousand Brains Theory of Intelligence #208 – Jeff Hawkins: The Thousand Brains Theory of Intelligence

Jeff Hawkins is a neuroscientist and cofounder of Numenta, a neuroscience research company.

Please support this podcast by checking out our sponsors:– Codecademy: https://codecademy.com and use code LEX to get 15% off– BiOptimizers: http://www.magbreakthrough.com/lex to get 10% off– ExpressVPN: https://expressvpn.com/lexpod and use code LexPod to get 3 months free– Eight Sleep: https://www.eightsleep.com/lex and use code LEX to get special savings– Blinkist: https://blinkist.com/lex and use code LEX to get 25% off premiumEPISODE LINKS:A Thousand Brain (book): https://amzn.to/3AmxJt7Numenta’s Twitter: https://twitter.com/NumentaNumenta’s Website: https://numenta.com/PODCAST INFO:Podcast webs…

1 месяц, 1 неделя назад @ lexfridman.com
Microsoft Research Podcast Microsoft Research Podcast
последний пост 1 месяц, 1 неделя назад
132 - New Future of Work: How remote and hybrid work will shape workplaces and society with Jaime Teevan and Sid Suri
132 - New Future of Work: How remote and hybrid work will shape workplaces and society with Jaime Teevan and Sid Suri 132 - New Future of Work: How remote and hybrid work will shape workplaces and society with Jaime Teevan and Sid Suri

For Microsoft researchers, COVID-19 was a call to action.

Teams from across the Microsoft organizational chart pooled their unique expertise together under The New Future of Work initiative.

The results have informed product features designed to better support remote work and are now being used to help companies, including Microsoft, usher their workforces into a future of hybrid work.

In this episode of The New Future of Work series, Chief Scientist Jaime Teevan and Senior Principal Researcher Siddharth Suri explore the many ways people were impacted by work shifts during the COVID-19 pandemic.

The research that Siddharth Suri describes in this podcast was jointly done with Hana Wolf of Li…

1 месяц, 1 неделя назад @ blubrry.com
131 - New Future of Work: Redefining workspaces as hybrid and remote work become more prevalent with Jaime Teevan and Ginger Hudson
131 - New Future of Work: Redefining workspaces as hybrid and remote work become more prevalent with Jaime Teevan and Ginger Hudson 131 - New Future of Work: Redefining workspaces as hybrid and remote work become more prevalent with Jaime Teevan and Ginger Hudson

For Microsoft researchers, COVID-19 was a call to action.

The reimagining of work practices had long been an area of study, but existing and new questions that needed immediate answers surfaced as companies and their employees quickly adjusted to significantly different working conditions.

Teams from across the Microsoft organizational chart pooled their unique expertise together under The New Future of Work initiative.

The results have informed product features designed to better support remote work and are now being used to help companies, including Microsoft, usher their workforces into a future of hybrid work.

They also talk about what an “anatomy of hybrid work” might look like and som…

1 месяц, 2 недели назад @ blubrry.com
130 - New Future of Work: Managing IT and security in remote scenarios with Jaime Teevan and Matt Brodsky
130 - New Future of Work: Managing IT and security in remote scenarios with Jaime Teevan and Matt Brodsky 130 - New Future of Work: Managing IT and security in remote scenarios with Jaime Teevan and Matt Brodsky

For Microsoft researchers, COVID-19 was a call to action.

The reimagining of work practices had long been an area of study, but existing and new questions that needed immediate answers surfaced as companies and their employees quickly adjusted to significantly different working conditions.

Teams from across the Microsoft organizational chart pooled their unique expertise together under The New Future of Work initiative.

The results have informed product features designed to better support remote work and are now being used to help companies, including Microsoft, usher their workforces into a future of hybrid work.

They also explore why remote work came with a spike in phishing threats, what…

1 месяц, 3 недели назад @ blubrry.com
129 - Machine learning, molecular simulation, and the opportunity for societal good with Chris Bishop and Max Welling
129 - Machine learning, molecular simulation, and the opportunity for societal good with Chris Bishop and Max Welling 129 - Machine learning, molecular simulation, and the opportunity for societal good with Chris Bishop and Max Welling

Unlocking the challenge of molecular simulation has the potential to yield significant breakthroughs in how we tackle such societal issues as climate change, drug discovery, and the treatment of disease, and Microsoft is ramping up its efforts in the space.

In this episode, Chris Bishop, Lab Director of Microsoft Research Cambridge, welcomes renowned machine learning researcher Max Welling to the Microsoft Research team as head of the new Amsterdam lab.

Connecting over their shared physics background and vision for molecular simulation, Bishop and Welling explore several fascinating topics, including a future in which machine learning and quantum computing will be used in tandem to model mo…

2 месяца назад @ blubrry.com
128 - New Future of Work: How developer collaboration and productivity are changing in a hybrid work model
128 - New Future of Work: How developer collaboration and productivity are changing in a hybrid work model 128 - New Future of Work: How developer collaboration and productivity are changing in a hybrid work model

Teams from across the Microsoft organizational chart pooled their unique expertise together under The New Future of Work initiative.

The results have informed product features designed to better support remote work and are now being used to help companies, including Microsoft, usher their workforces into a future of hybrid work.

In this episode of The New Future of Work series, Chief Scientist Jaime Teevan and Principal Productivity Engineer Brian Houck discuss what the massive shift to remote work meant for developers—both employees of Microsoft and customers using Microsoft developer platforms to support their work.

They’ll talk about how taking a holistic approach to developer productivi…

2 месяца, 1 неделя назад @ blubrry.com
127 - New Future of Work: Staying productive and happy when our office is our home with Jaime Teevan and Sonia Jaffe
127 - New Future of Work: Staying productive and happy when our office is our home with Jaime Teevan and Sonia Jaffe 127 - New Future of Work: Staying productive and happy when our office is our home with Jaime Teevan and Sonia Jaffe

For Microsoft researchers, COVID-19 was a call to action.

The reimagining of work practices had long been an area of study, but existing and new questions that needed immediate answers surfaced as companies and their employees quickly adjusted to significantly different working conditions.

Teams from across the Microsoft organizational chart pooled their unique expertise together under The New Future of Work initiative.

The results have informed product features designed to better support remote work and are now being used to help companies, including Microsoft, usher their workforces into a future of hybrid work.

They also explore how people already working from home helped them better und…

2 месяца, 2 недели назад @ blubrry.com
126 - New Future of Work: Meeting and collaborating in a remote and hybrid world with Jaime Teevan and Abigail Sellen
126 - New Future of Work: Meeting and collaborating in a remote and hybrid world with Jaime Teevan and Abigail Sellen 126 - New Future of Work: Meeting and collaborating in a remote and hybrid world with Jaime Teevan and Abigail Sellen

Teams from across the Microsoft organizational chart pooled their unique expertise together under The New Future of Work initiative.

The results have informed product features designed to better support remote work and are now being used to help companies, including Microsoft, usher their workforces into a future of hybrid work.

In this episode of The New Future of Work series of the podcast, Chief Scientist Jaime Teevan and Abigail Sellen, Deputy Lab Director at Microsoft Research Cambridge in the United Kingdom, explore the dynamics of meetings and collaborations in the context of remote work.

They specifically address the difference between weak and strong ties in our professional networ…

2 месяца, 3 недели назад @ blubrry.com
125 - New Future of Work: Driving innovation via cross-company research with Jaime Teevan and Brent Hecht
125 - New Future of Work: Driving innovation via cross-company research with Jaime Teevan and Brent Hecht 125 - New Future of Work: Driving innovation via cross-company research with Jaime Teevan and Brent Hecht

For Microsoft researchers, COVID-19 was a call to action.

The reimagining of work practices had long been an area of study, but existing and new questions that needed immediate answers surfaced as companies and their employees quickly adjusted to significantly different working conditions.

Teams from across the Microsoft organizational chart pooled their unique expertise together under The New Future of Work initiative.

The results have informed product features designed to better support remote work and are now being used to help companies, including Microsoft, usher their workforces into a future of hybrid work.

They’ll discuss the role of research during times of disruption, the widening…

2 месяца, 3 недели назад @ blubrry.com
124 - Econ4: Uncovering how decision-making shapes individuals and society through behavioral public economics featuring Evan Rose and Hunt Allcott
124 - Econ4: Uncovering how decision-making shapes individuals and society through behavioral public economics featuring Evan Rose and Hunt Allcott 124 - Econ4: Uncovering how decision-making shapes individuals and society through behavioral public economics featuring Evan Rose and Hunt Allcott

In the world of economics, researchers at Microsoft are examining a range of complex systems—from those that impact the technologies we use to those that inform the laws and policies we create—through the lens of a social science that goes beyond the numbers to better understand people and society.

In this episode, Senior Principal Researcher Hunt Allcott talks with Postdoctoral Researcher Evan Rose about Allcott’s work exploring the everyday decisions people face, like buying fuel-efficient cars or taking out payday loans, and how a clearer understanding of these decisions can shape meaningful public policy.

Allcott shares how his and others’ research shows that policy can often have compl…

3 месяца назад @ blubrry.com
123 - Econ3: Understanding the media ecosystem and how it informs public opinion in the internet age featuring Hunt Allcott and David Rothschild
123 - Econ3: Understanding the media ecosystem and how it informs public opinion in the internet age featuring Hunt Allcott and David Rothschild 123 - Econ3: Understanding the media ecosystem and how it informs public opinion in the internet age featuring Hunt Allcott and David Rothschild

Interviewed by Senior Principal Researcher Hunt Allcott, Economist David Rothschild discusses how the news media has evolved alongside social media and the internet, from story development to distribution of news via aggregators and wire services.

Rothschild illuminates how and where people are consuming news and shares some of the strategies he’s seeing news outlets use to appeal to their audiences.

He also covers research insights into media bias, misinformation, and how this knowledge could inform the future of news for the better.

In addition, the researchers talk about Rothschild’s work with Project Ratio, which looks at how the news ecosystem impacts public opinion and political polar…

3 месяца, 1 неделя назад @ blubrry.com
122 - Econ2: Causal machine learning, data interpretability, and online platform markets featuring Hunt Allcott and Greg Lewis
122 - Econ2: Causal machine learning, data interpretability, and online platform markets featuring Hunt Allcott and Greg Lewis 122 - Econ2: Causal machine learning, data interpretability, and online platform markets featuring Hunt Allcott and Greg Lewis

In the world of economics, researchers at Microsoft are examining a range of complex systems—from those that impact the technologies we use to those that inform the laws and policies we create—through the lens of a social science that goes beyond the numbers to better understand people and society.

In this episode, Senior Principal Researcher Dr. Hunt Allcott speaks with Microsoft Research New England office mate and Senior Principal Researcher Dr. Greg Lewis.

Together, they cover the connection between causal machine learning and economics research, the motivations of buyers and sellers on e-commerce platforms, and how ad targeting and data practices could evolve to foster a more symbiotic…

3 месяца, 2 недели назад @ blubrry.com
121 - Econ1: Using microeconomics to solve mass incarceration featuring Hunt Allcott and Evan Rose
121 - Econ1: Using microeconomics to solve mass incarceration featuring Hunt Allcott and Evan Rose 121 - Econ1: Using microeconomics to solve mass incarceration featuring Hunt Allcott and Evan Rose

In the world of economics, researchers at Microsoft are examining a range of complex systems—from those that impact the technologies we use to those that inform the laws and policies we create—through the lens of a social science that goes beyond the numbers to better understand people and society.

In this episode, Dr. Hunt Allcott, Senior Principal Researcher at Microsoft Research New England, talks with Dr. Evan Rose, Postdoctoral Researcher, whom Allcott describes as “one of the most engaging and talented researchers in applied microeconomics today.” They’ll discuss how Rose’s experience teaching adult learners at San Quentin State Prison has resonated throughout his research, and they’l…

4 месяца назад @ blubrry.com
120 - Advancing Excel as a programming language with Andy Gordon and Simon Peyton Jones
120 - Advancing Excel as a programming language with Andy Gordon and Simon Peyton Jones 120 - Advancing Excel as a programming language with Andy Gordon and Simon Peyton Jones

Today, people around the globe—from teachers to small-business owners to finance executives—use Microsoft Excel to make sense of the information that occupies their respective worlds, and whether they realize it or not, in doing so, they’re taking on the role of programmer.

In this episode, Senior Principal Research Manager Andy Gordon, who leads the Calc Intelligence team at Microsoft Research, and Senior Principal Researcher Simon Peyton Jones provide an inside account of the journey Excel has taken as a programming language, including the expansion of data types that has unlocked greater functionality and the release of the LAMBDA function, which makes the Excel formula language Turing-c…

4 месяца, 2 недели назад @ blubrry.com
NLP Highlights NLP Highlights
последний пост 1 месяц назад
131 - Opportunities and Barriers between HCI and NLP, with Nanna Inie and Leon Derczynski
131 - Opportunities and Barriers between HCI and NLP, with Nanna Inie and Leon Derczynski 131 - Opportunities and Barriers between HCI and NLP, with Nanna Inie and Leon Derczynski

What can NLP researchers learn from Human Computer Interaction (HCI) research?

We chatted with Nanna Inie and Leon Derczynski to find out.

We discussed HCI's research processes including methods of inquiry, the data anno…

1 месяц назад @ soundcloud.com
130 - Linking human cognitive patterns to NLP Models, with Lisa Bienborn
130 - Linking human cognitive patterns to NLP Models, with Lisa Bienborn 130 - Linking human cognitive patterns to NLP Models, with Lisa Bienborn

In this episode, we talk with Lisa Beinborn, an assistant professor at Vrije Universiteit Amsterdam, about how to use human cognitive signals to improve and analyze NLP models.

We start by discussing different kinds of c…

1 месяц, 1 неделя назад @ soundcloud.com
129 - Transformers and Hierarchical Structure, with Shunyu Yao
129 - Transformers and Hierarchical Structure, with Shunyu Yao 129 - Transformers and Hierarchical Structure, with Shunyu Yao

In this episode, we talk to Shunyu Yao about recent insights into how transformers can represent hierarchical structure in language.

Bounded-depth hierarchical structure is thought to be a key feature of natural language…

2 месяца, 2 недели назад @ soundcloud.com
128 - Dynamic Benchmarking, with Douwe Kiela
128 - Dynamic Benchmarking, with Douwe Kiela 128 - Dynamic Benchmarking, with Douwe Kiela

We discussed adversarial dataset construction and dynamic benchmarking in this episode with Douwe Kiela, a research scientist at Facebook AI Research who has been working on a dynamic benchmarking platform called Dynaben…

3 месяца назад @ soundcloud.com
127 - Masakhane and Participatory Research for African Languages, with Tosin Adewumi and Perez Ogayo
127 - Masakhane and Participatory Research for African Languages, with Tosin Adewumi and Perez Ogayo 127 - Masakhane and Participatory Research for African Languages, with Tosin Adewumi and Perez Ogayo

We invited members of Masakhane, Tosin Adewumi and Perez Ogayo, to talk about their EMNLP Findings paper that discusses why typical research is limited for low-resourced NLP and how participatory research can help.

3 месяца, 1 неделя назад @ soundcloud.com
126 - Optimizing Continuous Prompts for Generation, with Lisa Li
126 - Optimizing Continuous Prompts for Generation, with Lisa Li 126 - Optimizing Continuous Prompts for Generation, with Lisa Li

We invited Lisa Li to talk about her recent work, Prefix-Tuning: Optimizing Continuous Prompts for Generation.

Prefix tuning is a lightweight alternative to finetuning, and the idea is to tune only a fixed-length task-sp…

3 месяца, 4 недели назад @ soundcloud.com
125 - VQA for Real Users, with Danna Gurari
125 - VQA for Real Users, with Danna Gurari 125 - VQA for Real Users, with Danna Gurari

How can we build Visual Question Answering systems for real users?

For this episode, we chatted with Danna Gurari, about her work in building datasets and models towards VQA for people who are blind.

We talked about the …

4 месяца, 2 недели назад @ soundcloud.com
124 - Semantic Machines and Task-Oriented Dialog, with Jayant Krishnamurthy and Hao Fang
124 - Semantic Machines and Task-Oriented Dialog, with Jayant Krishnamurthy and Hao Fang 124 - Semantic Machines and Task-Oriented Dialog, with Jayant Krishnamurthy and Hao Fang

We use cookies for various purposes including analytics and personalized marketing.

By continuing to use the service, you agree to our use of cookies as described in the Cookie Policy

5 месяцев, 1 неделя назад @ soundcloud.com
123 - Robust NLP, with Robin Jia
123 - Robust NLP, with Robin Jia 123 - Robust NLP, with Robin Jia

We use cookies for various purposes including analytics and personalized marketing.

By continuing to use the service, you agree to our use of cookies as described in the Cookie Policy

5 месяцев, 2 недели назад @ soundcloud.com
Data Skeptic Data Skeptic
последний пост 2 часа назад
Applying k-Nearest Neighbors to Time Series
Applying k-Nearest Neighbors to Time Series Applying k-Nearest Neighbors to Time Series

Samya Tajmouati, a PhD student in Data Science at the University of Science of Kenitra, Morocco, joins us today to discuss her work Applying K-Nearest Neighbors to Time Series Forecasting: Two New Approaches.

2 часа назад @ dataskeptic.com
Ultra Long Time Series
Ultra Long Time Series Ultra Long Time Series

Dr. Feng Li, (@f3ngli) is an Associate Professor of Statistics in the School of Statistics and Mathematics at Central University of Finance and Economics in Beijing, China. He joins us today to discuss his work Distributed ARIMA Models for Ultra-long Time Series.

1 неделя назад @ dataskeptic.com
MiniRocket
MiniRocket MiniRocket

Angus Dempster, PhD Student at Monash University in Australia, comes on today to talk about MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification, a fast deterministic transform for time series classification. MINIROCKET reformulates ROCKET, gaining a 75x improvement on larger datasets with essentially the same performance. In this episode, we talk about the insights that realized this speedup as well as use cases.

2 недели назад @ dataskeptic.com
ARiMA is not Sufficient
ARiMA is not Sufficient ARiMA is not Sufficient

Chongshou Li, Associate Professor at Southwest Jiaotong University in China, joins us today to talk about his work Why are the ARIMA and SARIMA not Sufficient.

3 недели назад @ dataskeptic.com
Comp Engine
Comp Engine Comp Engine

Ben Fulcher, Senior Lecturer at the School of Physics at the University of Sydney in Australia, comes on today to talk about his project Comp Engine.

4 недели назад @ dataskeptic.com
Detecting Ransomware
Detecting Ransomware Detecting Ransomware

Nitin Pundir, PhD candidate at University Florida and works at the Florida Institute for Cybersecurity Research, comes on today to talk about his work “RanStop: A Hardware-assisted Runtime Crypto-Ransomware Detection Technique.”

1 месяц назад @ dataskeptic.com
GANs in Finance
GANs in Finance GANs in Finance

Florian Eckerli, a recent graduate of Zurich University of Applied Sciences, comes on the show today to discuss his work Generative Adversarial Networks in Finance: An Overview.

1 месяц, 1 неделя назад @ dataskeptic.com
Predicting Urban Land Use
Predicting Urban Land Use Predicting Urban Land Use

Today on the show we have Daniel Omeiza, a doctoral student in the computer science department of the University of Oxford, who joins us to talk about his work Efficient Machine Learning for Large-Scale Urban Land-Use Forecasting in Sub-Saharan Africa.

1 месяц, 2 недели назад @ dataskeptic.com
Opportunities for Skillful Weather Prediction
Opportunities for Skillful Weather Prediction Opportunities for Skillful Weather Prediction

Today on the show we have Elizabeth Barnes, Associate Professor in the department of Atmospheric Science at Colorado State University, who joins us to talk about her work Identifying Opportunities for Skillful Weather Prediction with Interpretable Neural Networks. Find more from the Barnes Research Group on their site. Weather is notoriously difficult to predict. Complex systems are demanding of computational power. Further, the chaotic nature of, well, nature, makes accurate forecasting especially difficult the longer into the future one wants to look. Yet all is not lost! In this interview, we explore the use of machine learning to help identify certain conditions under which the weather …

1 месяц, 3 недели назад @ dataskeptic.com
Predicting Stock Prices
Predicting Stock Prices Predicting Stock Prices

Today on the show we have Andrea Fronzetti Colladon (@iandreafc), currently working at the University of Perugia and inventor of the Semantic Brand Score, joins us to talk about his work studying human communication and social interaction. We discuss the paper Look inside. Predicting Stock Prices by Analyzing an Enterprise Intranet Social Network and Using Word Co-Occurrence Networks.

2 месяца назад @ dataskeptic.com
N-Beats
N-Beats N-Beats

Today on the show we have Boris Oreshkin @boreshkin, a Senior Research Scientist at Unity Technologies, who joins us today to talk about his work N-BEATS: Neural Basis Expansion Analysis for Interpretable Time Series Forecasting.

2 месяца, 1 неделя назад @ dataskeptic.com
Translation Automation
Translation Automation Translation Automation

Today we are back with another episode discussing AI in the work field. AI has, is, and will continue to facilitate the automation of work done by humans. Sometimes this may be an entire role. Other times it may automate a particular part of their role, scaling their effectiveness. Carl Stimson, a Freelance Japanese to English translator, comes on the show to talk about his work in translation and his perspective about how AI will change translation in the future.

2 месяца, 2 недели назад @ dataskeptic.com
Time Series at the Beach
Time Series at the Beach Time Series at the Beach

Shane Ross, Professor of Aerospace and Ocean Engineering at Virginia Tech University, comes on today to talk about his work “Beach-level 24-hour forecasts of Florida red tide-induced respiratory irritation.”

2 месяца, 3 недели назад @ dataskeptic.com
Automatic Identification of Outlier Galaxy Images
Automatic Identification of Outlier Galaxy Images Automatic Identification of Outlier Galaxy Images

Lior Shamir, Associate Professor of Computer Science at Kansas University, joins us today to talk about the recent paper Automatic Identification of Outliers in Hubble Space Telescope Galaxy Images. Follow Lio on Twitter @shamir_lior

3 месяца назад @ dataskeptic.com
Do We Need Deep Learning in Time Series
Do We Need Deep Learning in Time Series Do We Need Deep Learning in Time Series

Shereen Elsayed and Daniela Thyssens, both are PhD Student at Hildesheim University in Germany, come on today to talk about the work “Do We Really Need Deep Learning Models for Time Series Forecasting?”

3 месяца, 1 неделя назад @ dataskeptic.com
Linear Digressions Linear Digressions
последний пост None
SuperDataScience SuperDataScience
последний пост 3 дня, 4 часа назад
SDS 506: Supervised vs Unsupervised Learning
SDS 506: Supervised vs Unsupervised Learning SDS 506: Supervised vs Unsupervised Learning

In this episode, I continue with last week’s theme and discuss the differences between supervised and unsupervised learning.

Additional materials: www.superdatascience.com/506

3 дня, 4 часа назад @ soundcloud.com
SDS 505: From Data Science to Cinema
SDS 505: From Data Science to Cinema SDS 505: From Data Science to Cinema

Hadelin de Ponteves joins us to discuss his latest educational work and how his skills as a data science educator helped him start his career in acting.

In this episode you will learn:• What has Hadelin been up to?

6 дней, 4 часа назад @ soundcloud.com
SDS 504: Classification vs Regression
SDS 504: Classification vs Regression SDS 504: Classification vs Regression

In this episode, I give a quick introduction to subcategories of supervised learning problems.

Additional materials: www.superdatascience.com/504

1 неделя, 3 дня назад @ soundcloud.com
SDS 503: Deep Reinforcement Learning for Robotics
SDS 503: Deep Reinforcement Learning for Robotics SDS 503: Deep Reinforcement Learning for Robotics

Pieter Abbeel joins us to discuss his work as an academic and entrepreneur in the field of AI robotics and what the future of the industry holds.

In this episode you will learn:• How does Pieter do it all?

[5:45]• Pie…

1 неделя, 6 дней назад @ soundcloud.com
SDS 502: Managing Imposter Syndrome
SDS 502: Managing Imposter Syndrome SDS 502: Managing Imposter Syndrome

In this episode, I explore a common issue plaguing people across fields: imposter syndrome.

Additional materials: www.superdatascience.com/502

2 недели, 3 дня назад @ soundcloud.com
SDS 501: Statistical Programming with Friends
SDS 501: Statistical Programming with Friends SDS 501: Statistical Programming with Friends

Jared Lander joins us to discuss his work as an R meetup organizer, the upcoming virtual R Conference, and his work as a consultant for a variety of companies from metal workers to professional football teams.

2 недели, 6 дней назад @ soundcloud.com
SDS 499: Data Meshes and Data Reliability
SDS 499: Data Meshes and Data Reliability SDS 499: Data Meshes and Data Reliability

Barr Moses joins us to discuss the importance of data reliability for pipelines and how companies can achieve data mesh.

In this episode you will learn:• Data meshes [4:25]• Self-serve data reliability [15:36]• How …

3 недели, 6 дней назад @ soundcloud.com
SDS 500: Yoda Nidra with Jes Allen
SDS 500: Yoda Nidra with Jes Allen SDS 500: Yoda Nidra with Jes Allen

In this very special episode, we delve into a live yoga Nidra practice with Jes Allen and go over how you can open up to consciousness through yoga practice.

In this episode you will learn:• [3:40] What Yoga means• [1…

3 недели, 6 дней назад @ soundcloud.com
SDS 498: How Only Beginners Know Everything
SDS 498: How Only Beginners Know Everything SDS 498: How Only Beginners Know Everything

In this episode, I dive into a reoccurring pattern I’ve noticed where beginners, myself included, think they’re more skilled and experienced than they really are.

Additional materials: www.superdatascience.com/498

1 месяц назад @ soundcloud.com
SDS 497: Maximizing the Global Impact of Your Career
SDS 497: Maximizing the Global Impact of Your Career SDS 497: Maximizing the Global Impact of Your Career

Benjamin Todd joins us to discuss his work helping professionals maximize their career capital, the top skills to learn across professions, and more.

In this episode you will learn:• How Benjamin helped me become a dat…

1 месяц назад @ soundcloud.com
SDS 496: 2040: A Brain-Computer Interface Story
SDS 496: 2040: A Brain-Computer Interface Story SDS 496: 2040: A Brain-Computer Interface Story

In this episode, you’ll enjoy a fictional narrative I’ve titled “2040: A Brain-Computer Interface Story”. Additional materials: www.superdatascience.com/496

1 месяц, 1 неделя назад @ soundcloud.com
SDS 495: Successful AI Projects and AI Startups
SDS 495: Successful AI Projects and AI Startups SDS 495: Successful AI Projects and AI Startups

Greg Coquillo joins us to discuss his work on ROI for startups and the best ways to make the most of your company’s AI investment.

In this episode you will learn:• Our connection through Harpreet’s happy hours and DSGO…

1 месяц, 1 неделя назад @ soundcloud.com
SDS 494: How to Instantly Appreciate Being Alive
SDS 494: How to Instantly Appreciate Being Alive SDS 494: How to Instantly Appreciate Being Alive

In this episode, I talk about an interesting thought experiment that helps you appreciate your existence.

Additional materials: www.superdatascience.com/494

1 месяц, 2 недели назад @ soundcloud.com
SDS 493: Bringing Data to the People
SDS 493: Bringing Data to the People SDS 493: Bringing Data to the People

Anjali Shrivastava joins us to discuss her data science degree and her content creation efforts to bring data science to the people.

In this episode you will learn:• Anjali’s studies [2:00]• Anjali’s YouTube channel […

1 месяц, 2 недели назад @ soundcloud.com
SDS 492: The World is Awful (and it’'s Never Been Better)
SDS 492: The World is Awful (and it’'s Never Been Better) SDS 492: The World is Awful (and it’'s Never Been Better)

In this episode, I discuss the changing child mortality rate as evidence of how much better the world is and how much better it could be.

Additional materials: www.superdatascience.com/492

1 месяц, 3 недели назад @ soundcloud.com
Data Science at Home Data Science at Home
последний пост 1 месяц назад
Reinforcement Learning is all you need. Or is it? (Ep. 165)
Reinforcement Learning is all you need. Or is it? (Ep. 165) Reinforcement Learning is all you need. Or is it? (Ep. 165)

August 18, 2021 podcastIs reinforcement learning sufficient to build truly intelligent machines?

Our SponsorsQuantum MetricStay off the naughty list this holiday season by reducing customer friction, increasing conversions, and personalizing the shopping experience.

Visit us at quantummetric.com/podoffer and see if you qualify to receive our “12 Days of Insights” offer with code DATASCIENCE.

Amethix TechnologiesAmethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy.

Amethix provide solutions to collect and secure data with higher transparency and disintermediation…

1 месяц назад @ datascienceathome.com
What’s happening with AI today? (Ep. 164)
What’s happening with AI today? (Ep. 164) What’s happening with AI today? (Ep. 164)

August 18, 2021 podcastIn this episode I have a wonderful chat with Ronald Schmelzer and Kathleen Walch, authors of “AI Today” the top podcast for those wanting a no-hype, practical, real-world insight into what enterprises, public sector agencies, thought leaders, leading technology companies, pundits, and experts are doing with AI today.

Sponsored byQuantum Metric Did you know that 2021 holiday ecommerce sales are expected to exceed 2020 benchmarks?Are you prepared to capture every customer revenue opportunity?

Visit their website at quantummetric.com/podoffer and see if you qualify to receive their “12 Days of Insights” offer with code DATASCIENCE.

Sponsored by Amethix TechnologiesAmethi…

1 месяц назад @ datascienceathome.com
2 effective ways to explain your predictions (Ep. 163)
2 effective ways to explain your predictions (Ep. 163) 2 effective ways to explain your predictions (Ep. 163)

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits.

By clicking “Accept”, you consent to the use of ALL the cookies.

1 месяц, 2 недели назад @ datascienceathome.com
The Netflix challenge. Fair or what? (Ep. 162)
The Netflix challenge. Fair or what? (Ep. 162)

Remember the Netflix challenge? It was a ton of money for the one who would have cracked the problem of recommending the best possible movie. Was it a fair challenge? […]

The post The Netflix challenge. Fair or what? (Ep. 162) appeared first on Podcast Data science at home.

1 месяц, 2 недели назад @ datascienceathome.com
Artificial Intelligence for Blockchains with Jonathan Ward CTO of Fetch AI (Ep. 161)
Artificial Intelligence for Blockchains with Jonathan Ward CTO of Fetch AI (Ep. 161)

In this episode Fetch AI CTO Jonathan Ward speaks about decentralization, AI, blockchain for smart cities and the enterprise.Below some great links about collective learning, smart contracts in Rust and […]

The post Artificial Intelligence for Blockchains with Jonathan Ward CTO of Fetch AI (Ep. 161) appeared first on Podcast Data science at home.

1 месяц, 2 недели назад @ datascienceathome.com
Apache Arrow, Ballista and Big Data in Rust with Andy Grove RB (Ep. 160)
Apache Arrow, Ballista and Big Data in Rust with Andy Grove RB (Ep. 160) Apache Arrow, Ballista and Big Data in Rust with Andy Grove RB (Ep. 160)

August 4, 2021 podcastDo you want to know the latest in big data analytics frameworks?

Have you ever heard of Apache Arrow?

In this episode I speak with Andy Grove one of the main authors of Apache Arrow and Ballista compute engine.

Andy explains some challenges while he was designing the Arrow and Ballista memory models and he describes some amazing solutions.

Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy.

1 месяц, 2 недели назад @ datascienceathome.com
GitHub Copilot: yay or nay? (Ep. 159)
GitHub Copilot: yay or nay? (Ep. 159) GitHub Copilot: yay or nay? (Ep. 159)

July 6, 2021 podcastIt made already quite some noise in the news, GitHub copilot promises to be your pair programmer for life.

In this episode I explain how and what GitHub copilot does.

Should developers be happy, scared or just keep coding the traditional way?

SponsorsGet one of the best VPN at a massive discount with coupon code DATASCIENCE.

It provides you with an 83% discount which unlocks the best price in the market plus 3 extra months for free.

2 месяца, 2 недели назад @ datascienceathome.com
A simple trick for very unbalanced data (Ep. 157)
A simple trick for very unbalanced data (Ep. 157) A simple trick for very unbalanced data (Ep. 157)

June 22, 2021 podcastData from the real world are never perfectly balanced.

In this episode I explain a simple yet effective trick to train models with very unbalanced data.

SponsorsGet one of the best VPN at a massive discount with coupon code DATASCIENCE.

It provides you with an 83% discount which unlocks the best price in the market plus 3 extra months for free.

Here is the link https://surfshark.deals/DATASCIENCEReferences

3 месяца назад @ datascienceathome.com
Time to take your data back with Tapmydata (Ep. 156)
Time to take your data back with Tapmydata (Ep. 156) Time to take your data back with Tapmydata (Ep. 156)

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits.

By clicking “Accept”, you consent to the use of ALL the cookies.

3 месяца назад @ datascienceathome.com
True Machine Intelligence just like the human brain (Ep. 155)
True Machine Intelligence just like the human brain (Ep. 155) True Machine Intelligence just like the human brain (Ep. 155)

June 10, 2021 podcastIn this episode I have a really interesting conversation with Karan Grewal, member of the research staff at Numenta where he investigates how biological principles of intelligence can be translated into silicon.

We speak about the thousand brains theory and why neural networks forget.

3 месяца, 1 неделя назад @ datascienceathome.com
Delivering unstoppable data with Streamr (Ep. 154)
Delivering unstoppable data with Streamr (Ep. 154) Delivering unstoppable data with Streamr (Ep. 154)

May 26, 2021 podcastDelivering unstoppable data to unstoppable apps is now possible with Streamr NetworkStreamr is a layer zero protocol for real-time data which powers the decentralized Streamr pub/sub network.

The technology works in tandem with companion blockchains – currently Ethereum and xDai chain – which are used for identity, security and payments.

On top is the application layer, including the Data Union framework, Marketplace and Core, and all third party applications.

In this episode I have a very interesting conversation with Streamr founder and CEO Henri PihkalaReferences

3 месяца, 3 недели назад @ datascienceathome.com
MLOps: the good, the bad and the ugly (Ep. 153)
MLOps: the good, the bad and the ugly (Ep. 153) MLOps: the good, the bad and the ugly (Ep. 153)

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits.

By clicking “Accept”, you consent to the use of ALL the cookies.

3 месяца, 3 недели назад @ datascienceathome.com
MLOps: what is and why it is important Part 2 (Ep. 152)
MLOps: what is and why it is important Part 2 (Ep. 152) MLOps: what is and why it is important Part 2 (Ep. 152)

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits.

By clicking “Accept”, you consent to the use of ALL the cookies.

3 месяца, 4 недели назад @ datascienceathome.com
MLOps: what is and why it is important (Ep. 151)
MLOps: what is and why it is important (Ep. 151) MLOps: what is and why it is important (Ep. 151)

May 11, 2021 podcastIf you think that knowing Tensorflow and Scikit-learn is enough, think again.

What is MLOps and why is it important?

It’s a podcast for techies by techies.

Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy.

Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

4 месяца, 1 неделя назад @ datascienceathome.com
Can I get paid for my data? With Mike Andi from Mytiki (Ep. 150)
Can I get paid for my data? With Mike Andi from Mytiki (Ep. 150) Can I get paid for my data? With Mike Andi from Mytiki (Ep. 150)

April 28, 2021 podcastYour data is worth thousands a year.

Why aren’t you getting your fair share?

There is a company that has a mission: they want you to take back control and get paid for your data.

In this episode I speak about knowledge graphs, data confidentiality and privacy with Mike Audi, CEO of MyTiki.

You can reach them on their website https://mytiki.com/Discord official channelhttps://discord.com/invite/evjYQq48BeTelegramhttps://t.me/mytikiappSignalhttps://signal.group/#CjQKIA66Eq2VHecpcCd-cu-dziozMRSH3EuQdcZJNyMOYNi5EhC0coWtjWzKQ1dDKEjMqhkP

4 месяца, 3 недели назад @ datascienceathome.com