10/04 às 04:00
06/04 às 04:00
We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.
01/04 às 04:00
We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.
24/03 às 04:00
We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.
21/03 às 04:00
20/03 às 04:00
We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).
16/03 às 04:00
In this post we’ll outline new OpenAI research in which agents develop their own language.
15/03 às 04:00
12/03 às 05:00
06/03 às 05:00
24/02 às 05:00
Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult.
08/02 às 06:00
30/01 às 06:00
The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.
19/01 às 06:00
21/12 às 06:00
Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.
05/12 às 06:00
We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.
15/11 às 06:00
We’re working with Microsoft to start running most of our large-scale experiments on Azure.
15/11 às 06:00
14/11 às 06:00
11/11 às 06:00