John Schulman

43
John Schulman
Professor OpenAI | UC Berkeley
36 YEARS OLD
John Schulman is a prominent research scientist and co-founder of OpenAI. He currently leads the reinforcement learning (RL) team, which focuses on enhancing language models like GPT using RL algorithms, which involve learning through trial and error. John has significantly contributed to machine learning and AI, particularly in developing reinforcement learning algorithms and their applications in various domains.

John received his Ph.D. in Computer Science from the University of California, Berkeley, where he was advised by the renowned roboticist Pieter Abbeel. At UC Berkeley, he worked on robotics and developed innovative techniques for enabling robots to tie knots and plan movement using trajectory optimization. His research in robotics has been widely recognized and has opened up new avenues for machine learning applications in robotics.

Before delving into reinforcement learning, John briefly worked in neuroscience at Berkeley, exploring the neural mechanisms underlying perception and cognition. However, he soon shifted his focus to machine learning, where he saw the potential to develop algorithms that could significantly impact the world. John's earlier academic training was in physics, and he completed his undergraduate studies at the California Institute of Technology.

John has published numerous research papers and articles in top-tier conferences and journals, and he is a sought-after speaker and collaborator in the machine learning community. His work has been widely recognized and received several prestigious awards, including the ACM Dissertation Award and the Sloan Research Fellowship. John also promotes diversity and inclusion in AI and machine learning.
Memorable Quotations2
Certain software skills are exceptionally useful for machine learning. In a previous era, it was GPU programming. Now in the era of pretrained models, it's front-end development -- to quickly whip up a UI to collect a fine-tuning or eval dataset.
Handy trick: if you say something dumb, follow with that was just a temperature=1 sample, don't take it seriously
Notable Awards
MIT Technology Review's 35 Innovators Under 35. – 2018
Summary of recent tweets

John Schulman has been tweeting about a variety of topics lately. In one tweet, he shares a short story by Isaac Asimov called "Someday" which features a language model called Bard. He also mentions fine-tuning the language model on recent data. Another tweet discusses the useful software skills for machine learning, highlighting the importance of front-end development in the era of pretrained models.

John Schulman also shares his excitement about his work on overoptimization of reward models and provides a link to a paper on this topic. He mentions getting access to @Cruise driverless ride service and being impressed with its performance. Additionally, he retweets an episode featuring OpenAI co-founder and inventor of PPO/TRPO, where they discuss RL from human feedback and AI alignment.

In terms of new trends in AI, John Schulman's tweets mention pretrained models, fine-tuning language models, RL from human feedback, and AGI timelines. These topics suggest that he is interested in advancements related to language models, reinforcement learning, and artificial general intelligence.

Overall, based on sentiment analysis of John Schulman's recent tweets, it is difficult to determine whether he is positive or negative about the way AI is going. His tweets cover various topics without expressing explicit positive or negative sentiments towards AI as a whole.

SOME AI BOOK RECOMMENDATIONS

John Schulman hasn't written a book yet or we didn't find any ISBN number for their book(s). However, here are some popular books in AI:

Videos Featuring Professor John Schulman
John Schulman - Proxy Objectives in Reinforcement Learning from Human Feedback | ICML 2023

John Schulman - Proxy Objectives in Reinforcement Learning from Human Feedback | ICML 2023

S3 E18 John Schulman of OpenAI on ChatGPT: invention, capabilities and limitations

S3 E18 John Schulman of OpenAI on ChatGPT: invention, capabilities and limitations

Carl Shulman (Pt 1) - Intelligence Explosion, Primate Evolution, Robot Doublings, & Alignment

Carl Shulman (Pt 1) - Intelligence Explosion, Primate Evolution, Robot Doublings, & Alignment

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges

ChatGPT, LLMs, and AI — #29

ChatGPT, LLMs, and AI — #29

John Schulman: OpenAI and recent advances in Artificial Intelligence - #16

John Schulman: OpenAI and recent advances in Artificial Intelligence - #16

7. Deep Reinforcement Learning John Schulman, OpenAI

7. Deep Reinforcement Learning John Schulman, OpenAI

Deep RL Bootcamp  Lecture 6: Nuts and Bolts of Deep RL Experimentation

Deep RL Bootcamp Lecture 6: Nuts and Bolts of Deep RL Experimentation

Deep Reinforcement Learning (John Schulman, OpenAI)

Deep Reinforcement Learning (John Schulman, OpenAI)

John Schulman 1: Deep Reinforcement Learning

John Schulman 1: Deep Reinforcement Learning

Twitter Timeline of Professor John Schulman