Skip to content
View huangshiyu13's full-sized avatar
:octocat:
Coding
:octocat:
Coding

Organizations

@THUDM @TARTRL @OpenRL-Lab

Block or report huangshiyu13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
huangshiyu13/README.md

OpenRL | 知乎 | Google Scholar | Linkedin | Personal Website

  • Hi, I am a researcher in Zhipu AI. Before that, I was a research scientist in 4Paradigm Inc. and the leader of OpenRL Lab. I received my B.E. and Ph. D. degrees (co-advised by Prof. Jun Zhu and Prof. Ting Chen) from the Department of Computer Science and Technology, Tsinghua University in July, 2017 and June, 2022. My researches focus on deep reinforcement learning, multi-agent reinforcement learning, distributed reinforcement learning, RL for robotics, LLM as agent, artificial general intelligence (AGI) and generative artificial intelligence (GAI). I have also spent time working at RealAI Inc. , Huawei Noah's Ark Lab, Tencent AI Lab, Carnegie Mellon University and Sensetime Inc. . And I am also the founder of the OpenRL Lab and TARTRL group.
  • We are looking for self-motivated interns and full-timers who have a strong background in mathematics/computer science and are eager to get involved in cutting-edge, fundamental AI research. Please feel free to drop me an email if you are interested in collaborating with me.
  • 📫 Email: [email protected]

Pinned Loading

  1. THUDM/CogVideo THUDM/CogVideo Public

    text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

    Python 9.5k 891

  2. THUDM/CogVLM2 THUDM/CogVLM2 Public

    GPT4V-level open-source multi-modal model based on Llama3-8B

    Python 2.1k 145

  3. OpenRL-Lab/openrl OpenRL-Lab/openrl Public

    Unified Reinforcement Learning Framework

    Python 644 62

  4. OpenRL-Lab/Wandb_Tutorial OpenRL-Lab/Wandb_Tutorial Public

    How to use wandb?

    Python 597 49

  5. RPNplus RPNplus Public

    RPN+(Tensorflow) for people detection

    Python 181 85

  6. THUDM/LVBench THUDM/LVBench Public

    LVBench: An Extreme Long Video Understanding Benchmark

    Python 62 1