Skip to content
Change the repository type filter

All

    Repositories list

    • This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate the capabilities of large language models in decision-making tasks within these text-based environments.
      Python
      Apache License 2.0
      21530Updated May 29, 2024May 29, 2024
    • JavaScript
      13000Updated May 6, 2024May 6, 2024
    • Simple working implementation for google-deepmind FunSearch algorithm
      Python
      Apache License 2.0
      131000Updated Apr 18, 2024Apr 18, 2024
    • Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
      JavaScript
      MIT License
      44k100Updated Jan 9, 2024Jan 9, 2024
    • ChatDev

      Public
      Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
      Python
      Apache License 2.0
      3.2k000Updated Oct 15, 2023Oct 15, 2023
    • VMAgent

      Public
      Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.
      Python
      MIT License
      128521Updated Apr 27, 2023Apr 27, 2023
    • Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)
      Python
      Other
      152100Updated May 13, 2022May 13, 2022
    • MAMT

      Public
      Code for "Dealing with Non-Stationarity in MARL via Trust Region Decomposition" ICLR 2022
      Python
      MIT License
      173200Updated Feb 13, 2022Feb 13, 2022
    • Obtaining Dyadic Fairness by Optimal Transport
      Python
      MIT License
      0300Updated Feb 9, 2022Feb 9, 2022
    • PICO

      Public
      An algorithm for exploiting Reinforcement Learning (RL) on Multi-agent Path Finding tasks.
      Python
      MIT License
      145440Updated Feb 9, 2022Feb 9, 2022
    • Repo for reproduction of sequential social dilemmas
      Python
      MIT License
      132000Updated Nov 25, 2021Nov 25, 2021
    • Python
      4204Updated Jul 6, 2021Jul 6, 2021
    • MAS

      Public
      Jupyter Notebook
      31000Updated May 12, 2021May 12, 2021
    • chemopt

      Public
      Optimizing Chemical Reactions with Deep Reinforcement Learning
      Python
      MIT License
      43000Updated Mar 3, 2021Mar 3, 2021
    • Shell
      0000Updated Jan 8, 2021Jan 8, 2021
    • Shell
      0010Updated Jan 6, 2021Jan 6, 2021
    • BCQ

      Public
      Author's PyTorch implementation of BCQ for continuous and discrete actions
      Python
      MIT License
      139000Updated Dec 9, 2020Dec 9, 2020
    • Minimalistic gridworld package for OpenAI Gym
      Python
      Apache License 2.0
      611000Updated Dec 8, 2020Dec 8, 2020
    • slices in group meetings
      Python
      51300Updated Nov 29, 2020Nov 29, 2020
    • Python
      Other
      790000Updated Nov 19, 2020Nov 19, 2020
    • mailrl

      Public
      0000Updated Sep 25, 2020Sep 25, 2020
    • 你渴望力量吗年轻人
      11000Updated Sep 8, 2020Sep 8, 2020
    • kddcup

      Public
      Python
      0000Updated Jul 8, 2020Jul 8, 2020
    • use Mechanical arm and JAI Camera to auto jump at wechat
      Python
      4000Updated Jun 6, 2020Jun 6, 2020
    • 0000Updated May 8, 2020May 8, 2020
    • cs231n

      Public
      0000Updated May 20, 2019May 20, 2019
    • Paper-collections-of-Deep-Multi-Agent-Reinforcement-Learning
      2600Updated Apr 3, 2019Apr 3, 2019
    • The Reinforcement-Learning-Related Papers of ICLR 2019
      12000Updated Mar 20, 2019Mar 20, 2019
    • 0000Updated Mar 18, 2019Mar 18, 2019
    • 256610Updated Mar 4, 2019Mar 4, 2019