mail-ecnu

All

32 repositories

Text-Gym-Agents
Public
This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate the capabilities of large language models in decision-making tasks within these text-based environments.
Python
•
Apache License 2.0
•2•15•3•0•Updated May 29, 2024May 29, 2024
AAMAS2024-RL4OR.github.io
Public
http://mail-ecnu.cn/AAMAS2024-RL4OR.github.io/
JavaScript
•13•0•0•0•Updated May 6, 2024May 6, 2024
funsearch-L2O
Public
Simple working implementation for google-deepmind FunSearch algorithm
Python
•
Apache License 2.0
•131•0•0•0•Updated Apr 18, 2024Apr 18, 2024
mail-ecnu.github.io
Public
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript
•
MIT License
•44k•1•0•0•Updated Jan 9, 2024Jan 9, 2024
ChatDev
Public
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Python
•
Apache License 2.0
•3.2k•0•0•0•Updated Oct 15, 2023Oct 15, 2023
VMAgent
Public
Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.
scheduling scheduling-simulator rl-algorithms reinforcement-learning
Python
•
MIT License
•12•85•2•1•Updated Apr 27, 2023Apr 27, 2023
graspnet-aroundview
Public
Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)
Python
•
Other
•152•1•0•0•Updated May 13, 2022May 13, 2022
MAMT
Public
Code for "Dealing with Non-Stationarity in MARL via Trust Region Decomposition" ICLR 2022
Python
•
MIT License
•173•2•0•0•Updated Feb 13, 2022Feb 13, 2022
OTDyadicFair
Public
Obtaining Dyadic Fairness by Optimal Transport
Python
•
MIT License
•0•3•0•0•Updated Feb 9, 2022Feb 9, 2022
PICO
Public
An algorithm for exploiting Reinforcement Learning (RL) on Multi-agent Path Finding tasks.
Python
•
MIT License
•14•54•4•0•Updated Feb 9, 2022Feb 9, 2022
sequential_social_dilemma_games
Public
Repo for reproduction of sequential social dilemmas
Python
•
MIT License
•132•0•0•0•Updated Nov 25, 2021Nov 25, 2021
example_playground
Public
Python
•4•2•0•4•Updated Jul 6, 2021Jul 6, 2021
MAS
Public
Jupyter Notebook
•3•10•0•0•Updated May 12, 2021May 12, 2021
chemopt
Public
Optimizing Chemical Reactions with Deep Reinforcement Learning
Python
•
MIT License
•43•0•0•0•Updated Mar 3, 2021Mar 3, 2021
server-scripts
Public
Shell
•0•0•0•0•Updated Jan 8, 2021Jan 8, 2021
init_ubuntu18.04
Public
Shell
•0•0•1•0•Updated Jan 6, 2021Jan 6, 2021
BCQ
Public
Author's PyTorch implementation of BCQ for continuous and discrete actions
Python
•
MIT License
•139•0•0•0•Updated Dec 9, 2020Dec 9, 2020
gym-minigrid
Public
Minimalistic gridworld package for OpenAI Gym
Python
•
Apache License 2.0
•611•0•0•0•Updated Dec 8, 2020Dec 8, 2020
GroupMeeting
Public
slices in group meetings
Python
•5•13•0•0•Updated Nov 29, 2020Nov 29, 2020
multiagent-particle-envs
Public
Python
•
Other
•790•0•0•0•Updated Nov 19, 2020Nov 19, 2020
mailrl
Public
0•0•0•0•Updated Sep 25, 2020Sep 25, 2020
YouWannaPower
Public
你渴望力量吗年轻人
1•10•0•0•Updated Sep 8, 2020Sep 8, 2020
kddcup
Public
Python
•0•0•0•0•Updated Jul 8, 2020Jul 8, 2020
wechat_jump
Public
use Mechanical arm and JAI Camera to auto jump at wechat
Python
•4•0•0•0•Updated Jun 6, 2020Jun 6, 2020
KDDCUP2020
Public
0•0•0•0•Updated May 8, 2020May 8, 2020
cs231n
Public
0•0•0•0•Updated May 20, 2019May 20, 2019
Paper-collections-of-Deep-Multi-Agent-Reinforcement-Learning
Public
Paper-collections-of-Deep-Multi-Agent-Reinforcement-Learning
2•6•0•0•Updated Apr 3, 2019Apr 3, 2019
ICLR2019-RL-Papers
Public
The Reinforcement-Learning-Related Papers of ICLR 2019
12•0•0•0•Updated Mar 20, 2019Mar 20, 2019
RL-Resources
Public
0•0•0•0•Updated Mar 18, 2019Mar 18, 2019
Reinforcement-Learning-and-Optimal-Control
Public
25•66•1•0•Updated Mar 4, 2019Mar 4, 2019