RedCode

We are working hard to wrap up all the codes to provide an off-the-shelf deployment experience.

Repository Structure

dataset

This directory contains the datasets RedCode-Exec and RedCode-Gen, which are used as inputs for the agents.

environment

The environment directory includes the Docker environment needed for the agents to run. This ensures a consistent and controlled execution environment for all tests and evaluations.

evaluation

The evaluation directory contains subdirectories for the evaluation of three types of agents:

CA-evaluation: Evaluation scripts and resources for CodeAct agents.
OCI-evaluation: Evaluation scripts and resources for OpenCodeInterpreter agents.
RA-evaluation: Evaluation scripts and resources for ReAct agents.

Additionally, evaluation.py that serve as evaluation scripts for each risky scenario.

result

The result directory stores the results of the evaluations.

scripts

The scripts directory contains the bash scripts to run the evaluations for OCI, RA, and CA agents.

Environment Setup

conda env create -f environment.yml
conda activate redcode

Usage

./scripts/OCI_eval.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RedCode

Repository Structure

dataset

environment

evaluation

result

scripts

Environment Setup

Usage

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dataset		dataset
environment		environment
evaluation		evaluation
result		result
scripts		scripts
README.md		README.md
environment.yml		environment.yml

AI-secure/RedCode

Folders and files

Latest commit

History

Repository files navigation

RedCode

Repository Structure

dataset

environment

evaluation

result

scripts

Environment Setup

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages