Skip to content
View quanshr's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report quanshr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. QwenLM/Self-Lengthen QwenLM/Self-Lengthen Public

    Python 63 5

  2. AugCon AugCon Public

    Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

    Python 15

  3. DMoERM DMoERM Public

    [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

    Python 15

  4. DPOOJ/dpooj DPOOJ/dpooj Public

    Data Points Oriented Online Judge system for OO course

    Python 36

  5. KbsdJames/Awesome-LLM-Preference-Learning KbsdJames/Awesome-LLM-Preference-Learning Public

    The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"

    161 2