Skip to content
Change the repository type filter

All

    Repositories list

    • ambignlg

      Public
      🐶 Data for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG (Ayana Niwa and Hayate Iso; EMNLP 2024)
      Python
      Other
      0300Updated Oct 23, 2024Oct 23, 2024
    • 2410Updated Oct 17, 2024Oct 17, 2024
    • holobench

      Public
      🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.; Oct 2024)
      Python
      BSD 3-Clause "New" or "Revised" License
      0400Updated Oct 17, 2024Oct 17, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.4k000Updated Sep 27, 2024Sep 27, 2024
    • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
      C++
      Apache License 2.0
      1k000Updated Sep 1, 2024Sep 1, 2024
    • starmie

      Public
      Resources for PVLDB 2023 submission
      Python
      62330Updated Aug 28, 2024Aug 28, 2024
    • OpinionDigest: A Simple Framework for Opinion Summarization (ACL 2020)
      Python
      Apache License 2.0
      95633Updated Aug 20, 2024Aug 20, 2024
    • napa

      Public
      🍷 Code for Noisy Pairing and Partial Supervision for Stylized Opinion Summarization (Iso et al; INLG 2024)
      Python
      0100Updated Aug 12, 2024Aug 12, 2024
    • 🧩 Code for AutoTemplate: A Simple Recipe for Lexically Constrained Text Generation (Iso; INLG 2024)
      Python
      BSD 3-Clause "New" or "Revised" License
      0000Updated Aug 12, 2024Aug 12, 2024
    • Python
      BSD 3-Clause "New" or "Revised" License
      0720Updated Aug 2, 2024Aug 2, 2024
    • witqa

      Public
      Other
      0500Updated Jul 22, 2024Jul 22, 2024
    • CMDBench

      Public
      Data and Code for CMDBench experiments
      Python
      Apache License 2.0
      0400Updated Jun 19, 2024Jun 19, 2024
    • watchog

      Public
      The code for SIGMOD 2024 paper titled "Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation"
      Python
      1310Updated Jun 17, 2024Jun 17, 2024
    • Python
      BSD 3-Clause "New" or "Revised" License
      1000Updated May 22, 2024May 22, 2024
    • JavaScript
      BSD 3-Clause "New" or "Revised" License
      0000Updated Apr 18, 2024Apr 18, 2024
    • ditto

      Public
      Code for the paper "Deep Entity Matching with Pre-trained Language Models"
      Python
      Apache License 2.0
      89262181Updated Apr 17, 2024Apr 17, 2024
    • ginza

      Public
      A Japanese NLP Library using spaCy as framework based on Universal Dependencies
      Python
      MIT License
      57757102Updated Mar 30, 2024Mar 30, 2024
    • MCR

      Public
      0100Updated Mar 30, 2024Mar 30, 2024
    • bunkai

      Public
      Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
      Python
      Apache License 2.0
      11186216Updated Mar 26, 2024Mar 26, 2024
    • sato

      Public
      Code and data for Sato https://arxiv.org/abs/1911.06311.
      Python
      Apache License 2.0
      401081111Updated Feb 23, 2024Feb 23, 2024
    • 💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu, Iso et al; EACL 2024)
      Python
      BSD 3-Clause "New" or "Revised" License
      0620Updated Feb 22, 2024Feb 22, 2024
    • xatu

      Public
      🕊️ Code and Data for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates (Zhang et al; LREC-COLING 2024)
      Python
      BSD 3-Clause "New" or "Revised" License
      0600Updated Feb 20, 2024Feb 20, 2024
    • zett

      Public
      🙈 Code for Zero-shot Triplet Extraction by Template Infilling (Kim et al; IJCNLP-AACL 2023)
      Python
      BSD 3-Clause "New" or "Revised" License
      01721Updated Feb 17, 2024Feb 17, 2024
    • asdc

      Public
      Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)
      Python
      Creative Commons Attribution 4.0 International
      223012Updated Jan 19, 2024Jan 19, 2024
    • SubjQA

      Public
      A question-answering dataset with a focus on subjective information
      144332Updated Jan 8, 2024Jan 8, 2024
    • magneton

      Public
      Repository of the Magneton framework for authoring interaction-aware and customizable widgets.
      TypeScript
      Apache License 2.0
      0400Updated Jan 4, 2024Jan 4, 2024
    • pilota

      Public
      ✈ SCUD generator (解釈文生成器)
      Python
      Apache License 2.0
      01016Updated Nov 6, 2023Nov 6, 2023
    • rjdb

      Public
      0000Updated Nov 6, 2023Nov 6, 2023
    • vecscan

      Public
      Python
      MIT License
      25001Updated Sep 11, 2023Sep 11, 2023
    • cocosum

      Public
      🥥 Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)
      Python
      Apache License 2.0
      22120Updated Jul 25, 2023Jul 25, 2023