Skip to content
@stanford-futuredata

Future Data Systems

We are a CS research group building data-intensive systems

Popular repositories Loading

  1. ColBERT ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    Python 3.1k 389

  2. macrobase macrobase Public

    MacroBase: A Search Engine for Fast Data

    Java 661 126

  3. ARES ARES Public

    Automated Evaluation of RAG Systems

    Python 487 53

  4. noscope noscope Public

    Accelerating network inference over video

    Python 437 122

  5. sparser sparser Public

    Sparser: Raw Filtering for Faster Analytics over Raw Data

    C 432 55

  6. dawn-bench-entries dawn-bench-entries Public

    DAWNBench: An End-to-End Deep Learning Benchmark and Competition

    Python 262 74

Repositories

Showing 10 of 69 repositories
  • ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    stanford-futuredata/ColBERT’s past year of commit activity
    Python 3,085 MIT 389 77 18 Updated Nov 18, 2024
  • ARES Public

    Automated Evaluation of RAG Systems

    stanford-futuredata/ARES’s past year of commit activity
    Python 487 Apache-2.0 53 10 2 Updated Nov 4, 2024
  • FrugalGPT Public

    FrugalGPT: better quality and lower cost for LLM applications

    stanford-futuredata/FrugalGPT’s past year of commit activity
    Jupyter Notebook 187 Apache-2.0 21 3 0 Updated Sep 19, 2024
  • stk Public
    stanford-futuredata/stk’s past year of commit activity
    Python 90 Apache-2.0 20 2 1 Updated Aug 26, 2024
  • gavel Public

    Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

    stanford-futuredata/gavel’s past year of commit activity
    Jupyter Notebook 125 MIT 31 8 2 Updated Jul 25, 2024
  • InQuest Public

    Accelerating Aggregation Queries on Unstructured Streams of Data

    stanford-futuredata/InQuest’s past year of commit activity
    Python 7 2 1 0 Updated Apr 18, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    stanford-futuredata/Megatron-LM’s past year of commit activity
    Python 33 2,448 0 2 Updated Jan 19, 2024
  • tasti Public

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data (SIGMOD 2022)

    stanford-futuredata/tasti’s past year of commit activity
    Python 15 5 0 0 Updated Jan 17, 2024
  • omg Public
    stanford-futuredata/omg’s past year of commit activity
    Python 20 Apache-2.0 3 0 0 Updated Sep 20, 2023
  • abae Public

    Accelerating Approximate Aggregation Queries with Expensive Predicates (VLDB 21)

    stanford-futuredata/abae’s past year of commit activity
    Python 3 1 0 0 Updated Sep 20, 2023