Skip to content
@stanford-futuredata

Future Data Systems

We are a CS research group building data-intensive systems

Popular repositories Loading

  1. ColBERT ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    Python 3.8k 464

  2. ARES ARES Public

    Automated Evaluation of RAG Systems

    Python 683 67

  3. macrobase macrobase Public

    MacroBase: A Search Engine for Fast Data

    Java 671 126

  4. noscope noscope Public

    Accelerating network inference over video

    Python 436 121

  5. sparser sparser Public

    Sparser: Raw Filtering for Faster Analytics over Raw Data

    C 434 54

  6. dawn-bench-entries dawn-bench-entries Public

    DAWNBench: An End-to-End Deep Learning Benchmark and Competition

    Python 263 72

Repositories

Showing 10 of 70 repositories
  • ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    stanford-futuredata/ColBERT’s past year of commit activity
    Python 3,754 MIT 464 85 20 Updated Oct 14, 2025
  • colbert-serve Public
    stanford-futuredata/colbert-serve’s past year of commit activity
    Python 23 0 1 0 Updated May 30, 2025
  • ARES Public

    Automated Evaluation of RAG Systems

    stanford-futuredata/ARES’s past year of commit activity
    Python 683 Apache-2.0 67 19 1 Updated Mar 28, 2025
  • FrugalGPT Public

    FrugalGPT: better quality and lower cost for LLM applications

    stanford-futuredata/FrugalGPT’s past year of commit activity
    Jupyter Notebook 246 Apache-2.0 31 3 0 Updated Feb 10, 2025
  • stk Public
    stanford-futuredata/stk’s past year of commit activity
    Python 115 Apache-2.0 22 3 1 Updated Aug 26, 2024
  • gavel Public

    Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

    stanford-futuredata/gavel’s past year of commit activity
    Jupyter Notebook 136 MIT 34 8 2 Updated Jul 25, 2024
  • InQuest Public

    Accelerating Aggregation Queries on Unstructured Streams of Data

    stanford-futuredata/InQuest’s past year of commit activity
    Python 9 2 1 0 Updated Apr 18, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    stanford-futuredata/Megatron-LM’s past year of commit activity
    Python 38 3,569 0 2 Updated Jan 19, 2024
  • tasti Public

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data (SIGMOD 2022)

    stanford-futuredata/tasti’s past year of commit activity
    Python 17 5 0 0 Updated Jan 17, 2024
  • omg Public
    stanford-futuredata/omg’s past year of commit activity
    Python 22 Apache-2.0 3 0 0 Updated Sep 20, 2023

Most used topics

Loading…