FlashInfer

FlashInfer is our kernel library that implements high-performance operators for LLM Serving on GPUs.

People

Tianqi Chen
Assistant Professor - CMU
Luis Ceze
Professor