Publications

SYMI: Efficient Mixture-of-Experts Training via Model and Optimizer State Decoupling

Athinagoras Skiadopoulos, Mark Zhao, Swapnil Gandhi, Thomas Norrie, Shrijeet Mukherjee, Christos Kozyrakis

USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2026

Sparse Checkpointing for Fast and Reliable MoE Training

Swapnil Gandhi, Christos Kozyrakis

USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2026

FailSafe: High-performance Resilient Serving

Ziyi Xu, Zhiqiang Xie, Swapnil Gandhi, Christos Kozyrakis

Conference on Machine Learning and Systems (MLSys), 2026

Wave: A Split OS Architecture for Application Engines

Jack Humphries, Neel Natu, Kostis Kaffes, Stanko Novakovic, Paul Turner, Hank Levy, David E Culler, Christos Kozyrakis

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2025

Strata: Hierarchical Context Caching for Long Context Language Model Serving

Zhiqiang Xie, Ziyi Xu, Mark Zhao, Yuwei An, Vikram Mailthody, Scott Mahlke, Michael Garland, Christos Kozyrakis

Preprint, 2025

AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution

Zhiqiang Xie, Hao Kang, Ying Sheng, Tushar Krishna, Kayvon Fatahalian, Christos Kozyrakis

Conference on Machine Learning and Systems (MLSys), 2025

DBOS: three years later

Qian Li, Peter Kraft, Christos Kozyrakis, Matei Zaharia, Michael Stonebraker

The International Journal on Very Large Data Bases (VLDB), 2025

Teaching Cloud Infrastructure and Scalable Application Deployment in an Undergraduate Computer Science Program

Aditya Saligrama, Cody Ho, Benjamin Tripp, Michael Abbott, Christos Kozyrakis

ACM Technical Symposium on Computer Science Education (SIGCSETS), 2025

Sglang: Efficient execution of structured language model programs

Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Livia Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E Gonzalez, Clark Barrett, Ying Sheng

Conference on Neural Information Processing Systems (NeurIPS), 2024

ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation

Swapnil Gandhi, Mark Zhao, Athinagoras Skiadopoulos, Christos Kozyrakis

ACM SIGOPS Symposium on Operating Systems Principles (SOSP), 2024

· PDF · Slides

cedar: Optimized and Unified Machine Learning Input Data Pipelines

Mark Zhao, Emanuel Adamiak, Christos Kozyrakis

The International Journal on Very Large Data Bases (VLDB), 2024

High-throughput and Flexible Host Networking for Accelerated Computing

Athinagoras Skiadopoulos, Zhiqiang Xie, Mark Zhao, Qizhe Cai, Saksham Agarwal, Jacob Adelmann, David Ahern, Carlo Contavalli, Michael Goldflam, Vitaly Mayatskikh, Raghu Raja, Daniel Walton, Rachit Agarwal, Shrijeet Mukherjee, Christos Kozyrakis

USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2024

Cloud atlas: Efficient fault localization for cloud systems using language models and causal insight

Zhiqiang Xie, Yujia Zheng, Lizi Ottens, Kun Zhang, Christos Kozyrakis, Jonathan Mace

Preprint, 2024

Tectonic-Shift: A Composite Storage Fabric for Large-Scale ML Training

Mark Zhao, Satadru Pan, Niket Agarwal, Zhaoduo Wen, David Xu, Anand Natarajan, Pavan Kumar, Shiva Shankar P, Ritesh Tijoriwala, Karan Asher, Hao Wu, Aarti Basant, Daniel Ford, Delia David, Nezih Yigitbasi, Pratap Singh, Carole-Jean Wu, Christos Kozyrakis

USENIX Annual Technical Conference (USENIX ATC), 2023

Honeycomb: Secure and Efficient GPU Executions via Static Validation

Haohui Mai, Jiacheng Zhao, Zhongguancun Laboratory, Hongren Zheng, Yiyang Zhao, Zibin Liu, Mingyu Gao, Cong Wang, Xiaobing Feng, Christos Kozyrakis

USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2023

R3: Record-Replay-Retroaction for Database-Backed Applications

Qian Li, Peter Kraft, Michael Cafarella, Çağatay Demiralp, Goetz Graefe, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia

The International Journal on Very Large Data Bases (VLDB), 2023

Zelda: Video analytics using vision-language models

Francisco Romero*, Caleb Winston*, Johann Hauswald, Matei Zaharia, Christos Kozyrakis

Preprint, 2023

RecD: Deduplication for end-to-end deep learning recommendation model training infrastructure

Mark Zhao, Dhruv Choudhary, Devashish Tyagi, Ajay Somani, Max Kaplan, Sung-Han Lin, Sarunya Pumma, Jongsoo Park, Aarti Basant, Niket Agarwal, Carole-Jean Wu, Christos Kozyrakis

Conference on Machine Learning and Systems (MLSys), 2023

Flexshard: Flexible sharding for industry-scale sequence recommendation models

Geet Sethi, Pallab Bhattacharya, Dhruv Choudhary, Carole-Jean Wu, Christos Kozyrakis

Preprint, 2023

Transactions Make Debugging Easy

Qian Li, Peter Kraft, Michael Cafarella, Çağatay Demiralp, Goetz Graefe, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Matei Zaharia

Conference on Innovative Data Systems Research (CIDR), 2022

Optimizing video analytics with declarative model relationships

Francisco Romero, Johann Hauswald, Aditi Partap, Daniel Kang, Matei Zaharia, Christos Kozyrakis

The International Journal on Very Large Data Bases (VLDB), 2022

Hermod: principled and practical scheduling for serverless functions

Kostis Kaffes, Neeraja Yadwadkar, Christos Kozyrakis

ACM Symposium on Cloud Computing (SoCC), 2022

Towards μs tail latency and terabit ethernet: disaggregating the host network stack

Qizhe Cai, Midhul Vuppalapati, Jaehyun Hwang, Christos Kozyrakis, Rachit Agarwal

ACM Special Interest Group on Data Communication (SIGCOMM), 2022

Apiary: A DBMS-Integrated Transactional Function-as-a-Service Framework

Peter Kraft, Qian Li, Kostis Kaffes, Athinagoras Skiadopoulos, Deeptaanshu Kumar, Danny Cho, Jason Li, Robert Redmond, Nathan Weckwerth, Brian Xia, Peter Bailis, Michael Cafarella, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia

Preprint, 2022

Understanding data storage and ingestion for large-scale deep recommendation model training: Industrial product

Mark Zhao, Niket Agarwal, Aarti Basant, Buğra Gedik, Satadru Pan, Mustafa Ozdal, Rakesh Komuravelli, Jerry Pan, Tianshu Bao, Haowei Lu, Sundaram Narayanan, Jack Langman, Kevin Wilfong, Harsha Rastogi, Carole-Jean Wu, Christos Kozyrakis, Parik Pol

International Symposium on Computer Architecture (ISCA), 2022

SOL: Safe on-node learning in cloud platforms

Yawen Wang, Daniel Crankshaw, Neeraja Yadwadkar, Daniel Berger, Christos Kozyrakis, Ricardo Bianchini

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022

ShEF: Shielded enclaves for cloud fpgas

Mark Zhao, Mingyu Gao, Christos Kozyrakis

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022

RecShard: statistical feature-based memory optimization for industry-scale neural recommendation

Geet Sethi, Bilge Acun, Niket Agarwal, Christos Kozyrakis, Caroline Trippel, Carole-Jean Wu

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022

RAIL: Predictable, low tail latency for NVMe flash

Heiner Litz, Javier Gonzalez, Ana Klimovic, Christos Kozyrakis

ACM Transactions on Storage (ToS), 2022

Llama: A heterogeneous & serverless framework for auto-tuning video analytics pipelines

Francisco Romero, Mark Zhao, Neeraja Yadwadkar, Christos Kozyrakis

ACM Symposium on Cloud Computing (SoCC), 2021

Faa$t: A Transparent Auto-Scaling Cache for Serverless Applications

Francisco Romero, Gohar Irfan Chaudhry, Íñigo Goiri, Pragna Gopa, Paul Batum, Neeraja Yadwadkar, Rodrigo Fonseca, Christos Kozyrakis, Ricardo Bianchini

ACM Symposium on Cloud Computing (SoCC), 2021

Syrup: User-defined scheduling across the stack

Kostis Kaffes, Jack Humphries, David Mazières, Christos Kozyrakis

ACM SIGOPS Symposium on Operating Systems Principles (SOSP), 2021

ghOSt: Fast & Flexible User-Space Delegation of Linux Scheduling

Jack Humphries, Neel Natu, Ashwin Chaugule, Ofir Weisse, Barret Rhoden, Josh Don, Luigi Rizzo, Oleg Rombakh, Paul Turner, Christos Kozyrakis

ACM SIGOPS Symposium on Operating Systems Principles (SOSP), 2021

A case against (most) context switches

Jack Humphries*, Kostis Kaffes*, David Mazières, Christos Kozyrakis

USENIX Workshop on Hot Topics in Operating Systems (HotOS), 2021

Smartharvest: Harvesting idle cpus safely and efficiently in the cloud

Yawen Wang, Kapil Arya, Marios Kogias, Manohar Vanga, Aditya Bhandari, Neeraja Yadwadkar, Siddhartha Sen, Sameh Elnikety, Christos Kozyrakis, Ricardo Bianchini

European Conference on Computer Systems (EuroSys), 2021

Interference-aware scheduling for inference serving

Daniel Mendoza, Francisco Romero, Qian Li, Neeraja Yadwadkar, Christos Kozyrakis

European Conference on Machine Learning Systems (EuroMLSys), 2021

RAMBO: Resource allocation for microservices using Bayesian optimization

Qian Li, Bin Li, Pietro Mercati, Ramesh Illikkal, Charlie Tai, Michael Kishinevsky, Christos Kozyrakis

IEEE Computer Architecture Letters, 2021

RackSched: A microsecond-scale scheduler for rack-scale computers

Hang Zhu, Kostis Kaffes, Zixu Chen, Zhenming Liu, Christos Kozyrakis, Ion Stoica, Xin Jin

USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020

Leveraging application classes to save power in highly-utilized data centers

Kostis Kaffes, Dragos Sbirlea, Yiyan Lin, David Lo, Christos Kozyrakis

ACM Symposium on Cloud Computing (SoCC), 2020

A polystore based database operating system (DBOS)

Michael Cafarella, David DeWitt, Vijay Gadepally, Jeremy Kepner, Christos Kozyrakis, Tim Kraska, Michael Stonebraker, Matei Zaharia

International Conference on Very Large Data Bases (VLDB) Workshop, 2020

DBOS: A proposal for a data-centric operating system

Michael Cafarella, David DeWitt, Vijay Gadepally, Jeremy Kepner, Christos Kozyrakis, Tim Kraska, Michael Stonebraker, Matei Zaharia

The International Journal on Very Large Data Bases (VLDB), 2020

Asmdb: Understanding and mitigating front-end stalls in warehouse-scale computers

Nayana Prasad Nagendra, Grant Ayers, David I August, Hyoun Kyu Cho, Svilen Kanev, Christos Kozyrakis, Trivikram Krishnamurthy, Heiner Litz, Tipp Moseley, Parthasarathy Ranganathan

International Symposium on Computer Architecture (ISCA), 2020

Interstellar: Using Halides Scheduling Language to Analyze DNN Accelerators

Xuan Yang, Mingyu Gao, Qiaoyi Liu, Jeff Setter, Jing Pu, Ankita Nayak, Steven Bell, Kaidi Cao, Heonjae Ha, Priyanka Raina, Christos Kozyrakis, Mark Horowitz

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2020

Classifying memory access patterns for prefetching

Grant Ayers, Heiner Litz, Christos Kozyrakis, Parthasarathy Ranganathan

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2020

From laptop to lambda: outsourcing everyday jobs to thousands of transient functional containers

Sadjad Fouladi, Francisco Romero, Dan Iter, Qian Li, Shuvo Chatterjee, Christos Kozyrakis, Matei Zaharia, Keith Winstein

USENIX Annual Technical Conference (USENIX ATC), 2020

Mind the gap: A case for informed request scheduling at the nic

Jack Tigar Humphries, Kostis Kaffes, David Mazières, Christos Kozyrakis

ACM Workshop on Hot Topics in Networks (HotNets), 2019

Centralized core-granular scheduling for serverless functions

Kostis Kaffes, Neeraja Yadwadkar, Christos Kozyrakis

ACM Symposium on Cloud Computing (SoCC), 2019

INFaaS: A model-less and managed inference serving system

Francisco Romero, Qian Li, Neeraja J Yadwadkar, Christos Kozyrakis

USENIX Annual Technical Conference (USENIX ATC), 2019

A case for managed and model-less inference serving

Neeraja Yadwadkar, Francisco Romero, Qian Li, Christos Kozyrakis

USENIX Workshop on Hot Topics in Operating Systems (HotOS), 2019

Tangram: Optimized coarse-grained dataflow for scalable nn accelerators

Mingyu Gao, Xuan Yang, Jing Pu, Mark Horowitz, Christos Kozyrakis

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2019

A new frontier for pull-based graph processing

Samuel Grossman, Christos Kozyrakis

Preprint, 2019

Pocket: Elastic Ephemeral Storage for Serverless Analytics

Ana Klimovic, Yawen Wang, Patrick Stuedi, Animesh Trivedi, Jonas Pfefferle, Christos Kozyrakis

USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2019

Trevor: Automatic configuration and scaling of stream processing pipelines

Manu Bansal, Eyal Cidon, Arjun Balasingam, Aditya Gudipati, Christos Kozyrakis, Sachin Katti

Preprint, 2018

QuMan Profile-based Improvement of Cluster Utilization

Yannis Sfakianakis, Christos Kozanitis, Christos Kozyrakis, Angelos Bilas

ACM Transactions on Architecture and Code Optimization (TACO), 2018

Spatial: A language and compiler for application accelerators

David Koeplinger, Matthew Feldman, Raghu Prabhakar, Yaqi Zhang, Stefan Hadjis, Ruben Fiszel, Tian Zhao, Luigi Nardi, Ardavan Pedram, Christos Kozyrakis, Kunle Olukotun

ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2018

Plasticine: A reconfigurable accelerator for parallel patterns

Raghu Prabhakar, Yaqi Zhang, David Koeplinger, Matt Feldman, Tian Zhao, Stefan Hadjis, Ardavan Pedram, Christos Kozyrakis, Kunle Olukotun

International Symposium on Computer Architecture (ISCA), 2018

Making pull-based graph processing performant

Samuel Grossman, Heiner Litz, Christos Kozyrakis

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2018

Appswitch: Resolving the application identity crisis

Dinesh Subhraveti, Sri Goli, Serge Hallyn, Ravi Chamarthy, Christos Kozyrakis

Preprint, 2017

Persona: A High-Performance Bioinformatics Framework

Stuart Byma, Sam Whitlock, Laura Flueratoru, Ethan Tseng, Christos Kozyrakis, Edouard Bugnion, James Larus

USENIX Annual Technical Conference (USENIX ATC), 2017

Tetris: Scalable and efficient neural network acceleration with 3d memory

Mingyu Gao, Jing Pu, Xuan Yang, Mark Horowitz, Christos Kozyrakis

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2017

Reflex: Remote flash≈ local flash

Ana Klimovic, Heiner Litz, Christos Kozyrakis

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2017

Bolt: I know what you did last summer... in the cloud

Christina Delimitrou, Christos Kozyrakis

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2017

The IX operating system: Combining low latency, high throughput, and efficiency in a protected dataplane

Adam Belay, George Prekas, Mia Primorac, Ana Klimovic, Samuel Grossman, Christos Kozyrakis, Edouard Bugnion

ACM Transactions on Computer Systems (TOCS), 2016

DRAF: A low-power DRAM-based reconfigurable acceleration fabric

Mingyu Gao, Christina Delimitrou, Dimin Niu, Krishna T Malladi, Hongzhong Zheng, Bob Brennan, Christos Kozyrakis

International Symposium on Computer Architecture (ISCA), 2016

Automatic generation of efficient accelerators for reconfigurable hardware

David Koeplinger, Raghu Prabhakar, Yaqi Zhang, Christina Delimitrou, Christos Kozyrakis, Kunle Olukotun

International Symposium on Computer Architecture (ISCA), 2016

Flash storage disaggregation

Ana Klimovic, Christos Kozyrakis, Eno Thereska, Binu John, Sanjeev Kumar

European Conference on Computer Systems (EuroSys), 2016

Hcloud: Resource-efficient provisioning in shared cloud systems

Christina Delimitrou, Christos Kozyrakis

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2016

Generating configurable hardware from parallel patterns

Raghu Prabhakar, David Koeplinger, Kevin J Brown, HyoukJoong Lee, Christopher De Sa, Christos Kozyrakis, Kunle Olukotun

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2016

Energy-Efficient Abundant-Data Computing: The N3XT 1,000x

Mohamed M Sabry Aly, Mingyu Gao, Gage Hills, Chi-Shuen Lee, Greg Pitner, Max M Shulaker, Tony F Wu, Mehdi Asheghi, Jeff Bokor, Franz Franchetti, Kenneth E Goodson, Christos Kozyrakis, Igor Markov, Kunle Olukotun, Larry Pileggi, Eric Pop, Jan Rabaey, Christopher Ré, H-S Philip Wong, Subhasish Mitra

IEEE Computer, 2015