Data-driven statistical sharding for industry-scale neural recommendation

Stanford University, 2023


Abstract

This dissertation uses workload statistics to partition and place the large embedding tables used by industry-scale neural recommendation models across tiered memory systems, improving training efficiency and scalability.