Data-driven statistical sharding for industry-scale neural recommendation
Stanford University, 2023
Abstract
This dissertation uses workload statistics to partition and place the large embedding tables used by industry-scale neural recommendation models across tiered memory systems, improving training efficiency and scalability.