ByteShape

AI models are outpacing hardware capabilities, particularly in memory bandwidth and capacity. ByteShape automates the process of selecting optimal datatypes—e.g. int8, fp8, fp5, and int2—for neural network models, a task currently impossible to manage manually given the complexity of 200+ tensors. ByteShape’s software tools and services for compression and quantization automatically learn and select optimal…