Description:
Network traffic collection (PCAP) of three widely-used state-of-the-art Distributed Machine Learning (DML) frameworks (Tensorflow, Horovod, KungFu). The collection contains distributed training runs of four models (MobileNetV2, ResNet50, Resnet101, DenseNet201) with varying configurations of the frameworks. Varied parameters are the communication topology and backend, the distributed optimizer, the batch size and the packet loss in the network.