测试说明¶
测试(在 fbgemm_gpu/test/
目录中)和基准测试(在 fbgemm_gpu/bench/
目录中)为如何使用 FBGEMM_GPU 算子提供了很好的示例。
设置 FBGEMM_GPU 测试环境¶
通过构建/安装 FBGEMM_GPU 包获得可用环境后,需要安装额外的包才能正确运行测试。
# !! Run inside the Conda environment !!
# From the /fbgemm_gpu/ directory
python -m pip install -r requirements.txt
运行 FBGEMM_GPU 测试¶
在构建/安装 FBGEMM_GPU 包后运行测试
# !! Run inside the Conda environment !!
# From the /fbgemm_gpu/test/ directory
cd test
python -m pytest -v -rsx -s -W ignore::pytest.PytestCollectionWarning split_table_batched_embeddings_test.py
python -m pytest -v -rsx -s -W ignore::pytest.PytestCollectionWarning quantize_ops_test.py
python -m pytest -v -rsx -s -W ignore::pytest.PytestCollectionWarning sparse_ops_test.py
python -m pytest -v -rsx -s -W ignore::pytest.PytestCollectionWarning split_embedding_inference_converter_test.py
使用 CUDA 变体进行测试¶
对于 FBGEMM_GPU CUDA 包,GPU 将被自动检测并用于测试。要在支持 GPU 的机器上以仅 CPU 模式运行测试和基准测试,必须在环境中设置 CUDA_VISIBLE_DEVICES=-1
。
# !! Run inside the Conda environment !!
# Specify the specific CUDA devices to run the tests on
# Alternatively, set to -1 for running in CPU-only mode (when on a GPU-capable machine)
export CUDA_VISIBLE_DEVICES=-1
# Enable for debugging failed kernel executions
export CUDA_LAUNCH_BLOCKING=1
# For operators involving NCCL, if the rpath is not set up correctly for
# libnccl.so.2, LD_LIBRARY_PATH will need to be updated.
export LD_LIBRARY_PATH="/path/to/nccl/lib:${LD_LIBRARY_PATH}"
python -m pytest -v -rsx -s -W ignore::pytest.PytestCollectionWarning split_table_batched_embeddings_test.py
使用 ROCm 变体进行测试¶
对于 ROCm 机器,需要通过在环境中设置 FBGEMM_TEST_WITH_ROCM=1
来启用对 ROCm GPU 的测试。
# !! Run inside the Conda environment !!
# From the fbgemm_gpu/test/ directory
cd test
export FBGEMM_TEST_WITH_ROCM=1
# Specify the specific HIP devices to run the tests on
#
# NOTE: This is necessary if PyTorch is unable to see the devices that
# `rocm-smi --showproductname` can see
export HIP_VISIBLE_DEVICES=0,1,2,3
# Enable for debugging kernel executions
export HIP_LAUNCH_BLOCKING=1
python -m pytest -v -rsx -s -W ignore::pytest.PytestCollectionWarning split_table_batched_embeddings_test.py
运行 FBGEMM_GPU 基准测试¶
运行基准测试
# !! Run inside the Conda environment !!
# From the fbgemm_gpu/bench/ directory
cd bench
python tbe_training_benchmark.py device