Logo
Explore Help
Sign In
gaspersic/triton
1
0
Fork 0
You've already forked triton
Code Issues Pull Requests Packages Projects Releases Wiki Activity
Files
phil/mma-v1-is-row-debug
triton/python/src/main.cc

12 lines
310 B
C++
Raw Permalink Normal View History

[PYTHON] Some cleaning of the PyBind11 wrappers (#62)
2021-02-06 17:10:44 -08:00
#include <pybind11/pybind11.h>
void init_superblocking(pybind11::module &m);
void init_torch_utils(pybind11::module &m);
void init_triton(pybind11::module &m);
[PYTHON] CUTLASS wrapper for fair benchmarks (#75) Before this commit, the benchmarking infrastructure used heterogeneous protocols between library (e.g., CUTLASS uses a C++ binary that reports mean TFLOPS; torch and triton use python call and report 10th, 50th and 90th quantiles). For the sake of uniformity and fair benchmark practices, this PR adds a python wrapper for auto-tuned CUTLASS matrix multiplication. Benchmarks have been rewritten to use this wrapper with `triton.testing.do_bench` rather than system calls to CUTLASS profiler. Importantly, this also ensures that all the matmuls are done on the *same* input data which should stabilize clock across providers.
2021-03-09 16:32:44 -05:00
void init_cutlass(pybind11::module &m);
[PYTHON] Some cleaning of the PyBind11 wrappers (#62)
2021-02-06 17:10:44 -08:00
PYBIND11_MODULE(libtriton, m) {
m.doc() = "Python bindings to the C++ Triton API";
init_triton(m);
}
Reference in New Issue Copy Permalink
Powered by Gitea Version: 1.24.3 Page: 301ms Template: 25ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API