[general] a bunch of fixes in anticipation of proper triton vs cudnn
benchmarks * DNN: Added partial auto-tuning mode and skeleton for heuristics * Examples: Moduralized benchmarking and now evaluating ResNet-18 shapes
This commit is contained in:
@@ -66,8 +66,9 @@ void base::enqueue(driver::stream *stream, std::vector<driver::buffer *> args, a
|
||||
clone->init_impl(stream, (triton::driver::cu_module*)kernel->module());
|
||||
}
|
||||
/* retrieved compiled template */
|
||||
else
|
||||
else{
|
||||
jit = m_jit.at(this).get();
|
||||
}
|
||||
|
||||
/* get launch parameters */
|
||||
driver::kernel* kernel = jit->get_function(name_.c_str());
|
||||
|
Reference in New Issue
Block a user