triton/tune/model.py

from sklearn import ensemble
import numpy as np

def gmean(a, axis=0, dtype=None):
    if not isinstance(a, np.ndarray):  # if not an ndarray object attempt to convert it
        log_a = np.log(np.array(a, dtype=dtype))
    elif dtype:  # Must change the default dtype allowing array type
        if isinstance(a,np.ma.MaskedArray):
            log_a = np.log(np.ma.asarray(a, dtype=dtype))
        else:
            log_a = np.log(np.asarray(a, dtype=dtype))
    else:
        log_a = np.log(a)
    return np.exp(log_a.mean(axis=axis))

def nrmse(y_ground, y):
    N = y.size
    rmsd = np.sqrt(np.sum((y_ground - y)**2)/N)
    if len(y_ground) > 1:
        return rmsd/(np.max(y_ground) - np.min(y_ground))
    else:
        return rmsd

def train(X, Y, profiles):      
    X = np.array(X)
    Y = np.array(Y)
    M = X.shape[0]

    p = np.random.permutation(X.shape[0])
    X = X[p,:]
    Y = Y[p,:]   

    #Train the model
    cut = int(0.9*M)
    XTr, YTr = X[:cut,:], Y[:cut,:]
    XCv, YCv = X[cut:,:], Y[cut:,:]

    nrmses = {}
    for N in range(1,min(M+1,20)):
        for depth in range(1,min(M+1,20)):
            clf = ensemble.RandomForestRegressor(N, max_depth=depth).fit(XTr, YTr)
            t = np.argmax(clf.predict(XCv), axis = 1)
            y = np.array([YCv[i,t[i]] for i in range(t.size)])
            ground = np.max(YCv[:,:], axis=1)
            nrmses[clf] = nrmse(ground, y)
            
    clf = min(nrmses, key=nrmses.get)
    return clf, nrmses[clf]
Cleaned model building ; added some informative commented code 2014-10-13 03:38:19 +02:00			`from sklearn import ensemble`
Python/Autotune: Moved devices from config.ini to command line argument 2014-10-27 03:28:46 -04:00			`import numpy as np`
Cleaned model building ; added some informative commented code 2014-10-13 03:38:19 +02:00
Replaced cxfreeze with pyinstaller. Works better. 2014-10-16 17:49:17 -04:00			`def gmean(a, axis=0, dtype=None):`
			`if not isinstance(a, np.ndarray): # if not an ndarray object attempt to convert it`
			`log_a = np.log(np.array(a, dtype=dtype))`
			`elif dtype: # Must change the default dtype allowing array type`
			`if isinstance(a,np.ma.MaskedArray):`
			`log_a = np.log(np.ma.asarray(a, dtype=dtype))`
			`else:`
			`log_a = np.log(np.asarray(a, dtype=dtype))`
			`else:`
			`log_a = np.log(a)`
			`return np.exp(log_a.mean(axis=axis))`
Added viennacl-src-path in the UI 2014-11-02 10:05:14 -05:00
			`def nrmse(y_ground, y):`
			`N = y.size`
			`rmsd = np.sqrt(np.sum((y_ground - y)**2)/N)`
Tuning: Merged tune branch. - Much cleaner and more concise source - Better exceptions handling - Checks local minima to see if retuning is needed. Resolved conflicts: bench/blas.cpp include/isaac/backend/templates/mproduct.h include/isaac/driver/buffer.h lib/array.cpp lib/backend/templates/mproduct.cpp lib/driver/buffer.cpp python/setup.py tune/pysrc/autotune.py tune/pysrc/dataset.py tune/pysrc/misc_tools.py 2015-06-28 17:53:16 -07:00			`if len(y_ground) > 1:`
			`return rmsd/(np.max(y_ground) - np.min(y_ground))`
			`else:`
			`return rmsd`

			`def train(X, Y, profiles):`
			`X = np.array(X)`
			`Y = np.array(Y)`
			`M = X.shape[0]`
Added viennacl-src-path in the UI 2014-11-02 10:05:14 -05:00
Enhancements of the auto-tuner 2014-10-29 17:01:57 +01:00			`p = np.random.permutation(X.shape[0])`
			`X = X[p,:]`
Squashed feature branch: * Added CUDA support * Performance improvements * API improvements * Added "depth" parameter to GEMM * Android cross-compilation 2015-04-29 15:50:57 -04:00			`Y = Y[p,:]`
Fixed over-head bug in the auto-tuner (not in the benchmarks) 2014-11-06 16:14:46 -05:00
Tuning: Merged tune branch. - Much cleaner and more concise source - Better exceptions handling - Checks local minima to see if retuning is needed. Resolved conflicts: bench/blas.cpp include/isaac/backend/templates/mproduct.h include/isaac/driver/buffer.h lib/array.cpp lib/backend/templates/mproduct.cpp lib/driver/buffer.cpp python/setup.py tune/pysrc/autotune.py tune/pysrc/dataset.py tune/pysrc/misc_tools.py 2015-06-28 17:53:16 -07:00			`#Train the model`
			`cut = int(0.9*M)`
Fixed over-head bug in the auto-tuner (not in the benchmarks) 2014-11-06 16:14:46 -05:00			`XTr, YTr = X[:cut,:], Y[:cut,:]`
			`XCv, YCv = X[cut:,:], Y[cut:,:]`
Fixed indentation 2014-09-29 03:01:33 +02:00
Fixed over-head bug in the auto-tuner (not in the benchmarks) 2014-11-06 16:14:46 -05:00			`nrmses = {}`
Tuning: Merged tune branch. - Much cleaner and more concise source - Better exceptions handling - Checks local minima to see if retuning is needed. Resolved conflicts: bench/blas.cpp include/isaac/backend/templates/mproduct.h include/isaac/driver/buffer.h lib/array.cpp lib/backend/templates/mproduct.cpp lib/driver/buffer.cpp python/setup.py tune/pysrc/autotune.py tune/pysrc/dataset.py tune/pysrc/misc_tools.py 2015-06-28 17:53:16 -07:00			`for N in range(1,min(M+1,20)):`
			`for depth in range(1,min(M+1,20)):`
Fixed over-head bug in the auto-tuner (not in the benchmarks) 2014-11-06 16:14:46 -05:00			`clf = ensemble.RandomForestRegressor(N, max_depth=depth).fit(XTr, YTr)`
Squashed feature branch: * Added CUDA support * Performance improvements * API improvements * Added "depth" parameter to GEMM * Android cross-compilation 2015-04-29 15:50:57 -04:00			`t = np.argmax(clf.predict(XCv), axis = 1)`
Fixed over-head bug in the auto-tuner (not in the benchmarks) 2014-11-06 16:14:46 -05:00			`y = np.array([YCv[i,t[i]] for i in range(t.size)])`
Squashed feature branch: * Added CUDA support * Performance improvements * API improvements * Added "depth" parameter to GEMM * Android cross-compilation 2015-04-29 15:50:57 -04:00			`ground = np.max(YCv[:,:], axis=1)`
			`nrmses[clf] = nrmse(ground, y)`
Tuning: Merged tune branch. - Much cleaner and more concise source - Better exceptions handling - Checks local minima to see if retuning is needed. Resolved conflicts: bench/blas.cpp include/isaac/backend/templates/mproduct.h include/isaac/driver/buffer.h lib/array.cpp lib/backend/templates/mproduct.cpp lib/driver/buffer.cpp python/setup.py tune/pysrc/autotune.py tune/pysrc/dataset.py tune/pysrc/misc_tools.py 2015-06-28 17:53:16 -07:00
Added viennacl-src-path in the UI 2014-11-02 10:05:14 -05:00			`clf = min(nrmses, key=nrmses.get)`
Tuning: Merged tune branch. - Much cleaner and more concise source - Better exceptions handling - Checks local minima to see if retuning is needed. Resolved conflicts: bench/blas.cpp include/isaac/backend/templates/mproduct.h include/isaac/driver/buffer.h lib/array.cpp lib/backend/templates/mproduct.cpp lib/driver/buffer.cpp python/setup.py tune/pysrc/autotune.py tune/pysrc/dataset.py tune/pysrc/misc_tools.py 2015-06-28 17:53:16 -07:00			`return clf, nrmses[clf]`