triton/python/autotune/pysrc/model.py

from sklearn import tree
from sklearn import ensemble
from numpy import array, bincount, mean, std, max, argmax, min, argmin, median


def gmean(a, axis=0, dtype=None):
    if not isinstance(a, np.ndarray):  # if not an ndarray object attempt to convert it
        log_a = np.log(np.array(a, dtype=dtype))
    elif dtype:  # Must change the default dtype allowing array type
        if isinstance(a,np.ma.MaskedArray):
            log_a = np.log(np.ma.asarray(a, dtype=dtype))
        else:
            log_a = np.log(np.asarray(a, dtype=dtype))
    else:
        log_a = np.log(a)
    return np.exp(log_a.mean(axis=axis))
    
def train_model(X, Y, profiles, metric):
    print("Building the model...")

    Xmean = mean(X)
    Xstd = std(X)
    X = (X - Xmean)/Xstd

    Y = Y[:, :]
    Ymax = max(Y)
    Y = Y/Ymax

    ref = argmax(bincount(argmin(Y, axis=1))) #most common profile
    cut = int(0.800*X.shape[0]+1)

    #Train the model
    clf = ensemble.RandomForestRegressor(10, max_depth=10).fit(X[:cut,:], Y[:cut,:])

    t = argmin(clf.predict(X[cut:,:]), axis = 1)
    s = array([y[ref]/y[k] for y,k in zip(Y[cut:,:], t)])
    tt = argmin(Y[cut:,:], axis = 1)
    ss = array([y[ref]/y[k] for y,k in zip(Y[cut:,:], tt)])
    print("Testing speedup : mean = %.3f, median = %.3f, min = %.3f,  max %.3f"%(gmean(s), median(s), min(s), max(s)))
    print("Optimal speedup : mean = %.3f, median = %.3f, min = %.3f,  max %.3f"%(gmean(ss), median(ss), min(ss), max(ss)))
Cleaned model building ; added some informative commented code 2014-10-13 03:38:19 +02:00			`from sklearn import tree`
			`from sklearn import ensemble`
Now compiling ATIDLAS 2014-10-14 23:49:18 -04:00			`from numpy import array, bincount, mean, std, max, argmax, min, argmin, median`
Cleaned model building ; added some informative commented code 2014-10-13 03:38:19 +02:00

Replaced cxfreeze with pyinstaller. Works better. 2014-10-16 17:49:17 -04:00			`def gmean(a, axis=0, dtype=None):`
			`if not isinstance(a, np.ndarray): # if not an ndarray object attempt to convert it`
			`log_a = np.log(np.array(a, dtype=dtype))`
			`elif dtype: # Must change the default dtype allowing array type`
			`if isinstance(a,np.ma.MaskedArray):`
			`log_a = np.log(np.ma.asarray(a, dtype=dtype))`
			`else:`
			`log_a = np.log(np.asarray(a, dtype=dtype))`
			`else:`
			`log_a = np.log(a)`
			`return np.exp(log_a.mean(axis=axis))`

Input-dependent models now activated for all the operations 2014-10-04 08:58:11 +02:00			`def train_model(X, Y, profiles, metric):`
Cleaned model building ; added some informative commented code 2014-10-13 03:38:19 +02:00			`print("Building the model...")`

Now compiling ATIDLAS 2014-10-14 23:49:18 -04:00			`Xmean = mean(X)`
			`Xstd = std(X)`
nn? 2014-10-01 04:44:16 +02:00			`X = (X - Xmean)/Xstd`
Porting GA for all the operations 2014-10-03 09:29:45 +02:00
Cleaned model building ; added some informative commented code 2014-10-13 03:38:19 +02:00			`Y = Y[:, :]`
Now compiling ATIDLAS 2014-10-14 23:49:18 -04:00			`Ymax = max(Y)`
nn? 2014-10-01 04:44:16 +02:00			`Y = Y/Ymax`
Simple linear model 2014-09-28 19:37:56 -04:00
Now compiling ATIDLAS 2014-10-14 23:49:18 -04:00			`ref = argmax(bincount(argmin(Y, axis=1))) #most common profile`
Porting GA for all the operations 2014-10-03 09:29:45 +02:00			`cut = int(0.800*X.shape[0]+1)`
Fixed indentation 2014-09-29 03:01:33 +02:00
			`#Train the model`
Cleaned model building ; added some informative commented code 2014-10-13 03:38:19 +02:00			`clf = ensemble.RandomForestRegressor(10, max_depth=10).fit(X[:cut,:], Y[:cut,:])`

Now compiling ATIDLAS 2014-10-14 23:49:18 -04:00			`t = argmin(clf.predict(X[cut:,:]), axis = 1)`
			`s = array([y[ref]/y[k] for y,k in zip(Y[cut:,:], t)])`
			`tt = argmin(Y[cut:,:], axis = 1)`
			`ss = array([y[ref]/y[k] for y,k in zip(Y[cut:,:], tt)])`
			`print("Testing speedup : mean = %.3f, median = %.3f, min = %.3f, max %.3f"%(gmean(s), median(s), min(s), max(s)))`
			`print("Optimal speedup : mean = %.3f, median = %.3f, min = %.3f, max %.3f"%(gmean(ss), median(ss), min(ss), max(ss)))`