Sitemap
TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Follow publication

Optimizing Neural Networks — Where to Start?

7 min readJan 10, 2019

--

Setting up the Environment

Getting the Data

from google.colab import files
files.upload()
!pip install -U -q kaggle
!mkdir -p ~/.kaggle
!cp kaggle.json ~/.kaggle/
!kaggle datasets download -d uciml/pima-indians-diabetes-database
from google.colab import drive
drive.mount('/content/gdrive')

A Baseline Model with XGBoost

t1 = time()
clf = xgb.XGBClassifier()
cv = StratifiedKFold(n_splits=10, shuffle=True, random_state=seed)
scores = cross_val_score(clf, X, y, cv=cv)
t2 = time()
t = t2 - t1
print("Mean Accuracy: {:.2%}, Standard Deviation: {:.2%}".format(scores.mean(), scores.std()))
print("Time taken: {:.2f} seconds".format(t))

Creating the Model

Automating the Tests

Baseline Neural Network Model

param_dict_defaults, param_dict = get_defaults(), get_defaults()
accuracy_baseline = run_test(X=X, y=y, param_dict=param_dict_defaults)
Finished cross-valiation. Took 1.5 mintues. Mean Accuracy: 71.61%, Standard Deviation: 2.92%

Importance of Different Parameters

Finished cross-valiation. Took 49.3 mintues. Mean Accuracy: 78.00%, Standard Deviation: 4.59%

Parameter Tuning

bases = np.repeat(10, 3)
exponents_1 = -(np.random.rand(3) + 3)
exponents_2 = -(np.random.rand(3) + 2)
learning_rate = np.power(bases, exponents_1).tolist() + np.power(bases, exponents_2).tolist()
batch_size = [2 ** e for e in range(6)]
run_test(X=X_std, y=y, param_dict=param_dict)
Finished cross-valiation. Took 8.3 mintues.
Mean Accuracy: 78.53%, Standard Deviation: 3.64%
Finished cross-valiation. Took 1.7 mintues. Mean Accuracy: 76.95%, Standard Deviation: 2.88%

Summary

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

George Liu
George Liu

Written by George Liu

Business Savvy Data Scientist Specializing in Analytics & Machine Learning 👨‍🎓 Lifelong Learner Fascinated by Tech 👉 linkedin.com/in/georgeliu2

Responses (2)