2023-01-23 22:01:16 +01:00
|
|
|
# PMLPP
|
2023-01-23 21:13:26 +01:00
|
|
|
|
2023-12-30 00:51:38 +01:00
|
|
|
A Machine Learning module for the Pandemonium Engine. Based on: https://github.com/novak-99/MLPP
|
2023-12-30 00:46:12 +01:00
|
|
|
|
|
|
|
|
|
|
|
## Contents of the Library
|
|
|
|
1. ***Regression***
|
|
|
|
1. Linear Regression
|
|
|
|
2. Logistic Regression
|
|
|
|
3. Softmax Regression
|
|
|
|
4. Exponential Regression
|
|
|
|
5. Probit Regression
|
|
|
|
6. CLogLog Regression
|
|
|
|
7. Tanh Regression
|
|
|
|
2. ***Deep, Dynamically Sized Neural Networks***
|
|
|
|
1. Possible Activation Functions
|
|
|
|
- Linear
|
|
|
|
- Sigmoid
|
|
|
|
- Softmax
|
|
|
|
- Swish
|
|
|
|
- Mish
|
|
|
|
- SinC
|
|
|
|
- Softplus
|
|
|
|
- Softsign
|
|
|
|
- CLogLog
|
|
|
|
- Logit
|
|
|
|
- Gaussian CDF
|
|
|
|
- RELU
|
|
|
|
- GELU
|
|
|
|
- Sign
|
|
|
|
- Unit Step
|
|
|
|
- Sinh
|
|
|
|
- Cosh
|
|
|
|
- Tanh
|
|
|
|
- Csch
|
|
|
|
- Sech
|
|
|
|
- Coth
|
|
|
|
- Arsinh
|
|
|
|
- Arcosh
|
|
|
|
- Artanh
|
|
|
|
- Arcsch
|
|
|
|
- Arsech
|
|
|
|
- Arcoth
|
|
|
|
2. Possible Optimization Algorithms
|
|
|
|
- Batch Gradient Descent
|
|
|
|
- Mini-Batch Gradient Descent
|
|
|
|
- Stochastic Gradient Descent
|
|
|
|
- Gradient Descent with Momentum
|
|
|
|
- Nesterov Accelerated Gradient
|
|
|
|
- Adagrad Optimizer
|
|
|
|
- Adadelta Optimizer
|
|
|
|
- Adam Optimizer
|
|
|
|
- Adamax Optimizer
|
|
|
|
- Nadam Optimizer
|
|
|
|
- AMSGrad Optimizer
|
|
|
|
- 2nd Order Newton-Raphson Optimizer*
|
|
|
|
- Normal Equation*
|
|
|
|
<p></p>
|
|
|
|
*Only available for linear regression
|
|
|
|
3. Possible Loss Functions
|
|
|
|
- MSE
|
|
|
|
- RMSE
|
|
|
|
- MAE
|
|
|
|
- MBE
|
|
|
|
- Log Loss
|
|
|
|
- Cross Entropy
|
|
|
|
- Hinge Loss
|
|
|
|
- Wasserstein Loss
|
|
|
|
4. Possible Regularization Methods
|
|
|
|
- Lasso
|
|
|
|
- Ridge
|
|
|
|
- ElasticNet
|
|
|
|
- Weight Clipping
|
|
|
|
5. Possible Weight Initialization Methods
|
|
|
|
- Uniform
|
|
|
|
- Xavier Normal
|
|
|
|
- Xavier Uniform
|
|
|
|
- He Normal
|
|
|
|
- He Uniform
|
|
|
|
- LeCun Normal
|
|
|
|
- LeCun Uniform
|
|
|
|
6. Possible Learning Rate Schedulers
|
|
|
|
- Time Based
|
|
|
|
- Epoch Based
|
|
|
|
- Step Based
|
|
|
|
- Exponential
|
|
|
|
3. ***Prebuilt Neural Networks***
|
|
|
|
1. Multilayer Peceptron
|
|
|
|
2. Autoencoder
|
|
|
|
3. Softmax Network
|
|
|
|
4. ***Generative Modeling***
|
|
|
|
1. Tabular Generative Adversarial Networks
|
|
|
|
2. Tabular Wasserstein Generative Adversarial Networks
|
|
|
|
5. ***Natural Language Processing***
|
|
|
|
1. Word2Vec (Continous Bag of Words, Skip-Gram)
|
|
|
|
2. Stemming
|
|
|
|
3. Bag of Words
|
|
|
|
4. TFIDF
|
|
|
|
5. Tokenization
|
|
|
|
6. Auxiliary Text Processing Functions
|
|
|
|
6. ***Computer Vision***
|
|
|
|
1. The Convolution Operation
|
|
|
|
2. Max, Min, Average Pooling
|
|
|
|
3. Global Max, Min, Average Pooling
|
|
|
|
4. Prebuilt Feature Detectors
|
|
|
|
- Horizontal/Vertical Prewitt Filter
|
|
|
|
- Horizontal/Vertical Sobel Filter
|
|
|
|
- Horizontal/Vertical Scharr Filter
|
|
|
|
- Horizontal/Vertical Roberts Filter
|
|
|
|
- Gaussian Filter
|
|
|
|
- Harris Corner Detector
|
|
|
|
7. ***Principal Component Analysis***
|
|
|
|
8. ***Naive Bayes Classifiers***
|
|
|
|
1. Multinomial Naive Bayes
|
|
|
|
2. Bernoulli Naive Bayes
|
|
|
|
3. Gaussian Naive Bayes
|
|
|
|
9. ***Support Vector Classification***
|
|
|
|
1. Primal Formulation (Hinge Loss Objective)
|
|
|
|
2. Dual Formulation (Via Lagrangian Multipliers)
|
|
|
|
10. ***K-Means***
|
|
|
|
11. ***k-Nearest Neighbors***
|
|
|
|
12. ***Outlier Finder (Using z-scores)***
|
|
|
|
13. ***Matrix Decompositions***
|
|
|
|
1. SVD Decomposition
|
|
|
|
2. Cholesky Decomposition
|
|
|
|
- Positive Definiteness Checker
|
|
|
|
3. QR Decomposition
|
|
|
|
14. ***Numerical Analysis***
|
|
|
|
1. Numerical Diffrentiation
|
|
|
|
- Univariate Functions
|
|
|
|
- Multivariate Functions
|
|
|
|
2. Jacobian Vector Calculator
|
|
|
|
3. Hessian Matrix Calculator
|
|
|
|
4. Function approximator
|
|
|
|
- Constant Approximation
|
|
|
|
- Linear Approximation
|
|
|
|
- Quadratic Approximation
|
|
|
|
- Cubic Approximation
|
|
|
|
5. Diffrential Equations Solvers
|
|
|
|
- Euler's Method
|
|
|
|
- Growth Method
|
|
|
|
15. ***Mathematical Transforms***
|
|
|
|
1. Discrete Cosine Transform
|
|
|
|
16. ***Linear Algebra Module***
|
|
|
|
17. ***Statistics Module***
|
|
|
|
18. ***Data Processing Module***
|
|
|
|
1. Setting and Printing Datasets
|
|
|
|
2. Available Datasets
|
|
|
|
1. Wisconsin Breast Cancer Dataset
|
|
|
|
- Binary
|
|
|
|
- SVM
|
|
|
|
2. MNIST Dataset
|
|
|
|
- Train
|
|
|
|
- Test
|
|
|
|
3. Iris Flower Dataset
|
|
|
|
4. Wine Dataset
|
|
|
|
5. California Housing Dataset
|
|
|
|
6. Fires and Crime Dataset (Chicago)
|
|
|
|
3. Feature Scaling
|
|
|
|
4. Mean Normalization
|
|
|
|
5. One Hot Representation
|
|
|
|
6. Reverse One Hot Representation
|
|
|
|
7. Supported Color Space Conversions
|
|
|
|
- RGB to Grayscale
|
|
|
|
- RGB to HSV
|
|
|
|
- RGB to YCbCr
|
|
|
|
- RGB to XYZ
|
|
|
|
- XYZ to RGB
|
|
|
|
19. ***Utilities***
|
|
|
|
1. TP, FP, TN, FN function
|
|
|
|
2. Precision
|
|
|
|
3. Recall
|
|
|
|
4. Accuracy
|
|
|
|
5. F1 score
|
|
|
|
|
2023-12-30 00:51:38 +01:00
|
|
|
## Todos
|
|
|
|
|
|
|
|
### Saves
|
|
|
|
|
|
|
|
Reimplement saving.
|
|
|
|
|
|
|
|
### Bind remaining methods
|
|
|
|
|
|
|
|
Go through and bind all methods. Also add properties as needed.
|
|
|
|
|
|
|
|
### Add initialization api to all classes that need it
|
|
|
|
|
|
|
|
The original library used contructors to initialize everything, but with the engine scripts can't rely on this,
|
|
|
|
make sure all classes have initializations apis, and they bail out when they are in an uninitialized state.
|
|
|
|
|
|
|
|
### Rework remaining apis.
|
|
|
|
|
|
|
|
Rework and bind the remaining apis, so they can be used from scripts.
|
|
|
|
|
|
|
|
### Error handling
|
|
|
|
|
|
|
|
Make error macros usage consistent. Also a command line option should be available that disables them for math operations.
|
|
|
|
|
|
|
|
### Crashes
|
|
|
|
|
|
|
|
There are still likely lots of crashes, find, and fix them.
|
|
|
|
|
|
|
|
### Unit tests
|
|
|
|
|
|
|
|
- Add more unit tests
|
|
|
|
- Also use the engine's own unit test module. It still needs to be fininshed, would be a good idea doing it alongside this modules's tests.
|
|
|
|
- They should only be built when you want them. Command line option: `mlpp_tests=yes`
|
|
|
|
|
|
|
|
### std::random
|
|
|
|
|
|
|
|
Replace remaining std::random usage with engine internals.
|
|
|
|
|
|
|
|
### Tensor
|
|
|
|
|
|
|
|
Add an N-dimensional tensor class.
|
|
|
|
|
|
|
|
### More algos
|
|
|
|
|
|
|
|
Add more machine learning algorithms.
|
2023-12-30 00:46:12 +01:00
|
|
|
|
|
|
|
## Citations
|
|
|
|
|
2023-12-30 00:51:38 +01:00
|
|
|
Originally created by Marc Melikyan: https://github.com/novak-99/MLPP
|