BigML Certifications

The breadth of intelligent applications the BigML platform can support spawn many new opportunities for BigML partners to get involved in delivering Machine Learning-based solutions. Our certifications are perfect for software developers, system integrators, technology consulting, and strategic consulting firms to rapidly get up to speed with Machine Learning and the BigML platform as they acquire and grow their customer base.

In order to be eligible to enroll into the BigML Certified courses you must show certain level of proficiency in Machine Learning, BigML Dashboard, BigML API, and WhizzML. The following getting started assets will get you up and running in no time: ML 101, Tutorials, API documentation, and WhizzML.

Certified Engineer

This certification course prepares analysts, scientists, and software developers to become BigML Certified Engineers. It consists of 8 online classes of 1.5 hours each. Evaluation will be based on solving a set of theoretical questions and exercises presented during the course. The modules listed below will consist of 2 sessions each to complete the 8 online classes.



1 Advanced Modeling


  • Understand how to parameterize supervised and unsupervised methods to achieve better performance.
  • Learn how to compose multiple methods together to better solve modeling problems.



  • Modeling vs. Prediction
  • Supervised Learning

    Decision Trees: Node threshold, Weights, Statistical Pruning, Modeling Missing Values.

    Ensemble Classifiers: Bagging (Sample Rates, Number of Models), Random Decision Forests (Random Candidates), Boosting.

    Linear Regression: Field Encodings.

    Logistic Regression: L1 Normalization, L2 Normalization, Field Encodings, Scales.

    Deepnets: Topologies, Gradient Descent Algorithms, Automatic Network Discovery.

    Time Series: Error, Trend, Damped, Seasonality.

    Evaluation: How to Properly Evaluate a Predictive Model, Cross-Validation, ROC Spaces and Curves.

    OptiML: How to optimize the process for model selection and parametrization to automatically find the best model for a given dataset.

    Fusion: Combination of models, ensembles, linear regressions, logistic regressions, and deepnets to balance out the individual weaknesses of single models.

  • Unsupervised Learning

    Clustering: Number of Clusters, Dealing with Missing Values, Modeling Clusters, Scaling Fields, Weights, Summary Fields, K-means vs. G-means.

    Association Discovery: Measures (Support, Confidence, Leverage, Significance Level, Lift), Search Strategies (Confidence, Coverage, Leverage, Lift, Support), Missing Items, Discretization.

    Topic Modeling: Topics, Terms, Text analysis.

    Anomaly Detection: Forest Size, Constraints, ID Fields.

  • Combination and Automation



  • The lecturer will be available between 08:00 AM and 08:00 PM PT. Please send an email to for other time ranges.
2 Advanced API


  • Proficiency in using BigML's API and client-side tools to create ML resources.
  • Integration and automation of the workflows needed put a ML solution in production.


  • Basic knowledge of BigML and its resources (UI-level familiarity is enough).
  • Basic programming skills (some examples are in Python, so knowledge of the language will be a plus).
  • Familiarity with REST APIs.


  • API description

    Domains ( vs. Private Deployments).


    Inputs and outputs.

    Resources: Common information, Specifics, Listing and filtering.

  • First level wrappers


    Methods mapping.

    Field management.

    Local resources.

  • Second level wrappers


    Resource management.

    Field management.

    Workflow automation.

    Automated feature engineering.

  • Modeling strategies
  • Predicting strategies


  • The lecturer will be available between 01:00 AM and 01:00 PM PT. Please send an email to for other time ranges.
3 Advanced Data Transformations


  • Data is typically: scattered, unclean, and imperfect. How to make it ML-Ready.
  • Once data is ML-Ready, why/how to make better features.
  • Not all features are good. How to choose and what to watch out for.


  • Advanced Modeling Class.
  • Familiarity with: SQL, Python / Pandas, CSV formatting.


  • ML-Ready Data

    What is it?


    Structures for ML tasks.

    Automating Labeling.

  • Data Transformations

    Cleansing Missing Data, Cleaning Data, Better Data.

    Transformations outside and inside BigML: SQL-style queries, Denormalizing, Aggregating, Pivoting, Time windows, Updates, Streaming Data, Images.

    Principal Component Analysis (PCA): Dataset transformation and dimensionality reduction.

  • Feature Engineering

    Auto Transformations: Date-time parsing, LR/cluster missing, LR/cluster auto-scaling, Bag-of-words (Language, Tokenization, etc).

    Manual - Flatline: DSL for feature engineering, Basics (s-expressions/formulas, Literals, Counters, Field Values / Properties, Strings, Regex, Operators), Limitations.

    Numerics: Discretization, Normalization, Z-score, Built-in math functions, Type-casting, Random, Shocks, Moving averages.

    Date-times: UI timestamp, Epoch, Moon phase.

    Text: JSON key/val, Topic distributions.

  • Feature Selection



    Field Importance (ensembles).

    Advanced Selection: Best-First, Boruta.


  • The lecturer will be available between 10:00 AM and 10:00 PM PT. Please send an email to for other time ranges.
4 Advanced WhizzML


  • Proficiency in using BigML's DSL language, WhizzML, as a server-side tool to automate ML-workflows in a scalable, replicable and shareable way.


  • Basic knowledge of BigML and its resources (UI-level familiarity is enough).
  • Familiarity with ML-workflows.
  • Basic programming skills (knowledge of some language of the LISP-family and/or WhizzML will be a plus).


  • WhizzML directives
  • Directives mappings
  • Simple workflows in WhizzML

    Batch Anomaly Score.


    Clustered dataset generation.

  • Advanced workflows in WhizzML


    Covariate shift.

    Stacked generalization.


  • The lecturer will be available between 03:00 PM and 09:00 PM PT. Please send an email to for other time ranges.
Certifications calendar
Registered by Starts Certification by
37th Registered by September 24, 2021 Starts September 27, 2021 Certification by November 5, 2021
38th Registered by November 5, 2021 Starts November 8, 2021 Certification by December 17, 2021
39th Registered by December 17, 2021 Starts December 20, 2021 Certification by January 28, 2022
40th Registered by January 28, 2022 Starts January 31, 2022 Certification by March 11, 2022
41st Registered by March 11, 2022 Starts March 14, 2022 Certification by April 22, 2022
42nd Registered by April 22, 2022 Starts April 25, 2022 Certification by June 3, 2022
43rd Registered by June 3, 2022 Starts June 6, 2022 Certification by July 15, 2022
44th Registered by July 15, 2022 Starts July 18, 2022 Certification by August 26, 2022
45th Registered by August 26, 2022 Starts August 29, 2022 Certification by October 7, 2022
46th Registered by October 7, 2022 Starts October 10, 2022 Certification by November 18, 2022
47th Registered by November 18, 2022 Starts November 21, 2022 Certification by December 30, 2022

Certified Architect

This certification course prepares BigML Certified Engineers to become BigML Certified Architects. Once you have successfully become a BigML Certified Engineer, you are eligible to enroll into the BigML Certified Architect course. The certification process consists of 8 online classes of 1.5 hours each. Evaluation will be based on solving a set of theoretical questions and exercises presented during the course.



  • Get ready to design and build robust Machine Learning-based applications that operate in real-world environments.


  • BigML Certified Engineer.
1 Machine Learning Engineering


  • Real-world Machine Learning
  • Building end-to-end Machine Learning applications
  • How to size and address your project

    Premature optimization is the root of all evil in Machine Learning as well.

    Automating the automatable.

2 BigML Predictions


  • How to generate thousands of predictions per second
  • How to store predictions for further analyses
  • How to implement robust predictions
3 Model Risk Management


  • Local models vs. remote models
  • How to use and operate models
  • How to monitor your models
4 Machine Learning Models: How to Automatically Create Models


  • Automated model and parameter selection
  • When good is "good enough"
  • What your actual test set tells you about your model
5 Model Retraining: When and How to Retrain Models


  • Tracking models over time. You can learn from everything
  • Automating covariate shift detection
  • Active Learning
6 Building Datasets for Machine Learning


  • Diversity vs. volume
  • Detecting biases
  • Detecting blind spots
7 Automatically Preparing Your Data for Machine Learning


  • Choice of data engineering tools
  • Automating feature selection
  • Automating feature generation
8 Putting It All Together


  • Anatomy of a robust Machine Learning application
  • Lessons learned and best practices
  • Design patterns: beyond lessons learned and best practices