Lecciones

Mapa de módulos

Data Science and Machine Learning - 16 wks

Las siguientes lecciones explican diferentes conceptos de programación y han sido publicadas por miembros de 4Geeks

Prework Review

4 Lecciones

The Pre-work was about Python, Numpy, Pandas and Matplotlib but we understand that was a lot, let's go over some of the concepts with the rest of your cohort and mentors.

Data Science con Python Introduccion a Numpy Introduccion a Pandas en Python Introduccion a Matplotlib

Calculus and L. Algebra

4 Lecciones

Linear Algebra's Matrix and Vectors are heavily used in Machine Learning, these structures are mainly used to store manipulate high amounts of information. Also, L. algebra functions "f(x)" help understand the relationship between target variables "y" and their predictors "x".

Álgebra Lineal Cálculo Derivadas con Python Integrales y Derivadas Graficar funciones con Python

Probability

1 Lección

A machine learning algorithm is basically all about probability, you have to predict how probable an event or data is to happen, that is why we need to go over the basic probability functions to better create our future models.

Probabilidad

Descriptive Statistics

2 Lecciones

When you have a big dataset, you cannot calculate very specific values because there is too much data, what you can do is get central tendency values like the mean, median and standard deviation and others.

Media y desviacion estandar con python Estadísticas Descriptivas

Random Variables

5 Lecciones

In order to predict better, we first need to understand how our dataset values are distributed, based on that we can better identify outliers, fill missing values and do better data mining in general.

Probabilidad Binomial con Python Plotting Distributions with Python Plotting Binomial Distribution with Python Probabilidad Bernoulli con Python Variables Aleatorias

Hypothesis Testing

1 Lección

Before creating an algorithm/model with your assumptions, it's recommended to use Hypothesis Testing (defining a null and alternative hypothesis) and test agains it.

Evaluación de hipótesis

Algorithm Optimization

1 Lección

Algorithms need time and space to run, as a machine learning engineer you should learn how to optimize your code to take the lowest amount of space without taking too much time to execute.

Optimizacion de Algoritmos y Estructuras de Datos

M.L. Ops

2 Lecciones

If you are going to be deploying your machine learning models you need to know the command line, python package manager (PIP) and the cookie cutter boilerplate (at least). Today you will get the first Machine Learning Operations lesson.

Como utilizar Gitpod La Línea de Comando (conocida como: La Terminal)

Intro to SQL

3 Lecciones

SQL is the language for data, all the famous database engines use it to query or manipulate the data inside the database, lets get familiar with the most basic concepts, instructions and connect to our first real database.

Todo lo que necesitas saber sobre SQLAlchemy Conexión a Bases de Datos SQL Dominando Bases de Datos: ¿Qué es la Base de Datos SQL?

Web Scraping

3 Lecciones

Scraping is one of the most used sources of gathering data in the world of machine learning. This technology allows you to download almost anything publicly available on the internet, even without an API! Let's do our first scraping and learn the basics of it.

Web Scraping con Beautiful Soap ¿Què es HTTP?Importación de Archivos Estáticos con Pandas

API Requests

1 Lección

The last wait to fetch or retrieve data that we will be learning during the courses is API integrations, that are millions of public API's on the internet with very valuable information. As a Machine Learning Engineer sometimes you will need to use the python request package to connect to an API and get the extra data you need.

Entendiendo las Rest Apis

Exploratory Data Analysis

2 Lecciones

Find patterns in your data in order to get insights and valuable information. Use that information to make decisions and generate better predictions. If your data is garbage, the output will be garbage: Clean your data to avoid poor quality outputs.

Análisis y Limpieza de Datos Exploratorios con Titanic Cómo tomar el Módulo de Análisis Exploratorio de Datos

Your first ML Algorithm

4 Lecciones

During this module you will learn the basics of machine learning, the evaluation metrics and how to optimize your ML algo. We will start our journey with logistic regressions.

Regresión Logística Optimización de Hiperparámetros del modelo Métricas de Evaluación Básicos de Machine Learning

Linear Regression

2 Lecciones

Read the linear regression theory and run the code in the exploring linear regression notebook to practice. Then go to your project and predict the cost of a medical insurance using Linear Regression

Explorando la Regresión lineal Que es una regresion lineal y para que utilizarla

Regularized Linear. Reg.

1 Lección

It is very important to avoid overfitting, so in this lesson you will learn about regularized linear regression models, which are a common way to avoid it.

Modelos Lineales Regularizados

Decision Tree Algo.

2 Lecciones

This is one of the most used algorithms in the industry. Decision Tree's are used for both classification and regression problems. This algorithm makes decisions by building trees with nodes, leaves and branches to make decisions.

Explorando Árboles de Decisión Árboles de Decisión

Random Forest Algo.

1 Lección

In this module we will add some randomness to our trees and build machine learning models using Random Forest.

Bosque Aleatorio

Boosting Algorithms

1 Lección

In this lesson, we will learn about boosting techniques, specifically about gradient descent algorithm and XGBoost (extreme gradient descent).

Boosting de Algoritmos

Naive Bayes Algorithm

2 Lecciones

Were you wondering when are you going to apply Bayes Theorem? Now it's the time. The Naive Bayes algorithm is one of the fastest algorithm and its based in the bayes theorem. We will use it for classification and also as a brief and simple introduction to NLP, which we'll learn deeper in another module.

Explorando Naive Bayes Algoritmo Naive Bayes

Intro to NLP

3 Lecciones

In this module we will learn the basics of a new algorithm: Support Vector machine and we will also have an intro to Natural Language Processing. We will combine both by practicing with an email spam classifier on the exploring NLP notebook, and then you will work on a URL spam classifier in your project.

Explorando el Procesamiento del Lenguaje Natural Procesamiento Natural del Lenguaje Máquinas de Vectores de Soporte

K-nearest neighbors

1 Lección

In this module we will learn about the k-nearest neighbors algorithm and we will dive into a very simple recommender system built with k-nearest neighbors.

K Vecinos más Cercanos

Unsupervised learning

1 Lección

In this module we will learn about a couple of unsupervised algorithms but we will focus on k-means for clustering with a very simple project to help you understand how to group data in clusters.

Aprendizaje no Supervisado

Time Series forecasting

2 Lecciones

In this lesson, we will learn how to recognize and deal with time series when they are present in our datasets. This lesson's project will be a real time competition so get all your skills ready!

Explorando Series de Tiempo Series de Tiempo

Intro to Deep Learning

2 Lecciones

This is our last module on algorithms. We will close this part with a brief introduction to deep learning and neural networks so that we can learn a more complex but efficient model.

Explorando Redes Neuronales Introducción a Deep Learning

ML web app using Flask

1 Lección

In this lesson, we will learn how to build a machine learning web application using one of your best models and the Flask tool. Then we will deploy it to Heroku so that our model can be showed to the world.

Implementar Modelo Usando Flask y Heroku

ML webapp using Streamlit

1 Lección

In this lesson, we will learn how to build a machine learning web application using one of your best models and the Streamlit tool. Then we will deploy it to Heroku so that our model can be showed to the world.

Implementar Modelo Usando Streamlit y Heroku

Cloud Computing for ML

2 Lecciones

This is our last lesson, a brief introduction to cloud computing resources for machine learning. When using large datasets, it is important to know what resources are available for us in the cloud. They can help us reducing training times. Read this lesson at home and discuss it in class. Feel free to dive deeper into your preferred platform with the learning links mentioned in the lesson.

Introducción a AWS SageMaker Computación en la Nube