4Geeks logo
4Geeks logo
About us

Learning library

For all the self-taught geeks out there, here our content library with most of the learning materials we have produces throughout the years.

It makes sense to start learning by reading and watching videos about fundamentals and how things work.

Full-Stack Software Developer

Data Science and Machine Learning - 16 wks

Search from all Lessons

Social & live learning

The most efficient way to learn: Join a cohort with classmates just like you, live streams, impromptu coding sessions, live tutorials with real experts, and stay motivated.

← Back to Projects

Continue learning for free about:

Boosting Algorithms Project Tutorial

Goal

4Geeks Coding Projects tutorials and exercises for people learning to code or improving their coding skills

Difficulty

beginner

Repository

Click to open

Video

Not available

Live demo

Not available

Average duration

2 hrs

Technologies

  • Bike sharing system can be a virtual sensor network that can be used for sensing mobility in a city. Hence, it is expected that most of important events in the city could be detected via monitoring these data.

  • In this project, practice your new Boosting Algorithm skills trying to predict the number of bike rentals in a specific city.

🌱 How to start this project

You will not be forking this time, please take some time to read this instructions:

  1. Create a new repository based on machine learning project by clicking here.
  2. Open the recently created repository on Gitpod by using the Gitpod button extension.
  3. Once Gitpod VSCode has finished opening you start your project following the Instructions below.

πŸš› How to deliver this project

Once you are finished creating your model, make sure to commit your changes, push to your repository and go to 4Geeks.com to upload the repository link.

πŸ“ Instructions

Predicting bike rental using Boosting Algorithm

Step 1:

The dataset can be found in this project folder as 'bike_sharing_dataset.csv' file. The core data set is related to the two-year historical log corresponding to years 2011 and 2012 from Capital Bikeshare system, Washington D.C., USA which is publicly available in http://capitalbikeshare.com/system-data. You are also welcome to load it directly from the following link (https://raw.githubusercontent.com/4GeeksAcademy/random-forest-project-tutorial/main/impressions.csv), or to download it and add it to your data/raw folder. In that case, don't forget to add the data folder to the .gitignore file.

You can find the description of each feature in the 'data-dictionary.md' file on this folder.

Time to work on it!

Step 2:

Explore and clean the data.

Step 3:

Build a first baseline model using Linear Regression. Chose an evaluation metric. Then, use a boosting algorithm and evaluate the performance of both models.

Step 4:

Use the app.py to create your pipeline.

Step 5:

To save your model and be able to use it later use the following code:

1 2import pickle 3 4filename = 'finalized_model.sav' 5pickle.dump(model, open(filename, 'wb'))

In your README file write a short summary.

Goal

4Geeks Coding Projects tutorials and exercises for people learning to code or improving their coding skills

Difficulty

beginner

Repository

Click to open

Video

Not available

Live demo

Not available

Average duration

2 hrs