subject

Suppose you are a data scientist at Machine Learning Pythons LCC. You would like to create and evaluate a few models to predict bike rental count for certain day. Then you need to select the model that performs the best on your dataset. To get credit for this question, you must use Python with Pandas, numpy, and Sklearn to implement/code the solution. The Bike sharing dataset is in the attached files of this assignment (bike_sharing. csv). For this question, you need to do the following: 1. Write a Python code to load the dataset into Pandas dataframe and then save it in an Excel file fi. e., bike_sharing. xlsx). The file must be saved in the same working directory (the same folder your code is saved in.)
2. Explore the data by printing information about the columns/variables of the dataset. Then, Plot the relationship between every two variables.
3. Split your dataset into two partitions so the training is 75% of the given dataset and 25% for the testing
4. Fit a linear regression model on the training split. The model must predict the bike rental count at certain day. You should include the following in your code:
a. Check for missing values (i. e., null and nan values) and show how many missing values in each column and how many missing values in each row of the dataset (Remember this step needs to be applied before splitting the dataset.)
b. Remove the rows with missing values from the dataset before using it as input to your model dataset (Remember this step needs to be applied before splitting the dataset.)
c. Scale your training dataset. (Remember: do not scale the target (label) variable. Moreover, this will be two sub-steps that are applied after you split the data to train and test. Stepi, Calculate the scale parameters using the train dataset. Step ii, Use the parameters you calculated at step i to scale both the training and the testing dataset.) After that, you must save your training and testing dataset to CSV files, each split in a file (es, bike_test. csv, bike_train. csv.)
d. Show the training mean squared error (MSE). i. e., the error of labeling the training dataset using the model.

ansver
Answers: 1

Another question on Computers and Technology

question
Computers and Technology, 21.06.2019 14:00
Var add = function( x, y ) { return ( x + y ); } alert( add (5, 3) ); 11. (refer to code example 2.) the function a. accepts 2 parameters and returns 2 values. b. accepts 2 parameters and returns 1 value. c. accepts 2 parameters and does not return a value. d. does not accept a parameter and returns 1 value.
Answers: 1
question
Computers and Technology, 22.06.2019 11:10
Look at the far left lane in the picture. explain what the red car is doing and what it needs to do to travel safely.
Answers: 2
question
Computers and Technology, 23.06.2019 09:30
Light travels at a speed of 186,000 miles a second. the distance light travels in a year is 5,865,690,000,000 miles/year 5,865,695,000,000 miles/year 58,656,950,000,000 miles/year 6,789,000,0000 miles/year
Answers: 1
question
Computers and Technology, 25.06.2019 00:40
Roger ginde is developing a program in supply chain management certification for managers. ginde has listed a number of activities that must be completed before a training program of this nature could be conducted. the activities, immediate predecessors, and times appear in the accompanying table: activity immediate predecessor(s) time (days) a -− 44b -− 55 c -− 11 d b 1010 e a, d 55 f c 55 g e, f 1010 calculate the slack time. show the details.
Answers: 2
You know the right answer?
Suppose you are a data scientist at Machine Learning Pythons LCC. You would like to create and evalu...
Questions
question
Mathematics, 11.07.2019 19:30
question
Mathematics, 11.07.2019 19:30
Questions on the website: 13722363