2.2.2 汽车油耗预测

请根据题目要求，在下方空白处填入正确的代码（点击 💡 按钮查看提示）

数据集说明

文件名：auto-mpg.csv

mpg	cylinders	displacement	horsepower	weight	acceleration
18.0	8	307.0	130	3504	12.0
15.0	8	350.0	165	3693	11.5
18.0	8	318.0	150	3436	11.0
16.0	8	304.0	150	3433	12.0
17.0	8	302.0	140	3449	10.5
15.0	8	429.0	198	4341	10.0
14.0	8	454.0	220	4354	9.0
14.0	8	440.0	215	4312	8.5
14.0	8	455.0	225	4425	10.0
15.0	8	390.0	190	3850	8.5

共 398 条数据，仅展示前 10 条

代码填空

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import Pipeline
import pickle
from sklearn.ensemble import RandomForestRegressor
# 加载数据集
df = 
# 显示前五行数据
print()
# 处理缺失值
# 将 'horsepower' 列中的所有值转换为数值类型
df['horsepower'] = (, errors='coerce')
# 删除包含缺失值的行
df = 
# 选择相关特征进行建模（定义自变量（返回一个DataFrame）和因变量）
X = 
y = 
# 将数据集划分为训练集和测试集（测试集占比20%）
X_train, X_test, y_train, y_test = (, random_state=42)
# 创建包含标准化和线性回归的管道
pipeline = ([('scaler', ),('linreg', )])
# 训练模型
# 保存训练好的模型
with open('2.2.2_model.pkl', 'wb') as model_file:
    pickle.
# 预测并保存结果
y_pred = 
results_df = pd.DataFrame(y_pred, columns=['预测结果'])
('2.2.2_results.txt', index=False)
# 测试模型
with open('2.2.2_report.txt', 'w') as results_file:
    results_file.write(f'训练集得分: {pipeline.score(X_train, y_train)}\n')
    results_file.write(f'测试集得分: {pipeline.score(X_test, y_test)}\n')
# 创建随机森林回归模型实例（创建的决策树的数量为100）
rf_model = (, random_state=42)
# 训练随机森林回归模型
# 使用随机森林模型进行预测
y_pred_rf = 
# 保存新的结果
results_rf_df = pd.DataFrame(y_pred_rf, columns=['预测结果'])
('2.2.2_results_rf.txt', index=False)
# 测试模型并保存得分
with open('2.2.2_report_rf.txt', 'w') as results_rf_file:
    results_rf_file.write(f'训练集得分: {rf_model.score(X_train, y_train)}\n')
    results_rf_file.write(f'测试集得分: {rf_model.score(X_test, y_test)}\n')