coursera机器学习课程笔记第二周线性回归

最新推荐文章于 2020-11-10 20:41:05 发布

原创最新推荐文章于 2020-11-10 20:41:05 发布 · 340 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#机器学习 #学习笔记

本文详细介绍并实践了机器学习中的线性回归算法，包括特征规范化、代价函数、梯度下降及正规方程等核心概念。通过具体示例，演示了如何使用MATLAB或Octave实现线性回归，涵盖了数据预处理、参数优化及模型评估的全过程。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

ex1

github:https://github.com/DLW3D/coursera-machine-learning-ex
练习文件下载地址:https://s3.amazonaws.com/spark-public/ml/exercises/on-demand/machine-learning-ex1.zip

Linear Regression 线性回归

Feature Normalize 特征规范化

featureNormalize.m

function [X_norm, mu, sigma] = featureNormalize(X)
	mu = mean(X);
	sigma = std(X);
	X_norm = (X - mu) ./ sigma;
end

Cost Function 代价函数

在这里插入图片描述
computeCostMulti.m

function J = computeCostMulti(X, y, theta)
	% number of training examples
	m = length(y); 
	% Compute the cost of a particular choice of theta
	J = (X*theta-y)'*(X*theta-y)/2/m;
end

Gradient Descent 梯度下降

在这里插入图片描述
gradientDescentMulti.m

function [theta, J_history] = gradientDescentMulti(X, y, theta, alpha, num_iters)
	%GRADIENTDESCENTMULTI Performs gradient descent to learn theta
	%   theta = GRADIENTDESCENTMULTI(x, y, theta, alpha, num_iters) updates theta by
	%   taking num_iters gradient steps with learning rate alpha
	
	m = length(y); 
	J_history = zeros(num_iters, 1);
	
	for iter = 1:num_iters
	    % Perform a single gradient step on the parameter vector
	    %               theta. 
	    n = size(X,2);
	    h = X * theta;
	    dtheta = zeros(n,1);
	    for j = 1:n
	        gradient = 0;
	        for i = 1:m
	            gradient = gradient + (h(i)-y(i))*X(i,j);
	        end
	        dtheta(j) = gradient * alpha / m;
	    end
	    theta = theta - dtheta;
	    
	    % Save the cost J in every iteration    
	    J_history(iter) = computeCostMulti(X, y, theta);
	end
end

Normal Equation 正规方程

normalEqn.m

function [theta] = normalEqn(X, y)
	%NORMALEQN Computes the closed-form solution to linear regression 
	%   NORMALEQN(X,y) computes the closed-form solution to linear 
	%   regression using the normal equations.
	theta = inv(X'*X)*X'*y;
end

运行线性回归

data = load('ex1data2.txt');
X = data(:, 1:2);
y = data(:, 3);
m = length(y);
% Scale features and set them to zero mean
fprintf('Normalizing Features ...\n');
[X mu sigma] = featureNormalize(X);
% Add intercept term to X
X = [ones(m, 1) X];

fprintf('Running gradient descent ...\n');
% Choose some alpha value
alpha = 0.01;
num_iters = 400;
% Init Theta and Run Gradient Descent 
theta = zeros(3, 1);
[theta, J_history] = gradientDescentMulti(X, y, theta, alpha, num_iters);


% Plot the convergence graph
figure;
plot(1:numel(J_history), J_history, '-b', 'LineWidth', 2);
xlabel('Number of iterations');
ylabel('Cost J');

% Display gradient descent's result
fprintf('Theta computed from gradient descent: \n');
fprintf(' %f \n', theta);
fprintf('\n');

在这里插入图片描述