当前位置：移动技术网 > IT编程>脚本编程>Python > python实现逻辑回归

python实现逻辑回归

2020年07月22日 | 移动技术网IT编程 | 我要评论

1.自定义代码实现

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split


def sigmoid(z):
    s = 1 / (1 + np.exp(-z))
    s = s.reshape(s.shape[0], 1)  # s.shape[0]表示求数组的长度
    return s


def draw_sigmoid():
    x = np.arange(-6, 6, .01)  # 返回一个有起点有终点且固定步长的排列，左闭右开
    y = sigmoid(x)

    plt.plot(x, y, color='red', lw=2)
    plt.show()


def model(theta, X):
    z = np.sum(theta.T * X, axis=1)  # 压缩列
    return sigmoid(z)


# 定义损失函数
# h(x)
def cross_entropy(y, y_hat):
    n_samples = y.shape[0]
    return sum(-y * np.log(y_hat) - (1 - y) * np.log(1 - y_hat)) / n_samples


def cost_function(theta, X, y):
    y_hat = model(theta, X)
    return cross_entropy(y, y_hat)


# 梯度下降
def optimize(theta, X, y):
    n = X.shape[0]
    alpha = 1e-1
    y_hat = model(theta, X)
    dtheta = (1.0 / n) * ((y_hat - y) * X)
    dtheta = np.sum(dtheta, axis=0)  # 压缩行
    dtheta = dtheta.reshape((31, 1))
    theta = theta - alpha * dtheta
    return theta


# 对数据进行迭代
def iterate(theta, X, y, times):
    costs = []
    accs = []
    for i in range(times):
        theta = optimize(theta, X, y)
        costs.append(cost_function(theta, X, y))
        accs.append(accuracy(theta, X, y))

    return theta, costs, accs


# 对数据进行评估
def predict_proba(theta, X):
    y_hat = model(theta, X)
    return y_hat


def predict(X, theta):
    y_hat = predict_proba(theta, X)
    y_hard = (y_hat > 0.5) * 1
    return y_hard


def accuracy(theta, X, y):
    y_hard = predict(X, theta)
    count_right = sum(y_hard == y)
    return count_right * 1.0 / len(y)


# 载入数据
dataset = load_breast_cancer()
data = pd.DataFrame(data=dataset.data, columns=dataset.feature_names)
data['cancer'] = [dataset.target_names[t] for t in dataset.target]

# 赋值数据  shape[0] shape[1]代表数据的维度
X = dataset.data
y = dataset.target
n_features = X.shape[1]

std = X.std(axis=0)  # 按照行 竖直方向计算标准差
mean = X.mean(axis=0)  # 按照行 竖直方向计算均值
X_norm = (X - mean) / std  # 标准差标准化，经过处理的数据符合标准正态分布


def add_ones(X):
    ones = np.ones((X.shape[0], 1))
    X_with_ones = np.hstack((ones, X))
    return X_with_ones


X_with_ones = add_ones(X_norm)

X_train, X_test, y_train, y_test = train_test_split(X_with_ones, y, test_size=0.3, random_state=12345)
y_train = y_train.reshape((y_train.shape[0], 1))
y_test = y_test.reshape((y_test.shape[0], 1))

# 应用算法
theta = np.ones((n_features+1,1))
theta, costs, accs = iterate(theta, X_train, y_train, 1500)
plt.plot(costs)    # 画出代价函数
plt.plot(accs)     # 画出准确率变化
plt.show()
print(accuracy(theta, X_test, y_test))

2.库函数调用

from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
dataset = load_breast_cancer()
X = dataset.data
y = dataset.target
X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=0.3,random_state=12345)

lr=LogisticRegression()
lr.fit(X_train,y_train)
print(lr.score(X_train,y_train))
print(lr.score(X_test,y_test))

本文地址：https://blog.csdn.net/qq_40690199/article/details/107466555

您可能感兴趣的文章:

如对本文有疑问，点击进行留言回复！！

Python如何合并多个字典或映射

问题现在有多个字典或者映射，你想将它们从逻辑上合并为一个单一的映射后执行某些操作，比如查找值或者检查某些键是否存在。解决方案加入你有如下两个字典:a = {'x... [阅读全文]
Python图像处理二值化方法实例汇总

在用python进行图像处理时，二值化是非常重要的一步，现总结了自己遇到过的6种图像二值化的方法（当然这个绝对不是全部的二值化方法，若发现新的方法会继续新增）... [阅读全文]
浅析Python 多行匹配模式

问题你正在试着使用正则表达式去匹配一大块的文本，而你需要跨越多行去匹配。解决方案这个问题很典型的出现在当你用点(.)去匹配任意字符的时候，忘记了点(.)不能匹配... [阅读全文]
python实现学生管理系统开发

使用python完成超级基础的学生管理系统，供大家参考，具体内容如下说明：1、本学生管理系统非常非常简易，只有增，显，查，删，改功能，对于python新手容易看... [阅读全文]
深入了解NumPy 高级索引

numpy 比一般的 python 序列提供更多的索引方式。除了之前看到的用整数和切片的索引外，数组可以由整数数组索引、布尔索引及花式索引。整数数组索引以下实例... [阅读全文]
Python 解析简单的XML数据

问题你想从一个简单的xml文档中提取数据。解决方案可以使用 xml.etree.elementtree 模块从简单的xml文档中提取数据。为了演示，假设你想解析... [阅读全文]
用python实现学生管理系统

学生管理系统相信大家学各种语言的时候，练习总是会写各种管理系统吧，管理系统主要有对数据的增删查改操作，原理不难，适合作为练手的小程序数据的结构要保存数据就需要数... [阅读全文]
Python按照先后顺序，对列表进行多条件自定义排序

需求：对指定的列表，按照以下顺序排序：①先按照【编号】从小到大进行排序②再按照列表中包含【方案、扩初、施工图、后... [阅读全文]
Python经典入门100题 (21-30题)

Python入门练手，有这100题就够了！ [阅读全文]
python实现LRU算法

LRU算法python实现学习mysql数据库时，了解了一下ib_buffer_pool的存储机制，使用LRU... [阅读全文]

网友评论


验证码：

python实现逻辑回归

2020年07月22日 | 移动技术网IT编程 | 我要评论

1.自定义代码实现

2.库函数调用

您可能感兴趣的文章:

相关文章:

网友评论