I only need to use numpy library for this project I am not allowed any external library except panda and numpy. and work should be done in Jupyter notebook This project asks you to implement a logistic regression classifier, and apply it on a real data set. We use the Breast Cancer Wisconsin dataset from UCI machine learning repository: http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29 Data File: breast-cancer-wisconsin.data (class: 2 for benign, 4 for malignant) Data Metafile: breast-cancer-wisconsin.names we have seen that logistic regression is a convex problem, and gradient descent gives the optimal parameters. However, the efficiency is highly dependent on the step length which is left for users to tune. In this assignment, we look at a faster solution called Newton’s method (a.k.a. Newton-Raphson method), which avoids the use of step length. Please implement Newton-Raphson algorithm for logistic regression (i.e., to minimize the cross- entropy loss as discussed in class), and run it over the Breast Cancer Wisconsin dataset. Please randomly sample 80% of the training instances to train a classifier and then testing it on the remaining 20%. Ten such random data splits should be performed and the average over these 10 trials is used to estimate the generalization performance. Please submit: (1) your source code (or Jupyter notebook file) that TA should be able to (compile and) run, and the pre-processed dataset if any; (2) a report on a program checklist, how you accomplish the project, and the result of your classification. Hint: you can use sklearn’s LogisticRegression to verify if you get the same accuracy. I have the following code till now- https://cdn.livechat-files.com/api/file/lc/att/3070322/f6d1df317f0555dac2d59345f2ea971b/Mini_project3_wisconsin_breast_cancer_datasets_now%20%281%29%20%281%29.ipynb Please write a new code or fix this one for me. Need o

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

I only need to use numpy library for this project

I am not allowed any external library except panda and numpy. and work should be done in Jupyter notebook

This project asks you to implement a logistic regression classifier, and apply it on a real
data set.
We use the Breast Cancer Wisconsin dataset from UCI machine learning repository:
http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29
Data File: breast-cancer-wisconsin.data (class: 2 for benign, 4 for malignant)
Data Metafile: breast-cancer-wisconsin.names

we have seen that logistic regression is a convex problem, and gradient
descent gives the optimal parameters. However, the efficiency is highly dependent on
the step length which is left for users to tune. In this assignment, we look at a faster
solution called Newton’s method (a.k.a. Newton-Raphson method), which avoids the
use of step length.

Please implement Newton-Raphson algorithm for logistic regression (i.e., to minimize the cross-
entropy loss as discussed in class), and run it over the Breast Cancer Wisconsin dataset.

Please randomly sample 80% of the training instances to train a classifier and then
testing it on the remaining 20%. Ten such random data splits should be performed and
the average over these 10 trials is used to estimate the generalization performance.

Please submit: (1) your source code (or Jupyter notebook file) that TA should be able to (compile and) run, and the pre-processed dataset if any; (2) a report on a program checklist, how you accomplish the project, and the result of your classification. Hint: you can use sklearn’s LogisticRegression to verify if you get the same accuracy.

I have the following code till now-

https://cdn.livechat-files.com/api/file/lc/att/3070322/f6d1df317f0555dac2d59345f2ea971b/Mini_project3_wisconsin_breast_cancer_datasets_now%20%281%29%20%281%29.ipynb

Please write a new code or fix this one for me. Need original solution

Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 2 steps

Blurred answer
Knowledge Booster
Temporal Difference Learning
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education