An edition of Learning data mining with Python (2015)

Learning data mining with Python

harness the power of Python to analyze data and create insightful predictive models

  • 4 Want to read

My Reading Lists:

Create a new list

  • 4 Want to read

Buy this book

Last edited by MARC Bot
December 20, 2022 | History
An edition of Learning data mining with Python (2015)

Learning data mining with Python

harness the power of Python to analyze data and create insightful predictive models

  • 4 Want to read

If you are a programmer who wants to get started with data mining, then this book is for you.

Publish Date
Publisher
Packt Publishing
Language
English
Pages
317

Buy this book

Previews available in: English

Book Details


Table of Contents

Cover
Copyright
Credits
About the Author
About the Reviewers
www.PacktPub.com
Table of Contents
Preface
Chapter 1: Getting Started with Data Mining
Introducing data mining
Using Python and the IPython notebook
Installing Python
Installing IPython
Installing scikit-learn
A simple affinity analysis example
What is affinity analysis?
Product recommendations
Loading the dataset with NumPy
Implementing a simple ranking of rules
Ranking to find the best rules
A simple classification example
What is classification?Loading and preparing the dataset
Implementing the OneR algorithm
Testing the algorithm
Summary
Chapter 2: Classifying with scikit-learn
scikit-learn estimators
Nearest neighbors
Distance metrics
Loading the dataset
Moving towards a standard workflow
Running the algorithm
Setting parameters
Preprocessing using pipelines
An example
Standard preprocessing
Putting it all together
Pipelines
Summary
Chapter 3: Predicting Sports Winners with Decision Trees
Loading the datasetCollecting the data
Using pandas to load the dataset
Cleaning up the dataset
Extracting new features
Decision trees
Parameters in decision trees
Using decision trees
Sports outcome prediction
Putting it all together
Random forests
How do ensembles work?
Parameters in Random forests
Applying Random forests
Engineering new features
Summary
Chapter 4: Recommending Movies Using Affinity Analysis
Affinity analysis
Algorithms for affinity analysis
Choosing parameters
The movie recommendation problemObtaining the dataset
Loading with pandas
Sparse data formats
The Apriori implementation
The Apriori algorithm
Implementation
Extracting association rules
Evaluation
Summary
Chapter 5: Extracting Features with Transformers
Feature extraction
Representing reality in models
Common feature patterns
Creating good features
Feature selection
Selecting the best individual features
Feature creation
Principal Component Analysis
Creating your own transformer
The transformer APIImplementation details
Unit testing
Putting it all together
Summary
Chapter 6: Social Media Insight Using Naive Bayes
Disambiguation
Downloading data from a social network
Loading and classifying the dataset
Creating a replicable dataset from Twitter
Text transformers
Bag-of-words
N-grams
Other features
Naive Bayes
Bayes' theorem
Naive Bayes algorithm
How it works
Application
Extracting word counts
Converting dictionaries to a matrix

Edition Notes

Includes index.

Published in
Birmingham, UK
Series
Community experience distilled, Community experience distilled
Other Titles
Harness the power of Python to analyze data and create insightful predictive models
Copyright Date
2015

Classifications

Dewey Decimal Class
005.13
Library of Congress
QA76.73.P98, T55.4-60.8, QA76.9.D343

The Physical Object

Pagination
1 online resource (xiv, 317 pages)
Number of pages
317

Edition Identifiers

Open Library
OL35344305M
ISBN 10
1784391204, 1784396052
ISBN 13
9781784391201, 9781784396053
OCLC/WorldCat
916530911, 918259466

Work Identifiers

Work ID
OL26193647W

Community Reviews (0)

No community reviews have been submitted for this work.

Lists

Download catalog record: RDF / JSON / OPDS | Wikipedia citation