
Scala:Applied Machine Learning
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
More details
Content
- Cover
- Copyright
- Credits
- Preface
- Table of Contents
- Module 1: Scala for Data Science
- Chapter 1: Scala and Data Science
- Data science
- Programming in data science
- Why Scala?
- When not to use Scala
- Summary
- References
- Chapter 2: Manipulating Data with Breeze
- Code examples
- Installing Breeze
- Getting help on Breeze
- Basic Breeze data types
- An example - logistic regression
- Towards re-usable code
- Alternatives to Breeze
- Summary
- References
- Chapter 3: Plotting with breeze-viz
- Diving into Breeze
- Customizing plots
- Customizing the line type
- More advanced scatter plots
- Multi-plot example - scatterplot matrix plots
- Managing without documentation
- Breeze-viz reference
- Data visualization beyond breeze-viz
- Summary
- Chapter 4: Parallel Collections and Futures
- Parallel collections
- Futures
- Summary
- References
- Chapter 5: Scala and SQL through JDBC
- Interacting with JDBC
- First steps with JDBC
- JDBC summary
- Functional wrappers for JDBC
- Safer JDBC connections with the loan pattern
- Enriching JDBC statements with the "pimp my library" pattern
- Wrapping result sets in a stream
- Looser coupling with type classes
- Creating a data access layer
- Summary
- References
- Chapter 6: Slick - A Functional Interface for SQL
- FEC data
- Invokers
- Operations on columns
- Aggregations with "Group by
- Accessing database metadata
- Slick versus JDBC
- Summary
- References
- Chapter 7: Web APIs
- A whirlwind tour of JSON
- Querying web APIs
- JSON in Scala - an exercise in pattern matching
- Extraction using case classes
- Concurrency and exception handling with futures
- Authentication - adding HTTP headers
- Summary
- References
- Chapter 8: Scala and MongoDB
- MongoDB
- Connecting to MongoDB with Casbah
- Inserting documents
- Extracting objects from the database
- Complex queries
- Casbah query DSL
- Custom type serialization
- Beyond Casbah
- Summary
- References
- Chapter 9: Concurrency with Akka
- GitHub follower graph
- Actors as people
- Hello world with Akka
- Case classes as messages
- Actor construction
- Anatomy of an actor
- Follower network crawler
- Fetcher actors
- Routing
- Message passing between actors
- Queue control and the pull pattern
- Accessing the sender of a message
- Stateful actors
- Follower network crawler
- Fault tolerance
- Custom supervisor strategies
- Life-cycle hooks
- What we have not talked about
- Summary
- References
- Chapter 10: Distributed Batch Processing with Spark
- Installing Spark
- Acquiring the example data
- Resilient distributed datasets
- Building and running standalone programs
- Spam filtering
- Lifting the hood
- Data shuffling and partitions
- Summary
- Reference
- Chapter 11: Spark SQL and DataFrames
- DataFrames - a whirlwind introduction
- Aggregation operations
- Joining DataFrames together
- Custom functions on DataFrames
- DataFrame immutability and persistence
- SQL statements on DataFrames
- Complex data types - arrays, maps, and structs
- Interacting with data sources
- Standalone programs
- Summary
- References
- Chapter 12: Distributed Machine Learning with MLlib
- Introducing MLlib - Spam classification
- Pipeline components
- Evaluation
- Regularization in logistic regression
- Cross-validation and model selection
- Beyond logistic regression
- Summary
- References
- Chapter 13: Web APIs with Play
- Client-server applications
- Introduction to web frameworks
- Model-View-Controller architecture
- Single page applications
- Building an application
- The Play framework
- Dynamic routing
- Actions
- Interacting with JSON
- Querying external APIs and consuming JSON
- Creating APIs with Play: a summary
- Rest APIs: best practice
- Summary
- References
- Chapter 14: Visualization with D3 and the Play Framework
- GitHub user data
- Do I need a backend?
- JavaScript dependencies through web-jars
- Towards a web application: HTML templates
- Modular JavaScript through RequireJS
- Bootstrapping the applications
- Client-side program architecture
- Drawing plots with NVD3
- Summary
- References
- Chapter 15: Pattern Matching and Extractors
- Pattern matching in for comprehensions
- Pattern matching internals
- Extracting sequences
- Summary
- Reference
- Module 2: Scala for Machine Learning
- Chapter 1: Getting Started
- Mathematical notation for the curious
- Why machine learning?
- Why Scala?
- Model categorization
- Taxonomy of machine learning algorithms
- Don't reinvent the wheel!
- Tools and frameworks
- Source code
- Let's kick the tires
- Summary
- Chapter 2: Hello World!
- Modeling
- Defining a methodology
- Monadic data transformation
- A workflow computational model
- Profiling data
- Assessing a model
- Summary
- Chapter 3: Data Preprocessing
- Time series in Scala
- Moving averages
- Fourier analysis
- The discrete Kalman filter
- Alternative preprocessing techniques
- Summary
- Chapter 4: Unsupervised Learning
- Clustering
- Dimension reduction
- Performance considerations
- Summary
- Chapter 5: Naïve Bayes Classifiers
- Probabilistic graphical models
- Naïve Bayes classifiers
- The Multivariate Bernoulli classification
- Naïve Bayes and text mining
- Pros and cons
- Summary
- Chapter 6: Regression and Regularization
- Linear regression
- Regularization
- Numerical optimization
- Logistic regression
- Summary
- Chapter 7: Sequential Data Models
- Markov decision processes
- The hidden Markov model
- Conditional random fields
- Regularized CRFs and text analytics
- Comparing CRF and HMM
- Performance consideration
- Summary
- Chapter 8: Kernel Models and Support Vector Machines
- Kernel functions
- Support vector machines
- Support vector classifiers - SVC
- Anomaly detection with one-class SVC
- Support vector regression
- Performance considerations
- Summary
- Chapter 9: Artificial Neural Networks
- Feed-forward neural networks
- The multilayer perceptron
- Evaluation
- Convolution neural networks
- Benefits and limitations
- Summary
- Chapter 10: Genetic Algorithms
- Evolution
- Genetic algorithms and machine learning
- Genetic algorithm components
- Implementation
- GA for trading strategies
- Advantages and risks of genetic algorithms
- Summary
- Chapter 11: Reinforcement Learning
- Reinforcement learning
- Learning classifier systems
- Summary
- Chapter 12: Scalable Frameworks
- An overview
- Scala
- Scalability with Actors
- Akka
- Apache Spark
- Summary
- Chapter 13: Basic Concepts
- Scala programming
- Mathematics
- Finances 101
- Suggested online courses
- References
- Module 3: Mastering Scala Machine Learning
- Chapter 1: Exploratory Data Analysis
- Getting started with Scala
- Distinct values of a categorical field
- Summarization of a numeric field
- Basic, stratified, and consistent sampling
- Working with Scala and Spark Notebooks
- Basic correlations
- Summary
- Chapter 2: Data Pipelines and Modeling
- Influence diagrams
- Sequential trials and dealing with risk
- Exploration and exploitation
- Unknown unknowns
- Basic components of a data-driven system
- Optimization and interactivity
- Summary
- Chapter 3: Working with Spark and MLlib
- Setting up Spark
- Understanding Spark architecture
- Applications
- ML libraries
- Spark performance tuning
- Running Hadoop HDFS
- Summary
- Chapter 4: Supervised and Unsupervised Learning
- Records and supervised learning
- Unsupervised learning
- Problem dimensionality
- Summary
- Chapter 5: Regression and Classification
- What regression stands for?
- Continuous space and metrics
- Linear regression
- Logistic regression
- Regularization
- Multivariate regression
- Heteroscedasticity
- Regression trees
- Classification metrics
- Multiclass problems
- Perceptron
- Generalization error and overfitting
- Summary
- Chapter 6: Working with Unstructured Data
- Nested data
- Other serialization formats
- Hive and Impala
- Sessionization
- Working with traits
- Working with pattern matching
- Other uses of unstructured data
- Probabilistic structures
- Projections
- Summary
- Chapter 7: Working with Graph Algorithms
- A quick introduction to graphs
- SBT
- Graph for Scala
- GraphX
- Summary
- Chapter 8: Integrating Scala with R and Python
- Integrating with R
- Integrating with Python
- Summary
- Chapter 9: NLP in Scala
- Text analysis pipeline
- MLlib algorithms in Spark
- Segmentation, annotation, and chunking
- POS tagging
- Using word2vec to find word relationships
- Summary
- Chapter 10: Advanced Model Monitoring
- System monitoring
- Process monitoring
- Model monitoring
- Summary
- Biblography
System requirements
File format: PDF
Copy-Protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our eBook Help page.