Scala:Applied Machine Learning

Name: Scala:Applied Machine Learning
Brand: Packt Publishing
Price: 85.49 EUR
Availability: OnlineOnly

Pascal Bugnion(Author)

Packt Publishing

Published on 23. February 2017

1265 pages

E-Book

PDF with Adobe-DRM

System requirements

978-1-78712-455-4 (ISBN)

€85.49incl. 7% vat

System requirements

for PDF with Adobe-DRM

E-Book Single Licence

Available for download

Description

Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.

Alles über E-Books, Kopierschutz & Dateiformate finden Sie in unserem Info- & Hilfebereich.

Leverage the power of Scala and master the art of building, improving, and validating scalable machine learning and AI applications using Scala's most advanced and finest featuresAbout This BookBuild functional, type-safe routines to interact with relational and NoSQL databases with the help of the tutorials and examples providedLeverage your expertise in Scala programming to create and customize your own scalable machine learning algorithmsExperiment with different techniques; evaluate their benefits and limitations using real-world financial applicationsGet to know the best practices to incorporate new Big Data machine learning in your data-driven enterprise and gain future scalability and maintainabilityWho This Book Is ForThis Learning Path is for engineers and scientists who are familiar with Scala and want to learn how to create, validate, and apply machine learning algorithms. It will also benefit software developers with a background in Scala programming who want to apply machine learning.What You Will LearnCreate Scala web applications that couple with JavaScript libraries such as D3 to create compelling interactive visualizationsDeploy scalable parallel applications using Apache Spark, loading data from HDFS or HiveSolve big data problems with Scala parallel collections, Akka actors, and Apache Spark clustersApply key learning strategies to perform technical analysis of financial marketsUnderstand the principles of supervised and unsupervised learning in machine learningWork with unstructured data and serialize it using Kryo, Protobuf, Avro, and AvroParquetConstruct reliable and robust data pipelines and manage data in a data-driven enterpriseImplement scalable model monitoring and alerts with ScalaIn DetailThis Learning Path aims to put the entire world of machine learning with Scala in front of you. Scala for Data Science, the first module in this course, is a tutorial guide that provides tutorials on some of the most common Scala libraries for data science, allowing you to quickly get up to speed building data science and data engineering solutions.The second course, Scala for Machine Learning guides you through the process of building AI applications with diagrams, formal mathematical notation, source code snippets, and useful tips. A review of the Akka framework and Apache Spark clusters concludes the tutorial.The next module, Mastering Scala Machine Learning, is the final step in this course. It will take your knowledge to next level and help you use the knowledge to build advanced applications such as social media mining, intelligent news portals, and more. After a quick refresher on functional programming concepts using REPL, you will see some practical examples of setting up the development environment and tinkering with data. We will then explore working with Spark and MLlib using k-means and decision trees.By the end of this course, you will be a master at Scala machine learning and have enough expertise to be able to build complex machine learning projects using Scala.This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products:Scala for Data Science, Pascal BugnionScala for Machine Learning, Patrick NicolasMastering Scala Machine Learning, Alex KozlovStyle and approachA tutorial with complete examples, this course will give you the tools to start building useful data engineering and data science solutions straightaway. This course provides practical examples from the field on how to correctly tackle data analysis problems, particularly for modern Big Data datasets.

More details

Content

Cover
Copyright
Credits
Preface
Table of Contents
Module 1: Scala for Data Science
Chapter 1: Scala and Data Science
Data science
Programming in data science
Why Scala?
When not to use Scala
Summary
References
Chapter 2: Manipulating Data with Breeze
Code examples
Installing Breeze
Getting help on Breeze
Basic Breeze data types
An example - logistic regression
Towards re-usable code
Alternatives to Breeze
Summary
References
Chapter 3: Plotting with breeze-viz
Diving into Breeze
Customizing plots
Customizing the line type
More advanced scatter plots
Multi-plot example - scatterplot matrix plots
Managing without documentation
Breeze-viz reference
Data visualization beyond breeze-viz
Summary
Chapter 4: Parallel Collections and Futures
Parallel collections
Futures
Summary
References
Chapter 5: Scala and SQL through JDBC
Interacting with JDBC
First steps with JDBC
JDBC summary
Functional wrappers for JDBC
Safer JDBC connections with the loan pattern
Enriching JDBC statements with the "pimp my library" pattern
Wrapping result sets in a stream
Looser coupling with type classes
Creating a data access layer
Summary
References
Chapter 6: Slick - A Functional Interface for SQL
FEC data
Invokers
Operations on columns
Aggregations with "Group by
Accessing database metadata
Slick versus JDBC
Summary
References
Chapter 7: Web APIs
A whirlwind tour of JSON
Querying web APIs
JSON in Scala - an exercise in pattern matching
Extraction using case classes
Concurrency and exception handling with futures
Authentication - adding HTTP headers
Summary
References
Chapter 8: Scala and MongoDB
MongoDB
Connecting to MongoDB with Casbah
Inserting documents
Extracting objects from the database
Complex queries
Casbah query DSL
Custom type serialization
Beyond Casbah
Summary
References
Chapter 9: Concurrency with Akka
GitHub follower graph
Actors as people
Hello world with Akka
Case classes as messages
Actor construction
Anatomy of an actor
Follower network crawler
Fetcher actors
Routing
Message passing between actors
Queue control and the pull pattern
Accessing the sender of a message
Stateful actors
Follower network crawler
Fault tolerance
Custom supervisor strategies
Life-cycle hooks
What we have not talked about
Summary
References
Chapter 10: Distributed Batch Processing with Spark
Installing Spark
Acquiring the example data
Resilient distributed datasets
Building and running standalone programs
Spam filtering
Lifting the hood
Data shuffling and partitions
Summary
Reference
Chapter 11: Spark SQL and DataFrames
DataFrames - a whirlwind introduction
Aggregation operations
Joining DataFrames together
Custom functions on DataFrames
DataFrame immutability and persistence
SQL statements on DataFrames
Complex data types - arrays, maps, and structs
Interacting with data sources
Standalone programs
Summary
References
Chapter 12: Distributed Machine Learning with MLlib
Introducing MLlib - Spam classification
Pipeline components
Evaluation
Regularization in logistic regression
Cross-validation and model selection
Beyond logistic regression
Summary
References
Chapter 13: Web APIs with Play
Client-server applications
Introduction to web frameworks
Model-View-Controller architecture
Single page applications
Building an application
The Play framework
Dynamic routing
Actions
Interacting with JSON
Querying external APIs and consuming JSON
Creating APIs with Play: a summary
Rest APIs: best practice
Summary
References
Chapter 14: Visualization with D3 and the Play Framework
GitHub user data
Do I need a backend?
JavaScript dependencies through web-jars
Towards a web application: HTML templates
Modular JavaScript through RequireJS
Bootstrapping the applications
Client-side program architecture
Drawing plots with NVD3
Summary
References
Chapter 15: Pattern Matching and Extractors
Pattern matching in for comprehensions
Pattern matching internals
Extracting sequences
Summary
Reference
Module 2: Scala for Machine Learning
Chapter 1: Getting Started
Mathematical notation for the curious
Why machine learning?
Why Scala?
Model categorization
Taxonomy of machine learning algorithms
Don't reinvent the wheel!
Tools and frameworks
Source code
Let's kick the tires
Summary
Chapter 2: Hello World!
Modeling
Defining a methodology
Monadic data transformation
A workflow computational model
Profiling data
Assessing a model
Summary
Chapter 3: Data Preprocessing
Time series in Scala
Moving averages
Fourier analysis
The discrete Kalman filter
Alternative preprocessing techniques
Summary
Chapter 4: Unsupervised Learning
Clustering
Dimension reduction
Performance considerations
Summary
Chapter 5: Naïve Bayes Classifiers
Probabilistic graphical models
Naïve Bayes classifiers
The Multivariate Bernoulli classification
Naïve Bayes and text mining
Pros and cons
Summary
Chapter 6: Regression and Regularization
Linear regression
Regularization
Numerical optimization
Logistic regression
Summary
Chapter 7: Sequential Data Models
Markov decision processes
The hidden Markov model
Conditional random fields
Regularized CRFs and text analytics
Comparing CRF and HMM
Performance consideration
Summary
Chapter 8: Kernel Models and Support Vector Machines
Kernel functions
Support vector machines
Support vector classifiers - SVC
Anomaly detection with one-class SVC
Support vector regression
Performance considerations
Summary
Chapter 9: Artificial Neural Networks
Feed-forward neural networks
The multilayer perceptron
Evaluation
Convolution neural networks
Benefits and limitations
Summary
Chapter 10: Genetic Algorithms
Evolution
Genetic algorithms and machine learning
Genetic algorithm components
Implementation
GA for trading strategies
Advantages and risks of genetic algorithms
Summary
Chapter 11: Reinforcement Learning
Reinforcement learning
Learning classifier systems
Summary
Chapter 12: Scalable Frameworks
An overview
Scala
Scalability with Actors
Akka
Apache Spark
Summary
Chapter 13: Basic Concepts
Scala programming
Mathematics
Finances 101
Suggested online courses
References
Module 3: Mastering Scala Machine Learning
Chapter 1: Exploratory Data Analysis
Getting started with Scala
Distinct values of a categorical field
Summarization of a numeric field
Basic, stratified, and consistent sampling
Working with Scala and Spark Notebooks
Basic correlations
Summary
Chapter 2: Data Pipelines and Modeling
Influence diagrams
Sequential trials and dealing with risk
Exploration and exploitation
Unknown unknowns
Basic components of a data-driven system
Optimization and interactivity
Summary
Chapter 3: Working with Spark and MLlib
Setting up Spark
Understanding Spark architecture
Applications
ML libraries
Spark performance tuning
Running Hadoop HDFS
Summary
Chapter 4: Supervised and Unsupervised Learning
Records and supervised learning
Unsupervised learning
Problem dimensionality
Summary
Chapter 5: Regression and Classification
What regression stands for?
Continuous space and metrics
Linear regression
Logistic regression
Regularization
Multivariate regression
Heteroscedasticity
Regression trees
Classification metrics
Multiclass problems
Perceptron
Generalization error and overfitting
Summary
Chapter 6: Working with Unstructured Data
Nested data
Other serialization formats
Hive and Impala
Sessionization
Working with traits
Working with pattern matching
Other uses of unstructured data
Probabilistic structures
Projections
Summary
Chapter 7: Working with Graph Algorithms
A quick introduction to graphs
SBT
Graph for Scala
GraphX
Summary
Chapter 8: Integrating Scala with R and Python
Integrating with R
Integrating with Python
Summary
Chapter 9: NLP in Scala
Text analysis pipeline
MLlib algorithms in Spark
Segmentation, annotation, and chunking
POS tagging
Using word2vec to find word relationships
Summary
Chapter 10: Advanced Model Monitoring
System monitoring
Process monitoring
Model monitoring
Summary
Biblography

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Scala:Applied Machine Learning

Description

More details

Content

System requirements