
Software Engineering for Data Scientists
From Notebooks to Scalable Systems
Catherine Nelson(Author)
O'Reilly (Publisher)
Published on 30. April 2024
Book
Paperback/Softback
400 pages
978-1-0981-3620-8 (ISBN)
Description
Data science happens in code. The ability to write reproducible, robust, scaleable code is key to a data science project's success-and is absolutely essential for those working with production code. This practical book bridges the gap between data science and software engineering, clearly explaining how to apply the best practices from software engineering to data science.
Examples are provided in Python, drawn from popular packages such as NumPy and pandas. If you want to write better data science code, this guide covers the essential topics you need (and that are often missing from introductory data science or coding classes), including how to:
Understand data structures and object-oriented programming
Clearly and skillfully document your code
Package and share your code
Integrate data science code with a larger codebase
Write APIs
Create secure code
Apply best practices to common tasks such as testing, error handling, and logging
Work more effectively with software engineers
Write more efficient, maintainable, and robust code in Python
Put your data science projects into production
And more
Examples are provided in Python, drawn from popular packages such as NumPy and pandas. If you want to write better data science code, this guide covers the essential topics you need (and that are often missing from introductory data science or coding classes), including how to:
Understand data structures and object-oriented programming
Clearly and skillfully document your code
Package and share your code
Integrate data science code with a larger codebase
Write APIs
Create secure code
Apply best practices to common tasks such as testing, error handling, and logging
Work more effectively with software engineers
Write more efficient, maintainable, and robust code in Python
Put your data science projects into production
And more
More details
Language
English
Place of publication
Sebastopol
United States
Target group
Professional and scholarly
Product notice
Paperback (trade)
Unsewn / adhesive bound
Dimensions
Height: 230 mm
Width: 175 mm
Thickness: 16 mm
Weight
418 gr
ISBN-13
978-1-0981-3620-8 (9781098136208)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

Catherine Nelson
Software Engineering for Data Scientists
E-Book
04/2024
O'Reilly
€58.99
Available for download

Catherine Nelson
Software Engineering for Data Scientists
E-Book
04/2024
O'Reilly
€58.99
Available for download
Person
Catherine Nelson is a Principal Data Scientist at SAP Concur, where she explores innovative ways to deliver production machine learning applications which improve a business traveler's experience. Her key focus areas range from ML explainability and model analysis to privacy-preserving ML. She is also co-author of the O'Reilly publication "Building Machine Learning Pipelines", and she is an organizer for Seattle PyLadies, supporting women who code in Python. She has been recognized as a Google Developer Expert in machine learning. In her previous career as a geophysicist she studied ancient volcanoes and explored for oil in Greenland. Catherine has a PhD in geophysics from Durham University and a Masters of Earth Sciences from Oxford University.