
Mechanizing Hypothesis Formation
Principles and Case Studies
CRC Press
1st Edition
Published on 9. October 2024
Book
Paperback/Softback
346 pages
978-0-367-54982-4 (ISBN)
Description
Mechanizing hypothesis formation is an approach to exploratory data analysis. Its development started in the 1960s inspired by the question "can computers formulate and verify scientific hypotheses?". The development resulted in a general theory of logic of discovery. It comprises theoretical calculi dealing with theoretical statements as well as observational calculi dealing with observational statements concerning finite results of observation. Both calculi are related through statistical hypotheses tests. A GUHA method is a tool of the logic of discovery. It uses a one-to-one relation between theoretical and observational statements to get all interesting theoretical statements. A GUHA procedure generates all interesting observational statements and verifies them in a given observational data. Output of the procedure consists of all observational statements true in the given data. Several GUHA procedures dealing with association rules, couples of association rules, action rules, histograms, couples of histograms, and patterns based on general contingency tables are involved in the LISp-Miner system developed at the Prague University of Economics and Business. Various results about observational calculi were achieved and applied together with the LISp-Miner system.
The book covers a brief overview of logic of discovery. Many examples of applications of the GUHA procedures to solve real problems relevant to data mining and business intelligence are presented. An overview of recent research results relevant to dealing with domain knowledge in data mining and its automation is provided. Firsthand experiences with implementation of the GUHA method in the Python language are presented.
The book covers a brief overview of logic of discovery. Many examples of applications of the GUHA procedures to solve real problems relevant to data mining and business intelligence are presented. An overview of recent research results relevant to dealing with domain knowledge in data mining and its automation is provided. Firsthand experiences with implementation of the GUHA method in the Python language are presented.
More details
Language
English
Place of publication
London
United Kingdom
Publishing group
Taylor & Francis Ltd
Target group
Professional and scholarly
Academic
Illustrations
213 s/w Abbildungen, 10 farbige Abbildungen, 1 s/w Photographie bzw. Rasterbild, 212 s/w Zeichnungen, 10 farbige Zeichnungen, 61 s/w Tabellen
61 Tables, black and white; 10 Line drawings, color; 212 Line drawings, black and white; 1 Halftones, black and white; 10 Illustrations, color; 213 Illustrations, black and white
Dimensions
Height: 234 mm
Width: 156 mm
Thickness: 20 mm
Weight
550 gr
ISBN-13
978-0-367-54982-4 (9780367549824)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

Jan Rauch | Milan Simunek | David Chudan
Mechanizing Hypothesis Formation
Principles and Case Studies
Book
10/2022
1st Edition
CRC Press
€259.30
Shipment within 15-20 days

Jan Rauch | Milan Simunek | David Chudan
Mechanizing Hypothesis Formation
Principles and Case Studies
E-Book
10/2022
1st Edition
CRC Press
€82.99
Available for download

Jan Rauch | Milan Simunek | David Chudan
Mechanizing Hypothesis Formation
Principles and Case Studies
E-Book
10/2022
1st Edition
CRC Press
€82.99
Available for download
Persons
Jan Rauch graduated from the Faculty of Mathematics and Physics of Charles University in Prague. He received his Ph.D. in Mathematical Logic in 1987 from the Institute of Mathematics of the Czechoslovak Academy of Sciences. He is a full professor at the Department of Information and Knowledge Engineering, Prague University of Economics and Business since 2011.
Milan Simunek is an associate professor (since 2012) at the Faculty of Informatics and Statistics, Prague University of Economics and Business. His research activities include data mining, databases, virtual reality and software projects development. He is the software project leader of the LISp-Miner system since its launch in 1996 and author of its core-modules implementation.
David Chudan is an assistant professor of Applied Informatics at the Faculty of Informatics and Statistics, Prague University of Economics and Business. He received his Ph.D. in 2015 in the field of Applied informatics. His research interests include data mining and machine learning on different tools and platforms. Another research area is GUHA association rules and their complementary usage with business intelligence.
Petr Masa graduated from the Prague University of Economics and Business and the Faculty of Mathematics and Physics of Charles University in Prague. He received his Ph.D. in 2006. He also works on business projects where he uses data mining, data science, data analytics and he is also business responsible.
Milan Simunek is an associate professor (since 2012) at the Faculty of Informatics and Statistics, Prague University of Economics and Business. His research activities include data mining, databases, virtual reality and software projects development. He is the software project leader of the LISp-Miner system since its launch in 1996 and author of its core-modules implementation.
David Chudan is an assistant professor of Applied Informatics at the Faculty of Informatics and Statistics, Prague University of Economics and Business. He received his Ph.D. in 2015 in the field of Applied informatics. His research interests include data mining and machine learning on different tools and platforms. Another research area is GUHA association rules and their complementary usage with business intelligence.
Petr Masa graduated from the Prague University of Economics and Business and the Faculty of Mathematics and Physics of Charles University in Prague. He received his Ph.D. in 2006. He also works on business projects where he uses data mining, data science, data analytics and he is also business responsible.
Content
1. Introduction 2. Data Sets SECTION I: THE GUHA PROCEDURES 3. Principle and Simple Examples 4. Common Features 5. LISp-Miner System SECTION II: APPLYING THE GUHA PROCEDURES 6. Examples Overview 7. 4ft-Miner - GUHA Association Rules 8. CF-Miner - Histograms 9. KL-Miner - Pairs of Categorical Attributes 10. SD-4ft-Miner - Couples of GUHA Association Rules 11. SDCF-Miner - Couples of Histograms 12. SDKL-Miner - Couples of Pairs of Categorical Attributes 13. Ac4ft-Miner - Action Rules 14. GUHA Procedures and Business Intelligence 15. CleverMiner - GUHA and Python SECTION III: RELATED RESEARCH AND THEORY 16. Artificial Data Generation and LM ReverseMiner Module 17. Applying Domain Knowledge 18. Observational Calculi