Data Mining

Data mining is a process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal to extract information (with intelligent methods) from a data set and transform the information into a comprehensible structure for further use. Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD. Aside from the raw analysis step, it also involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating. The term "data mining" is a misnomer, because the goal is the extraction of patterns and knowledge from large amounts of data, not the extraction (mining) of data itself. It also is a buzzword and is frequently applied to any form of large-scale data or information processing (collection, extraction, warehousing, analysis, and statistics) as well as any application of computer decision support system, including artificial intelligence (e.g., machine learning) and business intelligence. The book Data mining: Practical machine learning tools and techniques with Java (which covers mostly machine learning material) was originally to be named just Practical machine learning, and the term data mining was only added for marketing reasons. Often the more general terms (large scale) data analysis and analytics – or, when referring to actual methods, artificial intelligence and machine learning – are more appropriate.

Last Updated on: Apr 22, 2025

Global Scientific Words in General Science

Experts in General Science

Cheng-Li Liu Miguel Acevedo Faramarz Helali Peter Andreas In-Ju Kim Rabiul Ahasan Anil K. Tripathi Isamu Nishida Gregor Harih Stoica George Tal Amasay Brian Rothman Semra Peksoz Ashok Sharma Mahesh Kumar Vyas Arjunsingh Baghel Calatayud Paul Andre MARCO ADAMINA Heba Barazi AEBERSOLD URSULA Susan Brind Justin Carter Thomas Joshua Cooper Dr. Heena V Dave Mary Carrington Dr Patrick Allington CREDENCE BAKER Tiziano Manca Elliot Blair Dr. Anne B. Clark

Experts by Subject

Infectious Diseases [15791] Computer Science [72] Pediatrics [20947] Reproductive Medicine [11797] Psychiatry [56053] Environmental Sciences [60017] Social and Political Sciences [131767] Chemistry [52850] Orthopaedics [13285] Gastroenterology [16820] Medical Sciences [71506] Biochemistry [40242] Dermatology [23076] Engineering [200505] Anesthesiology [17843] Veterinary Sciences [22784] Cardiology [50875] Immunology [22807] Molecular Biology [28455] Agri and Aquaculture [52608] Business and Management [252587] Neurology [52108] Geology and Earth Science [27642] Diabetes and Endocrinology [24071] Chemical Engineering [36667] Surgery [38956] Haematology [11168] General Science [47418] Bioinformatics and Systems Biology [30065] Radiology [26]