中文参考译名:数据准备使用的SAS数据挖掘在数据管理系统(摩根考夫曼系列)
Author: Mamdouh Refaat
Publisher: Morgan Kaufma
Keywords: data, series, management, systems, kaufmann, sas, preparation, mining, using, morgan
Number of Pages: 424
Published: 2006-10-13
List price: $74.95
ISBN-10: 0123735777
ISBN-13: 9780123735775
Are you a data mining analyst, who spends up to 80% of your time assuring data quality, then preparing that data for developing and deploying predictive models? And do you find lots of literature on data mining theory and concepts, but when it comes to practical advice on developing good mining views find little "how to" information? And are you, like most analysts, preparing the data in SAS?This book is intended to fill this gap as your source of practical recipes. It introduces a framework for the process of data preparation for data mining, and presents the detailed implementation of each s
中文参考译名:数据准备数据挖掘技术在数据管理系统(摩根考夫曼系列)
Author: Dorian Pyle
Publisher: Morgan Kaufma
Keywords: data, management, systems, series, morgan, preparation, mining, kaufmann
Number of Pages: 560
Published: 1999-04-05
List price: $78.95
ISBN-10: 1558605290
ISBN-13: 9781558605299
Data Preparation for Data Mining addresses an issue unfortunately ignored by most authorities on data mining: data preparation. Thanks largely to its perceived difficulty, data preparation has traditionally taken a backseat to the more alluring question of how best to extract meaningful knowledge. But without adequate preparation of your data, the return on the resources invested in mining is certain to be disappointing.Dorian Pyle corrects this imbalance. A twenty-five-year veteran of what has become the data mining industry, Pyle shares his own successful data preparation methodology, offeri
中文参考译名:Web数据挖掘:探索超链接,内容和使用数据(数据为中心的系统和应用)
Author: Bing Liu
Publisher: Springer
Keywords: data, centric, systems, applications, usage, hyperlinks, mining, exploring, web, contents
Number of Pages: 532
Published: 2009-01-21
List price: $59.95
ISBN-10: 3540378812
ISBN-13: 9783540378815
Web mining aims to discover useful information and knowledge from the Web hyperlink structure, page contents, and usage data. Although Web mining uses many conventional data mining techniques, it is not purely an application of traditional data mining due to the semistructured and unstructured nature of the Web data and its heterogeneity. It has also developed many of its own algorithms and techniques. Liu has written a comprehensive text on Web data mining. Key topics of structure mining, content mining, and usage mining are covered both in breadth and in depth. His book brings together all t
中文参考译名:聚类数据挖掘:数据恢复方法(查普曼和霍尔/华润计算机科学与数据分析)
Author: Boris Mirki
Publisher: Chapman and Hall/CRC
Keywords: data, crc, computer, analysis, hall, science, chapman, mining, recovery, approach, clustering
Number of Pages: 296
Published: 2005-04-29
List price: $93.95
ISBN-10: 1584885343
ISBN-13: 9781584885344
Often considered more as an art than a science, the field of clustering has been dominated by learning through examples and by techniques chosen almost through trial-and-error. Even the most popular clustering methods--K-Means for partitioning the data set and Ward’s method for hierarchical clustering--have lacked the theoretical attention that would establish a firm relationship between the two methods and relevant interpretation aids. Rather than the traditional set of ad hoc techniques, Clustering for Data Mining: A Data Recovery Approach presents a theory that not only closes gaps in
中文参考译名:数据之间的科学和应用数据分析:在第26与Gesellschaft fur Klassifikation的eV,曼海姆大学年度会议论文...数据分析和知识组织)
Authors:Martin Schader, Wolfgang Gaul, Maurizio Vichi,
Publisher: Springer
Keywords: data, analysis, klassifikation, organization, university, mannheim, fr, knowledge, conference, applied, science, proceedings, 26th, annual, gesellschaft
Number of Pages: 693
Published: 2003-09-10
List price: $149.00
ISBN-10: 354040354X
ISBN-13: 9783540403548
The volume presents new developments in data analysis and classification and gives an overview of the state of the art in these scientific fields and relevant applications. Areas that receive considerable attention in the book are clustering, discrimination, data analysis, and statistics, as well as applications in economics, biology, and medicine. The reader will find material on recent technical and methodological developments and a large number of application papers demonstrating the usefulness of the newly developed techniques.
中文参考译名:金博尔的数据仓库工具包经典:数据仓库工具包,第2版;数据仓库生命周期,第二版;数据仓库的ETL Toolk
Authors:Ralph Kimball, Margy Ross, Bob Becker, Joy Mundy,
Publisher: Wiley
Keywords: data, warehouse, toolkit, toolk, etl, lifecycle, classics, kimball
Number of Pages: 1628
Published: 2009-04-06
List price: $145.00
ISBN-10: 0470479574
ISBN-13: 9780470479575
The Web copy for this title needs to be updated to the following · Cowritten by Ralph Kimball, the world’s leading data warehousing authority · Delivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) process · Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse · Offers proven time-saving ETL techniques
中文参考译名:在分类的新进展和数据分析:会议讨论的分类和数据分析小组(CLADAG)的意大利...数据分析和知识组织)
Authors:Maurizio Vichi, Paola Monari, Stefania Mignani, An
Publisher: Springer
Keywords: analysis, data, classification, italian, knowledge, organization, cladag, proceedings, developments, new, meeting, group
Number of Pages: 369
Published: 2005-04-06
List price: $104.00
ISBN-10: 3540238093
ISBN-13: 9783540238096
The volume presents new developments in data analysis and classification. Particular attention is devoted to clustering, discrimination, data analysis and statistics, as well as applications in biology, finance and social sciences. The reader will find theory and algorithms on recent technical and methodological developments and many application papers showing the empirical usefulness of the newly developed solutions.
