Web数据挖掘：挖掘Web内容模型，结构和用途.pdf

资源大小： 6.79MB

发布时间： 2012-11-19

文件格式： pdf

下载次数： 21

分享到：

下载地址：

下载地址1

（本站为飞网专业下载站，域名：down.cfei.net）

资源简介：

中文名: Web数据挖掘：挖掘Web内容模式、结构和用途作者: Zdravko MarkovDaniel T. Larose图书分类: 网络资源格式: PDF版本: 文字版出版社: Wiley Blackwell书号: 0471666556发行时间: 2007年04月01日地区: 美国语言: 英文简介: 内容介绍：This book introduces the reader to methods of data mining on the web, including uncovering patterns in web content (classification, clustering, language processing), structure (graphs, hubs, metrics), and usage (modeling, sequence analysis, performance).内容截图：目录: PREFACEPART I: WEB STRUCTURE MINING1 INFORMATION RETRIEVAL AND WEB SEARCHWeb ChallengesWeb Search EnginesTopic DirectoriesSemantic WebCrawling the WebWeb BasicsWeb CrawlersIndexing and Keyword SearchDocument RepresentationImplementation ConsiderationsRelevance RankingAdvanced Text SearchUsing the HTML Structure in Keyword SearchEvaluating Search QualitySimilarity SearchCosine SimilarityJaccard SimilarityDocument ResemblanceReferencesExercises2 HYPERLINK-BASED RANKINGIntroductionSocial Networks AnalysisPageRankAuthorities and HubsLink-Based Similarity SearchEnhanced Techniques for Page RankingReferencesExercisesPART II: WEB CONTENT MINING3 CLUSTERINGIntroductionHierarchical Agglomerative Clusteringk-Means ClusteringProbabilty-Based ClusteringFinite Mixture ProblemClassification ProblemClustering ProblemCollaborative Filtering (Recommender Systems)ReferencesExercises4 EVALUATING CLUSTERINGApproaches to Evaluating ClusteringSimilarity-Based Criterion FunctionsProbabilistic Criterion FunctionsMDL-Based Model and Feature Evaluation.Minimum Description Length Principle.MDL-Based Model EvaluationFeature SelectionClasses-to-Clusters EvaluationPrecision, Recall, and F-MeasureEntropyReferencesExercises5 CLASSIFICATIONGeneral Setting and Evaluation TechniquesNearest-Neighbor AlgorithmFeature SelectionNaive Bayes AlgorithmNumerical ApproachesRelational LearningReferencesExercisesPART III: WEB USAGE MINING6 INTRODUCTION TO WEB USAGE MININGDefinition of Web Usage MiningCross-Industry Standard Process for Data MiningClickstream AnalysisWeb Server Log FilesRemote Host FieldDate/Time FieldHTTP Request FieldStatus Code FieldTransfer Volume (Bytes) FieldCommon Log FormatIdentification FieldAuthuser FieldExtended Common Log FormatReferrer FieldUser Agent FieldExample of a Web Log RecordMicrosoft IIS Log FormatAuxiliary InformationReferencesExercises7 PREPROCESSING FOR WEB USAGE MININGNeed for Preprocessing the DataData Cleaning and FilteringPage Extension Exploration and FilteringDe-Spidering the Web Log FileUser IdentificationSession IdentificationPath CompletionDirectories and the Basket TransformationFurther Data Preprocessing StepsReferencesExercises8 EXPLORATORY DATA ANALYSIS FOR WEB USAGE MININGIntroductionNumber of Visit ActionsSession DurationRelationship between Visit Actions and Session DurationAverage Time per PageDuration for Individual PagesReferencesExercises9 MODELING FOR WEB USAGE MINING: CLUSTERING, ASSOCIATION, AND CLASSIFICATIONIntroductionModeling MethodologyDefinition of ClusteringThe BIRCH Clustering AlgorithmAffinity Analysis and the A Priori AlgorithmDiscretizing the Numerical Variables: BinningApplying the A Priori Algorithm to the CCSU Web Log DataClassification and Regression TreesThe C4.5 AlgorithmReferencesExercisesINDEX

飞网下载站，免费下载共享资料，内容涉及教育资源、专业资料、IT资源、娱乐生活、经济管理、办公文书、游戏资料等。

Web数据挖掘：挖掘Web内容模型，结构和用途.pdf

下载地址：

资源简介：

相关资源：

飞网精选

《Java常用算法手册》，108个经典示例融入算法思想与高级应用，本书共14章，还列举了算法的一些常见面试题。

《HTML5开发精要与实例详解》，这是一本以综合性案例为导向并辅之以精要知识点讲解的html 5实战教程，内容分为两大部分。百度云盘分享。

一套20美刀的程序猿专用HTML模板-developr1.7，这套风格，很高大上、很炫酷、吊炸天......

马哥linux运维全套面授班培训教程+ppt+工具+视频全套不加密，培训价格几万块的想必大家都知道他的价值

热门下载