最新消息:请大家多多支持

Lynda – Data Analysis on Hadoop

其他教程 dsgsd 203浏览 0评论


Lynda - Data Analysis on Hadoop
Lynda – Data Analysis on Hadoop
Size: 9.41 GB | Duration: 0h 41m | Video: AVC (.mp4) 1280×720 15&30fps | Audio: AAC 48KHz 2ch
Genre: eLearning | Level: Intermediate | Language: English

Hadoop is the cloud computing platform data scientists use to perform highly parallelized operations on big data. If you’ve explored Hadoop, you’ve probably discovered it has many levels of complexity. After getting comfortable with the fundamentals, you’re ready to see how to put additional frameworks and tool sets to use. In this course, software engineer and data scientist Jack Dintruff goes beyond the basic capabilities of Hadoop. He demonstrates hands-on, project-based, practical skills for analyzing data, including how to use Pig to analyze large datasets and how to use Hive to manage large datasets in distributed storage. Learn how to configure the Hadoop distributed file system (HDFS), perform processing and ingestion using MapReduce, copy data from cluster to cluster, create data summarizations, and compose queries.

Topics include:
* Setting up and administrating clusters
* Ingesting data
* Working with MapReduce, YARN, Pig, and Hive
* Selecting and aggregating large datasets
* Defining limits, unions, filters, and joins
* Writing custom user-defined functions (UDFs)
* Creating queries and lookups
Lynda - Data Analysis on HadoopLynda - Data Analysis on Hadoop
Download 百度云

你是VIP 1个月(1 month)赞助会员,

资源下载此资源仅限VIP下载,请先

转载请注明:0daytown » Lynda – Data Analysis on Hadoop

您必须 登录 才能发表评论!