Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lesson 4: Advanced Transforms

Similar presentations


Presentation on theme: "Lesson 4: Advanced Transforms"— Presentation transcript:

1 Lesson 4: Advanced Transforms
Chapter 4C – Maps and Arrays

2 Lesson 4 – Chapter 4C Chapter 4C: Maps and Arrays
In this Chapter, you will: Understand how to use the following transforms: Extractlist Extractkv Countpattern Nest Unnest/Flatten Vertically (to create new rows) Horizontally (to create new colums) A datasourse is a reference to a set of data that has been imported into the system. This source is not modified within the application datasource and can be used in multiple datasets. It is important to note that when you use Trifacta to wrangle a source, or file, the original file is not modified – therefore, it can be used over and over – to prepare output in multiple ways, for example. Datasources are created in the Datasources Page, or when a new dataset is created. There are two ways to add a datasource to your Trifacta instance: You can locate and select a file in HDFS – HDFS stands for Hadoop File System. You can use the file browser to locate and select the file. You can also upload a local file from your machine. Note that there is a 1 GB file size limit for local files. Several file formats are supported: CSV LOG JSON AVRO EXCEL – Note that if you upload an Excel file with multiple worksheets, each worksheet will be imported as a separate source.


Download ppt "Lesson 4: Advanced Transforms"

Similar presentations


Ads by Google