Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lesson 3: Trifacta Basics

Similar presentations


Presentation on theme: "Lesson 3: Trifacta Basics"— Presentation transcript:

1 Lesson 3: Trifacta Basics
Chapter 3A – Filtering: Keep, Delete

2 Lesson 3 – Chapter 3A Chapter 3A – filtering: keep, delete
In this Chapter, you will: Use the Data Quality Bar to Filter your records and generate suggested script lines Understand how to use the following transforms: Keep Delete A datasourse is a reference to a set of data that has been imported into the system. This source is not modified within the application datasource and can be used in multiple datasets. It is important to note that when you use Trifacta to wrangle a source, or file, the original file is not modified – therefore, it can be used over and over – to prepare output in multiple ways, for example. Datasources are created in the Datasources Page, or when a new dataset is created. There are two ways to add a datasource to your Trifacta instance: You can locate and select a file in HDFS – HDFS stands for Hadoop File System. You can use the file browser to locate and select the file. You can also upload a local file from your machine. Note that there is a 1 GB file size limit for local files. Several file formats are supported: CSV LOG JSON AVRO EXCEL – Note that if you upload an Excel file with multiple worksheets, each worksheet will be imported as a separate source. Trifacta. Confidential & Proprietary.


Download ppt "Lesson 3: Trifacta Basics"

Similar presentations


Ads by Google