Data ingestion Identifying input files Sorting input files by file name Sorting input files by file size Sorting input files by file time Sorting input files by special file time Finding file duplicates Using load distribution to distribute files Distributing files to processing chains defined on job submission Using file group split to distribute files Choosing a parser Using many parsers Activating the compression parameter for file readers Activating the encoding parameter for file readers Using file preprocessing