Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Introduction

...

  1. When doing a large import, first test with small subset of 10-100 rows. Many times same mistake is committed in every row. That avoids waiting for long time to get the results back with same error in every row.
  2. For huge uploads, like 500K to 1+M, refer to "Tips and tricks to improve speed for import large number of records" below.

...

The data import file has to be in CSV (comma separated valuesComma Separated Values) format. 

Can I import any CSV file?

The data in CSV file should be in specific template formats. The templates can be downloaded from the application under the import option for every data.

Can I import participant, visits, specimens, etc. in one go?

Yes, using 'Master Specimens' template user you can import participants, visits and specimens in one go. For more details on import refer 'Master Template'

...

The report contains details about if a record upload was successful or failed. In case of failure, user you can download the bulk import job file showing the import status and error showing reason of failure.

...

Can I abort a bulk import job?

If the user wants you want to abort the bulk import job, click on 'Abort' icon on the jobs page for the specific import job:

This will help if users you have uploaded large files like 10k records and realize that there is a mistake in the records and would like to abort the bulk import job instead of waiting for the whole file to process. 

...

Email notifications are sent after bulk import job is completed, failed or aborted to the user who performed the bulk import. The email is also CCed to 'Administrator Email Address' set under Settings ->Email→ Email.

Settings of validation before import

All the bulk import jobs are validated first by the system. If there are errors in any of the records, none of the records are inserted or updated. User You can download the report, correct the errors, and upload the same report file. 

...

To disable validation before importing, follow the steps:

  1. Go to the home page, click on ‘Settings’ card.

  2. Click on ‘Common’ module and select property ‘Pre-validate Records Limit’

  3. Set ‘0’ for the ‘New Value’ field and click on ‘Update’


Info

Note: When validation is disabled, the system will show errors for failed records but will upload the success records.

What is 'Validate and Import'? (v3.4 onwards)

In bulk upload, if 100 records are uploaded out of which 60 failed and only 40 records processed successfully, the user has to filter out the failed records, rectify and upload them again for reprocessing. The 'Validate and Import' feature validates the complete file before upload.

  • If any record fails in inputting CSV file, the whole job will fail and nothing will be saved in a the database until all the records get succeeded.
  • If there is any error then the system returns status log file with the proper error message for incorrect records so that user is able to rectify the incorrect records and upload again.
  • The time required to validate the records is the same as that required to upload the records.
  • The maximum number of records that can be validated in one job is set to 10000 by default. It can be changed from Settings -> Common -> Pre→ Common → Pre-validate Records Limit.
  • If the records are more than 10k, the system shows a message 'Number of records to import are greater than 10000, do you want to proceed without validating input file?'.
  • If user proceeds without validation, then the records are processed individually.

Can the

...

Super Admin view import jobs of all users?

Yes, "super administratorSuper Administrator" can view import jobs of all the users. Please note that the user can bulk upload the data from two places.

  1. Outside individual CP: i.e. from the Collection protocol list page (Collection Protocols -> More -> Import→ More → Import)
  2. For specific collection protocol (Collection Protocols -> Participant List -> More -> Import→ Participant List → More → Import)

The jobs will be visible to the super admin Super Admin based on  on how the user uploaded the file. In other words, jobs uploaded at the global level won't be visible under specific CP and vice-versa.

Can the institute admin view import jobs of other users? (v3.4 onwards)

Yes, same as super adminSuper Admin. Institute admin Admin can view import of users from his/her institute.

...

Tips and tricks to import large number of records

If you have a very large number of data to import (say in 100s of K or millions), you can follow the below steps to improve the speed of data import:

  1. Do imports via folder import and not via UI. Refer to Auto bulk import for this.
  2. Break the large file into smaller files. Say 100K specimens each. The problem with one large file is that it will take forever for the system to even read the file (i.e. before starting to even process the first row).
  3. If importing via UI, import the file as super admin Super Admin user. This will tell the system to not spend time doing privilege checks. This will automatically happen if you do the auto-bulk import by dropping the file in the server folder.
  4. Schedule the import during off-peak hours e.g. daily 5 PM to 8 AM next day or weekend. You can do this by putting fixed number of files in the folder . Ii.e. once you know 1 file of 100K takes 1 hour, then you can put say 14 files.