Skip to content

Data Source Mapping

During onboarding, the same piece of client information — a name, a date of birth, an address — can arrive from several different systems: DigiLocker, the KRA (KYC Registration Agency), the CKYC (Central KYC) registry, a vendor API, or the customer themselves. This page documents which source wins when they conflict. You will refer back to it when debugging data mismatches or trying to understand why a particular field was prefilled with a value the client did not type in.

When multiple sources provide the same field, the system uses a fixed precedence order to decide which value to keep.

1. DigiLocker > 2. CKYC (CERSAI) > 3. KRA Fetch > 4. Vendor API > 5. User Input

Rule: DigiLocker wins for identity fields. KRA/CKYC win for financial profile. User input only for data no source can provide (mobile, email, segment choice).

In plain English: the system trusts government-issued digital documents more than registry records, registry records more than third-party vendors, and vendor data more than what the customer typed in manually.

The table below shows how the roughly 454 fields in the master dataset break down by origin. Notice that the vast majority are system-generated — timestamps, status codes, and audit logs that the customer never sees.

SourceFields%Examples
System Generated~350+77%Timestamps, IDs, status codes, audit logs, API responses, submission tracking
KRA/CKYC Prefill~409%Occupation, income, net worth, FATCA, PEP, signature, CKYC number
DigiLocker~255.5%Name, DOB, gender, photo, full address, father’s name, POI/POA
Vendor APIs~255.5%PAN status, bank name match, AML score, face match, e-Sign metadata
User Types~123%Mobile, PAN, DOB, email, bank a/c, IFSC, segments, marital status

This table connects the nine onboarding screens to their primary data sources. When investigating a bug on a specific screen, this tells you which integration to look at first.

ScreenPrimary SourceFields FilledUser Input
Screen 1User + OTPMobile, email1-2 fields
Screen 2User + async APIsPAN, DOB + KRA/CKYC/AML results2 fields
Screen 3DigiLocker~25 identity fields0 fields (redirect)
Screen 4Pre-filled from Screen 1-3Name, DOB, gender, address confirmed1 field (email)
Screen 5User + Penny DropBank account, IFSC, beneficiary name3 fields
Screen 6User toggles + KRA prefillSegment preferences, income rangeToggles only
Screen 7UserNominee detailsVariable
Screen 8SystemDeclaration checkboxesCheckboxes
Screen 9Review + APIsFace match, e-Sign0 fields (OTP)