Skip to content

Data Source Mapping

During onboarding, the same piece of client information — a name, a date of birth, an address — can arrive from several different systems: DigiLocker, the KRA (KYC Registration Agency), the CKYC (Central KYC) registry, a verification API, or the customer themselves. This page documents which source wins when they conflict. You will refer back to it when debugging data mismatches or trying to understand why a particular field was prefilled with a value the client did not type in.

When multiple sources provide the same field, the system uses a fixed precedence order to decide which value to keep.

1. DigiLocker > 2. CKYC (CERSAI) > 3. KRA Fetch > 4. Verification API > 5. User Input

Rule: DigiLocker wins for identity fields. KRA/CKYC win for financial profile. User input only for data no source can provide (mobile, email, segment choice).

Government-issued digital documents (DigiLocker) are treated as the highest-authority source per SEBI/HO/MIRSD/DOP/CIR/P/2020/73. Registry records (CKYC/KRA) take precedence over third-party verification services, which in turn take precedence over customer-entered data.

The table below shows how the roughly 454 fields in the master dataset break down by origin. The vast majority are system-generated — timestamps, status codes, and audit logs that the customer never sees.

SourceCategoryExamples
System GeneratedLargest share of total fieldsTimestamps, IDs, status codes, audit logs, API responses, submission tracking
KRA/CKYC PrefillRegulatory profile fieldsOccupation, income, net worth, FATCA, PEP, signature, CKYC number
DigiLockerGovernment identity fieldsName, DOB, gender, photo, full address, father’s name, POI/POA documents
Verification APIsVerification and compliance fieldsPAN status, bank name match, AML screening result, face match result, eSign metadata
User InputMinimal manual entryMobile, PAN, DOB, email, bank a/c, IFSC, segments, marital status

This table connects the nine onboarding screens to their primary data sources. When investigating a bug on a specific screen, this tells you which integration to look at first.

ScreenPrimary SourceFields FilledUser Input
Screen 1User + OTPMobile, email1-2 fields
Screen 2User + async APIsPAN, DOB + KRA/CKYC/AML results2 fields
Screen 3DigiLockerIdentity fields (name, DOB, gender, address, photo, father’s name)0 fields (redirect)
Screen 4Pre-filled from Screen 1-3Name, DOB, gender, address confirmed1 field (email)
Screen 5User + Bank VerificationBank account, IFSC, beneficiary name3 fields
Screen 6User toggles + KRA prefillSegment preferences, income rangeToggles only
Screen 7UserNominee detailsVariable
Screen 8SystemDeclaration checkboxesCheckboxes
Screen 9Review + Verification APIsFace match result, eSign0 fields (OTP)