MS-1135 Combine per-modality duplication code by luhmirin-s · Pull Request #1466 · Simprints/Android-Simprints-ID

luhmirin-s · 2025-11-13T14:57:02Z

JIRA ticket
Will be released in: 2026.1.0

Notable changes

Combine most of the instances of parallel "face/fingerprint" parameters.
Implement a way to use the SDK configuration in a more generic way.
Simplify the local record repository interface to have a single method.

Testing guidance

Run all kinds of regular flows. Everything should works exactly as before.

Additional work checklist

Effect on other features and security has been considered
Design document marked as "In development" (if applicable)
External (Gitbook) and internal (Confluence) Documentation is up to date (or ticket created)
Test cases in Testiny are up to date (or ticket created)
Other teams notified about the changes (if applicable)

BurningAXE

Nice! Much simpler this way!

BurningAXE · 2025-11-17T14:23:19Z

    val probeReferenceId: String?,
-    val faceSamples: List<CaptureSample>,
-    val fingerprintSamples: List<CaptureSample>,
+    val samples: Map<Modality, List<CaptureSample>>,


Would it be better if we had a dedicated model ModalitySamples with explicit properties instead of a Map?

We would lose the ability to look up by modality key, which is used in several places.
I cannot think of a benefit other than a shorter declaration.

BurningAXE · 2025-11-17T14:39:18Z

+    JsonSubTypes.Type(value = BiometricDataSource.CommCare::class, name = "BiometricDataSource.CommCare"),
+    JsonSubTypes.Type(value = BiometricDataSource.Simprints::class, name = "BiometricDataSource.Simprints"),
    JsonSubTypes.Type(value = SubjectQuery::class, name = "SubjectQuery"),
+    JsonSubTypes.Type(value = AgeGroup::class, name = "AgeGroup"),


Hm, changes in this file seem suspicious - were the added ones missed previously? And the deleted ones unneeded?

This part was done in parallel with similar fixes in the release branch and later main, so it is messy. IIRC, all remaining records are required to pass the cache integration tests.

BurningAXE · 2025-11-17T14:44:59Z

+     * Combines all of the matching results per SDK and returns up to [maxNbOfReturnedCandidates] results from the SDK with
+     * the highest overall score in descending order. Credential matches take precedence over direct matches.
+     *
+     * If there are any matches of [AppMatchConfidence.HIGH], only those will be returned,


of = above?

AppMatchConfidence is the enum of confidence bands, there is no band above HIGH.

BurningAXE · 2025-11-17T17:23:45Z

        // require format to be set for biometric templates query
-        val format = query.fingerprintSampleFormat ?: query.faceSampleFormat
-        require(format != null) {
+        require(query.format != null) {


Here but also in lots of other places - it was more work to maintain hardcoded modalities than to have it abstracted!

BurningAXE · 2025-11-18T14:16:01Z

+                            samples = (
+                                dbSubject.faceSamples.map { it.toDomain() } +
+                                    dbSubject.fingerprintSamples.map { it.toDomain() }
+                            ).filter { it.format == query.format },


Why move the filter last? Not that it really matters in practice but it's less efficient this way.

Tbh, don't remember. Most likely just to do it once on the combined domain samples list.

alexandr-simprints · 2025-11-19T10:42:51Z

+        scope.launch(dispatcher) {
+            ranges
+                .map { range ->
+                    async { semaphore.withPermit { channel.send(load(range)) } }


In this implementation, the semaphore is acquired first, and then then potentially blocks on channel.send().

If we have more ranges than available processor slots (i.e., 10 ranges and 4 permits), then all 10 async tasks are launched immediately. The first 4 acquire semaphore permits, and if the channel buffer fills, they block on send() while holding the permission. It seems to me that this defeats the purpose of the semaphore for concurrency control

Maybe we should change it to this:

async { val result = semaphore.withPermit { // Only hold semaphore during actual work load(range) } // Release semaphore channel.send(result) }

I don't really see the issue - you'd have 10 coroutines launched "simultaneously". 4 of them would acquire semaphores, load and then send (the Channel also has capacity of 4), then those 4 semaphores would be released and taken by next 4 coroutines. Am I missing anything?

Please see the comment above.

From memory, channel.send() can suspend when the buffer is full. If it suspends, it is still holding the semaphore permit (since it's inside withPermit).

Looking at the 10 ranges and 4 semaphore permits scenario, with channel capacity of 4:

First 4 coroutines acquire permits, load, then hit channel.send() - buffer fills

Those 4 are now blocked on send() while holding their permits

Next 4 coroutines get permits, load, and also blocks on send()

Now all permits are held by coroutines waiting on channel I/O, not doing actual work

That's why it seems that it's better for semaphore to surround only the expensive load() operation, and not the channel 'communication':

async { val result = semaphore.withPermit { load(range) } channel.send(result) // Can block, but permit already released }

With this modification, the permits are released immediately after loading, and it allows the next batch to start work (rather than being stuck behind channel operations).

HOWEVER, according to @luhmirin-s:

While this shows up as "added" code, it is exactly the same as the pre-existing loadIdentities methods. The only change from my side is the renaming and duplication removal.

So it's up to you to change the code or keep it this way.

…hestrator

…e API

sonarqubecloud · 2025-11-20T08:38:14Z

Quality Gate passed

Issues
8 New issues
0 Accepted issues

Measures
0 Security Hotspots
84.6% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

cla-bot Bot added the ... label Nov 13, 2025

luhmirin-s changed the title ~~Feature/ms 1135 combine per modality duplication~~ MS-1135 Combine per-modality duplication code Nov 13, 2025

luhmirin-s requested review from a team, BurningAXE, TristramN, alex-vt, alexandr-simprints, meladRaouf and ybourgery and removed request for a team November 13, 2025 15:15

luhmirin-s marked this pull request as ready for review November 13, 2025 15:15

BurningAXE approved these changes Nov 18, 2025

View reviewed changes

luhmirin-s force-pushed the feature/MS-1135-simplifying-module-api branch from 59b38bf to 1650d12 Compare November 19, 2025 08:44

luhmirin-s force-pushed the feature/MS-1135-combine-per-modality-duplication branch from 1466cce to 6460be2 Compare November 19, 2025 09:05

luhmirin-s force-pushed the feature/MS-1135-simplifying-module-api branch from 1650d12 to b303c4d Compare November 19, 2025 09:14

luhmirin-s force-pushed the feature/MS-1135-combine-per-modality-duplication branch 2 times, most recently from 9648303 to 3363a56 Compare November 19, 2025 09:26

alexandr-simprints requested changes Nov 19, 2025

View reviewed changes

alexandr-simprints approved these changes Nov 19, 2025

View reviewed changes

meladRaouf approved these changes Nov 20, 2025

View reviewed changes

luhmirin-s added 5 commits November 20, 2025 09:57

MS-1135 Simplify per-modality logic based on the configuration in orc…

095b95e

…hestrator

MS-1135 Simplify the local record repo interface by combining methods

d228af3

MS-1135 Combine modality sample lists in the subject model

38ebae0

MS-1135 Combine probe lists in matcher params

17784e9

MS-1135 Combine per-modality parameters in external credentials modul…

e04f020

…e API

luhmirin-s force-pushed the feature/MS-1135-combine-per-modality-duplication branch from 3363a56 to e4f718a Compare November 20, 2025 08:11

luhmirin-s changed the base branch from feature/MS-1135-simplifying-module-api to main November 20, 2025 08:12

MS-1135 Move AgeGroup to core domain

f628d6d

luhmirin-s force-pushed the feature/MS-1135-combine-per-modality-duplication branch from e4f718a to f628d6d Compare November 20, 2025 08:24

luhmirin-s merged commit ab02f16 into main Nov 20, 2025
13 checks passed

luhmirin-s deleted the feature/MS-1135-combine-per-modality-duplication branch November 20, 2025 08:39

Conversation

luhmirin-s commented Nov 13, 2025

Notable changes

Testing guidance

Additional work checklist

Uh oh!

BurningAXE left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alexandr-simprints Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud Bot commented Nov 20, 2025

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

alexandr-simprints Nov 19, 2025 •

edited

Loading