LID: several random samples for long file by karpnv · Pull Request #6853 · NVIDIA-NeMo/NeMo

karpnv · 2023-06-12T15:58:59Z

What does this PR do ?

Use several random samples for long files

Collection: ASR

Changelog

Added parameters:

segment_duration (float): random sample duration in seconds
num_segments (int): number of segments of file to use for majority vote

Usage

lang_model = nemo_asr.models.EncDecSpeakerLabelModel.from_pretrained(model_name="langid_ambernet")
lang = lang_model.get_label(filename, segment_duration = np.inf, num_segments = 1, random_seed = None)

PR Type:

[V] New Feature

Who can review?

@fayejf

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…ation_limit

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…ation_limit

github-actions · 2023-06-29T02:03:56Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions · 2023-07-06T02:07:28Z

This PR was closed because it has been inactive for 7 days since being marked as stale.

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…ation_limit

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…into karpnv/duration_limit

…ation_limit

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…into karpnv/duration_limit

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

nithinraok · 2023-10-25T00:16:23Z

nemo/collections/asr/models/label_models.py

            label: label corresponding to the trained model
        """
-        _, logits = self.infer_file(path2audio_file=path2audio_file)
+        audio, sr = librosa.load(path2audio_file, sr=None)


can you replace this with sf.read it is much faster

I know it might not have been originally designed to support reading mp3 or multi-channel (stereo) wav files, but it was able to do so in the past with librosa.load, however, it may result in errors after switching to sf.read. Should we consider adding support for more formats or stick to using librosa.load for consistency?

nithinraok · 2023-10-25T00:17:59Z

nemo/collections/asr/models/label_models.py

-            path2audio_file: path to audio wav file
+            path2audio_file (str): path to audio wav file
+            segment_duration (float): random sample duration in seconds
+            num_segments (int): number of segments of file to use for majority vote


instead of num_segments, just do non-overlap segments from start to end based on 5 sec audio samples? Have you done ablation study on what is best?

I personally didn't, but it was suggested by Fai. This is for very long audio (several hours). We take several segments and get result by majority vote

…ation_limit

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…into karpnv/duration_limit

nithinraok

LGTM, Please add a random seed for selection of random segments.

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…ation_limit

for more information, see https://pre-commit.ci

karpnv · 2023-10-26T10:36:01Z

added random_seed parameter

nithinraok

Thanks, LGTM

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

karpnv · 2023-10-30T17:12:26Z

jenkins

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

…ation_limit

nithinraok · 2023-11-03T16:03:21Z

jenkins

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

nithinraok · 2023-11-03T19:59:54Z

jenkins

* add random samlpes (num_segments) with segment_duration for get_label(filename, segment_duration = 60*6, num_segments) --------- Signed-off-by: Nikolay Karpov <karpnv@gmail.com> Signed-off-by: Piotr Żelasko <petezor@gmail.com>

* add random samlpes (num_segments) with segment_duration for get_label(filename, segment_duration = 60*6, num_segments) --------- Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

add duration_limit

44fbcd8

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

github-actions bot added the ASR label Jun 12, 2023

karpnv and others added 4 commits June 12, 2023 08:59

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

3a2303a

…ation_limit

[pre-commit.ci] auto fixes from pre-commit.com hooks

034bb3a

for more information, see https://pre-commit.ci

target_sr

ebd9f1b

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

30cee72

…ation_limit

github-actions bot added the stale label Jun 29, 2023

github-actions bot closed this Jul 6, 2023

karpnv and others added 10 commits July 6, 2023 05:56

limit first

5c1034d

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

facb08d

…ation_limit

soundfile

c6eae3e

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

d936bc6

for more information, see https://pre-commit.ci

rm soudfile

54da5d8

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

Merge branch 'karpnv/duration_limit' of https://github.com/NVIDIA/NeMo …

06ff20e

…into karpnv/duration_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

541ae3a

…ation_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

4125955

…ation_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

20e67ab

…ation_limit

infer_segment

514f66d

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

karpnv reopened this Oct 24, 2023

pre-commit-ci bot and others added 4 commits October 24, 2023 17:44

[pre-commit.ci] auto fixes from pre-commit.com hooks

270789f

for more information, see https://pre-commit.ci

soundfile

68bd103

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

Merge branch 'karpnv/duration_limit' of https://github.com/NVIDIA/NeMo …

c41b09f

…into karpnv/duration_limit

docstring

4a0acc6

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

karpnv marked this pull request as ready for review October 24, 2023 17:52

karpnv changed the title ~~duration limit~~ LID: several random samples for long file Oct 24, 2023

karpnv requested a review from nithinraok October 24, 2023 17:56

nithinraok reviewed Oct 25, 2023

View reviewed changes

github-actions bot removed the stale label Oct 25, 2023

karpnv and others added 3 commits October 25, 2023 00:13

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

961893d

…ation_limit

soundfile

360fcf4

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

9b15855

for more information, see https://pre-commit.ci

karpnv requested a review from nithinraok October 25, 2023 07:30

karpnv added 2 commits October 25, 2023 03:08

type float

26bcfa7

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

Merge branch 'karpnv/duration_limit' of https://github.com/NVIDIA/NeMo …

93ecb72

…into karpnv/duration_limit

nithinraok previously approved these changes Oct 25, 2023

View reviewed changes

karpnv added 2 commits October 26, 2023 03:31

random_seed

783121a

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

5000160

…ation_limit

karpnv dismissed nithinraok’s stale review via 5000160 October 26, 2023 10:32

[pre-commit.ci] auto fixes from pre-commit.com hooks

476dec3

for more information, see https://pre-commit.ci

karpnv requested a review from nithinraok October 26, 2023 10:36

nithinraok previously approved these changes Oct 26, 2023

View reviewed changes

Merge branch 'main' into karpnv/duration_limit

6824fce

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

karpnv added 3 commits October 31, 2023 10:36

Merge branch 'main' into karpnv/duration_limit

58ffd07

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

7619746

…ation_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

bae985e

…ation_limit

to float

3df2df1

Signed-off-by: Nikolay Karpov <karpnv@gmail.com>

karpnv dismissed nithinraok’s stale review via 3df2df1 November 3, 2023 19:57

karpnv requested a review from nithinraok November 3, 2023 19:59

nithinraok approved these changes Nov 3, 2023

View reviewed changes

karpnv merged commit 286e84e into main Nov 6, 2023

karpnv deleted the karpnv/duration_limit branch November 6, 2023 07:51

Conversation

karpnv commented Jun 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Usage

Who can review?

Uh oh!

github-actions bot commented Jun 29, 2023

Uh oh!

github-actions bot commented Jul 6, 2023

Uh oh!

nithinraok Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

karpnv Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

liuspencersjtu Mar 10, 2024

Choose a reason for hiding this comment

Uh oh!

nithinraok Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

karpnv Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

nithinraok left a comment

Choose a reason for hiding this comment

Uh oh!

karpnv commented Oct 26, 2023

Uh oh!

nithinraok left a comment

Choose a reason for hiding this comment

Uh oh!

karpnv commented Oct 30, 2023

Uh oh!

nithinraok commented Nov 3, 2023

Uh oh!

nithinraok commented Nov 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

karpnv commented Jun 12, 2023 •

edited

Loading