iframe-proxy

poonehmousavi · 2024-08-11T19:52:53Z

What does this PR do?

Fix the partial_fit bug introduced in kmeans.py which causes only the last batch used for training the k-means. It is related to this bug:
#2602

Based on the new experiments with the kmeans trained on all 960 datasets, there is not much difference in the performance of the downstream task. The result of the ablation study will be posted. The kmeans models trained on <1% data, 10%(only using LibriSpeech100 and 100% of data are uploaded in the HF repo:
https://huggingface.co/speechbrain/SSL_Quantization/tree/main

Fixes #<issue_number>

Before submitting

Did you read the contributor guideline?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Does your code adhere to project-specific code style and conventions?

PR review

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified
Confirm that the changes adhere to compatibility requirements (e.g., Python version, platform)
Review the self-review checklist to ensure the code is ready for review

poonehmousavi · 2024-10-11T11:13:37Z

mravanelli · 2024-10-11T12:57:27Z

@poonehmousavi, it looks like the docstring of the new function is not formatted in the same way as the others one. Could you please fix it?

poonehmousavi · 2024-10-11T13:13:05Z

@mir

@poonehmousavi, it looks like the docstring of the new function is not formatted in the same way as the others one. Could you please fix it?

@mravanelli It seems all the checkpoints are passed... I am not sure what to fix

mravanelli · 2024-10-11T14:29:57Z

My comment regards the format of docstring of the process_chunk.py function. It is different from the standard format for doctring used in speechbrain.

poonehmousavi · 2024-10-11T19:27:46Z

* data prep scripts update * iterate over utterances * without parallel map * parallel map -> so fast omfg * gigaspeech data prep done * speechcolab extra dep if one must download gigaspeech * create ASR CTC folder * base yaml + update data prep to better reflect potential different naming for csvs * update recipe * update recipe to be compliant with gigaspeech csv * add transformers dep * convert opus to wav * recipe --debug mode works. * typo GRABAGE_UTTERANCE_TAGS -> GARBAGE_UTTERANCE_TAGS * tmp DL file * update DL FILE * add DL file in ASR/CTC * update extra_requirements.txt * add support of savedir within Pretrained subclasses * add wbs requirements * webdataset * remove print * tmp files webdataset * verbosity + metada.json * letzo now label_encoder can actually train + the recipe seems to work. * remove wbs * DL info * HF DL support * remove webdataset as it sucks :p * name * ngram commands * whisper baseline * fix HF * pre-commit + sentencepiece char * remove csv * Add quirks.py, move global PyTorch config and GPU workarounds there * Add support for SB_DISABLE_QUIRKS environment variable * Fetch rework: make savedir optional * bunch of updates to make it run * no download script * fix precommit * fix precommit * readmes * readmes * readmes * readmes * doc update * CI god not happy, make CI god happy * why you here little encoder * adding a tranduscer streaming recipe, because why not * add test for transducer * works better when me not stupid * fix yaml * update req * add warning for cache dir * add warning for cache dir * enable multiprocessing * Minor cleanups to fetching * Change default behavior of inference to not create savedir if not specified * allow data prep without ddp * fix tests * smoll readme update * fix review comments * fixed concat_start_index check (speechbrain#2717) * Ensure adapted models save their parameters (speechbrain#2716) Co-authored-by: Parcollet Titouan <parcollet.titouan@gmail.com> * wtf * update doc * more documentation on storage * missing arg * a bit of logs * new schedulers * new schedulers * Fixes speechbrain#2656: Remove EOS from SoundChoice * fix my stupidity * Update non-HF code path for new preprocessing code in GigaSpeech * Fix CSV path for non-HF Gigaspeech * Fix formatting * Kmeans fix (speechbrain#2642) * fix kmeans bug * fix final batch * fix chuncksize * fix * fix * fix precommit * fix doxstrin inconsistency * fix precommit * fix doc string --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * add call on start of fit_batch fn * Update core.py Fix old commit * Update core.py * Fix preprocess_text example * Fix guess_source docstring with up-to-date info * Also remove default savedir from Pretrained * Fix function name for log_applied_quirks * wip audiomnist+gt * Revert "fix normalization for LFB" This reverts commit 3fd0330. * audiomnist classification setup * fix config * add missing file * update dataset load/training * remove unnecessary params * remove sort * remove unnecessary code * fix paths * fix loss computation * add missing flatten * print summary * Explain quirks in docs/experiment.md * ok stupid linter check that hates intentional leading spaces in markdown * add citing in README * add code to pad all wavs to the same length * fix pad call * fix error computation * fix error computation * Make `collect_in` optional for `Pretrainer`, disable it by default * Change more defaults to `savedir=None` and `fetch_strategy=SYMLINK` Since the SYMLINK strategy falls back to NO_LINK whenever `savedir is None`, it makes sense to switch more things to default to `savedir=None`. Should the `savedir` explicitly be set by the user, past behavior is preserved (defaulting to symlinks). * move flatten in audionet * Fix GS transducer test prediction decoding? * fix data prep logic and paths * Actually fix GS transducer test prediction decoding * Remove punctuation filtering that is handled elsewhere * HuggingFance * fix skip data prep logic * add original audionet feature extraction * fix pooling for audionet feature extraction * fix audionet shape + remove input norm * try data augmentation * add missing refs * - rework AudioNet to have optional pooling - use official AudioMNIST train/test/valid splits * fix typo in url * update audionet hparams * update audionet custom hparams * update audionet custom hparams * Updated warning for load_collected * Add results and notices for results for GigaSpeech transducer & wavlm * english hard * update audionet custom hparams * fix doc + pre-commit clean * fix code examples * fix consistency tests * fix pre commit * remove config * fix docstring for LFB * fix docstring for GammatoneConv1D --------- Co-authored-by: Adel Moumen <adelmoumen.pro@gmail.com> Co-authored-by: Adel Moumen <88119391+Adel-Moumen@users.noreply.github.com> Co-authored-by: asu <sdelang@sdelang.fr> Co-authored-by: TParcollet <parcollet.titouan@gmail.com> Co-authored-by: Peter Plantinga <plantinga.peter@proton.me> Co-authored-by: gianfranco <62777451+gfdb@users.noreply.github.com> Co-authored-by: Peter Plantinga <plantinga.peter@protonmail.com> Co-authored-by: Titouan Parcollet/Embedded AI /SRUK/Engineer/Samsung Electronics <t.parcollet@sruk-ccn4.eu.corp.samsungelectronics.net> Co-authored-by: flexthink <flexthink@users.noreply.github.com> Co-authored-by: Pooneh Mousavi <moosavi.pooneh@gmail.com> Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com>

poonehmousavi added 5 commits August 8, 2024 18:56

fix kmeans bug

9e21ca0

fix final batch

bdd13f3

fix chuncksize

fcf82ed

fix

bae0cf6

fix

bc8bb39

poonehmousavi requested a review from mravanelli August 11, 2024 19:53

poonehmousavi and others added 2 commits August 11, 2024 16:00

Merge branch 'speechbrain:develop' into kmeans_fix

e1bdf2a

fix precommit

6f64bfa

poonehmousavi mentioned this pull request Aug 15, 2024

Kmeans .fit() should be changed to .partial_fit() #2602

Closed

asumagic linked an issue Aug 19, 2024 that may be closed by this pull request

Kmeans .fit() should be changed to .partial_fit() #2602

Closed

poonehmousavi self-assigned this Oct 11, 2024

Merge branch 'develop' into kmeans_fix

623bbc2

poonehmousavi and others added 3 commits October 11, 2024 14:34

Merge branch 'speechbrain:develop' into kmeans_fix

9f310a8

fix doxstrin inconsistency

3378739

fix precommit

0079fbe

fix doc string

cd830c6

mravanelli approved these changes Oct 15, 2024

View reviewed changes

mravanelli merged commit 410fe2f into speechbrain:develop Oct 15, 2024

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kmeans fix#2642

Kmeans fix#2642
mravanelli merged 12 commits into
speechbrain:developfrom
poonehmousavi:kmeans_fix

poonehmousavi commented Aug 11, 2024 •

edited

Loading

Uh oh!

poonehmousavi commented Oct 11, 2024

Uh oh!

mravanelli commented Oct 11, 2024

Uh oh!

poonehmousavi commented Oct 11, 2024

Uh oh!

mravanelli commented Oct 11, 2024 •

edited

Loading

Uh oh!

poonehmousavi commented Oct 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

poonehmousavi commented Aug 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

PR review

Uh oh!

poonehmousavi commented Oct 11, 2024

Uh oh!

mravanelli commented Oct 11, 2024

Uh oh!

poonehmousavi commented Oct 11, 2024

Uh oh!

mravanelli commented Oct 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

poonehmousavi commented Oct 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

poonehmousavi commented Aug 11, 2024 •

edited

Loading

mravanelli commented Oct 11, 2024 •

edited

Loading