Update private_README.md
Browse files- private_README.md +7 -8
private_README.md
CHANGED
|
@@ -516,21 +516,20 @@ Data collection methods vary across individual datasets. For example, the above
|
|
| 516 |
|
| 517 |
|
| 518 |
## References
|
| 519 |
-
[1] [Speaker Targeting via Self-Speaker Adaptation for Multi-talker ASR](https://arxiv.org/pdf/2506.22646)
|
| 520 |
|
| 521 |
-
[
|
| 522 |
|
| 523 |
-
[
|
| 524 |
|
| 525 |
-
[
|
| 526 |
|
| 527 |
-
[
|
| 528 |
|
| 529 |
-
[
|
| 530 |
|
| 531 |
-
[
|
| 532 |
|
| 533 |
-
[
|
| 534 |
|
| 535 |
## Licence
|
| 536 |
|
|
|
|
| 516 |
|
| 517 |
|
| 518 |
## References
|
|
|
|
| 519 |
|
| 520 |
+
[1] [Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens](https://arxiv.org/abs/2409.06656)
|
| 521 |
|
| 522 |
+
[2] [Streaming Sortformer: Speaker Cache-Based Online Speaker Diarization with Arrival-Time Ordering](https://arxiv.org/abs/2507.18446)
|
| 523 |
|
| 524 |
+
[3] [NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks](https://arxiv.org/abs/2408.13106)
|
| 525 |
|
| 526 |
+
[4] [Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition](https://arxiv.org/abs/2305.05084)
|
| 527 |
|
| 528 |
+
[5] [NVIDIA NeMo Framework](https://github.com/NVIDIA/NeMo)
|
| 529 |
|
| 530 |
+
[6] [NeMo speech data simulator](https://arxiv.org/abs/2310.12371)
|
| 531 |
|
| 532 |
+
[7] [Can We Really Repurpose Multi-Speaker ASR Corpus for Speaker Diarization?](https://arxiv.org/abs/2507.09226)
|
| 533 |
|
| 534 |
## Licence
|
| 535 |
|