An unique model for removing crowd sounds from music recordings (applause, clapping, whistling, noise, laugh etc.). Current metrics on our internal dataset for quality control:
| Algorithm name | Crowd dataset | |
| SDR crowd | SDR other | |
| Crowd model MDX23C (v1) | 5.57 | 18.79 |
| Crowd model MDX23C (v2) | 6.06 | 19.28 |
| MelBand Roformer | 6.07 | 19.29 |
| Ensemble (MelRoformer + MDX23C) | 6.27 | 19.49 |
| BS Roformer | 7.21 | 20.43 |