Julien HAURET☆, Thomas JOUBAUD†, Véronique ZIMPFER†, Éric BAVU☆
☆ : LMSSC, Conservatoire national des arts et métiers, Paris, France, HESAM Université
† : Department of Acoustics and Soldier Protection, French-German Research Institute of Saint-Louis (ISL)
Reference
In-ear
Reference
In-ear
Speech | PESQ | SI-SDR | STOI |
---|---|---|---|
Simulated In-ear | 2.42 (0.34) | 8.4 (3.7) | 0.83 (0.05) |
Audio U-net | 2.24 (0.49) | 11.9 (3.7) | 0.87 (0.04) |
Hifi-GAN v3 | 1.32 (0.16) | -25.1 (11.4) | 0.78 (0.04) |
Seanet | 1.92 (0.48) | 11.1 (3.0) | 0.89 (0.04) |
Streaming Seanet | 2.01 (0.46) | 11.2 (3.6) | 0.89 (0.04) |
EBEN (ours) | 2.08 (0.45) | 10.9 (3.3) | 0.89 (0.04) |
Speech | \[P_{gen}\] | \[P_{dis}\] | \[\tau~\textrm{(ms)}\] | \[\delta~\textrm{(MB)}\] |
---|---|---|---|---|
Audio U-net | 71.0 M | \[\emptyset\] | 37.5 | 1117.3 |
Hifi-GAN v3 | 1.5 M | 70.7 M | 3.1 | 22.2 |
Seanet | 8.3 M | 56.6 M | 13.1 | 89.2 |
Streaming Seanet | 0.7 M | 56.6 M | 7.5 | 10.9 |
EBEN (ours) | 1.9 M | 27.8 M | 4.3 | 20 |