Show HN: Cloning a musical instrument from 16 seconds of audio
19 by abdljasser2 | 1 comments on Hacker News.
In 2020, Magenta released DDSP [1], a machine learning algorithm / python library which made it possible to generate good sounding instrument synthesizers from about 6-10 minutes of data. While working with DDSP for a project, we realised how it was actually quite hard to find 6-10 minute of clean recordings of monophonic instruments. In this project, we have combined the DDSP architecture with a domain adaptation technique from speech synthesis [2]. This domain adaptation technique works by pre-training our model on many different recordings from the Solos dataset [3] first and then fine-tuning parts of the model to the new recording. This allows us to produce decent sounding instrument synthesisers from as little as 16 seconds of target audio instead of 6-10 minutes. [1] https://ift.tt/cdPY8O9 [2] https://ift.tt/xUdJoz8 [3] https://ift.tt/aFO0Pvo We hope to publish a paper on the topic soon.
Subscribe to:
Post Comments (Atom)
A ban on food dye in West Virginia has forged an unlikely alliance
Article URL: https://www.theguardian.com/us-news/2025/mar/30/west-virginia-food-dye-ban Comments URL: https://news.ycombinator.com/item?id=...
-
Article URL: https://flox.dev/blog/simplified-service-management-with-flox Comments URL: https://news.ycombinator.com/item?id=41323550 Poi...
-
Article URL: https://www.discovermagazine.com/mind/how-fonts-affect-learning-and-memory Comments URL: https://news.ycombinator.com/item?id=...
No comments:
Post a Comment