One-shot Text-aligned Virtual Instrument Generation Utilizing Diffusion TransformerPublished in Audio Imagination: NeurIPS 2024 Workshop on AI-Driven Speech, Music, and Sound Generation, 2024Qihui Yang, Jiahe Lei, Qiuqiang KongShare on Twitter Facebook LinkedIn Previous Next