Diffusion Transformer for Music Generation
本文整理了在音乐生成中使用DiT的相关工作。
本文整理了在音乐生成中使用DiT的相关工作。
图神经网络整理
Repaint 将latent\(x_t\)的非mask区域不断替换为已知的部分。
时间轴上dense,音高/乐器轴上压缩成token.
本文讨论了在DDPM去噪过程中进行条件引导的一些方法
This note contains the contents of lecture 7,8,9 of MIT 6.007 Signals and Systems course.
This note contains the contents of lecture 4,5,6 of MIT 6.007 Signals and Systems course.
This note contains the contents of lecture 1,2,3 of MIT 6.007 Signals and Systems course.
In this work, we present a Python and C++ implementation that replicates the numerical optimization algorithm for designing lattices with minimal normalized second moments (NSM). The algorithm uses...
In this work, I present a facial style transfer project based on DualStyleGAN. Given a portrait, what does it look like in a buddha style? I adopted DualStyleGAN to answer the question of: human(co...