CVPR 13
- [Paper Review with Code] ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
- [Paper Review] Escaping Plato's Cave: Towards the Alignment of 3D and Text Latent Spaces
- [Paper Review] SplatFlow
- [Paper Review] UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
- [Paper Review] Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
- [Paper Review] Ctrl-D
- [Paper Review] Flow Matching in Latent Space
- [Paper Review] Multi-Scale 3D Gaussian Splatting
- [Paper Review] REACT : Learning Customized Visual Models with Retrieval-Augmented Knowledge
- [Paper Review] PromptStyler
- [Paper Review] DreamBooth
- [Paper Review] Pix2pix
- [Paper Review] StarGAN