Rishi
Rishi
Home
Experience
Events
Projects
Posts
Accomplishments
Contact
Light
Dark
Automatic
Computer Vision
DAAMI2I
Extension of DAAM for Image Self-Attention in Diffusion Models! Exploiting Zero-Shot image segmentation capability of Stable Diffusion using a novel Attention Diffusion layer
Code
Slides
PHASE-based Implicit Neural Representation
Unofficial implementation to the paper - “Phase Transitions, Distance Functions, and Implicit Neural Representations” for 3D surface reconstruction based on point clouds
PDF
Code
TITAN
Large-Scale Visual Object Discovery through Diffusion Attentive Attribution Text2Image Heatmaps using Stable Diffusion. Python package - Generate OD Synthetic Data in 30 lines of code!
Code
Visuo-Textual Joint Embedding
Contextual Information-rich joint embedding for image and text in a multi-modal vector space using object-text collocation and Relative Position-based Transformer
Code
Scanned Document Classification
Scanned Document Representation Learning using Image-Text-Loc Fusion CNN-Transformer and leveraging it for clustering and classification into 16 document categories
PDF
Code
Cite
×