Scene graph generation (SGG) analyzes images to extract meaningful information about objects and their relationships. In the dynamic visual world, it is crucial for AI systems to continuously detect new objects and establish their relationships with existing ones.
Nov 6, 2024
Nov 23, 2024 · To fill this gap, we introduce Scene-Bench, a comprehensive benchmark designed to evaluate and enhance the factual consistency in generating natural scenes.
Nov 5, 2024 · A multiview scene graph refers to our proposed task, where a place-object graph is inferred by associating unposed images across views, predicting their ...
7 days ago · In this paper, we leverage the inherent connection between 3D scene graphs and natural language, proposing a 3D scene graph-guided vision-language pre-training ...
2 days ago · Scene graph generation aims to provide a structured and comprehensive representation of the semantic content within an image by identifying objects and their ...
Nov 5, 2024 · [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes) - ai4ce/MSG.
Nov 21, 2024 · Utilizing an embedding of this scene graph enables our model to more explicitly reason over objects and their relations during story generation, compared to the ...
15 hours ago · A scene graph captures detailed scene semantics by explicitly modeling objects, their attributes, and the relationships between paired objects (e.g., “blue ...