G
o
o
g
l
e
×
Please click
here
if you are not redirected within a few seconds.
All
Images
Videos
News
Maps
Shopping
Books
Search tools
Recent
Recent
Past hour
Past 24 hours
Past week
Past month
Past year
Archives
Sorted by relevance
Sorted by relevance
Sorted by date
Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing
Frontiers
We address interactive natural language grounding without auxiliary information. Specifically, we first propose a referring expression comprehension network.
4 months ago
Salesforce AI Research Propose Programmatic VLM Evaluation (PROVE): A New Benchmarking Paradigm for Evaluating VLM Responses to Open-Ended Queries
MarkTechPost
Salesforce AI Research Propose Programmatic VLM Evaluation (PROVE): A New Benchmarking Paradigm for Evaluating VLM Responses to Open-Ended...
2 weeks ago
Researchers improve scene perception with innovative framework
Tech Xplore
Led by Prof. Liu Yong from the Hefei lnstitutes of Physical Science of the Chinese Academy of Sciences, researchers have proposed a novel...
5 months ago
Long-form video representation learning (Part 3: Long-form egocentric video representation learning)
Towards Data Science
The first two blogs in this series described how different architectural motifs ranging from graph neural networks to sparse transformers addressed the...
5 months ago
Scene Graph Technology: High-Impact Use Cases for Government
Nextgov/FCW
Scene graphs allow humans and machines to categorize and query images based on complex relationships among objects in a scene.
30 months ago
Scene Graph Generation and its Application in Robotics
Towards Data Science
Scene graph generation is the process of generating scene graphs and a scene graph contains the visual understanding of an image in the form...
13 months ago
This AI Paper from China Introduces SGTR and SGTR+: An End-to-End CNN-Transformer-Based Scene Graph Generating Framework and Its Extension
MarkTechPost
This paper proposes a novel approach to address these challenges, leveraging the compositional property of scene graphs.
9 months ago
Human-like scene interpretation by a guided counterstream processing
PNAS
Understanding a visual scene is an unsolved and daunting task, since scenes can contain a large number of objects, their properties,...
13 months ago
Recent Advances in Image Captioning, Image-Text Retrieval and Visual Question Answering using Scene Graph Parsing, What Next?
Microsoft
Creating appropriate representation of data is the key for many recent breakthroughs in both language and vision. In natural language, from the structured...
40 months ago
Universal Scene Description
Computer Graphics World
We present Universal Scene Description (USD), Pixar's open-source software for describing, composing, interchanging, and interacting with incredibly complex 3D...
47 months ago