Real World Understanding
❏ A Study on Extraction of Motion Inflection Points Focusing on Objects in an Image
Overview
In recent years, there has been a strong recognition of the importance of research to capture intuitive physics computationally. Humans’ innate ability to understand physical phenomena and study to make computers understand the real world has been vigorously pursued. There are various approaches to understanding the physical world. For example, methods of understanding based on object recognition through visual inference from image information and approaches to understanding real-world events using image features extracted from the video as input and the output results. In contrast, we propose a method to extract motion inflection points in the real world represented in the latent hierarchical structure of physical relationships of recognized objects. In concrete, we modified the Variational Temporal Abstraction (VTA) model so that it can extract inflection points from a given graph structure, which represents physical relationships among objects through their latent system. We conducted experiments on whether our method can correctly detect motion inflection points using a modified CLEVRER dataset [19] and confirmed that the results show high accuracy.
Slides
kuroda-01
kuroda-02
kuroda-03
kuroda-04
kuroda-05
kuroda-06
kuroda-07
kuroda-08
kuroda-09
kuroda-10
kuroda-11
kuroda-12
kuroda-13
kuroda-14
kuroda-15
kuroda-16
kuroda-17
kuroda-18
kuroda-19
kuroda-20
kuroda-21
kuroda-22
kuroda-23
kuroda-24
kuroda-25
kuroda-26
kuroda-27
kuroda-28
kuroda-29
kuroda-30
previous arrow
next arrow

Eri Kuroda
黒田 彗莉,小林 一郎「画像内の物体に着目した動きの変化点抽出への取り組み」人工知能学会全国大会(第36回),国立京都国際会館,京都,2022年6月.(in Japanese)