### Abstract

We present an approach for learning stochastic geometric models of object categories from single view images. We focus here on models expressible as a spatially contiguous assemblage of blocks. Model topologies are learned across groups of images, and one or more such topologies is linked to an object category (e.g. chairs). Fitting learned topologies to an image can be used to identify the object class, as well as detail its geometry. The latter goes beyond labeling objects, as it provides the geometric structure of particular instances. We learn the models using joint statistical inference over category parameters, camera parameters, and instance parameters. These produce an image likelihood through a statistical imaging model. We use trans-dimensional sampling to explore topology hypotheses, and alternate between Metropolis-Hastings and stochastic dynamics to explore instance parameters. Experiments on images of furniture objects such as tables and chairs suggest that this is an effective approach for learning models that encode simple representations of category geometry and the statistics thereof, and support inferring both category and geometry on held out single view images.

Original language | English (US) |
---|---|

Title of host publication | Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference |

Pages | 1615-1623 |

Number of pages | 9 |

State | Published - Dec 1 2009 |

Event | 23rd Annual Conference on Neural Information Processing Systems, NIPS 2009 - Vancouver, BC, Canada Duration: Dec 7 2009 → Dec 10 2009 |

### Publication series

Name | Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference |
---|

### Other

Other | 23rd Annual Conference on Neural Information Processing Systems, NIPS 2009 |
---|---|

Country | Canada |

City | Vancouver, BC |

Period | 12/7/09 → 12/10/09 |

### ASJC Scopus subject areas

- Information Systems

## Fingerprint Dive into the research topics of 'Learning models of object structure'. Together they form a unique fingerprint.

## Cite this

*Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference*(pp. 1615-1623). (Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference).