Computer Vision with Open Vocabulary
Published:
Introduction
- What do we mean by open world vision?
- Referring expression segmentation
- Referring expression matting
- Open Vocabulary panoptic segmentation
- How are these systems trained?
- What are the popular models and their current limitations?
- What are the popular benchmarks?
- How do we accurately measure the performance of such systems?
- What are the novel applications enabled by these systems?
References
- https://www.cs.cmu.edu/~shuk/open-world-vision.html
- https://github.com/nvlabs/odise
- https://github.com/JizhiziLi/RIM
- https://github.com/isl-org/lang-seg
- https://github.com/ngthanhtin/owlvit_segment_anything