1

Aligning Large Multimodal Models with Factually Augmented RLHF
Learn large multimodal models with RLHF augmented by factual information to reduce hallucination.
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
Learn to segment visual entities and their parts in an open-world with pure self-supervision.