Dioptra is an open source data curation and management platform designed for computer vision, natural language processing (NLP), and large language models (LLMs). It helps users curate valuable unlabeled data, register metadata, diagnose model failure modes, and integrate with labeling and retraining stacks.