The VIS30K dataset is a collection of 29,689 images that represents 30 years of figures and tables from each track of the IEEE Visualization conference series (Vis, SciVis, InfoVis, VAST). VIS30K's comprehensive coverage of the scientific literature in visualization not only reflects the progress of the field but also enables researchers to study the evolution of the state-of-the-art and to find relevant work based on graphical content. The paper describes the dataset and its semi-automatic collection process, which couples convolutional neural networks (CNN) with curation.
Link to the paper: https://arxiv.org/abs/2101.01036
Link to the data repository:
https://ieee-dataport.org/open-access/ieee-vis-figures-and-tables-image-dataset