Have a personal or library account? Click to login
PicAxe: Extracting Figures from Structurally and Syntactically Heterogeneous Corpora of PDF Files Cover

PicAxe: Extracting Figures from Structurally and Syntactically Heterogeneous Corpora of PDF Files

Open Access
|Dec 2025

Authors

Anna C. Guerrero

acg@santafe.edu

Santa Fe Institute, Santa Fe, New Mexico

Krishna Kamath

kamathk@uchicago.edu

Master’s Program in Computer Science, University of Chicago, Chicago, Illinois

Qilin Zhou

qilin@uchicago.edu

Master’s Program in Computer Science, University of Chicago, Chicago, Illinois

Bruno Felalaga

brunofelalaga@uchicago.edu

Master’s Program in Computer Science, University of Chicago, Chicago, Illinois

Julia Damerow

jdamerow@asu.edu

Arizona State University

Aaron R. Dinner

dinner@uchicago.edu

Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois
DOI: https://doi.org/10.5334/jors.574 | Journal eISSN: 2049-9647
Language: English
Submitted on: Apr 28, 2025
Accepted on: Dec 1, 2025
Published on: Dec 16, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Anna C. Guerrero, Krishna Kamath, Qilin Zhou, Bruno Felalaga, Julia Damerow, Aaron R. Dinner, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.