pdf to page file type error using PDF files

#1
by Csplk - opened

Hi, very excited about the Dataset-Creation-Tools organization and the already super potent and simple to use spaces available as I have a rather deep dive into doing the technical design and planning and implementation soon of some real wide coverage worthy document processing project work tasks following initial processing pipeline research up front. So thanks first of all! :)

I was using the pdf to data set space yesterday on some pdfs of research relating to mathematical models of visual hallucinations in the visual cortex I am using to make computational GLSL webcam filter models from and it works great! https://huggingface.co/datasets/Csplk/VisualHallucinations

I went to try the pdf-to-page-images today and it seems to not think the PDFs are PDFs for some reason (Perhaps file type restrictions mixed with file picker support on this machines OS?)

Not a big deal as I usually just POSIX split my PDFs but thought I would make a note of the spaces issue

Screen Shot 2024-09-24 at 11.58.38 AM.png
Screen Shot 2024-09-24 at 12.08.08 PM.png
Screen Shot 2024-09-24 at 12.07.12 PM.png
I shall try it on mobile to verify if it does the same or works there to trouble shoot it and update when able to do so.

Sign up or log in to comment