1234567891011121314151617181920212223242526 |
- Metadata-Version: 2.0
- Name: pdfminer
- Version: 20140328
- Summary: PDF parser and analyzer
- Home-page: http://euske.github.io/pdfminer/index.html
- Author: Yusuke Shinyama
- Author-email: yusuke at cs dot nyu dot edu
- License: MIT/X
- Keywords: pdf parser,pdf converter,layout analysis,text mining
- Platform: UNKNOWN
- Classifier: Development Status :: 4 - Beta
- Classifier: Environment :: Console
- Classifier: Intended Audience :: Developers
- Classifier: Intended Audience :: Science/Research
- Classifier: License :: OSI Approved :: MIT License
- Classifier: Topic :: Text Processing
- PDFMiner is a tool for extracting information from PDF documents.
- Unlike other PDF-related tools, it focuses entirely on getting
- and analyzing text data. PDFMiner allows to obtain
- the exact location of texts in a page, as well as
- other information such as fonts or lines.
- It includes a PDF converter that can transform PDF files
- into other text formats (such as HTML). It has an extensible
- PDF parser that can be used for other purposes instead of text analysis.
|