![]() ![]() A neat example of how to use the library interpreter for PDF pages, which you can use for many different things, like text extraction, images extraction and even…if you got nothing better to do with your time…rendering. PDF Text extraction CLI(C++): parse PDF file for the text it contains, writes the collected text to stdout.It’s insistance on performance makes it great for server side production, or batch production of large PDFs or for quicky parsing and extracting data from PDF files (say for your AI apps). Hummus is very flexible, and can provide for many different requirements around PDF files either in reading, writing or manipulating them. It should help you with common tasks where Hummus API might be a little more advanced.Īs chances are taken i’ll use the remainder of this post to review some repos on my account related to PDF that show what kinds of things you can do with Hummus Lib (or it’s more up to date counterparts). I’ll take this chance to also recommend an accompanying library that chunyenHuang wrote a long while ago which provides simplified API to MuhammaraJS (and HummusJS prior) called Hummus Recipe. The C++ version remains as is…as it manages to maintain itself…C++ don’t do deprecations so much, and you guys been forking it to your needs so we’re good there. The API, as it did for the past 11+ years, will not break, so there’s no immediate todo here for you peeps. There’s currently 18K downloads a week for HummusJS and 10K downloads for MuhammaraJS…So I hope those numbers will change in favor of MuhammaraJS with this encouragement.Īs for HummusJS itself, I’m keeping it up, for those of you who still need to transit, and for my own personal projects. Muhammara, for the very least, provides updates to binaries per NodeJs updates and sercurity corrections at a level which the HummusJS no longer provides, with me no longer supporting it. It’s a simple drop in replacement for Hummus that you Can and Should use instead of HummusJS. ![]() He has been providing updates since with a growing community on the projects repo. It’s been 3 years minus 6 days since I stopped providing support for the C++ and NodeJS versions of Hummus PDF library.īack then julianhille picked up providing support for HummusJS in his drop in replacement Muhammara. Might want some feedback to understand how to better support this use case. That is those that attempt to use system libraries for PDFHummus dependencies. I’m not sure how well it serves installations with USE_BUNDLED option off though. Tested there to make sure that the new PDFHummus install method works both with local install and grabbing and building from the repo. There’s an example in my pdf text extraction project which one can learn from here (which b.t.w now supports extracting tables!!1 check it out). Rather you can either use find_package(PDFHummus) if you installed PDFHummus already or use FetchContent to grab PDFHummus from its github repo and install it in your project (as well as try to find a local copy first), and then its just a matter of target_link_libraries ( PDFHummus::PDFWriter) to link your code to PDFHummus (it does headers includes too…so don’t worry about that). So, thoeretically speaking without knowing much about your current setup, there’s no longer need to do some custom ExternalProject or copy the library code to your project. I decided to sit down and learn some more cmake, which led to an attempt to make PDFHummus into a cmake package…which seems like it succeeded. ![]() Quick note about new stuff with PDFHummus C++ library. PDFHummus C++ Cmake package is now available ![]()
0 Comments
Leave a Reply. |