Jump to content
Sign in to follow this  
Alexander Sviridenkov

ANN: HTML Office Library 4.6 released

Recommended Posts

I'm glad to announce a new release of the HTML Office Library: 100% native and cross-platform Delphi library for conversion and displaying documents of the following types:

  • Rich Text Format (RTF)
  • MS Word 6-2007 binary format (DOC)
  • MS Word XML document (DOCX)
  • MS Power Point binary format (PPT)
  • MS Power Point XML format (PPTX)
  • MS Excel binary format (XLS)
  • MS Excel XML format (XLSX)
  • MS Excel XML binary format (XLSB)
  • Adobe PDF format (PDF)
  • Supercalc format (SXC)
  • EPUB (electronic books).
  • FB2 (electronic books).
  • Markdown.
  • Outlook Message (MSG)
  • MIME message (.EML)
  • Outlook databases (.OST, .PST)
  • The Bat! database (.TBB)
  • RAR archives
  • ZIP archives

 

Whats's new

1. Improved conversion quality for all document types.
2. Significantly improved conversion speed and memory consumption for all document types.
3. Reduced size of converted document for all document types.
4. Document to text conversion is now much faster (direct text extraction and use of SAX parser for XML).
5. Added support for MIME mail messages .EML (including attachments).
6. Classes for reading Outlook databases (.PST, .OST).
7. Classes for reading RAR archives.
8. Classes for reading The Bat message folders (.TBB).
9. New THtOfficeDocument.AsThumbnail function for creating document thumbnail.
10. Added support for password protected office files (OnPasswordRequest callback in converion methods)
11. New THtOfficeDocument.ConvertStreamtoText for converting to text with support for callback function which get
  next portion of text during conversion.
12. New MaxLines and MaxPages properties to limit size of converted document.
13. Converted Word/RTF documents can have paged and web layout.
14. Direct PDF export for all platforms (no OS or third party library used).
15. Fully functional text search engine  (can index documents from any souce: file, archive, database, etc. using implementation of IHtVirtualFolder interface).

 

Conversion speed measurements:

timing.thumb.png.e3a394479ce3f394ac1b595d820885e0.png

 

There are two compiled demos available (source code of both applications is included):

 

1. Simple document viewer: allows to view any document on hard drive using file tree on left side and HtPanel on right.
https://delphihtmlcomponents.com/FileBrowser.zip

 

2. Search Engine demo: create full text search index for documents located in selected folders and find any document from application or Web.
https://delphihtmlcomponents.com/SearchEngine.zip
This demo also shows the following features of THtSearchEngine:

 

Date map - found document number by year

DateMap.thumb.png.879415d46978db6fb3bf7a52ea8cb9c8.png

 

Type Map - found documents by category

1011681315_Screenshot2022-06-29at18-41-57HTMLOfficeLibraryDemo.png.d5c9a2bb1b0f2520990e073f18d9daef.png

 

Completion

completion.thumb.png.7c19fae9f585ecbbafae027955c283c9.png

 

 

Test on sample database:

 

image.thumb.png.2cc228fbda5ec62e3670c0cb8e826617.png

 

Note that search time depends only on amout of found/shown documents  and do not depend on total document count.

 

https://delphihtmlcomponents.com/office.html

 

 

  • Like 5

Share this post


Link to post

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
Sign in to follow this  

×