Resuming

In my final weeks of the Google Summer of Code 2022, I spend my time polishing code and writing and documenting all parts of my code for the final Submission for final integration into the branch master.

I discussed with my mentor throughout these weeks that these tasks mainly focus on :

  • Add a license header on the top of each source file.
  • Drop extra space by using the bash script : https://invent.kde.org/graphics/digikam/-/blob/master/project/scripts/dropextraspaces.sh.
  • Use cppcheck (Cppcheck is an analysis tool for C/C++ code). It provides unique code analysis to detect bugs and focuses on seeing undefined behavior and danger).
  • Reduce the text size of each item in QcomboBox, add notes in context for translators to limit translation sizes, and use a tooltip to host a long string description for each item.
  • Limit digiKam and Qt headers to export the minimum dependencies outside digiKam.

To summarize, I would like to demo the functionalities of the Digikam OCR tool what I have done:

  1. The user can process OCR in multiple documented images by in a items list; if list is empty, a pop-up will appear.

  2. There are four options that users can choose from based on 4 Tesseract basic options.

  3. When the User clicks the button “Start OCR,” The batch process will begin, it finishes 100% in the progress bar, and all results details are displayed in the Text Editor by double click on the item.

  4. Double click on each item list allows users to review recognized text.

  5. With the support of the spell-checking engine, users can adjust the text and store it in separate text files or XMP metadata by clicking on the “Save” button.

  6. The text stored in XMP can translate to another language if stored in another place.

You can view the demo of the plugin here :

demo

Main commits

Improvement

As mentioned in week 1 and 2, the accuracy of OCR need to be enhanced, so a dialog concluding the pre-processing methods is necessary. This feature will be helpful in the future.