Project title:
Tesseract Training
Posted by:
External project from PeoplePerHour
Started:
19-Mar-2025 10:12 GMT
Description:
Description: I have some text, which is single word on tiff file, designed to train eng_custom.traineddata. Currently I use syntax below which seem sane and does not produce any error before last step. Important: I don't want to change current approach as my goal to train each of 1000 tiff files with same parameters, since I prepared corresponding tessRead and boxes for each tiff. #Make lstmf file tesseract test_sample.tiff test_sample \ --tessdata-dir /home/j/img2/tess_files \ --psm 7 --oem 1 -l eng_custom \ /home/j/tesseract/tessdata/configs/lstm.train echo "test_sample.lstmf" single_lstmf_file.txt #Train LSTM model lstmtraining \ --model_output tess_training.lstm \ --continue_from /home/j/img2/tess_files/eng.lstm \ --traineddata /home/j/img2/tess_files/eng_custom.traineddata \ --train_listfile single_lstmf_file.txt \ --max_iterations 1 Stop training and finalize model lstmtraining --stop_training \ --continue_from tess_training.lstm_checkpoint \ --traineddata /home/j/img2/tess_files/eng_custom.traineddata \ --model_output /home/j/img2/tess_files/eng_final.lstm Update traineddata with new LSTM model mkdir -p /home/j/img2/base_model combine_tessdata -u /home/j/img2/tess_files/eng_custom.traineddata /home/j/img2/base_model/eng_custom cp /home/j/img2/tess_files/eng_final.lstm /home/j/img2/base_model/eng.lstm combine_tessdata /home/j/img2/base_model/eng_custom cp /home/j/img2/base_model/eng_custom.traineddata /home/j/img2/tess_files/eng_custom.traineddata But I get problem during final step: j@j:~/t$ tesseract test_sample.tiff stdout -l eng_custom --tessdata-dir /home/j/img2/tess_files/ index = 0:Error:Assert failed:in file /home/j/tesseract4/src/ccutil/strngs.cpp, line 266 Aborted (core dumped) Question: How to amend above commands so I can combine eng_final.lstm with eng_custom.traineddata Environment: /home/j/img2/tess_files/ eng.traineddata eng_custom.traineddata eng.lstm eng_final.lstm /home/j/img2/base_model/ eng_custom.bigram-dawg eng_custom.normproto eng_custom.word-dawg eng_custom.freq-dawg eng_custom.number-dawg eng.lstm eng_custom.inttemp eng_custom.pffmtable eng.lstm-number-dawg eng_custom.lstm eng_custom.punc-dawg eng.lstm-punc-dawg eng_custom.lstm-number-dawg eng_custom.shapetable eng.lstm-recoder eng_custom.lstm-punc-dawg eng_custom.traineddata eng.lstm-unicharset eng_custom.lstm-recoder eng_custom.unicharambigs eng.lstm-word-dawg eng_custom.lstm-unicharset eng_custom.unicharset eng.version eng_custom.lstm-word-dawg eng_custom.version Any guidance would be greatly appreciated. Thanks! Jacob
Project ID:
3426853
Project category:
Project budget:
Project
Started
Course Content Editor / Knowledge Organizer (Transcript Rewriting & Course Structuring)
Category : AI Content Editing, Article Rewriting, Article Writing, Content Writing, Editing, Educational Research, Ghostwriting, Organizational Change Management Budget : ₹750 - ₹1250 INR
18-Mar-2026 05:04 GMT
Commission Based Sales Partner — AI SaaS for Cafe Owners India
Category : Account Management, B2B Marketing, Business Development, Customer Retention Marketing, Customer Strategy, Lead Generation, Sales, Social Media Marketing Budget : ₹12500 - ₹37500 INR
18-Mar-2026 05:03 GMT
Minimalist Social Media Promo Graphics
Category : Adobe Illustrator, Photoshop, Canva, Figma, Graphic Design, Illustration, Logo Design Budget : ₹600 - ₹1500 INR
18-Mar-2026 05:03 GMT
Android Vending Kiosk App Build
Category : Android, Android App Development, API Development, Java, Mobile App Development, Software Architecture, User Interface / IA Budget : ₹12500 - ₹37500 INR
18-Mar-2026 05:02 GMT
Desarrollo de página Web + App para el CONEIC Huancayo 2027 -- 2
Category : App Development, Backend Development, Content Management System (CMS), CSS, HTML, JavaScript, Mobile App Development, Web Design, Web Development Budget : $1000 - $5500 USD
18-Mar-2026 05:02 GMT
n8n Workflow-Fehlerbehebung benötigt
Category : API Integration, Automation, Debugging, German Translator, HTML, N8n, PHP, Translation Budget : €8 - €30 EUR
18-Mar-2026 05:01 GMT
Residential Property Buyer Agent
Category : Market Research, Marketing, Property Development, Property Law, Property Management, Real Estate, Real Estate Management, Real Estate Tax, Research, Sales Budget : $10000 - $20000 AUD
18-Mar-2026 05:01 GMT
Freelance Sourcing Agent
Category : Automation, Compliance, Documentation, Logistics, Machinery, Manufacturing, Procurement, Risk Management, Supplier Sourcing Budget : $5000 - $10000 USD
18-Mar-2026 05:00 GMT
Eagle Mascot Costume
Category : Costume Design, Graphic Design, Textile Design Budget : $750 - $1500 AUD
18-Mar-2026 04:57 GMT
300 Nofollow Links for Jewellery Store
Category : Digital Marketing, Internet Marketing, Keyword Research, Link Building, SEO, Shopify, Website Optimization Budget : $30 - $250 AUD
18-Mar-2026 04:57 GMT
Classic 3BHK SketchUp Interior Model
Category : 3D Modelling, 3D Rendering, 3D Visualization, Architecture, Building Design, Home Design, Interior Design, Lumion, SketchUp, V Ray Budget : ₹1500 - ₹12500 INR
18-Mar-2026 04:57 GMT
Fix Play Store Redirect Error
Category : Android, API, App Development, Backend Development, HTML, JavaScript, MySQL, PHP Budget : $10 - $30 USD
18-Mar-2026 04:56 GMT
Zoho E-Commerce Site Setup
Category : ECommerce, HTML, Payment Gateway Integration, Payment Processing, PHP, Web Design, Web Development, Zoho Budget : ₹12500 - ₹37500 INR
18-Mar-2026 04:56 GMT
Angular Engineer - CI/CD Pipelines & LLM Agent Systems
Category : Angular, AngularJS, Backend Development, CI / CD, JavaScript, LangChain, Typescript Budget : $2 - $8 USD
18-Mar-2026 04:53 GMT
Cross-Platform Currency Detection App
Category : Android, Convolutional Neural Network, Deep Learning, Flutter, Image Recognition, Machine Learning (ML), Mobile App Development, Python Budget : $3 - $10 NZD
18-Mar-2026 04:51 GMT
Browse All Projects