Commit Graph

21 Commits

Author SHA1 Message Date
9d712c921a feat: add PDF document recognition with 10-page pre-hook
- Migrate recognition_results table to JSON schema (meta_data + content),
  replacing flat latex/markdown/mathml/mml columns
- Add TaskTypePDF constant and update all formula read/write paths
- Add PDFRecognitionService using pdftoppm (Poppler) for CGO-free page
  rendering; limits processing to first 10 pages (pre-hook)
- Reuse existing downstream OCR endpoint (cloud.texpixel.com) for each
  page image; stores results as [{page_number, markdown}] JSON array
- Add Redis queue + distributed lock for PDF worker goroutine
- Add REST endpoints: POST /v1/pdf/recognition, GET /v1/pdf/recognition/:task_no
- Add .pdf to OSS upload file type whitelist
- Add migrations/pdf_recognition.sql for safe data migration

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-31 14:17:44 +08:00
liuyuanchuang
81c2767423 feat: add mml from backend 2026-02-05 10:44:11 +08:00
liuyuanchuang
a5f1ad153e refactor: update package path 2026-01-27 21:56:21 +08:00
ece026bea2 feat: add new path for recognize 2026-01-25 09:10:54 +08:00
3293f1f8a5 fix: downgrade error 2025-12-27 22:21:34 +08:00
ff6795b469 feat: convert markdown to mml 2025-12-27 22:06:48 +08:00
bb7403f700 feat: add baidu api 2025-12-25 14:02:29 +08:00
3a86f811d0 feat: add log for time 2025-12-23 22:32:29 +08:00
770c334083 fix: update app key 2025-12-20 22:48:02 +08:00
08d5e37d0e fix: udpate app_id 2025-12-20 22:15:56 +08:00
203c2b64c0 feat: add mathpixel 2025-12-20 21:42:58 +08:00
be00a91637 feat: check login for list 2025-12-19 13:59:47 +08:00
4bb59ecf7e feat: update vlm url 2025-12-19 09:55:26 +08:00
8a6da5b627 feat: add list api 2025-12-18 12:39:50 +08:00
f0449bab25 feat: add user register 2025-12-17 20:43:08 +08:00
9ceb5fe92a feat: update url 2025-12-15 23:32:07 +08:00
liuyuanchuang
50922641a9 feat: update ocr model 2025-12-11 19:51:51 +08:00
liuyuanchuang
696919611c feat: use siliconflow model 2025-12-11 19:39:35 +08:00
0bc77f61e2 feat: update dockerfile 2025-12-10 23:17:24 +08:00
083142491f refact: update oss config 2025-12-10 22:23:05 +08:00
liuyuanchuang
48e63894eb init repo 2025-12-10 18:33:37 +08:00