I-Free Plan: 1 ukuguqulwa/ihora, 1 ifayela ngasikhathi sinye
Iya ngaphandle komkhawulo →

Tshintsha PDF kuya TXT

Tshintsha PDF kuya TXT amadokhumende ngokulula

Khetha amafayela akho

*Amafayela asusiwe ngemva kwamahora angu-24

Guqula amafayela afinyelela ku-1 GB mahhala, abasebenzisi be-Pro bangaguqula amafayela afinyelela ku-100 GB; Bhalisa manje

Ukulayisha

0%

Indlela yokuguqula PDF kuya TXT

Isinyathelo 1: Layisha eyakho PDF amafayela usebenzisa inkinobho engenhla noma ngokuhudula bese uphonsa.

Isinyathelo 2: Chofoza inkinobho ethi 'Guqula' ukuze uqale ukuguqulwa.

Isinyathelo 3: Landa i-version yakho TXT amafayela


PDF kuya TXT Ukushintsha

Ngiyiguqula kanjani i-PDF ibe yidokhumende elihlelayo i-TXT?
+
Layisha phezulu i PDF futhi umguquli ukhipha umbhalo wayo - usebenza i-OCR (ukuqonda okubonakalayo kophawu) lapho i-PDF iyisikhangiso noma isithombe, noma ucindezela umbhalo ophansi ngokuqondile lapho i-PDF iqukethe umbhalo ongempela - bese uyivula futhi uyiguqule ibe yi-TXT eguqulekayo ongavula futhi ushintshe ku-Word, Google Docs, noma LibreOffice.
Yebo - uma i PDF iyisithombe esihlolwe noma isithombe-PDF, isiguquli sisebenzisa i-OCR ukuqaphela amaphawu futhi sikhiqize umbhalo ofanelekayo ku-TXT. Uma i-PDF ikhona i-digital text layer, ishiya i-OCR ikopa umbhalo ngokuqondile, okushesha futhi 100% kulungile.
Ukuhlanza, ukucacisa okuphezulu kwe-scans yombhalo ophrintiwe, ukuthembeka kwe-OCR kuvame ukuba ngu-98-99%+. Ukunemba kuhla kakhulu ku-DPI scans aphansi, amakhasi aphindwe kabili, ukubhalela ngesandla, noma amafonti ajwayelekile. Ukuthola imiphumela engcono, hlola i PDF ku-300 DPI noma ngaphezulu futhi ugcine amakhasi amile; umguquli u-auto-deskews futhi u-de-noise ngaphambi kokwaziswa.
Umguquli uphinde uvule ukulandelana kokufunda, amapharamitha, namasihloko ku TXT, futhi ugcine isakhiwo esilula sekholamsi nethebula. Ama-layouts abukekayo (amamagazine asakazeka, amafomu anzima) alula ku-flow ehlanzekile ekwazi ukuhlela — isinqumo siyiqiniso, umbhalo ohlelayo ngaphezulu kokukhishwa kwe-pixel-perfect layout.
OCR ikwazi izilimi ezingaphezu kuka-100 kufaka phakathi isiLatini, isiCyrillic, isiGreki, isiArabhu, isiHebhere, kanye neCJK (Chinese/Japanese/Korean) izibhalo, futhi ikwazi ukukhomba ngokuzenzakalela isilimi se PDF. Amakhasi e-language axhumeneyo aphathwa futhi. Umbhalo owaziwayo uhlala ku TXT kwisibhalo esilungile, ukulungele ukuhlela.
Yebo — ikhasi eliningi PDF (PDF noma ikhasi eliningi TIFF) liqhubekelwa ngekhasi ngalinye futhi lihlanganisene kudokhumende elilodwa eliqhubekayo TXT ngekhasi ngalinye ngokulandelana. Iziqephu zekhasi kusuka ku PDF zigcinwa njengeziqephu zesigaba ku TXT ngakho-ke isakhiwo sihlala sicacile.
Umguquli uthola amathebhu alawulwa ku PDF futhi uphinde aphinde aphinde abe TXT amathebhu aguqulwe ngokusemthethweni lapho kudingeka khona. Amathebhu angaba khona noma abonisane ngokubonakalayo anzima ukuwathola futhi angafika njengesihloko esilinganiswe nge-tab - hlola futhi ulungisa amathebhu ezindaweni zomhleli wakho ngemuva kokuguqulwa.
Ukukhishwa kwe-text-layer (akukho OCR kudingekayo) kuncane kakhulu. OCR ihamba kancane — cishe amasekondi angu-1-3 ngekhasi ngalinye ngokuya ngesinqumo nesilimi. Ikhasi eli-50 eliscanwe PDF livame ukuqedwa ngaphansi kwemizuzu emibini; I-Premium isebenza ngokufanayo nabasebenzi be-OCR abasebenza ngezigaba ezinkulu.
Yebo — amafayela alayishwe phezulu PDF namafayela aphawulwe aqhubekelwa phambili emisebenzini ehlukanisiwe futhi asuswa ngaphakathi kwemizuzu. Asikwazi ukufunda, ukugcina, noma ukuhlanganisa okuqukethwe idokhumende. Bona /privacy/ ngefasitela lokugcinwa.
Iphutha le-OCR lihlala lilandela umthombo wekhwalithi: i-DPI ephansi, i-JPEG yokucindezela, umbhalo omnyama noma ophrintiwe, omnyama, noma amafonti ahlobisa. Phinda ukuhlola i PDF ku-300 DPI ku-greyscale, gcina amakhasi aqinile futhi ahamba phambili, bese uphinde uqhube — ukucaciswa kokuqonda kuthuthukiswa kakhulu ngomthombo ohlanzekile.
Umbhalo ophrintiwe uyazi ngokuthembekile; ukuphawula kokubhala ngesandla kuncane kakhulu futhi kusebenza kuphela ku-clean, olunye ukubhala kokubhala, hhayi oku-cursive. Ukubhala ngesandla PDF, cabanga ukubuyekeza TXT ngokucophelela. Okushicilelwe noma okuphrintiwe okusuka kumathuluzi yilapho i-OCR ihamba phambili khona.
Yebo — inhloso yokuguqula PDF ibe TXT eguqulwe ngokuzenzakalela ukuthi i-output ibhalwe ngokugcwele, hhayi isithombe: ungasesha, ukhethe futhi ukope, ubheke ukubhaliwe, futhi uguqule ngokukhululekile. Lokhu kuyinto ehlukile phakathi koku nokubheka i-PDF njengesithombe.

PDF

Amafayela we-PDF agcina ukwakheka kuwo wonke amadivayisi namasistimu wokusebenza, okwenza kube ngcono ukuwabelana amafayela adinga ukubonakala kufanayo kuwo wonke amanye.

TXT

Amafayela we-TXT aqukethe umbhalo ojwayelekile kuphela, ofundeka ngumsebenzisi wonke wohlelo lokubhala.


Linganisela leli thuluzi
5.0/5 - 0 amavoti
Noma lahla amafayela akho lapha