{"id":24218,"date":"2024-09-24T15:18:16","date_gmt":"2024-09-24T13:18:16","guid":{"rendered":"https:\/\/www.itspy.cz\/thesis\/deleni-textu-do-logickych-celku\/"},"modified":"2024-09-24T15:18:16","modified_gmt":"2024-09-24T13:18:16","slug":"deleni-textu-do-logickych-celku","status":"publish","type":"thesis","link":"https:\/\/www.itspy.cz\/sk\/thesis\/deleni-textu-do-logickych-celku\/","title":{"rendered":"D\u011blen\u00ed textu do logick\u00fdch celk\u016f"},"content":{"rendered":"<p>Existuj\u00ed miliardy a miliardy nejr\u016fzn\u011bj\u0161\u00edch dokument\u016f, knih,<br \/>\nslovn\u00edk\u016f, novin, pr\u00e1vn\u00edch dokument\u016f a dal\u0161\u00edch, kter\u00e9 jsou<br \/>\nuchov\u00e1v\u00e1ny knihovnami a jin\u00fdmi institucemi. V dne\u0161n\u00ed dob\u011b<br \/>\nje kladen velk\u00fd d\u016fraz na digitalizaci dokument\u016f s c\u00edlem<br \/>\nuchovat d\u011bdictv\u00ed spole\u010dnosti, ale vzhledem k obrovsk\u00e9mu<br \/>\nmno\u017estv\u00ed dat je tento proces extr\u00e9mn\u011b pomal\u00fd. Pou\u017eit\u00ed<br \/>\nstrojov\u00e9ho u\u010den\u00ed m\u016f\u017ee v\u00fdrazn\u011b zv\u00fd\u0161it rychlost zpracov\u00e1n\u00ed,<br \/>\nco\u017e umo\u017en\u00ed rychlej\u0161\u00ed roz\u0161i\u0159ov\u00e1n\u00ed digit\u00e1ln\u00edch knihoven.<br \/>\nJedn\u00edm krokem v tomto procesu digitalizace, kter\u00fd je<br \/>\nhlavn\u00edm c\u00edlem tohoto projektu, je nalezen\u00ed logick\u00fdch celk\u016f<br \/>\nv dokumentech, jako jsou kapitoly knih, hesla ve slovn\u00edc\u00edch<br \/>\nnebo \u010dl\u00e1nky v novin\u00e1ch.<\/p>\n<p>V\u00fdzvou projektu je kombinace textov\u00e9ho obsahu dokument\u016f s<br \/>\njejich strukturou. Nalezen\u00ed celk\u016f prob\u00edh\u00e1 ve dvou kroc\u00edch:<br \/>\nnejprve se v naskenovan\u00e9m naleznou mal\u00e9 regiony textu s<br \/>\nvyu\u017eit\u00edm detektoru objekt\u016f YOLOv8, kter\u00e9 se n\u00e1sledn\u011b<br \/>\nspojuj\u00ed pomoc\u00ed grafov\u00e9 neuronov\u00e9 s\u00edt\u011b, co\u017e umo\u017en\u00ed nal\u00e9zt i<br \/>\nvizu\u00e1ln\u011b nenavazuj\u00edc\u00ed kusy textu a spojit je do jednoho<br \/>\nlogick\u00e9ho celku. Metoda dosahuje p\u0159esnosti 95.79 % u<br \/>\nslovn\u00edk\u016f a 90.23 % u novin.<\/p>\n","protected":false},"featured_media":23686,"template":"","meta":{"_acf_changed":false,"_links_to":"","_links_to_target":""},"university":[190,188],"thesis-year":[391,407],"class_list":["post-24218","thesis","type-thesis","status-publish","has-post-thumbnail","hentry","thesis-year-391","thesis-year-2024-sk"],"acf":{"autor":"Martin Kosteln\u00edk","portret":"","vedouci":"Ing. Karel Bene\u0161"},"_links":{"self":[{"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/thesis\/24218","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/thesis"}],"about":[{"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/types\/thesis"}],"version-history":[{"count":0,"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/thesis\/24218\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/media\/23686"}],"wp:attachment":[{"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/media?parent=24218"}],"wp:term":[{"taxonomy":"university","embeddable":true,"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/university?post=24218"},{"taxonomy":"thesis-year","embeddable":true,"href":"https:\/\/www.itspy.cz\/sk\/wp-json\/wp\/v2\/thesis-year?post=24218"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}