You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
(on draft level, could just be by generalizing from 4=22 location tokens (interpreted as rectangle) to N2 tokens for any arbitrary polygon with N vertices)
converge between <content xml:space="preserve"> and {<content>+ other way of signalizing whitespace preservation e.g. via ATTLIST, XML schema, convention etc.}
❗ High prio ❗
training-relevant
update draft
content,base64, layer classes, picture classes etc)group,br)update schema (XSD + Schematron)
🔸 Medium prio 🔸
consider serialization extensions
consider standard extensions
update deserialization
🔹 Low prio 🔹
DocLang (embedded) archive
page breaks
check formatting:hyperlink (feat(DocLang): add hyperlink support #578)
list languages in draft (as subset of Linguist vX.Y.Z)
codewith unset language orShell?)cross-provenance content
cross-page content
misc
include_formattingserializer switch✅ Done ✅
address key value regions (consolidate
FormItem/KeyValueItem(KEY_VALUE_REGION) (feat: introduce field data model incl. Doclang serialization #519)fix table cell content deserialization (fix(DocLang): fix table cell
<content>deserialization #512)fix document re-indexing (fix: fix document re-indexing #510)
fix DocTags picture meta deserialization (fix(DocTags): fix deserialization to populate picture meta fields #505)
review checkboxes (fix(Doclang): fix checkbox serialization #503)
review chart serialization (compared to table serialization) (test(Doclang): add chart serialization test #498)
align handling of default resolution (location) (fix(IDocTags): fix default location resolution handling #492)
rich tables / nested tables (feat(IDocTags): add rich table support #491)
leading / trailing whitespaces (feat(IDocTags): add content wrapping for handling whitespace #489)
finalize outmost element name (chore: rename IDocTags to Doclang #494)
add radio button support(update: not for now)XML Schema / DTD?
converge between
<content xml:space="preserve">and {<content>+ other way of signalizing whitespace preservation e.g. viaATTLIST, XML schema, convention etc.}line breaks (feat(Doclang): add newline handling #575)
content layer (e.g. furniture) not captured in DocLang (feat(Doclang): add content layer support #568)
map HANDWRITTEN label to new DocLang element (similar to bold, italics) (feat: Add HANDWRITTEN_TEXT label support #561)
inline code issue: conflict between
<inline class="code">and<code class="Python">(chore(Doclang): removeinlineelement #517)use defusedxml? (fix: switch xml parsing #509)
align image ref mode on DocLang serialization (fix(Doclang): align image mode, defaulting to placeholder #506)
image URI serialization (fix(Doclang): fix image URI serialization #504)
track CVAT test output (fix: populate picture meta, track Doclang output docling-cvat-tools#8)