Extract/Transform Document Structure API

Extracted JSON from result odt

Command for transform:

curl -v -k -F "data=@contentControlsExampleOriginal.odt" -F "transform=$(cat contentControlsTransform.JSON)" https://localhost:9980/cool/transform-document-structure > contentControlsResult.odt

Command for extract:

curl -k -F "data=@contentControlsExampleOriginal.odt" -F "filter=contentcontrol" https://localhost:9980/cool/extract-document-structure > contentControlsOriginalExtract.JSON
curl -k -F "data=@contentControlsResult.odt" -F "filter=contentcontrol" https://localhost:9980/cool/extract-document-structure > contentControlsResultExtract.JSON

Extracted JSON from result odt, Pretty printed

Extracted JSON from original odt, Pretty printed

Charts

Extract

Use filter=charts to extract the charts.

Example output: (it is pretty printed here):

{
    "DocStructure": {
        "Charts.ByEmbedIndex.0": {
            "name": "Object1",
            "title": "Paid leave days",
            "subtitle": "Subtitle2",
            "RowDescriptions": [ "James", "Mary", "Patricia", "David"],
            "ColumnDescriptions": [ "2022", "2023"],
            "DataValues": [
                [ "22", "24"],
                [ "18", "16"],
                [ "32", "32"],
                [ "25", "23"]
            ]
        }
    }
}

Data it extracts:

Property	Description
`name`	Name of the embedded object of the chart, can be used as a filter in transform to select the needed chart.
`title`	The title of the chart, as a simple string.
`subtitle`	The Subtitle of the chart, as a simple string.
`RowDescriptions`	Array of strings, containing the descriptions of the rows.
`ColumnDescriptions`	Array of strings, containing the descriptions of the columns.
`DataValues`	Matrix of numbers, containing every cells data.

Note

Some of the data values can be “NaN”, this means they are not set.

Transform

Example Transform:

{
    "Transforms": [
        { "Charts.ByEmbedIndex.0": [
            {"modifyrow.1": [ 19, 15 ]},
            {"datayx.3.1": 37},
            {"deleterow.0": ""},
            {"insertrow.0": [ 15, 17 ]},
            {"setrowdesc.0": "Paul"},
            {"insertcolumn.1": [ 1,2,3,4,5,6 ]},
            {"setcolumndesc.0": "c0"},
            {"deletecolumn.3": ""}
        ]},
        { "Charts.ByEmbedName.Object3": [
            {"resize": [ 3, 3 ]},
            {"setrowdesc": [ "a", "b", "c"]}
        ]},
        { "Charts.ByTitle.Fixed issues": [
            {"data": [ [ 3,1 ],
                       [ 2,0,1 ],
                       [ 3 ] ]},
            {"setrowdesc": ["2023.01",".02",".03"]},
            {"setcolumndesc": ["Jennifer", "Charles", "Thomas"]}
        ]}
    ]
}

To select which chart you want to transform, you can use these selectors:

Selector	Description
`ByEmbedIndex`.<num>	Index of the embedded object counted from 0. It is always unique, but the index may reference embed other than charts.
`ByEmbedName`.<string>	The unique name of the embedded object of the chart.
`ByTitle`.<string>	Title of the chart. (Title is optional)
`BySubTitle`.<string>	Subtitle of the chart. (Subtitle is optional)

Note

While the values of the chart transformations can be represented by an object, it is recommended to use the new form as an array.

To transform a chart you can use these commands:

Command	Value	Description
`deletecolumn`.<num>	NONE	Delete the <num> column
`deleterow`.<num>	NONE	Delete the <num> row
`modifycolumn`.<num>	[<num>,]	Set the column <num> data to the values
`modifyrow`.<num>	[<num>,]	Set the row <num> data to the values
`insertcolumn`.<num>	NONE	Insert an empty column before column <num>
`insertcolumn`.<num>	[<num>,]	Insert a column before column <num>, with values
`insertrow`.<num>	NONE	Insert an empty row before row <num>
`insertrow` .<num>	[<num>,]	Insert a row before row <number>, with values
`setcolumndesc`.<num>	<text>	Set column <num> description to <text>
`setcolumndesc`	[<text>,]	Set the column description to the values from the first.
`setrowdesc`.<num>	<text>	Set the row <num> description to <text>
`setrowdesc`	[<text>,]	Set the row description to the values from the first.
`resize`	[<num>,<num>]	Resize data table <num> row and <num> column. Both numbers are required, and must be greater then 1.
`datayx`.<num>.<num>	<num>	Set the value of the cell row <num> and column <num> to the specified value.
`data`	[[<num>,],]	Set values of the data table to the values. The table size will grow as needed.

Note

Commands that needs an array of values can be used with less values than the destination array. In that case it will only change the provided elements and leave the remaining one untouched.

Screenshot

Example Files

Extracted JSON Pretty printed

Command for transform:

curl -v -k -F "data=@docStructureChartExampleOriginal.odt" -F "transform=$(cat ChartsTransform.JSON)" https://localhost:9980/cool/transform-document-structure > docStructureChartResult.odt

Command for extract:

curl -k -F "data=@docStructureChartExampleOriginal.odt" -F "filter=charts" https://localhost:9980/cool/extract-document-structure > ChartsExtractOriginal.JSON

Extracted JSON

Document Properties

You can extract and modify document properties. These properties include meta data and statistics. You can add arbitrary named meta data properties. Most statistics are recalculated when the document is opened and can’t be modified.

Extract

Use filter=docprops to extract the document properties.

Example output: (it is pretty printed here):

{
    "DocStructure": {
        "DocumentProperties": {
            "Author": "Author TxT",
            "Generator": "Generator TxT",
            "CreationDate": "2024-01-21T14:45:00",
            "Title": "Title TxT",
            "Subject": "Subject TxT",
            "Description": "Description TxT",
            "Keywords": [ ],
            "Language": "en-GB",
            "ModifiedBy": "ModifiedBy TxT",
            "ModificationDate": "2024-05-23T10:05:50.159530766",
            "PrintedBy": "PrintedBy TxT",
            "PrintDate": "0000-00-00T00:00:00",
            "TemplateName": "TemplateName TxT",
            "TemplateURL": "TemplateURL TxT",
            "TemplateDate": "0000-00-00T00:00:00",
            "AutoloadURL": "",
            "AutoloadSecs": 0,
            "DefaultTarget": "DefaultTarget TxT",
            "DocumentStatistics": {
                "PageCount": 300,
                "TableCount": 60,
                "ImageCount": 10,
                "ObjectCount": 0,
                "ParagraphCount": 2880,
                "WordCount": 78680,
                "CharacterCount": 485920,
                "NonWhitespaceCharacterCount": 411520
            },
            "EditingCycles": 12,
            "EditingDuration": 12345,
            "Contributor": [ "Contributor1 TxT", "Contributor2 TXT"],
            "Coverage": "Coverage TxT",
            "Identifier": "Identifier TxT",
            "Publisher": [ "Publisher TxT", "Publisher2 TXT"],
            "Relation": [ "Relation TxT", "Relation2 TXT"],
            "Rights": "Rights TxT",
            "Source": "Source TxT",
            "Type": "Type TxT",
            "UserDefinedProperties": {
                "NewPropName Bool": {
                    "type": "boolean",
                    "value": true
                },
                "NewPropName Numb": {
                    "type": "long",
                    "value": 1245
                },
                "NewPropName Str": {
                    "type": "string",
                    "value": "this is a string"
                },
                "NewPropName float": {
                    "type": "float",
                    "value": 12.45
                }
            }
        }
    }
}

The following properties are extracted:

Property	Description
`Author`	The user name who saved the file first time.
`Generator`	Identifies which application was used to create or last modify the document.
`CreationDate`	The date and time when file was first saved.
`Title`	Title of the document.
`Subject`	Subject of the document. Can be used to group documents with similar contents.
`Description`	Comments to help identify the document.
`Keywords`	`[string,]` Words used to index the content of the document. Can contain white spaces.
`Language`	the default language of the document.
`ModifiedBy`	The user name when the file was last saved in a LibreOffice file format.
`ModificationDate`	The date and time when the file was last saved in a LibreOffice file format.
`PrintedBy`	The user name who printed the file last time.
`PrintDate`	The date and time when the file was last printed.
`TemplateName`	The template that was used to create the file.
`TemplateURL`	The URL of the template from which the document was created. The value is an empty string if the document was not created from a template or if it was detached from the template.
`TemplateDate`	The date and time of when the document was created or updated from the template.
`AutoloadURL`	The URL to load automatically at a specified time after the document is loaded into a desktop frame. An empty URL is valid and describes a case where the document shall be reloaded from its original location. An empty URL together with an `AutoloadSecs` value of 0 describes a case where no autoload is specified.
`AutoloadSecs`	The number of seconds after which a specified URL is to be loaded after the document is loaded into a desktop. A value of 0 is valid and describes a redirection. A value of 0 together with an empty string as `AutoloadURL` describes a case where no autoload is specified.
`DefaultTarget`	The name of the default frame into which links should be loaded if no target is specified.
`DocumentStatistics`	Statistics about the document, as separate properties. They will be recalculated and overwritten at document open. `PageCount` `TableCount` `ImageCount` `ObjectCount` `ParagraphCount` `WordCount` `CharacterCount` `NonWhitespaceCharacterCount`
`EditingCycles`	The number of times that the file has been saved.
`EditingDuration`	The amount of time that the file has been open for editing since the file was created. The editing time is updated when file saved.
`Contributor`	`[string,]` Names of the people, organizations, or other entities that have made contributions to the document.
`Coverage`	Time, place, or jurisdiction that the document is relevant to. For example, a range of dates, a place, or an institution that the document applies to.
`Identifier`	Some unique identifier like ISBN.
`Publisher`	`[string,]` Name of the entity that is making the document available. For example, a company, university, or government body.
`Relation`	`[string,]` Resources related to the document. For example, a set of volumes the document is part of, or the document’s edition number.
`Rights`	Intellectual property rights associated with the document. For example, a copyright statement, or information about who has permission to access the document.
`Source`	Information about other resources from which the document is derived. For example, the name or identifier of a hard copy that the document was scanned from, or a URL that the document was downloaded from.
`Type`	Information about the category or format of the document. For example, whether the document is a text document, image, or multimedia presentation.
`UserDefinedProperties`	List of user defined properties. Date/Time related types are not supported yet. Their Names, and types will be extracted but their values will not.

Note

Extraction of UserDefinedProperties may retrieve other types, based on what type of document properties it has. Unfortunatelly the different parts of the LibreOffice have a bit different limitations for these types:

With the recent LibreOffice these 6 types are found within the UI dialog: string, boolean, double, com.sun.star.util.Date, com.sun.star.util.DateTime, and com.sun.star.util.Duration
There are other ways to make document properties, that can add different types. And probably older versions of LibreOffice allow to add different (deprecated) types as well, that can still be extracted from old documents.
Unfortunatelly the exact limitation for the possible document property types aren’t well documented. Checking from the source code (when a property is added) hints at what types can be expected in some special cases. Seven more types have been identified: typelib_TypeClass_FLOAT, typelib_TypeClass_HYPER, typelib_TypeClass_LONG, typelib_TypeClass_SHORT, Time, DateTimeWithTimezone, and DateWithTimezone.

Transform

Example Transform:

{
    "Transforms": [
        {"DocumentProperties": {
            "Author":"Author TxT",
            "Generator":"Generator TxT",
            "CreationDate":"2024-01-21T14:45:00",
            "Title":"Title TxT",
            "Subject":"Subject TxT",
            "Description":"Description TxT",
            "Keywords": [ ],
            "Language":"en-GB",
            "ModifiedBy":"ModifiedBy TxT",
            "ModificationDate":"2024-05-23T10:05:50.159530766",
            "PrintedBy":"PrintedBy TxT",
            "PrintDate":"0000-00-00T00:00:00",
            "TemplateName":"TemplateName TxT",
            "TemplateURL":"TemplateURL TxT",
            "TemplateDate":"0000-00-00T00:00:00",
            "AutoloadURL":"",
            "AutoloadSecs": 0,
            "DefaultTarget":"DefaultTarget TxT",
            "DocumentStatistics": {
                "PageCount": 300,
                "TableCount": 60,
                "ImageCount": 10,
                "ObjectCount": 0,
                "ParagraphCount": 2880,
                "WordCount": 78680,
                "CharacterCount": 485920,
                "NonWhitespaceCharacterCount": 411520
            },
            "EditingCycles":12,
            "EditingDuration":12345,
            "Contributor":["Contributor1 TxT","Contributor2 TXT"],
            "Coverage":"Coverage TxT",
            "Identifier":"Identifier TxT",
            "Publisher":["Publisher TxT","Publisher2 TXT"],
            "Relation":["Relation TxT","Relation2 TXT"],
            "Rights":"Rights TxT",
            "Source":"Source TxT",
            "Type":"Type TxT",
            "UserDefinedProperties":[
                {"Add.NewPropName Str": {
                    "type": "string",
                    "value": "this is a string"
                }},
                {"Add.NewPropName Str": {
                    "type": "boolean",
                    "value": false
                }},
                {"Add.NewPropName Bool": {
                    "type": "boolean",
                    "value": true
                }},
                {"Add.NewPropName Numb": {
                    "type": "long",
                    "value": 1245
                }},
                {"Add.NewPropName float": {
                    "type": "float",
                    "value": 12.45
                }},
                {"Add.NewPropName Double": {
                    "type": "double",
                    "value": 124.578
                }},
                {"Delete": "NewPropName Double"}
            ]
        }}
    ]
}

To transform a document property you can use the same named commands as the extracted data was named. There are some additional commands for UserDefinedProperties to add a remove properties:

Command	Description
`Delete`	<string> It will delete the user defined property.
`Add`.<string>	`{"type":<string>,"value":<value>}` It adds a new named user defined property overwriting the existing value when already present. The new property will have a type and value. Types are limited, `string`, `boolean`, `long`, `float`. Date / time related types are not supported.

Note

The value of the UserDefinedProperties (the commands) can be either an array or an object.

Note

Some property values are overwritten when the document is opened:

ModifiedBy and ModificationDate are overwritten by any save. (That is why in the screenshot they have wrong values)
DocumentStatistics are recalculated and overwritten when the document is opened, but it does not recalculated on extract.

Screenshot

Example Files

Extracted JSON Pretty printed

Command for transform:

curl -v -k -F "data=@docStructureChartExampleOriginal.odt" -F "transform=$(cat DocPropTransform.JSON)" https://localhost:9980/cool/transform-document-structure > DocPropResult.odt

Command for extract:

curl -k -F "data=@temp2.odt" -F "filter=docprops" https://localhost:9980/cool/extract-document-structure > DocPropExtract.JSON

Extracted JSON

Tracked changes

New in version 25.04.3.2.

You can extract information about tracked changes in documents.

Extract

Use filter=trackchanges to extract the tracked changes list. The filter string may optionally contain arguments after a comma, as a sequence of name:value pairs, separated by commas. The supported arguments are:

Argument	Description
`contextLen`	A non-negative integer, defines maximum text length in `textBefore` and `textAfter` (see below). Default is 200.

Example output (pretty printed):

{
    "DocStructure": {
        "TrackChanges.ByIndex.0": {
            "type": "Delete",
            "dateTime": "2025-06-12T14:15:21",
            "author": "John Doe",
            "description": "Delete “Foo”",
            "comment": "Some comment",
            "textBefore": " preceding text 1, up to contextLen characters ...",
            "textAfter": " following text 1, up to contextLen characters ...",
            "textChanged": "Foo",
        },
        "TrackChanges.ByIndex.1": {
            "type": "Insert",
            "dateTime": "2025-06-12T14:15:24",
            "author": "Jane Smith",
            "description": "Insert “Bar”",
            "comment": "Another comment",
            "textBefore": " preceding text 2, up to contextLen characters ...",
            "textAfter": " following text 2, up to contextLen characters ...",
            "textChanged": "Bar",
        },
        "TrackChanges.ByIndex.2": {
            "type": "Format",
            "dateTime": "2025-06-12T14:15:31",
            "author": "Jane Smith",
            "description": "Attributes changed",
            "comment": "",
            "textBefore": " preceding text 3, up to contextLen characters ...",
            "textAfter": "",
            "textChanged": "Baz",
        },
    }
}

Each tracked change is defined by its properties. type, dateTime, author, description, comment, textBefore, textAfter, textChanged are present (some may be empty) for every tracked change:

Property	Description
`type`	One of the types listed in the following table.
`dateTime`	ISO 8601 datetime string, when the change was made.
`author`	The name of the author of the change.
`description`	Brief auto-generated description, may include (part of) added or removed text.
`comment`	A comment to the change that a reviewer made.
`textBefore`	The text in the document, that immediately precedes the change. Up to `contextLen` characters long. It shows the text in the state it was at the moment when the change occured; i.e., all older changes are shown as if accepted; all later changes are shown as if rejected.
`textAfter`	The text in the document, that goes immediately after the change. Up to `contextLen` characters long. It shows the text in the state it was at the moment when the change occured; i.e., all older changes are shown as if accepted; all later changes are shown as if rejected.
`textChanged`	The text in the document, that constitutes the change (an added, inserted, or formatted text) in full.

Tracked change types:

Type	Content properties
`Insert`	A text was added to the document.
`Delete`	A text was deleted from the document.
`Format`	Formatting of a part of the existing text was changed.

Example Files

Extracted JSON from original fodt, Pretty printed

Command for extract:

curl -F "data=@trackedChangesExampleOriginal.fodt" -F "filter=trackchanges,contextLen:100" https://localhost:9980/cool/extract-document-structure > trackedChangesExampleOriginal.json

Slides

Can be used only on Impress documents (presentations).

Slides are individual pages in presentations that can contain various elements, including text, images, videos, audio, shapes, and more. Master slides, are template pages used for creating slides. Layouts are templates for elements on the slide: type, position, size.

This API can extract the slides and master slides structure, and transform slides in the document. It can create, delete and reorder slides, change their layout, and change text of text based elements.

Extract

Use filter=slides to extract the slides.

Example output (pretty printed):

{
    "DocStructure": {
        "SlideCount": 7,
        "MasterSlideCount": 8,
        "MasterSlides": {
            "MasterSlide 0": {
                "Name": "Topic_Separator_Purple"
            },
            "MasterSlide 1": {
                "Name": "Content_sidebar_White"
            },
            "MasterSlide 2": {
                "Name": "Topic Separator white"
            },
            "MasterSlide 3": {
                "Name": "Content_sidebar_White_"
            },
            "MasterSlide 4": {
                "Name": "Topic_Separator_Purple_"
            },
            "MasterSlide 5": {
                "Name": "Content_White_Purple_Sidebar"
            }
        },
        "Slides": {
            "Slide 0": {
                "SlideName": "Slide3-Renamed",
                "MasterSlideName": "Content_White_Purple_Sidebar",
                "LayoutId": 3,
                "LayoutName": "AUTOLAYOUT_TITLE_2CONTENT",
                "ObjectCount": 4,
                "Objects": {
                    "Objects 0": {
                        "TextCount": 1,
                        "Texts": {
                            "Text 0": {
                                "ParaCount": 1,
                                "Paragraphs": [
                                    "Friendly Open Source Project"
                                ]
                            }
                        }
                    },
                    "Objects 1": {},
                    "Objects 2": {
                        "TextCount": 1,
                        "Texts": {
                            "Text 0": {
                                "ParaCount": 9,
                                "Paragraphs": [
                                    "Real Open Source",
                                    "100% open-source code",
                                    "Built with LibreOffice technology",
                                    "Built with Free Software technology stacks: primarily C++",
                                    "Runs best on Linux",
                                    "Open Development",
                                    "Anyone can contribute & participate",
                                    "Follow commits and tickets",
                                    "Public community calls - forum has details"
                                ]
                            }
                        }
                    },
                    "Objects 3": {
                        "TextCount": 1,
                        "Texts": {
                            "Text 0": {
                                "ParaCount": 5,
                                "Paragraphs": [
                                    "Focus:",
                                    "a non-renewable resource.",
                                    "Office Productivity & Documents",
                                    "Excited about migrating your\u0001documents",
                                    "Grateful to our partners for solving\u0001other problems."
                                ]
                            }
                        }
                    }
                }
            },
            "Slide 1": {
                "SlideName": "Slide 2",
                "MasterSlideName": "Topic_Separator_Purple",
                "LayoutId": 3,
                "LayoutName": "AUTOLAYOUT_TITLE_2CONTENT",
                "ObjectCount": 1,
                "Objects": {
                    "Objects 0": {
                        "TextCount": 1,
                        "Texts": {
                            "Text 0": {
                                "ParaCount": 3,
                                "Paragraphs": [
                                    "Collabora Online",
                                    "",
                                    "Powerful Online Collaboration"
                                ]
                            }
                        }
                    }
                }
            }
        }
    }
}

Extracted properties from the Impress presentation:

Property	Description
`SlideCount`	Number of slides in the presentation.
`MasterSlideCount`	Number of master slides in the presentation. These are real pages in the presentation, only used as template for slides.
`MasterSlides`	List of all the master slides, and some of their data. Currently only extract their name and ID.
`Slides`	List of all the slides, and some of their data. See table below.

Extracted properties from a slide:

Property	Description
`SlideName`	Name of the slide. If a slide doesn’t have a unique name they are named dynamically like “Slide 1”, “Slide 2”, etc.
`MasterSlideName`	Name of the master slide, this slide is made from.
`LayoutId`	The ID number of the actual layout used.
`LayoutName`	Name of the Layout.
`ObjectCount`	Number of elements in the slide. An elemet can be text, image, video, audio, shape and more…
`Objects`	List of all the elements and some of their data. See table below.

Extracted properties from an object. Currently only text based information can be extracted:

Property	Description
`TextCount`	Number of texts in this object. For example table objects can have more texts.
`Texts`	List of all the texts. See table below.

Extracted properties from a text object. Currenbtly only text based information can be extracted:

Property	Description
`ParaCount`	Number of paragraphs in this text object.
`Paragraphs`	Array of all its paragraphs, as simple strings.

Transform

Example Transform:

{
    "Transforms": {
        "SlideCommands": [
            {"JumpToSlideByName": "Slide 3"},
            {"MoveSlide": 0},
            {"RenameSlide": "Slide3-Renamed"},
            {"DeleteSlide": 2},
            {"JumpToSlide": 2},
            {"DeleteSlide": ""},
            {"JumpToSlide": 1},
            {"DuplicateSlide": ""},
            {"RenameSlide": "Slide1-Duplicated"},
            {"InsertMasterSlide": 1},
            {"RenameSlide": "SlideInserted-1"},
            {"ChangeLayout": 18},
            {"JumpToSlide": "last"},
            {"InsertMasterSlideByName": "Topic Separator white"},
            {"RenameSlide": "SlideInserted-Name"},
            {"ChangeLayoutByName": "AUTOLAYOUT_TITLE_2CONTENT"},
            {"SetText.0": "first"},
            {"SetText.1": "second"},
            {"SetText.2": "third object para1\npara2\npara3"},
            {"EditTextObject.2": [
                {"SelectParagraph": 1},
                {"InsertText": "----\n++++"},
                {"UnoCommand": ".uno:DefaultNumbering"},
                {"SelectText": [0,6,0,12] },
                {"InsertText": "-Inserted-"},
                {"UnoCommand": ".uno:Underline"},
                {"UnoCommand": ".uno:Bold"},
                {"UnoCommand": ".uno:Italic"},
                {"UnoCommand": ".uno:Strikeout"},
                {"UnoCommand": ".uno:Shadowed"},
                {"UnoCommand": ".uno:JustifyPara"},
                {"UnoCommand": ".uno:DefaultBullet"},
                {"UnoCommand": ".uno:SuperScript"},
                {"SelectText": [0,17,0,20]},
                {"UnoCommand": ".uno:SubScript"},
                {"UnoCommand": ".uno:Color {\"Color.Color\":{\"type\":\"long\",\"value\":2777241}}"},
                {"UnoCommand": ".uno:CharBackColor {\"CharBackColor.Color\":{\"type\":\"long\",\"value\":6710886}}"},
                {"SelectParagraph": 1},
                {"UnoCommand": ".uno:CenterPara"},
                {"SelectParagraph": 2},
                {"UnoCommand": ".uno:RightPara"},
                {"SelectParagraph": 3},
                {"UnoCommand": ".uno:LeftPara"}
            ]},
            {"DuplicateSlide": 1},
            {"MoveSlide.2": 6}
        ]
    }
}

There is always a current slide that most commands do act on, and some commands that change the current slide. By default the current slide is the slide at index 0.

To transform a slide you can use these commands:

Command	Value	Description
`JumpToSlide`	<num> \| `"last"`	Jump to the slide a index, or to the last slide. The index is 0 based. Using `last` is useful to just add new slides at the end.
`JumpToSlideByName`	<string>	Jump to the named slide. Be careful with default slide names like “Slide 1”. Those names can change during slide deletion or insertion.
`InsertMasterSlide`	<num>	Insert a new slide after the current slide, based on the master slide at index. Jump to the newly created slide, setting the current slide.
`InsertMasterSlideByName`	<string>	Insert a new slide after the current slide, based on the named master slide. Jump to the newly created slide, setting the current slide.
`DeleteSlide`	NONE \| <num>	Delete the slide at index or if none, the current slide. Will jump to the previous slide, or to the new first slide if this was the first slide. If this is the current slide then it will jump to the previous slide. If needed the index of the current slide will be readjusted so the current slide is unchanged. As there must always be one slide left in the presentation, the last remainind slide can not be deleted.
`MoveSlide`	<num>	Move the the current slide to the new positon. The index the current slide is readjusterd to follow.
`MoveSlide`.<num>	<num>	Move the slide at index to a new position. If the index is the one of the current slide then it is like the previous command. Otherwise the current slide will be unchanged, but its index may be adjusted as needed.
`DuplicateSlide`	NONE \| <num>	Duplicate the slide at index, or if none, the current slide, and jump to this new slide.
`ChangeLayout`	<num> \| <string>	Change the layout of the current slide to the layout with the index or the named layout. For Layout names, you can use these: `AUTOLAYOUT_TITLE_CONTENT` `AUTOLAYOUT_TITLE_CONTENT_OVER_CONTENT` `AUTOLAYOUT_TITLE_CONTENT_2CONTENT` `AUTOLAYOUT_TITLE_4CONTENT` `AUTOLAYOUT_ONLY_TEXT` `AUTOLAYOUT_TITLE_ONLY` `AUTOLAYOUT_TITLE_6CONTENT` `AUTOLAYOUT_TITLE` `AUTOLAYOUT_TITLE_2CONTENT_CONTENT` `AUTOLAYOUT_TITLE_2CONTENT_OVER_CONTENT` `AUTOLAYOUT_TITLE_2CONTENT` `AUTOLAYOUT_VTITLE_VCONTENT` `AUTOLAYOUT_VTITLE_VCONTENT_OVER_VCONTENT` `AUTOLAYOUT_TITLE_VCONTENT` `AUTOLAYOUT_TITLE_2VTEXT`
`RenameSlide`	<string>	Rename the current slide. Use unique names. Two slides cannot have the same name. Default names like “Slide 1”, “Slide 43”, cannot be set.
`SetText`.<num>	<text>	Set the object text as index to text. Supported only on text based objects.
`MarkObject`	<num>	Mark (select) the object at index on the current slide. This allows to use UNO commands that work on selected objects.
`UnMarkObject`	<num>	Unmark (deselect) the object at index on the current slide.
`UnoCommand`	<string>	Call the UNO command. For example `.uno:DefaultBullet` will toggle the selected paragraphs bullets, to on/off. There are many more uno commands… Not checked yet wich works here and wich not. Be careful with these, some may even break the mechanism of transform.
`EditTextObject`.<num>	array of commands	Start to edit the object <num> on the current slide. It can contain an array of commands to edit the object

To edit the text object with EditTextObject you can use the following commands:

Command	Value	Description
`SelectParagraph`	<num>	Select the text of paragraph <num> in the edited text object.
`SelectText`	[<num>,<num>, <num>,<num>]	Select text in the edited textobject. Can be used with 0-4 parameter: [1,2,3,4] = select text between 2. character of 1. paragraph and 4 character of 3. para. [1,2,3] = [1,2,3,] = select text between 2. character of 1. paragraph and last character of 3. paragraph [1,2] = [1,2,1,2] = only position the cursor to 2. character of 1. paragraph. Does not select any text [1] = [1,0,1,] = select all text in the 1. paragrah [] = [0,0,,] = select all text of the object. Where * means the last character or paragraph.
`InsertText`	<string>	Insert text <string> into the actual text object to the selected place. It can insert multiple paragraphs. (`"1.\n2."` = 2 paragraph text, `\n` = end of paragraph) If a text is selected, it will replace that If cursor was set without selection, then it will extend the text there. It will select the newly inserted text, so it can be formatted right away
`UnoCommand`	<string>	Call the UNO command. Same as UnoCommand in `SlideCommands` Can be used to format the selected text

Usable (tested) UNO commands by categories:

Toggle (on/off) a format on the selected characters:

".uno:Bold"
".uno:Italic"
".uno:Strikeout"
".uno:Shadowed"
".uno:Underline"
".uno:SuperScript"
".uno:SubScript"
".uno:DefaultBullet" (it will affect whole paragraphs)
".uno:DefaultNumbering" (it will affect whole paragraphs)

Set the horizontal alignment of whole paragraphs:

".uno:CenterPara"
".uno:RightPara"
".uno:LeftPara"
".uno:JustifyPara"

Set color of the selected text:

".uno:Color {\"Color.Color\":{\"type\":\"long\",\"value\":2777241}}" Set the selected text color to 2777241, which is a blue.
".uno:CharBackColor {\"CharBackColor.Color\":{\"type\":\"long\",\"value\":6710886}}" Set the background color of the selected text to 6710886 which is a gray.

Note

There are still more (untested) UNO commands that may can be used.

Note

The value of SlideCommands (the commands) can be an array, or an object.

To obtain the full list of enabled uno commands, you can check: sfx2/source/control/unoctitm.cxx under:

const std::map<std::u16string_view, KitUnoCommand>& GetKitUnoCommandList()

Screenshot

Example Files

Extracted JSON Pretty printed

Command for transform:

curl -v -k -F "data=@SlidesExampleOriginal.odp" -F "transform=$(cat SlidesTransform.JSON)" https://localhost:9980/cool/transform-document-structure > SlidesResult.odp

Command for extract:

curl -k -F "data=@SlidesExampleOriginal.odp" -F "filter=slides" https://localhost:9980/cool/extract-document-structure > SlidesExtractOriginal.JSON

Extracted JSON