Besides the actual content, a document can contain additional data including meta information about the document title, tags and author information, custom key-value pairs and complete embedded documents. TX Text Control provides access to all of these settings to read and write this additional information from and to documents.

Meta Information

Document properties, often known as meta data, are details about the document that describe or identify it. That can include information such as document title, author name, modification date, subject and specific keywords to identity or categorize the content.

In TX Text Control, meta data can be added using the SaveSettings TX Text Control .NET Server for ASP.NET
TXTextControl Namespace
SaveSettings Class
The SaveSettings class provides properties for advanced settings and information during save operations.
object that is provided as a parameter in the Save TX Text Control .NET Server for ASP.NET
TXTextControl Namespace
ServerTextControl Class
Save Method
Saves the complete contents of a document with the specified format.
method.

The following properties can be used:

Property Value Type Value description
Author String Sets the document's author which will be saved in the document.
CreationDate DateTime Sets the document's creation date which will be saved in the document.
CreatorApplication String Sets the application, which has created the document.
DocumentKeywords String[] Sets the document's keywords which will be saved in the document.
DocumentSubject String Sets the document's subject string which will be saved in the document.
DocumentTitle String Sets the document's title that will be saved in the document.
LastModificationDate DateTime Sets the date the document is last modified.

The following code shows how to use the SaveSettings to add meta data to an Office Open XML (DOCX) document:

Loading...

When opening the Windows document properties dialog, the exported meta data can be seen in the property grid:

Document Properties

If the document is exported to PDF, the data can be found in the document properties dialog in Adobe Acrobat Reader:

Document Properties

Custom Properties

Custom properties can be created to store additional information about the document. These properties remain with a document and can be viewed by all MS Word users that open the document. Several property management servers provide data tracking capabilities to search for, sort, and track documents based on document properties.

In TX Text Control, these properties can be accessed and created using the UserDefinedPropertyDictionary TX Text Control .NET Server for ASP.NET
TXTextControl Namespace
UserDefinedPropertyDictionary Class
An instance of the UserDefinedPropertyDictionary class contains all user-defined document properties contained in a loaded document or which will be saved in a document.
class.

The UserDefinedPropertyDictionary class is used with the LoadSettings.UserDefinedDocumentProperties TX Text Control .NET Server for ASP.NET
TXTextControl Namespace
LoadSettings Class
UserDefinedDocumentProperties Property
Gets a dictionary with all user-defined document properties contained in the loaded document.
and SaveSettings.UserDefinedDocumentProperties TX Text Control .NET Server for ASP.NET
TXTextControl Namespace
SaveSettings Class
UserDefinedDocumentProperties Property
Sets a dictionary with all user-defined document properties which will be saved in the document.
properties. Each entry in the dictionary is a key/value pair, where the key is the name of the document property and the value is the document property's value.

The following code shows how to add new custom properties to an MS Word document:

Loading...

When opening the advanced properties dialog in MS Word, these key/value pairs are listed:

Document Properties

Embedding Documents

PDF/A-3 permits the embedding of files in any format. PDF/A-3 documents allow the progression from electronic paper to an electronic container that holds the human and machine-readable versions of a document. Applications can extract the machine-readable portion of the PDF document in order to process it. A PDF/A-3 document can contain an unlimited number of embedded documents for different processes.

Learn more

PDF/A-3 permits the embedding of files of any format. This article gives an overview of the advantages of PDF/A-3 as an electronic container.

PDF/A-3: The Better Container for Electronic Documents

This following sample code shows how TX Text Control can be used to attach a text file to a PDF document:

Loading...

The EmbeddedFile TX Text Control .NET Server for ASP.NET
TXTextControl Namespace
EmbeddedFile Class
The EmbeddedFile class represents a file embedded in another document.
object represents the attachment that is embedded in the PDF document. Besides the content, the file name and additional meta data can be added. The MIME type of the attachment (application/text in our case), a textual description, a relationship and the creation date must be provided.

The relationship is an optional string describing the relationship of the embedded file and the containing document. It can be a predefined value or should follow the rules for second-class names (ISO 32000-1, Annex E). Predefined values are "Source", "Data", "Alternative", "Supplement" or "Unspecified".

When opening the document in Adobe Acrobat Reader, the attachment can be found in the Attachments side-panel.

PDF Attachments

Conclusion

A document is more than it's content. Additional meta data, custom properties and embedded documents help processes and workflows to find and categorize documents and to process documents automatically by embedding machine-readable content. TX Text Control provides the required functionality to create digital documents for a complete document automation process.