> you’re reading...

Term of the Month

TOTM: Metadata Write-Back – Embedding metadata into assets

By default, Cumulus stores metadata in catalogs, which are separate from assets. This offers a number of advantages over systems that force you to store metadata inside assets, but sometimes storing metadata inside assets is important. Fortunately, Cumulus enables you to do both!

This Term of the Month explains what metadata write-back is, when it makes sense, and why it’s so important that Cumulus offers write-backs as option, not a requirement.

A hybrid approach to metadata storage

Operating systems store metadata inside asset files. This is how a file’s name, size, modification date and more “travel” with that file between computers. This is also how all that information comes into Cumulus in the first place. When you first think about it, internal storage of metadata sounds great—everything lives right inside the asset, no matter where that asset goes.

But there are important reasons why internal metadata storage isn’t a good option, particularly when it’s your only option:

  • Metadata loss – If you lose or damage a file, you also lose all the file’s metadata. Sure, you might have file backups, but do you really want to keep new copies of a 200MB Photoshop file each time you update some tiny bit of metadata?
  • Metadata searching of offline files – Unless your files are online, you won’t be able to search their metadata. This means all those years of CD/DVD archives need to also be stored on hard drives accessible to the DAM system. Otherwise, you can’t search their contents.
  • Metadata management – When you have two (or more) copies of a file, how do you manage the metadata contents? You might make a metadata update to one copy, but if you’ve got 1,000 other copies in circulation, that doesn’t do you much good, because they’ll all have obsolete metadata—and worse—no indication the metadata they contain is obsolete.
  • Metadata flexibility – File are very limited as to the types of metadata they can contain, and most file formats don’t support embedded custom metadata at all. Example: You can easily add IPTC or XMP metadata standard information to a JPEG image, but you can’t do the same with a PowerPoint presentation. Further, Cumulus supports metadata types as complex and powerful as audio annotations, graphical commenting regions and hierarchical categories. But no file formats support the embedding if information like that.
(Left) Sophisticated metadata types, like the searchable comments available in Cumulus, cannot be written back into any file format, which is one reason Cumulus catalogs are so valuable. (Right) The metadata supported by file formats for viewing in operating systems and other applications is limited to simple text values, such as those shown here for Windows 7. Though useful for copyrights and other simple notices, you wouldn’t want to limit your metadata possibilities to only these values.

The metadata catalogs Cumulus is based upon solve all of these issues, and more. But they don’t solve one important concern: Copyright notices and other important usage restrictions that travel with your files, no matter where those files go.

To address this, Cumulus uses metadata write-backs, through which Cumulus embeds into your assets   metadata you have added or edited inside Cumulus. The process is transparent to the user, so it’s something an organization can rely upon to be handled consistently.

In practice, it’s simple: Users edit metadata as usual, but when they save the asset record, Cumulus writes metadata inside the asset itself. Then, wherever the asset goes, so goes the embedded metadata.

(Left) Cumulus editing windows that write-back metadata into assets appear no different to users. Here, fields are highlighted that will be written back into the asset when this asset record is saved. (Right) Once written into the asset, the metadata can be seen and further edited from the File Info window inside Adobe Photoshop and other applications.

To further increase the benefits you can get from metadata write-backs, Cumulus also enables you to map metadata between fields. This means you can take advantage of popular metadata standards to store and transfer information they’re not specifically intended for.

To learn more about how Cumulus can write-back metadata into assets, don’t miss the Cumulus Power Tip, Add License Info (and More) to Every Photo You Distribute.

Discussion

6 comments for “TOTM: Metadata Write-Back – Embedding metadata into assets”

  1. For the richer metadata types that Canto Cumulus offers – why don’t you advantage of the flexibility of XMP and develop you own XMP schema that can be used for those richer metadata types. A comment on a certain part of the photo or page? Just create an XMP schema for ‘comments related to areas on a page’, containing (a list of the) the actual comment(s) as text combined with a rect (or series of rects, or polygon) field with the position on the page plus maybe name of the author, date/tme the comment was entered etc.

    Just an idea…

    And then it’s up the user/customer to decide whetehr they wish to write back this part of the metadata. Nevertheless, from a technical point of view there is no real reason to stop Canto from enabling this.

    Olaf

    Posted by Olaf Drümmer | January 21, 2010, 5:58 pm
  2. I agree with Olaf – add an additional XMP namespace with items that describe the unique information Cumulus adds to the asset (…or in its catalog).

    XMP can be useful beyond JPG, TIFF, PSD and RAW: just generate sidecar files! This way you can even use it for PowerPoint.

    A full metadata export increases the interoperability of Cumulus with other DAMs and workflows.

    Posted by Hans Fremuth | January 25, 2010, 4:29 pm
    • This sounds like an ideal way to approach this issue. Writing metadata into the file is a ‘must have’ in the today’s market space. Showcasing Canto unique approach to addressing this requirement, making is easier on Cumulus Admin, is a great help.

      v

      Posted by Vincent A. DiPaola | February 3, 2010, 3:51 pm
  3. One of the challenges we’ve had is using custom XMP fields for write back, it seems the only option is complicated munging via formulas. I know of the customXMPfield magic config file for reading metadata, but write support is desperately needed for institutions like ours, who rely on metadata not necessarily in Canto’s default XMP profiles.

    -Ryan Donahue
    George Eastman House

    Posted by Ryan Donahue | March 2, 2010, 3:16 pm
  4. OK, so here’s an interesting question. After upgrading from the 8.13 InDesign plugin to the 8.14 plugin, exporting to PDF is resulting in a file that is literally 30 times bigger than it has been (what used to be 200K, for example, is now 6MB). If that same file is put through Acrobat again, it gets reduced to around 400K.

    Could the new embedding scheme be accidentally taking up THAT MUCH space? Is it possible there is something wrong with the way the embedded data is being transferred to the PDF file? We’ve been pulling our hair out over this.

    Vincent Salzillo, President
    Double Exposure Technical Consulting

    Posted by Vincent Salzillo | March 11, 2011, 9:19 am
  5. I completely agree with Olaf, Cumulus is desperately missing support for custom XMP reading and writing.

    The limited XMP and IPTC fields are just not enough for complex workflows.

    Please please please add support for custom XMP

    Posted by James Kenwood | March 15, 2011, 7:27 am

Post a comment