Enterprise Mashup Markup Language

EMML, or Enterprise Mashup Markup Language, is an XML markup language for creating enterprise mashups, which are software applications that consume and mash data from variety of sources. These applications often perform logical or mathematical operations as well as present the data.

Mashed data produced by enterprise mashups are presented in graphical user interfaces as mashlets, widgets, or gadgets. EMML can also be considered a declarative mashup domain-specific language (DSL). A mashup DSL eliminates the need for complex, time-consuming, and repeatable procedural programming logic to create enterprise mashups. EMML also provides a declarative language for creating visual tools for enterprise mashups.

The primary benefits of EMML are mashup design portability and interoperability of mashup solutions. These benefits are expected to accelerate the adoption of enterprise mashups by creating transferable skills for software developers and reducing vendor lock-in.

The introduction of EMML is expected to help accelerate the trend toward the integration of web-based applications and service-oriented architecture (SOA) technologies. Bank of America was a high-profile early supporter of EMML. Other prominent early supporters included Hewlett-Packard, Capgemini, Adobe Systems, and Intel.

EMML history
Raj Krishnamurthy (chief architect at JackBe Corporation) and Deepak Alur (VP engineering at JackBe Corporation) started working on EMML in 2006. Their objective was to enable user-oriented and user-enabled mashups by creating what was then a new type of middleware called an Enterprise Mashup Platform. Raj Krishnamurthy became the chief language designer and implementer of EMML and also led the team to create an Eclipse-based EMML IDE called Mashup Studio. This work evolved into the EMML reference implementation that was donated to the Open Mashup Alliance. Raj Krishnamurthy continues to be one of the key contributors to EMML through the Open Mashup Alliance.

EMML features
EMML language provides a rich set of high-level mashup-domain vocabulary to consume and mash a variety of web data-sources in flexible ways. EMML provides a uniform syntax to invoke heterogeneous service styles: REST, WSDL, RSS/ATOM, RDBMS, and POJO. The EMML language also provides the ability to mix diverse data formats: XML, JSON, JDBC, JavaObjects, and primitive types.

High-level EMML language features include:
 * Filter and sort data coming from heterogeneous services.
 * Join data across heterogeneous services and data formats.
 * Group and aggregate data using assorted functions.
 * Annotate original service data to enrich its semantic meaning.
 * Merge multiple data streams into consolidated datasets
 * Split datasets to select individual data fields.
 * Embedded scripting support for JavaScript, JRuby, Groovy, XQuery
 * Web clipping to scrape data from HTML pages.
 * Conditional statements - / /,  ,
 * Parallel syntax for concurrent processing

EMML is primarily an XML-based declarative language, but also provides the ability to encode complex logic using embedded scripting engines. XPath is the expression language used in EMML.

Directinvoke statement
provides ability to invoke and consume a variety of data services. These data services may be REST, RSS/ATOM, or SOAP services. also supports Web clipping by allowing HTML pages to be specified as service endpoints. ,,  , and   protocols are supported in. HTTP Header and cookie support is also available thus providing the capability to consume a wide variety of REST/SOAP Web services. It is possible to use  with a proxy server.

Code sample of passing attributes as parameters to a service:

Filter statement
The statement filters the content of a variable using an XPath expression and places the result in a new variable.

Code sample for filtering west-coast customers using region data-item:

Sort statement
The statement sorts the content of a document-type variable or variable fragment based on key expressions and places the result in another variable.

Code sample that sorts tickets based on created date and customer:

Groupby statement
provides the ability to group and aggregate data sets. Standard XPath aggregation operations can be used and there is an extension mechanism for adding user-defined functions. Nested Grouping of hierarchical data sets are also supported. There is a  clause to filter Group attributes.

Code sample that groups books by genre and computes total copies for each genre:

Merge statement
provides ability to combine various data sources including RSS/ATOM feeds, XML, JSON payload formats. The merge feature is similar to  functionality but merges hierarchical document structures.

Code sample that merges Yahoo! News, Financial News, and Reuters feeds:

Annotate statement
provides ability to enrich the semantic meaning of source service data with microformat-like elements/attributes. These data annotations can be used by mashlets or gadgets to provide richer visual user interfaces.

Code sample for annotating vendor payload with geo-coordinates:

Join statement
The statement defines how disparate, hierarchical data formats are joined and is comparable to inner joins for relational databases.

Code sample where output variable contains a  element with a repeating set of   children, which are the repeating items. Each  contains a   child with data from the variable named movies and   and   children with data from the variable named reviews:

Scripting in EMML
EMML is a declarative language, but provides programmatic scripting extensions for performing complex mashup logic. JavaScript, JRuby, Groovy, POJO, XQuery scripting environments are supported. Data flows seamlessly between EMML and scripting environments.

Code sample where JavaScript snippet is used to extract authentication token that is required for subsequent calls "result" variable that gets propagated to JavaScript environment: