Streaming data

Streaming data is data that is continuously generated by different sources. Such data should be processed incrementally using stream processing techniques without having access to all of the data. In addition, it should be considered that concept drift may happen in the data which means that the properties of the stream may change over time.

It is usually used in the context of big data in which it is generated by many different sources at high speed.

Data streaming can also be explained as a technology used to deliver content to devices over the internet, and it allows users to access the content immediately, rather than having to wait for it to be downloaded. Big data is forcing many organizations to focus on storage costs, which brings interest to data lakes and data streams. A data lake refers to the storage of a large amount of unstructured and semi data, and is useful due to the increase of big data as it can be stored in such a way that firms can dive into the data lake and pull out what they need at the moment they need it. Whereas a data stream can perform real-time analysis on streaming data, and it differs from data lakes in speed and continuous nature of analysis, without having to store the data first.

Characteristics and consequences
In digital innovation management theories, five characteristics of digital innovative technologies are mentioned; homogenization and decoupling, modularity, connectivity, digital traces and programmability. Before these characteristics are explained and further elaborated with different examples of data streaming, it is important to understand the difference between digitalization and digitizing. The latter describes encoding from analog information to a digital format, such as light that enters the lens of a camera and transforms to a digital format/image (Yoo et al. 2012). Where digitalization refers to a more socio-technical process, where digitized techniques are applied to broader social and institutional contexts, meaning the process of converting information into a digital format, readable by some sort of computer. Within the context of data streaming this means that media for example, (information) has been digitized since the early 1990s, however the digitalization of ‘information/media’ has started to pick up since the beginning of this century.

Now, firstly homogenization and decoupling. “Because all digital information assumes the same form, it can, at least in principle, be processed by the same technologies. Consequently, digitizing has the potential to remove the tight  couplings between information types and their storage,  transmission, and processing technologies”. Within the context of data streaming, this means in theory that one can stream data now from any digital device. It also reduces the demand and use of music and films on CDs for example. One of the consequences of homogenization & decoupling is the decline of marginal costs. The marginal cost of data streaming is because it solely uses digital information, which can be transmitted, stored, and computed in fast and low-cost ways. An example of an industry that has low marginal costs due to data streaming is the music industry. Producers can now digitize songs and upload them on Spotify, instead of paying for the creation of the physical Albums and distributing these. Another consequence is convergent user experience, meaning that previously separated experiences are now brought together in one product.

Data streaming is also modular, because systems components may be separated and recombined mainly for flexibility and variety. Data streaming works in different application versions and systems such as IOS. It is also possible to change the speed of data streaming. A consequence of modularity is the creation of platforms. Data streaming platforms bring together analysis of information, but more importantly, they are able to integrate data between different sources (Myers, 2016). IBM streams for example is an analytics platform that enables the applications developed by users to gather, analyze and correlate information that comes to them from a variety of sources (IBM).

The third characteristic, connectivity, describes that a digital technology not only connects applications, devices and users but also connects customers and firms. Streaming services for example connects a vast collection of music and films of ‘producers’ with their consumers, so how music on Spotify can easily reach a vast group of consumers. Another example would be data of transport vehicles that can also be connected to firms with streaming applications, via vehicle-to-roadside communications. UPS does this for example to ‘calculate’ the optimal delivery routes by streaming real time big data and thereby reducing time to deliver packages.

Interoperability, which is the ability of a product or system to work with other products or systems, is a consequence of connectivity. For instance, the music industry is interoperable, because some music platforms have integrated social media platforms. Another of connectivity is network externality. This means that the value of a good to a user increases with the number of other users (installed base) of the same or similar good. Data streaming technology can utilize network externalities, because it brings together supply and demand of large networks of creators and consumers. This is very much the case at popcorn time, a service where people can stream latest movies on demand. These streams work better when people have used their content.

The latter has to do with the fact that if one streams content he/she automatically also down/uploads content. While a streaming service is being used it leaves Digital Traces, which simply describes the fact that all digital technologies leave a digital trace from the user. In the past, when media was sold, the seller/provider only had information about the transaction itself. With data streaming it has become possible to actually track the behaviour of the users because it occurs in real time, directly from the distributor/providers. Morris and Powers describe this as opening the 'black box' of consumption. Providers of streaming services, for example, are now able to track detailed consuming behavior of the user, which in turn, they use to influence the user's decision-making process by creating algorithms to further develop a service. This kind of streaming has changed the way people consume media, which in time offered new possibilities for new ideas. These are also referred to as wakes of innovation and occur in places one would not initially expect. For instance, data streaming has enabled the development of sensors, for example that are used in a lot of sectors for different purposes. In the manufacturing sector data streaming is used for real-time analysis to improve operations. In healthcare sector sensors are being used for connected medical devices to create hubs of patients and healthcare providers, that can trigger alerts when a patient has a medical emergency.

Finally, programmability, a characteristic that describes that an innovative digital technology can be reprogrammed, improved and/or updated. Consequences of programmability are emerging functionalities. The most applicable functionality is incompleteness, which means that products and services are never finished, which is the case for data streaming because suppliers will keep refreshing their models . However, a more influential consequence of the programmability, and also of connectivity is the servitization of digital media content. Data streaming has caused a shift towards pay for use instead of pay for ownership;. This is happening in the video and music streaming industry, think of Netflix or Spotify. You have to pay to use the service, instead of owning a product. This was the case with buying an album or DVD, whereas now it is possible to access thousands of songs or movies.

Implications
Data streaming is becoming more useful and necessary in today's world and is being applied in a broad range of industries, some of which that have been already mentioned in examples such as the medical or transportation industry. Other examples of industries or markets, where data streaming is applicable, are:

Finance: where it allows to track changes in the stock market in real time, computes value-at-risk, and automatically rebalances portfolios based on stock price movements.

Real-estate: Websites can track a subset of data from consumers’ mobile devices and makes real-time property recommendations of properties to visit based on their geo-location (Amazon).

Gaming: An online gaming company can collect streaming data about player-game interactions, and feeds the data into its gaming platform (Amazon).

E-commerce/Marketing: Data streaming can provide all clickstream records from its online properties and aggregate and enrich the data with demographic information about users, and optimizes content placement on its site, delivering relevancy and better experience to customers (Amazon).

Besides these examples, there are probably many more applications for data streaming. However, data streaming has had the biggest implications for the audio, video and telecom industry because of the creation of streaming services. Streaming services have majorly influenced how people consume their media nowadays. Since the streaming services have had the most significant impact using the data streaming technology, this will be the main focus further on this page.

Impacted industries
The process of technological convergence, which appears because different industries increasingly rely on the same set of technological skills in their production processes, leads to closer relations between markets that was previously not highly related. For example, social media platforms such as Facebook and Twitter are providing live-streaming services, which allows global news publishers to connect directly with the right audiences as well as a far wider range of audiences than they otherwise would have reached. This has led to a change in which how and where news publishers are interacting with their audiences, and how they use social media services to deliver their service.

An industry that is impacted by data streaming is the Video Streaming industry. Consumers are now demanding videos to be available at immediate request, meaning that it is no longer only the quality resolution of image that acts as important performance metrics in the media industry, but also how quickly video starts to play.

The video industry underwent some of the same changes as the music industry. The video industry gained revenue by selling DVDs to customers and selling rights to cinemas and television channels. In 1997, the first online distributors began, but this was still small over a decade later mainly due to lower quality compared to hardcopy films. The third wave of streaming services such as Netflix, iTunes, Hulu, Amazon and Blockbuster have changed the film market. Netflix started in 1997, but only started to disrupt the market more than a decade later

The digitization, digitalization and underlying technologies of streaming have created these streaming services which essentially caused this disruption. With the rise of streaming firms in the film industry, the sales of physical DVDs vanished completely. An important difference between the music and film industry is that within the film industry, streaming services such as iTunes and Netflix are ‘destroying’ revenue (Sullivan, 2009). Because of this less films are produced and consequently there are less jobs in this industry. On contrary, cinemas are still important in the film industry, but the share of movies and series that are streamed by customers is rising very fast. It replaced the DVD, changed the performance metrics of the incumbents and can thus be seen as disruptive.

Another impacted industry is the Music Streaming industry. In 2017, streaming accounted for 43% of revenues in the music industry, and this was the third year of consecutive growth. New music streaming services such as Spotify and Apple Music challenges the traditional label companies, which are now risking to be outcompeted by new business models. Before the rapid adaption of streaming, in 2000 the music industry was experiencing what turned out to be a 15-year-long continued stagnation in revenue, which was due to the high CD prices needed to cover the costs of record labels. In 2015, the streaming technology overtook the market by allowing revenues to increase by saving costs on labels, and artists to have a more steady income by making money on streams, rather than being reliant on a full album or CD to do well after being published.

Furthermore, data streaming also has an impact on the Game Streaming industry. Game streaming is caused by the considerable growth of cloud computing, which allow gamers to access a greater variety of games without having to own expensive hardware. Cloud computing operates as an enabler to the development of game streaming, where hardware and content is accessed from the cloud, leading to a change in offering greater flexibility in content distribution. Game streaming allowed by cloud technology will drive changes in the gaming industry, where it is the hardware configuration of machines in the cloud that will be the developers, cost and time will be reduced to develop a greater ability of user reach around the world.