Data broker

A data broker is an individual or company that specializes in collecting personal data (such as income, ethnicity, political beliefs, or geolocation data) or data about people, mostly from public records but sometimes sourced privately, and selling or licensing such information to third parties for a variety of uses. Sources, usually Internet-based since the 1990s, may include census and electoral roll records, social networking sites, court reports and purchase histories. The information from data brokers may be used in background checks used by employers and housing.

There are varying regulations around the world limiting the collection of information on individuals; privacy laws vary. In the United States there is no federal regulation protection for the consumer from data brokers, although some states have begun enacting laws individually. In the European Union, GDPR serves to regulate data brokers' operations. Some data brokers report to have large numbers of population data or "data attributes". Acxiom purports to have data from 2.5 billion different people.

Overview
Information broker is sometimes abbreviated to IB, and other terms used for information brokers include data brokers, independent information specialists, information or data agents, data providers, data suppliers, information resellers, data vendors, syndicated data brokers, or information product companies. Information consultants, freelance librarians, and information specialists are also sometimes termed information brokers.

Credit scores were first used in the 1950s, and information brokering emerged as a career for individuals during that decade. However the business of information brokering did not become widely known or specifically regulated until the 1990s. During the 1970s, "information brokers" often had a library science degree; however, towards the end of the 20th century, people with degrees in science, law, business, medicine, or other disciplines entered the profession, and the line between the terms information professional and information broker became more blurred. In 1977, Kelly Warnken published the first fee-based information directory, followed by the Journal of Fee-Based Information Services in 1979 and the book The Information Brokers: How to Start and Operate Your Own Fee-based Service in 1981.

Beginning in the late twentieth century, technological developments such as the development of the Internet, increasing computer processing power, and declining costs of data storage made it much easier for companies to collect, analyze, store and transfer large amounts of data about individuals. This gave rise to the information broker or data broker industry. , there is no required academic qualification for the job of information broker; some people may have a bachelor's degree in business or marketing, while others may have a background in library science, or may have worked for a database provider.

Services
Information brokering has been described as the "business of buying and selling information as a commodity". Information brokers have been defined by the (US) Federal Trade Commission as "companies that collect information, including personal information about consumers, from a wide variety of sources for the purpose of reselling such information to their customers for various purposes, including verifying an individual's identity, differentiating records, marketing products, and preventing financial fraud". Gartner defines an information broker as "a business that aggregates information from a variety of sources; processes it to enrich, cleanse or analyze it; and licenses it to other organizations". It states that data is "licensed for particular or limited uses" rather than sold to a client.

Information brokers (IBs) collect and collate data concerning myriad topics, ranging from the daily communications of an individual to more specialized data such as product registrations, patents and copyright data, mostly from publicly available sources, usually obtained from online databases. They may also provide various other services, such as analysing the data and writing reports on them; creating databases for clients; or updating clients whenever new information on a specific topic or person. Clients use data brokers to save themselves time and money, as the brokers are trained in the skills needed to retrieve such information effectively and efficiently. Information brokers are secondary researchers, who find information on a variety of subjects, including companies (often competitors ), markets, people, and products. Their role includes analysis and synthesis of the data they find, Brokers may find everything else they can about an individual on the Internet, and aggregate that data with information from a variety of other sources.

Information brokers sometimes specialise in a specific area, such as market research, statistics, or scientific data.

Clients of information brokers come from a wide range of industries and professions, including manufacturing, financial institutions, political parties, government agencies and historians. Non-profit organizations might benefit from information which helps them to apply for grant funding, and real estate agents often use IBs to undertake land title searches. Advertising, fraud detection and risk mitigation are three common reasons for using data brokers, and these are the three broad categories defined by the Federal Trade Commission. Information brokers need to screen their clients carefully to avoid criminals obtaining data on individuals for nefarious purposes: US broking companies Lexis-Nexis and ChoicePoint have both been duped by phoney clients, leading in one case to identity theft on a large scale.

Data may be harvested from various sources, including census, change of address, motor vehicle-related records, user-contributed material and social networking sites, media and court reports, voter registration lists, consumer purchase histories, most-wanted lists and terrorist watch lists, bank card transaction records, health care authorities, and Web browsing histories. IBs may also purchase information from other companies (such as a credit card company). The information collected may include name, address, social security number, driver's licence number and other such identifying information, as well as occupation, property ownership, income, etc. Advertising companies are most often only interested in profiles and categories rather than personal information about an individual.

Information from property records, tax filings, etc. may also be available via "people-search" whitepage sites, either for a small fee or no cost. These websites can thereby have implications for stalking, harassment, and domestic violence.

The data are aggregated to create individual profiles, often made up of thousands of pieces of information, such as a person's age, race, gender, height, weight, marital status, religious affiliation, political affiliation, occupation,  household income, net worth, home ownership status, investment habits, product preferences and health-related interests. Brokers then sell the profiles to other organizations that use them mainly to target advertising and marketing towards specific groups, or to verify a person's identity including for purposes of fraud detection, and to sell to individuals and organizations so they can research people for various reasons. Some datasets may also include geolocation data and is included in marketing resources from Acxiom. Experian and Oracle also advertise location-based marketing services.

Many brokers work independently, while others are employees of large companies such as LexisNexis or ProQuest.

In the United States
Data brokers in the United States include Acxiom, Experian, Epsilon, CoreLogic, Datalogix, Intelius, PeekYou, Exactis, and Recorded Future. In 2012, Acxiom claimed to have files on about 500 million active consumers worldwide, with about 1,500 data points per person and, in 2023, Acxiom (renamed LiveRamp) claims to have files on 2.5 billion people and over 3,000 data points per person. The company Oracle has publicly noted it has connections with 80 data broker companies. The US Department of Homeland Security has purchased cell phone location data and home utility data from data brokers to facilitate deportations. The Federal Bureau of Investigation (FBI) has purchased personal data from the company Venntel. Under both of these circumstances, a warrant is not required to acquire this data, due to the fact that it is "open source" or "commercially obtained". Use of the data also has implications in background checks (used in rent/housing and job applications).

In 2012, Spokeo, a people search website, settled with the US Federal Trade Commission for $800,000 over violations of the Fair Credit Reporting Act.

In 2017, Cambridge Analytica claimed that it has psychological profiles of 220 million United States citizens, based on 5,000 separate data sets, with another source reporting 230 million. A scandal emerged after it was found that after 270,000 Facebook users consented to sharing their data, data was scraped from about 50 million profiles on the social media platform. This was seen as breach of trust by Facebook.

In 2018, American companies spent $19 billion acquiring and analyzing consumer data, according to the Interactive Advertising Bureau.

In 2021, The Pillar outed a Catholic priest by purchasing data from a data broker including data usage from Grindr.

Privacy issues and regulation
Information privacy laws are not as strict in the United States as in the European Union, where data brokers work hard to get around the General Data Protection Regulation (GDPR) regulations, brought into operation in 2018. Under GDPR, data can only be collected for re-use on one of six legal bases. The rather vague term "legitimate interest" is often abused or misinterpreted. Explicit consent from users is required for information storage. In addition, data processing related with political opinion and religious belief is prohibited unless the consent of data subject is granted.

In the US, individuals generally cannot find out what data a broker holds on them, how a broker got it, or how it is used. There is no federal law that permits or enables consumers to see, make corrections to, or opt out of data compiled by brokers.

Files on individuals are generally sold in lists; examples cited in testimony to the U.S. Congress include lists of rape survivors, seniors with dementia, financially vulnerable people, people with HIV, police officers (by home address), alcoholics, and people with erectile dysfunction.

Calls for regulation in the US
A 2007 University of California study, after requesting and analyzing information-sharing practices at 86 companies, found many operating under an opt-out model that it described as inconsistent with consumer expectations, and recommended that the California state legislature require companies to disclose their information-sharing policies using clear, unambiguous language, and consider creating a centralized, user-friendly method for consumers to opt out of information-sharing.

The proposed US Data Accountability and Trust Act (introduced in 2009) contained a number of requirements for auditing and verification of accuracy of data held by information brokers, and additional measures in the case of a security breach. The bill also gave identified individuals the means and opportunity to review and correct the data held that related to them. It passed through the United States House of Representatives in the 111th United States Congress, but failed to pass the United States Senate. It was revived by the 112th United States Congress in 2011 as H.R. 1707., but died after being referred to committee. The bill was first introduced by Rep. Bobby Rush [D-IL1] on 30 April 2009, H.R. 2221.

In 2009, the U.S. Federal Trade Commission had recommended the United States Congress develop legislation enabling consumers to see the information that data brokers hold about them, a recommendation it renewed in subsequent reports in 2012 and 2014. In 2013, the U.S. Government Accountability Office also called for Congress to consider legislation.

In October 2019, California Governor Gavin Newsom signed into action statute AB 1202. This bill "would require data brokers to register with, and provide certain information to, the Attorney General. The bill would define a data broker as a business that knowingly collects and sells to third parties the personal information of a consumer with whom the business does not have a direct relationship, subject to specified exceptions". This law was created to safeguard against the "cloak of invisibility" (unregistered, unregulated, untracked information broker) that previous data brokers roamed in. It was also meant to regulate the purchasing of data in commercial third party buyers, and tracks the data brokers information trades.<? Clarity would aid the reader. >

Due to the interest in federal regulation, data broker firms have lobbied and spent $29 million in the year 2020.

Criticisms, consumer rights and breaches
A United States Senate Committee in 2013 published A Review of the Data Broker Industry: Collection, Use, and Sale of Consumer Data for Marketing Purposes. It states that "Today, a wide range of companies known as 'data brokers' collect and maintain data on hundreds of millions of consumers, which they analyze, package, and sell generally without consumer permission or input." Their main findings were that:
 * Data brokers collect a huge volume of detailed information on hundreds of millions of consumers.
 * Data brokers sell products that identify financially vulnerable consumers.
 * Data broker products provide information about consumer offline behavior to tailor online outreach by marketers.
 * Data brokers operate behind a veil of secrecy.

The information produced by data brokers has been criticized for enabling discrimination in pricing, services and opportunities. For example, a May 2014 White House report found that web searches that included black-seeming first names such as Jermaine were more likely to result in ads being displayed that include the word "arrest," compared with web searches including white-seeming first names such as Geoffrey.

An Online Information Broker FAQ is published by Privacy Rights Clearinghouse (PRC), a nonprofit consumer organization in the United States. PRC also maintains a list of information brokers, with links to their privacy policies, terms of service, and opt-out provisions.

Data brokers have also faced legal charges for security breaches due to poor data security practices.

Professional associations
The Association of Independent Information Professionals (AIIP) is a professional association based in Baton Rouge, Louisiana, with members from 20 countries worldwide, representing both primary and secondary researchers.

Fiction
Examples of information brokers in contemporary fiction would be the Shadow Broker in the video game series Mass Effect; Nicholas Wayne, Rachel, Elean Duga, Gustav St. Germain, Carol, and the President of the Daily Days newspaper company in Baccano!; or Izaya Orihara in the light novel series Durarara!!. A few of the characters in Neal Stephenson's novel Snow Crash find work selling data as "stringers" for the Central Intelligence Corporation. Information broker characters play a prominent role in stories published by DC Comics. The character trope is best exemplified by the superhero Oracle, but the trope is later used with the characters Calculator, Proxy, Chloe Sullivan, and Felicity Smoak as well.