Talk:Data integration

Untitled
the concepts of integrated data processing system

Removing commercial links
I removed the entire section of external links under the "Commercial" heading. This section was attracting spam and was in violation of WP:EL and WP:NOT - WP is not a link directory of commercial products. It did not improve the content of the article and it would be impossible to moderate if left in place. If you feel like any of these links were removed in error, please discuss their inclusion here before relinking. Thanks. Nposs 21:16, 19 January 2007 (UTC)


 * ref. Nr. 8 is a commercial brochure and not about enhanced modeling methodology — Preceding unsigned comment added by 128.131.193.208 (talk) 14:48, 4 August 2015 (UTC)

Should not be merged
I am concerned about the possibility of merging this with the general data integration entry - Edge Data Integration, while related and similar in some ways is very different from what most folks consider data integration - different purpose, different tools, different patterns, different data. 71.56.69.30 (talk) 10:38, 22 June 2008 (UTC)

Removal of external link
I've removed an external link to Costs of Data Integration because I have serious concerns that it represents a conflict of interest. The external article seems to suggest cost savings available from using products by Pervasive Software, however one of the authors of the article might very well be the same as a user here on wikipedia who has stated that they worked for Pervasive for many years ( I'm not sure if I'm allowed to state the author and username involved because of policy on outing, will provide the further details to admins if wanted) and then the article is added to this page by Shaw76 (talk • contribs) who has openly stated here that they are an offically authorized spokesperson for Pervasive. TurningWork (talk) 19:16, 21 April 2009 (UTC)

Insufficient Inline References and Possibly Original Research
The "History" section provides a chronology (as of 2009...as of 2011) without providing any citations - the most recent citation in the Bibliography (other than Lane's, which is strictly a news item) is 2002.

An additional concern is that the article appears to set up straw men - thus, the complaint that a centralized data warehouse can go out of date is of less importance in the situation of a single organization, where single-vendor RDBMS technologies that allow near-real-time incremental data updates to materialized views can be deployed. While federated approaches using data mediation and a virtual schema are viable, they are preferentially employed only in those circumstances where the individual local data sources are developed independently of the team that manages the integration task (as in research consortia, where the maintainers of individual sources collaborate - often loosely - but do not give up autonomy). In such conditions, physical data integration is often not politically feasible.

The classic paper of Won Kim et al on the challenges of integrating heterogeneous schemas (Won Kim, Injun Choi, Sunit Gala, and Mark Scheevel. On resolving schematic heterogeneity in multidatabase systems. Distributed and Parallel Databases, 1(3):251–277, July 1993) should be cited.

Prakash Nadkarni (talk) 02:11, 4 May 2012 (UTC) May 3 2012

Proposed merge with Information integration
A user editing from an IP address (76.19.112.164) has proposed that the Information integration page be merged into Data integration for the following reason: "I (Ryan Wisnesky), being an academic working in this field, propose to forward 'information integration' to 'data integration' for the following reasons. 1) Experts use the terms synonymously (e.g., Halevy wrote a book on data integration, but IBM has an information integration department). 2) The first sentence of the information integration article, 'Information integration (II) (also called deduplication and referential integrity)' doesn't make sense - II is not those things, but those things are captured by the techniques of data integration. The rest of the article isn't much better. 3) The data integration article is pretty good and subsumes everything on the information integration page." The original proposal was made at Articles for creation/Redirects. I have no opinion regarding it. Mz7 (talk) 20:38, 11 September 2015 (UTC)

Microsoft requesting edits
Hello, my name is Patricia Wagner and I'm an employee of Microsoft. I work in the Cloud+Enterprise division as a content publisher for Azure products. We are reviewing Wikipedia articles that relate to our areas and would like to update some to better represent the current state and features of our products. Please review the changes below and let me know if they are acceptable to you. Thank you very much for your consideration.

We would like to ask that two entries be added to the list of tools in the data integration article:

Azure Data Factory (ADF) SQL Server Integration Services (SSIS)

Pat MSFT (talk) 18:56, 22 April 2016 (UTC)


 * Done - Yuhong (talk) 10:51, 24 April 2016 (UTC)

External links modified
Hello fellow Wikipedians,

I have just modified 1 one external link on Data integration. Please take a moment to review my edit. If you have any questions, or need the bot to ignore the links, or the page altogether, please visit this simple FaQ for additional information. I made the following changes:
 * Added archive https://web.archive.org/web/20070926211342/http://www.csd.uoc.gr/~hy562/Papers/thesis_final.pdf to http://www.csd.uoc.gr/~hy562/Papers/thesis_final.pdf

When you have finished reviewing my changes, please set the checked parameter below to true or failed to let others know (documentation at ).

Cheers.— InternetArchiveBot  (Report bug) 08:02, 7 December 2016 (UTC)

Semantics
This sentence

"Issues with combining heterogeneous data sources are often referred to as information silos, under a single query interface have existed for some time."

is agrammatical. What is meant?

[Unfortunately such slopiness is in line with the general tone of BBS (Business BullShit) pervading the entire article. This will never be a computer science article if we dont change that.]

Marius63 (talk) 17:24, 17 August 2022 (UTC)