User:Godfred Douglas/sandbox

From Wikipedia, the free encyclopedia

Data ho nyansahu[edit]

Data nyansahu yɛ adesua a ɛfa nneɛma ahorow[1] ho a ɛde akontaabu, nyansahu mu kɔmputa, nyansahu akwan, akwan horow, algorithms ne nhyehyɛe ahorow di dwuma de yi anaa wɔde yi nimdeɛ ne nhumu fi data a ɛyɛ dede, wɔahyehyɛ, ne nea wɔanhyehyɛ no mu.[2]

Comet NEOWISE (a wohu no ha sɛ nsensanee kɔkɔɔ akuwakuw) a wohuu no fi nsoromma mu hwɛ mu nhwehwɛmu ho nsɛm a Wide-field Infrared Survey Explorer satellite telescope no nyae no mu nhwehwɛmu mu.

Data nyansahu nso ka domain nimdeɛ a efiri application domain a ɛwɔ aseɛ no mu (e.g., abɔdeɛ ho nyansahu, nsɛm ho mfiridwuma, ne nnuruyɛ) bom.[3]Data nyansahu wɔ afa horow pii na wobetumi aka ho asɛm sɛ nyansahu, nhwehwɛmu nhwɛso, nhwehwɛmu kwan, nteɛso, adwumayɛ kwan, ne adwuma.[4]

Data nyansahu yɛ "adwene a wɔde bɛka akontaabu, data nhwehwɛmu, informatiks, ne akwan a ɛfa ho abom" de "ate nsɛm a ɛkɔ so ankasa ase na wɔayɛ mu nhwehwɛmu" ne data.[5]Ɛde akwan ne nsusuwii ahorow a wonya fi nnwuma pii mu di dwuma wɔ akontaabu, akontaabu, kɔmputa ho nyansahu, nsɛm ho nyansahu, ne domain nimdeɛ mu.[6]Nanso, ɛsono data ho nyansahu wɔ kɔmputa ho nyansahu ne nsɛm ho nyansahu ho. Turing Award nkonimdifo Jim Gray yɛɛ data nyansahu ho mfonini sɛ nyansahu mu "nhwɛso a ɛto so anan" (empirical, theoretical, computational, na mprempren data-driven) na ɔsii so dua sɛ "biribiara a ɛfa nyansahu ho resakra esiane nkɛntɛnso a nsɛm ho mfiridwuma anya" ne data nsuyiri no nti.[7][8]

Data nyansahufo yɛ obi a ɔyɛ adwumaden a ɔyɛ koodu ho  nhyehyɛe a wɔyɛ na ɔde ka akontaabu ho nimdeɛ bom de yɛ nhumu fi data mu.[9]

Mfapem[edit]

Data nyansahu yɛ adwuma a ɛfa nneɛma ahorow ho[10] a wɔde wɔn adwene si nimdeɛ a wobeyi afi data ahorow a ɛtaa yɛ akɛse mu na wɔde nimdeɛ ne nhumu a efi saa data no mu adi dwuma de adi ɔhaw ahorow ho dwuma wɔ dwumadie domain ahodoɔ pii mu.Saa asɛmti yi fa data a wɔbɛsiesie ama nhwehwɛmu, data nyansahu mu haw ahorow a wɔbɛhyehyɛ, data mu nhwehwɛmu, ano aduru a wɔde data di dwuma a wɔbɛhyehyɛ, ne nea wɔahu a wɔde bɛma de akyerɛ gyinaesi ahorow a ɛkorɔn wɔ dwumadie domain ahodoɔ pii mu.Sɛnea ɛte no, ɛde nimdeɛ a efi kɔmputa nyansahu, akontaabu, nsɛm ho nyansahu, akontaabu, data mfoniniyɛ, nsɛm ho mfoniniyɛ, data sonification, data nkabom, mfoniniyɛ, nhyehyɛe a ɛyɛ den, nkitahodi ne adwumayɛ ka ho.[11][12]Akontaabuo ho nimdefoɔ Nathan Yau, a ɔde Ben Fry di dwuma no nso de data nyansahu bata onipa ne kɔmputa nkitahodiɛ ho: ɛsɛ sɛ wɔn a wɔde di dwuma no tumi de nkateɛ di data so na wɔhwehwɛ mu.[13][14]Wɔ afe 2015 mu no, Amerika Akontaabu Fekuw no kyerɛɛ database sohwɛ, akontabuo ne mfiri adesua, ne akyekyɛ nhyehyɛe ahorow a  ɛyɛ parallel sɛ ɛyɛ adwumayɛfoɔ akuo mmiɛnsa a ɛreba.[15]

Data nyansahu abusuabɔ a ɛda akontaabu ntam[edit]

Akontaabu ho abenfo pii a Nate Silver ka ho aka sɛ data ho nyansahu nyɛ adwuma foforo, na mmom ɛyɛ din foforo a wɔde frɛ akontaabu.[16]Afoforo ka sɛ data ho nyansahu ne akontaabu nnyɛ pɛ efirisɛ data ho nyansahu ɛtwe adwene si ɔhaw ahorow ne akwan horow a ɛyɛ soronko wɔ dijitaal data ho so.[17]Vasant Dhar kyerɛw sɛ akontaabu si data dodow ho nsɛm ne nkyerɛkyerɛmu so dua.Nea ɛne eyi bɔ abira no, data nyansahu di data dodow ne su ho dwuma (e.g., efi mfonini, nsɛm, atwerɛ, nkitahodi, adetɔfo ho nsɛm, ne nea ɛkeka ho) na esi nkɔmhyɛ ne adeyɛ so dua.[18]Andrew Gelman a ɔwɔ Columbia Sukuupɔn mu aka akontaabu ho asɛm sɛ ɛyɛ ade a ɛho nhia wɔ data ho nyansahu mu.[19]

Ɔbenfo David Donoho a ɔwɔ Stanford ka sɛ nhyehyɛe ahorow pii a wɔawie no di atoro hyɛ wɔn nkyerɛkyerɛ wɔ nhwehwɛmu ne akontaabu mu ho nkuran sɛ data-nyansahu nhyehyɛe bi mu ade titiriw, na datasets kɛse anaa akontaabu a wɔde di dwuma no nyɛ nneɛma a ɛtetew data nyansahu ne akontaabu ntam.Sɛnea ɔkyerɛ no, data nyansahu yɛ nteɛso a wɔde di dwuma a efii akontaabu a wɔde di dwuma wɔ amanne kwan so mu bae.[20]

Data Nyansahu ne Data Nhwehwɛmu[edit]

Data nyansahu ne data nhwehwɛmu nyinaa yɛ nteɛso a ɛho hia wɔ data sohwɛ ne nhwehwɛmu mu, nanso ɛsono wɔ akwan titiriw pii so.Bere a nnwuma mmienu no nyinaa hwehwɛ sɛ wɔde data yɛ adwuma no, data nyansahu yɛ adwuma a ɛfa nneɛma ahorow ho kɛse a ɛfa akontaabu, kompuuta, ne mfiri adesua akwan a wɔde di dwuma de yi nhumu fi data mu na wɔyɛ nkɔmhyɛ ahorow ho, bere a data nhwehwɛmu twe adwene si nhwehwɛmu ne nkyerɛase so kɛse.[21][22]

Data nhwehwɛmu taa hwehwɛ sɛ wɔde dataset nketewa a wɔahyehyɛ no bɛyɛ adwuma de abua nsɛmmisa pɔtee bi anaasɛ wobedi ɔhaw pɔtee bi ho dwuma.Eyi betumi ayɛ nnwuma te sɛ data ahotew, data ho mfoniniyɛ, ne data nhwehwɛmu de anya nhumu wɔ data no ho na wɔayɛ nsusuwii hunu a ɛfa twaka a ɛda nsakrae ahorow ntam ho.Akontaabu akwan na data nhwehwɛmufo taa de sɔ saa nsusuwii hunu yi hwɛ na wonya nsɛm firi data mu. Obi a ɔyɛ data mu nhwehwɛmu betumi ahwehwɛ adetɔn ho nsɛm mu de ahu sɛnea adetɔfo nneyɛe te na ɔde nyansahyɛ ahorow ama wɔ dawurubɔ akwan horow ho.[21]

Nea ɛne eyi bɔ abira no, data nyansahu yɛ adeyɛ a ɛyɛ den na wɔsan yɛ no mpɛn pii a ɛhwehwɛ sɛ wodi dataset akɛse a ɛyɛ den a ɛtaa hia akontaabu ne kompuuta akwan a ɛyɛ nwonwa de yɛ nhwehwɛmu ho dwuma.Sɛ wɔde data a wɔanhyehyɛ, a nsɛm anaa mfonini ka ho reyɛ adwuma a, data ho nyansahufo taa de mfiri a wɔde sua ade di dwuma de yɛ nkɔmhyɛ nhwɛso ahorow na wɔpaw nneɛma a wɔde data di dwuma.Data nyansahu taa de dwumadi ahorow te sɛ feature engineering, data preprocessing, ne model selection ka akontaabu nhwehwɛmu ho. Data ho nyansahufo betumi de mfiri adesua nhyehyɛe ahorow adi dwuma de ahyɛ nea ɔde di dwuma no apɛde ho nkɔm na wahwehwɛ nea ɔde di dwuma no nneyɛe mu de ayɛ nhyehyɛe a wɔde bɛkamfo akyerɛ ama e-commerce platform so.[22][23]

Data nyansahu trɛw kɔ akyiri sen data nhwehwɛmu denam nkɔmhyɛ nhwɛso ahorow a wɔbɔ ne nea wɔde di dwuma a wɔde ka bom na ama wɔatumi asi gyinae a ɛboro nhwehwɛmu no so, a ɛtwe adwene si nsɛm a wɔde ba awiei fi nsɛm a ɛwɔ hɔ mu so.Data ho nyansahufo taa hwɛ data a wɔboaboa ano na wosiesie, paw akwan a ɛyɛ sen biara a wɔfa so hwehwɛ nneɛma mu, na wɔde nhwɛso ahorow di dwuma wɔ tebea horow a mfaso wɔ so mu.Wodi nsɛm a ɛyɛ den ho dwuma na wohu nhwɛso ahorow a ahintaw wɔ dataset akɛse mu denam domain nimdeɛ, kɔmputa ho nyansahu, ne akontaabu a wɔde bom so.Wodi nsɛm a ɛyɛ den ho dwuma na wohu nhwɛso ahorow a ahintaw wɔ dataset akɛse mu denam domain nimdeɛ, kɔmputa ho nyansahu, ne akontaabu a wɔde bom so.[22]

Data nyansahu ne data nhwehwɛmu yɛ mmeae a ɛwɔ abusuabɔ kɛse a ɛtaa hwehwɛ sɛ wonya ahokokwaw a ɛte saa ara, ɛmfa ho  nsonsonoe ahorow a ɛda wɔn tam.Nteɛso abien no nyinaa hwehwɛ sɛ wonya nhyehyɛe, akontaabu, ne data mfoniniyɛ mu nimdeɛ a emu yɛ den de ka tumi a wɔde bɛka nea wɔahu no ho asɛm yiye akyerɛ atiefo a wɔwɔ mfiridwuma ho nimdeɛ ne wɔn a wonni bi no.Bio nso, nteɛsoɔ mmienu no nyinaa nya mfasoɔ firi adwene a ɛyɛ katee ne domain ho nimdeɛ mu ɛfiri sɛ nhwehwɛmu ne nhwɛsoɔ a ɛfata gyina nteaseɛ a ɛfa nsɛm a ɛfa ho ne anifereɛ a ɛwɔ data no mu so.[21][22]

Sɛ yɛbɛbɔ no mua a, wɔ asɛmti kɛseɛ a ɛfa data sohwɛ ne nhwehwɛmu mu no, data nyansahu ne data nhwehwɛmu yɛ nnwuma a ɛsono emu biara nanso ɛfa ho.Bere a data nyansahu fa ɔkwan a ɛkɔ akyiri a ɛka akontaabu nhwehwɛmu, kɔmputa akwan, ne mfiri adesua bom de yi nhumu, yɛ nkɔmhyɛ nhwɛso ahorow, na ɛkanyan gyinaesi a egyina data so no, data nhwehwɛmu twe adwene si nhumu a wobenya ne nsɛm a wɔde ba awiei afi data a wɔahyehyɛ mu.Nteɛso abien no nyinaa ho hia na ama wɔatumi de data tumi adi dwuma de ahu nneɛma a ɛrekɔ so, de aba awiei a nyansa wom, na wɔasiesie nsɛm a emu yɛ den wɔ nnwuma ahorow mu.

Hwɛ eyinom nso[edit]

Mmoa nwoma[edit]

  1. ^ Donoho, David (2017). "50 Years of Data Science". Journal of Computational and Graphical Statistics. 26 (4): 745–766. doi:10.1080/10618600.2017.1384734. S2CID 114558008.
  2. ^ Dhar, V. (2013). "Data science and prediction". Communications of the ACM. 56 (12): 64–73. doi:10.1145/2500499. S2CID 6107147. Archived from the original on 9 November 2014. Retrieved 2 September 2015.
  3. ^ Danyluk, A.; Leidig, P. (2021). Computing Competencies for Undergraduate Data Science Curricula (PDF). ACM Data Science Task Force Final Report (Report).
  4. ^ Mike, Koby; Hazzan, Orit (2023-01-20). "What is Data Science?". Communications of the ACM. 66 (2): 12–13. doi:10.1145/3575663. ISSN 0001-0782.
  5. ^ Hayashi, Chikio (1998-01-01). "What is Data Science ? Fundamental Concepts and a Heuristic Example". In Hayashi, Chikio; Yajima, Keiji; Bock, Hans-Hermann; Ohsumi, Noboru; Tanaka, Yutaka; Baba, Yasumasa (eds.). Data Science, Classification, and Related Methods. Studies in Classification, Data Analysis, and Knowledge Organization. Springer Japan. pp. 40–51. doi:10.1007/978-4-431-65950-1_3. ISBN 9784431702085.
  6. ^ Cao, Longbing (2017-06-29). "Data Science: A Comprehensive Overview". ACM Computing Surveys. 50 (3): 43:1–43:42. doi:10.1145/3076253. ISSN 0360-0300. S2CID 207595944.
  7. ^ Tony Hey; Stewart Tansley; Kristin Michele Tolle (2009). The Fourth Paradigm: Data-intensive Scientific Discovery. Microsoft Research. ISBN 978-0-9825442-0-4. Archived from the original on 20 March 2017.
  8. ^ Bell, G.; Hey, T.; Szalay, A. (2009). "Computer Science: Beyond the Data Deluge". Science. 323 (5919): 1297–1298. doi:10.1126/science.1170411. ISSN 0036-8075. PMID 19265007. S2CID 9743327.
  9. ^ Davenport, Thomas H.; Patil, D. J. (October 2012). "Data Scientist: The Sexiest Job of the 21st Century". Harvard Business Review. 90 (10): 70–76, 128. PMID 23074866. Retrieved 2016-01-18.
  10. ^ Emmert-Streib, Frank; Dehmer, Matthias (2018). "Defining data science by a data-driven quantification of the community". Machine Learning and Knowledge Extraction. 1: 235–251. doi:10.3390/make1010015.
  11. ^ "1. Introduction: What Is Data Science?". Doing Data Science [Book]. O’Reilly. Retrieved 2020-04-03.
  12. ^ "the three sexy skills of data geeks". m.e.driscoll: data utopian. 27 May 2009. Retrieved 2020-04-03.
  13. ^ Yau, Nathan (2009-06-04). "Rise of the Data Scientist". FlowingData. Retrieved 2020-04-03.
  14. ^ "Basic Example". benfry.com. Retrieved 2020-04-03.
  15. ^ "ASA Statement on the Role of Statistics in Data Science". AmStatNews. American Statistical Association. 2015-10-01. Archived from the original on 20 June 2019. Retrieved 2019-05-29.
  16. ^ "Nate Silver: What I need from statisticians". Statistics Views. Retrieved 2020-04-03.
  17. ^ "What's the Difference Between Data Science and Statistics?". Priceonomics. 13 October 2015. Retrieved 2020-04-03.
  18. ^ Vasant Dhar (2013-12-01). "Data science and prediction". Communications of the ACM. 56 (12): 64–73. doi:10.1145/2500499. S2CID 6107147.
  19. ^ "Statistics is the least important part of data science « Statistical Modeling, Causal Inference, and Social Science". statmodeling.stat.columbia.edu. Retrieved 2020-04-03.
  20. ^ Donoho, David (18 September 2015). "50 years of Data Science" (PDF). Retrieved 2 April 2020.
  21. ^ a b c Gareth, Hastie; Witten, Tibshira (2017-09-29). "An Introduction to Statistical Learning: with Applications in R." Springer.
  22. ^ a b c d Provost, Foster; Tom Fawcett (2013-08-01). "Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking". O'Reilly Media, Inc.
  23. ^ Han, Kamber; Pei (2011). Data Mining: Concepts and Techniques. ISBN 9780123814791. {{cite book}}: |work= ignored (help)