[ , ] Ukuphazamiseka kwe-Algorithmic kwakuyinkinga nge-Trends ye-Google Flu. Funda iphepha Lazer et al. (2014) , bese ubhala i-imeyili emfushane, ecacile kunjiniyela on Google echaza inkinga futhi enikeza umbono wokuthi ungayilungisa kanjani.
[ ] Bollen, Mao, and Zeng (2011) bathi idatha evela ku-Twitter ingasetshenziswa ukubikezela imakethe yesitoko. Lokhu kutholakala ekudaleni isikhwama semboqo-iMarokent Capital Markets-ukutshala ezimakethe zamasheya ngokusekelwe kwedatha eqoqwe kusuka ku-Twitter (Jordan 2010) . Yibuphi ubufakazi obungathanda ukukubona ngaphambi kokubeka imali yakho kulesi sikhwama?
[ ] Nakuba abanye ababhekele ezempilo zomphakathi babheka u-e-cigarettes usizo oluphumelelayo lokuyeka ukubhema, abanye baxwayisa mayelana nezingozi ezingenzeka, njengezinga eliphezulu le-nicotine. Ake ucabange ukuthi umcwaningi ukhetha ukutadisha umbono womphakathi ku-e-cigarettes ngokuqoqa okuthunyelwe kwe-e-cigarettes ehlobene no-cigaretta nokuqhuba ukuhlaziywa kwamagama.
[ ] NgoNovemba 2009, i-Twitter ishintshe umbuzo ebhokisini le-tweet elivela ku- "Wenzani?" Ku- "Kwenzekani?" (Https://blog.twitter.com/2009/whats-happening).
[ ] "I-Retweets" ivame ukusetshenziswa ukukala ithonya nokusabalalisa ithonya ku-Twitter. Ekuqaleni, abasebenzisi kwakudingeka bakopishe futhi banamathisele i-tweet abayithandile, bhala umlobi wangempela ngesibambo sakhe, bese uthayipha ngesandla "RT" ngaphambi kokuba i-tweet ibonise ukuthi i-retweet. Khona-ke, ngo-2009, i-Twitter yanezela inkinobho "ye-retweet". Ngo-June 2016, i-Twitter yenza ukuba abasebenzisi bakwazi ukubuyisa ama-tweets abo (https://twitter.com/twitter/status/742749353689780224). Ucabanga ukuthi lezi zinguquko kufanele zithinte indlela osebenzisa ngayo "i-repubu" ekucwaningweni kwakho? Kungani noma kungani?
[ , , , ] Emaphephandabeni axoxwa kakhulu, uMichel kanye nozakwethu (2011) bahlaziya okuqukethwe kwamabhuku angaphezu kwezigidi ezinhlanu ezibhalwe ngamakhadi okuzama ukukhomba izitayela zamasiko zesikhathi eside. Idatha ayisebenzisayo manje ikhishwe njengedatha yedatha ye-Google NGrams, ngakho-ke singasebenzisa idatha ukuze iphinde iphinde iphinde isandise enye yomsebenzi wayo.
Kwesinye seziphumo eziningi kuleli phephandaba, uMichel nozakwabo bathi bakhumbula ngokushesha futhi ngokushesha. Ngonyaka othize, bathi "ngo-1883," babalwa inani le-1-grams eyanyatheliswa ngonyaka ngamunye phakathi kuka-1875 no-1975 eyayingu "1883". Babecabanga ukuthi lesi sinani sinesilinganiso senzuzo ezenzakalweni ezenzeka kulowo nyaka. Esikhathini sabo 3a, bahlela ukulandelwa kwemithi iminyaka emithathu: 1883, 1910, no-1950. Le minyaka emithathu bahlanganyela iphethini evamile: ukusetshenziswa kancane ngaphambi kwalo nyaka, bese kuthiwa i-spike, bese ibola. Ngokulandelayo, ukulinganisa izinga lokubola konyaka ngamunye, uMichel kanye nosebenza nabo babalwa "ingxenye yokuphila" yonyaka ngamunye iminyaka yonke phakathi kuka-1875 no-1975. Kubalobo babo 3a (ekhoneni), babonisa ukuthi ingxenye yesigamu ngasinye unyaka wehla, futhi baphikisa ukuthi lokhu kusho ukuthi sikhumbula okusheshayo ngokushesha futhi ngokushesha. Basebenzisa i-Version 1 ye-English corpus, kodwa kamuva i-Google ikhishwe inguqulo yesibili ye-corpus. Sicela ufunde zonke izingxenye zombuzo ngaphambi kokuba uqale ukufaka ikhodi.
Lo msebenzi uzokunika umkhuba wokubhala ikhodi evuselelwayo, ukuhumusha imiphumela, nokuphikisana kwedatha (njengokusebenza ngamafayela angaqondakali nokuphatha idatha elahlekile). Lo msebenzi uzokusiza ukuthi uvuke uphinde usebenze nge-dataset ecebile nesithakazelisayo.
Thola idatha eluhlaza kusuka kuwebhusayithi yeGoogle Books Ngram Viewer. Ngokuyinhloko, kufanele usebenzise inguqulo yesi-2 yesiNgisi corpus, ekhishwe ngoJulayi 1, 2012. Ingaxhunyiwe, leli fayela liyi-1.4GB.
Buyisela ingxenye enkulu yomdwebo 3a Michel et al. (2011) . Ukuze ubuyekeze lesi sibalo, uzodinga amafayela amabili: lowo owalandile engxenyeni (a) kanye nefayela elithi "inani eliphelele," ongayisebenzisa ukuguqula izibalo ezingenakubalwa zibe yizilinganiso. Qaphela ukuthi ifayela elibala inani linesakhiwo esingenza kube nzima kanzima ukufunda. Ingabe i-version 2 yedatha ye-NGram ikhiqiza imiphumela efanayo nalabo abethunyelwe ku- Michel et al. (2011) , esekelwe idatha ye-1?
Manje hlola igrafu yakho ngokumelene negrafu edale i-NGram Viewer.
Ukuphinda uthole isithombe esingu-3a (isibalo esiphezulu), kodwa ushintshe i- \(y\) -axe ukuze kube ukubala okukhulunywe okungekho (hhayi izinga lokukhuluma).
Ingabe umehluko phakathi (b) no (d) ukuholela ekuhloliseni noma yimiphi imiphumela kaMichel et al. (2011). Kungani noma kungani?
Manje, usebenzisa ukulinganisa okukhulunywa ngakho, phendulela okufakiwe kwesibalo 3a. Lokhu kungukuthi, ngonyaka ngamunye phakathi kuka-1875 no-1975, balinganisa ukuphila kwengxenye yalo nyaka. I-half-life ichazwa ukuthi iyinani leminyaka edlulayo ngaphambi kokuba inani lokukhulumisana lifinyelele isigamu salo inani eliphakeme. Qaphela ukuthi Michel et al. (2011) yenza into enzima kakhulu ukulinganisa isigamu-ukuphila-bheka isigaba III.6 se-Information Support Online-kodwa bathi zonke izindlela ziveza imiphumela efanayo. Ingabe i-version 2 yedatha ye-NGram ikhiqiza imiphumela efanayo nalabo abethunyelwe ku- Michel et al. (2011) , esekelwe idatha ye-1? (Ukusikisela: Ungamangali uma kungenjalo.)
Kwakukhona yini iminyaka eyayiyizinyanga ezifana neminyaka eyayikhohlwa ikakhulukazi ngokushesha noma ikakhulukazi kancane? Ake ucabange ngezizathu ngezizathu zaleso siphethini bese uchaza indlela ozibonisile ngayo.
Manje uphinde uphendule lo mphumela wenguqulo yesibili yedatha ye-NGrams ngesiShayina, isiFulentshi, IsiJalimane, IsiHeberu, isi-Italian, IsiRashiya neSpanishi.
Ukuqhathanisa kuzo zonke izilimi, bekukhona yini iminyaka eyayiyizinkampani zangaphandle, njengeminyaka ebeyikhohliwe ikakhulukazi ngokushesha noma ikakhulukazi kancane kancane? Ake ucabange ngokufushane ngezizathu ezingenzeka zephethini.
[ , , , ] U- Penney (2016) uhlole ukuthi ngabe ukusabalalisa okubanzi mayelana nokubhekwa kwe-NSA / PRISM (okungukuthi, izimbulelo ze-Snowden) ngoJuni 2013 kwahlotshaniswa nokunciphisa ngokukhawuleza nangokungazelelwe emotweni kuma-athikili e-Wikipedia ngezihloko eziphakamisa ukukhathazeka kwangasese. Uma kunjalo, lolu shintsho ekuziphatheni luzohambisana nomphumela wokushisa okubangelwa ukubhekwa okukhulu. Indlela Penney (2016) ebizwa ngayo ngezinye izikhathi kuthiwa iphazamise ukuklanywa kwesikhathi sesikhathi , futhi ihlobene nezindlela ezichazwe esigabeni 2.4.3.
Ukuze ukhethe amagama angukhiye esihloko, uPenney okubhekiswe kuhlu olusetshenziswa nguMnyango wezokuThuthukiswa koMakhaya waseMelika ngokulandelela nokuqapha imithombo yezokuxhumana. Uhlu lwe-DHS luhlukanisa amagama athile okusesha ezinkingeni eziningi, okungukuthi, "Ukukhathazeka Kwezempilo," "Ukuphepha Kwengqalasizinda," no "Ubuphekula." Kwaqembu lokutadisha, uPenney wasebenzisa amagama angukhiye angu-48 ahlobene "nobuphekula" (bheka ithebula lesithasiselo 8) ). Wabe esehlanganisa ama-athikili e-Wikipedia okubalwa njalo ngenyanga ngezihloko ezingama-48 ze-Wikipedia ezinyangeni ezingu-32, kusukela ekuqaleni kukaJanuwari 2012 kuze kube sekupheleni kuka-Agasti 2014. Ukuze aqinise izingxabano zakhe, wadala amaqembu ambalwa ngokuqhathanisa ngokulandelela isihloko sibuka kwezinye izihloko.
Manje, uzophindaphinda futhi unwebe Penney (2016) . Yonke idatha eluhlaza oyodinga kulo msebenzi itholakala ku-Wikipedia. Noma ungayithola kusukela ku-R-package wikipediatrend (Meissner and R Core Team 2016) . Uma ubhala izimpendulo zakho, sicela wazi ukuthi iyiphi imithombo yedatha oyisebenzisile. (Qaphela ukuthi lo msebenzi owodwa ubonakala nesahluko 6.) Lo msebenzi uzokunika amandla ekusebenziseni ukudabuka kwedatha nokucabanga ngokuhlolwa kwemvelo kwimithombo emikhulu yedatha. Izophinde ikuvuselele futhi isebenze ngomthombo wedatha ongathandekayo wephrojekthi yamaphrojekthi esizayo.
[ ] U- Efrati (2016) wabika, ngokusekelwe kolwazi oluyimfihlo, ukuthi "ukwabelana okuphelele" ku-Facebook kuye kwehla ngo-5.5% unyaka ngonyaka ngenkathi "ukwabelana kwangempela kokusabalalisa" kwahlehla ngonyaka ka-21% ngonyaka. Lokhu kuncipha kwakunzima kakhulu nabasebenzisi be-Facebook abangaphansi kweminyaka engu-30 ubudala. Lo mbiko ubhekisele ekunciphiseni kwezici ezimbili. Enye ukukhula kwenani labantu "abangani" abanalo ku-Facebook. Enye ukuthi umsebenzi othile wokuhlanganyela uye wasuka ekuthumeleni imibiko nakumncintiswano njenge-Snapchat. Lo mbiko uveze ukuthi amaqhinga ambalwa we-Facebook azame ukukhulisa ukwabelana, kuhlanganise ne-News Feed i-algorithm tweaks eyenza okuthunyelwe kwangempela okuvelele, kanye nezikhumbuzi ngezikhathi ezithile zokuthunyelwe kwangempela nge-"On Today" feature. Yimiphi impendulo, uma ikhona, yenza lokhu okutholakele kwabaphenyi abafuna ukusebenzisa i-Facebook njengomthombo wedatha?
[ ] Uyini umehluko phakathi komphakathi wezenhlalo nomlando? Ngokusho kweGoldthorpe (1991) , umehluko omkhulu ulawula ukuqoqwa kwedatha. Abalando-mlando baphoqeleka ukuba basebenzise iziqu, kanti izakhamuzi zezenhlalakahle zingahle zihlele ukuqoqwa kwedatha ngezinhloso ezithile. Funda i- Goldthorpe (1991) . Umehluko kanjani phakathi kwezenhlalo kanye nomlando ohlobene nomqondo wokugcotshwa kanye nama-readymades?
[ ] Lokhu kwakha ku-quesiton yangaphambilini. Goldthorpe (1991) yathola izimpendulo eziningana ezibucayi, kuhlanganise nenye evela kuNicky Hart (1994) eyabeka inselele ukuzinikela kukaGoldthorpe ekuhambeni kwedatha eyenziwe. Ukuze kucacise ukulinganiselwa okungenzeka okungenzeka kwenziwe kwedatha eyenziwe, i-Hart ichaze iphrojekthi ye-Affluent Worker Project, inhlolovo enkulu yokulinganisa ubuhlobo phakathi kwesigaba somphakathi nokuvota okwenziwa yiGoldthorpe kanye nozakwethu phakathi nawo-1960. Njengoba omunye angalindela isazi esivumelana nedatha eyakhelwe phezu kokuthola idatha, i-Affluent Worker Project iqoqe idatha efanelwe ukubhekana ne-theory esanda kuphakanyiswa mayelana nekusasa lezenhlalo zomphakathi ngenkathi yokwanda kwamazinga okuphila. Kodwa, i-Goldthorpe kanye nabasebenza nabo ngandlela-thile "bakhohlwe" ukuqoqa ulwazi mayelana nokuziphatha kokuvota kwabesifazane. Nansi indlela uNicky Hart (1994) afingqa ngayo isiqephu sonke:
"... kunzima ukugwema isiphetho sokuthi abesifazane bashiywe ngoba lokhu kusetshenziswe" i-dataset "kuvinjelwe ingqondo yeparadigmatic engabandakanyi isipiliyoni sabesifazane. Ebhekene nombono wezinkolelo wokufunda ekilasini kanye nesenzo njengengcindezi yesilisa ..., i-Goldthorpe nabalingani bakhe bakha isethi yobufakazi obunamandla obondla futhi balungiselela ukucabanga kwabo siqu kunokuba baveze ekuvivinyeni okusemthethweni kokufaneleka. "
Hart yaqhubeka:
"Ukutholwa komsebenzi weProvincial Worker Project kusitshela kabanzi ngemikhuba yama-masculinist emphakathini wekhulu leminyaka kunokuba bazise izinqubo zokuqasha, ezombusazwe kanye nokuphila kwempahla."
Ungacabanga ngezinye izibonelo lapho ukuqoqwa kwedatha okwenziwe ngokwezifiso kunemikhuba yokuqoqa idatha eyakhelwe kuyo? Lokhu kuqhathaniswa kanjani ne-algorithmic confounding? Ngabe lokhu kungaba namuphi umphumela lapho abacwaningi kufanele basebenzise i-readymades futhi uma kufanele basebenzise amasosha?
[ ] Kulesi sahluko, ngiye ngahlukanisa idatha eqoqwe abacwaningi kubacwaningi abanezincwadi zokuphatha ezidalwa izinkampani kanye nohulumeni. Abanye abantu babiza amarekhodi okuphatha "atholakala idatha," ahluke "nedatha eyenziwe." Kuyiqiniso ukuthi amarekhodi okuphatha atholakala ngabacwaningi, kodwa nawo aklanyelwe kakhulu. Isibonelo, izinkampani zamanje zenkampani zisebenza kakhulu kanzima ukuqoqa nokuvimbela idatha yazo. Ngakho-ke, la marekhodi okuphatha atholakale futhi aklanyelwe, ngokuya nje kunombono wakho (isibalo 2.12).
Nikeza isibonelo somthombo wedatha lapho ukubona kokubili kutholakala futhi kuklanyelwe kuyasiza uma usebenzisa lo mthombo wedatha ucwaningo.
[ ] Encwadini ekhungathekisayo, umKristu waseSandvig no-Eszter Hargittai (2015) bahlukanisa ucwaningo lwedijithali ngezigaba ezimbili ezibanzi kuye ngokuthi ngabe uhlelo lwedijithali luyinto "noma" into yokutadisha. "Isibonelo sohlobo lokuqala-lapho uhlelo ithuluzi-yilolu cwaningo olwenziwa nguBangtsson nosebenza nabo (2011) ngokusebenzisa idatha yefoni yeselula ukulandelela ukufuduka emva kokuzamazama komhlaba eHaiti ngonyaka ka-2010. Isibonelo somusa wesibili-lapho uhlelo luyinto yokutadisha-lucwaninga nguJensen (2007) ngokuthi ukufakwa kwamafoni omakhalekhukhwini kulo lonke elaseKerala, e-India kwathinta kanjani ukusebenza kwemakethe yezinhlanzi. Ngithola ukuthi ukuhlukana kuyasiza ngoba kucacisa ukuthi izifundo zokusebenzisa imithombo yedatha yedijithali zingaba nemigomo ehlukene kakhulu noma ngabe isebenzisa uhlobo olufanayo lomthombo wedatha. Ukuze uqhubeke ucacise lo mbono, chaza izifundo ezine ozibonayo: ezimbili ezisebenzisa uhlelo lwedijithali njengethuluzi nambili abasebenzisa uhlelo lwedijithali njengento yokutadisha. Ungasebenzisa izibonelo ezivela kulesi sahluko uma uthanda.