Hypothesis

3,504 Matching Annotations

Mar 2017
pharo.org pharo.org

Pharo - Google Summer of Code: Call for Students

1
1. offray 25 Mar 2017
  
  in Public
  
  A first list of projects are available here but more can be found by interacting with mentors from the Pharo community. Join dedicated channels, #gsoc-students for general interactions with students on Pharo slack. In order to get an invitation for pharoproject.slack.com visit the here Discuss with mentors about the complexity and skills required for the different projects. Please help fix bugs, open relevant issues, suggest changes, additional features, help build a roadmap, and interact with mentors on mailing list and/or slack to get a better insight into projects. Better the contributions, Better are the chances of selection. Before applying: Knowledge about OOP Basic idea about Pharo & Smalltalk syntax and ongoing projects Past experience with Pharo & Smalltalk Interaction with organisation You can start with the Pharo MOOC: http://files.pharo.org/mooc/
  
  summer of code data week 8
Visit annotations in context

Tags

data week 8

summer of code

Annotators

offray

URL

pharo.org/news/GSoC17Students
www.linkedin.com www.linkedin.com

5 SXSW Forecasts for the Future of Enterprise Organizations

1
1. SenorG 21 Mar 2017
  
  in Public
  
  Corporate thought leaders have now realized that it is a much greater challenge to actually apply that data. The big takeaways in this topic are that data has to be seen to be acknowledged, tangible to be appreciated, and relevantly presented to have an impact. Connecting data on the macro level across an organization and then bringing it down to the individual stakeholder on the micro level seems to be the key in getting past the fact that right now big data is one thing to have and quite another to unlock.
  
  Simply possessing pools of data is of limited utility. It's like having a space ship but your only access point to it is through a pin hole in the garage wall that lets you see one small, random glint of ship; you (think you) know there's something awesome inside but that sense is really all you've got. Margaret points out that it has to be seen (data visualization), it has to be tangible (relevant to audience) and connected at micro and macro levels (storytelling). For all of the machine learning and AI that helps us access the spaceship, these key points are (for now) human-driven.
  
  Big data AI sxsw
Visit annotations in context

Tags

sxsw

AI

Big data

Annotators

SenorG

URL

linkedin.com/pulse/5-sxsw-forecasts-future-enterprise-organizations-margaret-roth
www.youtube.com www.youtube.com

The Data Center Mural Project: A History of Connection

2
1. dogtrax 18 Mar 2017
  
  in Public
  
  wanted there to be a continuum, a narrative,1:33that tracks the history of people1:37disseminating, collecting, sharing data.
  
  Back to the question: Can data help tell the story? or does it obscure the humanity of the narrative?
  
  data
2. dogtrax 18 Mar 2017
  
  in Public
  
  ail system and the telegraph1:08and made Council Bluffs an enduring anchor1:13of the sharing of information.
  
  The history of data points is on the ground, first, and then in the air, and then in the wires, and now, in the wireless.
  
  data
Visit annotations in context

Tags

data

Annotators

dogtrax

URL

youtube.com/watch
www.npr.org www.npr.org

U.S. Indicts 2 Russian Security Officials Over Yahoo Hack

1
1. daveh70 15 Mar 2017
  
  in Public
  
  The Justice Department has announced charges against four people, including two Russian security officials, over cybercrimes linked to a massive hack of millions of Yahoo user accounts. [500M accounts, in 2014]
  
  Two of the defendants — Dmitry Dokuchaev and his superior Igor Sushchin — are officers of the Russian Federal Security Service, or FSB. According to court documents, they "protected, directed, facilitated and paid" two criminal hackers, Alexsey Belan and Karim Baratov, to access information that has intelligence value. Belan also allegedly used the information obtained for his personal financial gain.
  
  data breach computer security
Visit annotations in context

Tags

data breach

computer security

Annotators

daveh70

URL

npr.org/sections/thetwo-way/2017/03/15/520258402/u-s-indicts-2-russian-security-officials-over-yahoo-hack
blog.outsider.ne.kr blog.outsider.ne.kr

기술 뉴스 #73 : 17-03-01 :: Outsider's Dev Story

1
1. sugeun.oh 15 Mar 2017
  
  in Public
  
  Prophet : Facebook에서 오픈 소스로 공개한 시계열 데이터의 예측 도구로 R과 Python으로 작성되었다.
  
  python statics opensource, also can use R
  
  static visualization data python docker
Visit annotations in context

Tags

data

static

python

visualization

docker

Annotators

sugeun.oh

URL

blog.outsider.ne.kr/1277
www.theguardian.com www.theguardian.com

Big data’s power is terrifying. That could be good news for democracy | George Monbiot

1
1. wiobyrne 10 Mar 2017
  
  in Public
  
  Either we own political technologies, or they will own us. The great potential of big data, big analysis and online forums will be used by us or against us. We must move fast to beat the billionaires.
  
  technology big-data
Visit annotations in context

Tags

technology

big-data

Annotators

wiobyrne

URL

theguardian.com/commentisfree/2017/mar/06/big-data-cambridge-analytica-democracy
hackeducation.com hackeducation.com

Ed-Tech in a Time of Trump

1
1. Laika57 07 Mar 2017
  
  in Public
  
  You can delete the data. You can limit its collection. You can restrict who sees it. You can inform students. You can encourage students to resist. Students have always resisted school surveillance.
  
  The first three of these can be tough for the individual faculty member to accomplish, but informing students and raising awareness around these issues can be done and is essential.
  
  privacy data surveillance #OpenLearning17
Visit annotations in context

Tags

data

surveillance

privacy

#OpenLearning17

Annotators

Laika57

URL

hackeducation.com/2017/02/02/ed-tech-and-trump
cs231n.github.io cs231n.github.io

CS231n Convolutional Neural Networks for Visual Recognition

1
1. ksagou 05 Mar 2017
  
  in Public
  
  Great course
  
  CNN Neural nets Data Science
Visit annotations in context

Tags

Data Science

Neural nets

CNN

Annotators

ksagou

URL

cs231n.github.io/
www.economist.com www.economist.com

Why literature is the ultimate big-data challenge

1
1. heatherstaines 04 Mar 2017
  
  in Public
  
  with the publication of the “New Oxford Shakespeare”, they have shaped the debate about authorship in Elizabethan England.
  
  Interesting how the technology improves.
  
  Shakespeare data
Visit annotations in context

Tags

data

Shakespeare

Annotators

heatherstaines

URL

economist.com/blogs/prospero/2017/03/revenge-maths-mob
www.researchinformation.info www.researchinformation.info

Mining for insight | Research Information

1
1. grolimur 03 Mar 2017
  
  in Public
  
  In addition, Neylon suggested that some low-level TDM goes on below the radar. ‘Text and data miners at universities often have to hide their location to avoid auto cut-offs of traditional publishers. This makes them harder to track. It’s difficult to draw the line between what’s text mining and what’s for researchers’ own use, for example, putting large volumes of papers into Mendeley or Zotero,’ he explained.
  
  Without a clear understanding of what a reference managers can do and what text and data mining is, it seems that some publishers will block the download of fulltexts on their platforms.
  
  reference manager TDM text and data mining
Visit annotations in context

Tags

text and data mining

TDM

reference manager

Annotators

grolimur

URL

researchinformation.info/feature/mining-insight
Feb 2017
www.usnews.com www.usnews.com

U.S. News Ranks the 50 States

1
1. nateangell 28 Feb 2017
  
  in Public
  
  Best States Rankings
  
  rankings of US states based on 7 criteria
  
  data rankings
Visit annotations in context

Tags

data

rankings

Annotators

nateangell

URL

usnews.com/news/best-states/rankings
wiki.dbpedia.org wiki.dbpedia.org

DBpedia

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  DBpedia data database Semantic web computer science
Visit annotations in context

Tags

data

computer science

database

DBpedia

Semantic web

Annotators

JanosHaits

URL

wiki.dbpedia.org/
semanticweb.org semanticweb.org

semanticweb.org.edu

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  semantic SemWeb Web3.0 data computer science
Visit annotations in context

Tags

data

computer science

Web3.0

semantic

SemWeb

Annotators

JanosHaits

URL

semanticweb.org/wiki/Main_Page.html
demo.dbpedia-spotlight.org demo.dbpedia-spotlight.org

DBpedia Spotlight

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  DBpedia Open data data SemWeb Semantic web Web3.0 computer science demo
Visit annotations in context

Tags

data

computer science

demo

SemWeb

DBpedia

Semantic web

Web3.0

Open data

Annotators

JanosHaits

URL

demo.dbpedia-spotlight.org/
lod-cloud.net lod-cloud.net

The Linking Open Data cloud diagram

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  Semantic web data Open data computer science
Visit annotations in context

Tags

data

Open data

Semantic web

computer science

Annotators

JanosHaits

URL

lod-cloud.net/
www.wikidata.org www.wikidata.org

Wikidata

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  Wikipedia data SemWeb Web3.0 Semantic web
Visit annotations in context

Tags

data

Web3.0

SemWeb

Wikipedia

Semantic web

Annotators

JanosHaits

URL

wikidata.org/wiki/Wikidata:Main_Page
query.wikidata.org query.wikidata.org

Wikidata Query Service

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  SemWeb Semantic web data Wiki Wikipedia Web3.0
Visit annotations in context

Tags

data

Web3.0

Wiki

SemWeb

Wikipedia

Semantic web

Annotators

JanosHaits

URL

query.wikidata.org/
cognonto.com cognonto.com

Cognonto - Knowledge Graph

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  SemWeb Semantic web Web3.0 computer science data Knowledge Graph knowledge
Visit annotations in context

Tags

data

computer science

Web3.0

Knowledge Graph

knowledge

SemWeb

Semantic web

Annotators

JanosHaits

URL

cognonto.com/knowledge-graph/
motherboard.vice.com motherboard.vice.com

Internet of Things Teddy Bear Leaked 2 Million Parent and Kids Message Recordings - Motherboard

1
1. daveh70 28 Feb 2017
  
  in Public
  
  A company that sells internet-connected teddy bears that allow kids and their far-away parents to exchange heartfelt messages left more than 800,000 customer credentials, as well as two million message recordings, totally exposed online for anyone to see and listen.
  
  computer security internet of things data breach
Visit annotations in context

Tags

internet of things

data breach

computer security

Annotators

daveh70

URL

motherboard.vice.com/en_us/article/internet-of-things-teddy-bear-leaked-2-million-parent-and-kids-message-recordings
en.lodlive.it en.lodlive.it

LodLive - browsing the Web of Data

1
1. JanosHaits 26 Feb 2017
  
  in Public
  
  SemWeb semantic Web3.0 computer science RDF Linked Data data
Visit annotations in context

Tags

computer science

data

Web3.0

semantic

Linked Data

SemWeb

RDF

Annotators

JanosHaits

URL

en.lodlive.it/
er.educause.edu er.educause.edu

Compliance, Privacy, and Security...What’s the Difference?

1
1. nateangell 22 Feb 2017
  
  in Public
  
  Compliance, Privacy, and Security
  
  on data compliance, privacy and security in EDU
  
  privacy security compliance data
Visit annotations in context

Tags

data

security

compliance

privacy

Annotators

nateangell

URL

er.educause.edu/blogs/2017/1/compliance-privacy-and-security-whats-the-difference
oaspa.org oaspa.org

Identifying quality in scholarly publishing: Not a black and white issue - OASPA

1
1. micahvandegrift 21 Feb 2017
  
  in Public
  
  Between 2013 and 2015 we accepted fewer than 25% of the total number of applications we received
  
  I'd love to see some stats on what the most common reasons for rejection are. Show me the data!
  
  data
Visit annotations in context

Tags

data

Annotators

micahvandegrift

URL

oaspa.org/guest-post-by-jean-claude-guedon-scholarly-communication-and-scholarly-publishing/
hackeducation.com hackeducation.com

Ed-Tech in a Time of Trump

3
1. jeremydean 16 Feb 2017
  
  in Public
  
  Not in the right major. Not in the right class. Not in the right school. Not in the right country.
  
  There's a bit of a slippery slope here, no? Maybe it's Audrey on that slope, maybe it's data-happy schools/companies. In either case, I wonder if it might be productive to lay claim to some space on that slope, short of the dangers below, aware of them, and working to responsibly leverage machine intelligence alongside human understanding.
  
  big data
2. wiobyrne 03 Feb 2017
  
  in Public
  
  All along the way, or perhaps somewhere along the way, we have confused surveillance for care. And that’s my takeaway for folks here today: when you work for a company or an institution that collects or trades data, you’re making it easy to surveil people and the stakes are high. They’re always high for the most vulnerable. By collecting so much data, you’re making it easy to discipline people. You’re making it easy to control people. You’re putting people at risk. You’re putting students at risk.
  
  privacy security data
3. otterscotter 03 Feb 2017
  
  in Public
  
  Ed-Tech in a Time of Trump
  
  edtech trump big data
Visit annotations in context

Tags

data

privacy

security

edtech

trump

big data

Annotators

jeremydean

wiobyrne

otterscotter

URL

hackeducation.com/2017/02/02/ed-tech-and-trump
oie.gsu.edu oie.gsu.edu

Advisement-GPS.pdf

4
1. jeremydean 16 Feb 2017
  
  in Public
  
  in order to facilitate advisors holding more productive conversations about potential academic directions with their advisees.
  
  Conversations!
  
  big data
2. jeremydean 16 Feb 2017
  
  in Public
  
  Each morning, all alerts triggered over the previous day are automatically sent to the advisor assigned to the impacted students, with a goal of advisor outreach to the student within 24 hours.
  
  Key that there's still a human and human relationships in the equation here.
  
  big data
3. jeremydean 16 Feb 2017
  
  in Public
  
  A single screen for each student offers all of the information that advisors reported was most essential to their work,
  
  Did students have access to the same data?
  
  big data
4. jeremydean 16 Feb 2017
  
  in Public
  
  and Georgia State's IT and legal offices readily accepted the security protocols put in place by EAB to protect the student data.
  
  So it's not as if this was done willy-nilly.
  
  big data
Visit annotations in context

Tags

big data

Annotators

jeremydean

URL

oie.gsu.edu/files/2014/04/Advisement-GPS.pdf
backchannel.com backchannel.com

A Lone Data Whiz Is Fighting Airbnb — and Winning – Backchannel

1
1. heatherstaines 11 Feb 2017
  
  in Public
  
  In his spare time, the documentary photographer had been scraping information on Airbnb listings across the city and displaying them in interactive maps on his website, InsideAirbnb.com.
  
  Quite an undertaking!
  
  data
Visit annotations in context

Tags

data

Annotators

heatherstaines

URL

backchannel.com/a-lone-data-whiz-is-fighting-airbnb-and-winning-7fd49513266e
www.nytimes.com www.nytimes.com

In Age of Trump, Scientists Show Signs of a Political Pulse

2
1. heatherstaines 07 Feb 2017
  
  in Public
  
  After a brief training session, participants spent six hours archiving environmental data from government websites, including those of the National Oceanic and Atmospheric Administration and the Interior Department.
  
  A worthwhile effort.
  
  science data
2. heatherstaines 07 Feb 2017
  
  in Public
  
  An anonymous donor has provided storage on Amazon servers, and the information can be searched from a website at the University of Pennsylvania called Data Refuge. Though the Federal Records Act theoretically protects government data from deletion, scientists who rely on it say would rather be safe than sorry.
  
  Data refuge.
  
  data science
Visit annotations in context

Tags

science

data

Annotators

heatherstaines

URL

nytimes.com/2017/02/06/science/donald-trump-scientists-politics.html
methods-sagepub-com.ezp1.lib.umn.edu methods-sagepub-com.ezp1.lib.umn.edu

Collecting and Managing Network Data - SAGE Research Methods

1
1. bchen 06 Feb 2017
  
  in Public
  
  In the node-list format, the first node in each row is ego, and the remaining nodes in that row are the nodes to which ego is connected (alters).
  
  Please don't do this!
  
  data SNAEd
Visit annotations in context

Tags

data

SNAEd

Annotators

bchen

URL

methods-sagepub-com.ezp1.lib.umn.edu/book/social-network-analysis-and-education/n4.xml
Jan 2017
static1.squarespace.com static1.squarespace.com

latour_back_to_basics_a_list_of_notebooks.pdf

1
1. patelrj 25 Jan 2017
  
  in Public
  
  prospective interviewee,
  
  Just a side spiel: In terms of an interviewee and data, everything really is data. I'll be interviewing freshmen next semester with other SLU students and some things I have already told the group to take note of in notebooks (ha ha) are the different responses the interviewee gives. In a way, the sad little freshmen turn into our experiment. Everyone in the group records a different response. These responses include the obvious oral responses, body language, and tone of voice.
  
  Interviews data experiment
Visit annotations in context

Tags

Interviews

experiment

data

Annotators

patelrj

URL

static1.squarespace.com/static/53713bf0e4b0297decd1ab8b/t/586c04f0ff7c50bb14f1a53f/1483474162961/latour_back_to_basics_a_list_of_notebooks.pdf
arstechnica.com arstechnica.com

Online databases dropping like flies, with >10k falling to ransomware groups

1
1. daveh70 09 Jan 2017
  
  in Public
  
  Thousands of poorly secured MongoDB databases have been deleted by attackers recently. The attackers offer to restore the data in exchange for a ransom -- but they may not actually have a copy.
  
  vulnerability computer security data breach
Visit annotations in context

Tags

data breach

vulnerability

computer security

Annotators

daveh70

URL

arstechnica.com/security/2017/01/more-than-10000-online-databases-taken-hostage-by-ransomware-attackers/
Dec 2016
article.sciencepublishinggroup.com article.sciencepublishinggroup.com

10.11648.j.ajsea.20140301.11

1
1. ElijahRenard 28 Dec 2016
  
  in Public
  
  evidence about obtaining higher productivity by using Agile methods
  
  If higher productivity came from including stakeholders in the frequent development releases, running a complementary scrum team on UX analysis should lead to improvement in quality.
  
  #UX #analysis data
Visit annotations in context

Tags

#analysis

data

#UX

Annotators

ElijahRenard

URL

article.sciencepublishinggroup.com/pdf/10.11648.j.ajsea.20140301.11.pdf
aeon.co aeon.co

If the internet is addictive, why don’t we regulate it? – Michael Schulson | Aeon Essays

2
1. offray 24 Dec 2016
  
  in Public
  
  ‘In the past, if you were an alcohol distiller, you could throw up your hands and say, look, I don’t know who’s an alcoholic,’ he said. ‘Today, Facebook knows how much you’re checking Facebook. Twitter knows how much you’re checking Twitter. Gaming companies know how much you’re using their free-to-play games. If these companies wanted to do something, they could.’
  
  addiction data mining
2. offray 24 Dec 2016
  
  in Public
  
  sites such as Facebook and Twitter automatically and continuously refresh the page; it’s impossible to get to the bottom of the feed.
  
  Well is not. A scrapping web technique used for the Data Selfies project goes to the end of the scrolling page for Twitter (after almost scrolling 3k tweets), which is useful for certain valid users of scrapping (like overwatch of political discourse on twitter).
  
  So, can be infinite scrolling be useful, but not allowed by default on this social networks. Could we change the way information is visualized to get an overview of it instead of being focused on small details all the time in an infitite scroll tread mill.
  
  infinite scroll tread mill data visualization data selfies
Visit annotations in context

Tags

infinite scroll

data selfies

tread mill

data mining

data visualization

addiction

Annotators

offray

URL

aeon.co/essays/if-the-internet-is-addictive-why-don-t-we-regulate-it
www.courthousenews.com www.courthousenews.com

2013-fracking-sites.pdf

1
1. judell 15 Dec 2016
  
  in Public
  
  digipo:analysis:gulf_of_frackwater data
Visit annotations in context

Tags

data

digipo:analysis:gulf_of_frackwater

Annotators

judell

URL

courthousenews.com/2016/06/30/2013-fracking-sites.pdf
www.dropbox.com www.dropbox.com

Gulf offshore fracking

1
1. judell 15 Dec 2016
  
  in Public
  
  digipo:analysis:gulf_of_frackwater data
Visit annotations in context

Tags

data

digipo:analysis:gulf_of_frackwater

Annotators

judell

URL

dropbox.com/sh/3w8dg2fr4bicdhm/AABOU8HvSL5Ryzyfkuxc_ruta
gemstonesoup.wordpress.com gemstonesoup.wordpress.com

Smalltalk is Dead? Long Live Smalltalk

1
1. offray 11 Dec 2016
  
  in Public
  
  Smalltalk doesn’t have to be pragmatic, because it’s better than its imitators and the things that make it different are also the things that give it an advantage.
  
  Smalltalk: advantages data week PhD
Visit annotations in context

Tags

Smalltalk: advantages

PhD

data week

Annotators

offray

URL

gemstonesoup.wordpress.com/2009/02/08/smalltalk-is-dead-long-live-smalltalk/
cplong.org cplong.org

Critical Diversity in a Digital Age

1
1. caseyboyle 10 Dec 2016
  
  in Public
  
  Preserving
  
  Really love the proposal overall and look forward to seeing what comes of the project(s).
  
  One slight thing I'd like to mention here, in the interest of furthering the critical diversity is that, in addition to our need to preserve data/archives, I'm increasingly being persuaded of the need to construct data prevention policies and techniques that would allow many people--protestors, youth, citizens, hospital patients, insurance beneficiaries, et al--much needed space to present clean-ish slates.
  
  data prevention
Visit annotations in context

Tags

data prevention

Annotators

caseyboyle

URL

cplong.org/2016/10/critical-diversity-in-a-digital-age/
blog.oceanconservancy.org blog.oceanconservancy.org

Nation’s First Regional Ocean Plans

1
1. jhh1899 09 Dec 2016
  
  in Public
  
  Northeast Ocean Data Portal
  
  This is so cool.
  
  Northeast Ocean Data Portal
Visit annotations in context

Tags

Northeast Ocean Data Portal

Annotators

jhh1899

URL

blog.oceanconservancy.org/2016/12/07/nations-first-regional-ocean-plans/
Nov 2016
mfeldstein.com mfeldstein.com

Analytics Literacy is a Major Limiter of Ed Tech Growth

1
1. otterscotter 21 Nov 2016
  
  in Public
  
  Data should extend our senses, not be a substitute for them. Likewise, analytics should augment rather than replace our native sense-making capabilities.
  
  data literacy digital literacy learning analytics analytics
Visit annotations in context

Tags

learning analytics

digital literacy

analytics

data literacy

Annotators

otterscotter

URL

mfeldstein.com/analytics-literacy-is-a-major-limiter-of-ed-tech-growth/
Oct 2016
www.whitehouse.gov www.whitehouse.gov

Federally Funded Research Results Are Becoming More Open and Accessible

1
1. otterscotter 31 Oct 2016
  
  in Public
  
  Federally Funded Research Results Are Becoming More Open and Accessible
  
  open access open data
Visit annotations in context

Tags

open data

open access

Annotators

otterscotter

URL

whitehouse.gov/blog/2016/10/28/federally-funded-research-results-are-becoming-more-open-and-accessible
www.troyhunt.com www.troyhunt.com

The Red Cross Blood Service: Australia's largest ever leak of personal data

1
1. daveh70 28 Oct 2016
  
  in Public
  
  A large database of blood donors' personal information from the AU Red Cross was posted on a web server with directory browsing enabled, and discovered by someone scanning randomly. It is unknown whether anyone else downloaded the file before it was removed.
  
  data leak data breach security
Visit annotations in context

Tags

data breach

data leak

security

Annotators

daveh70

URL

troyhunt.com/the-red-cross-blood-service-australias-largest-ever-leak-of-personal-data/
www.jacobinmag.com www.jacobinmag.com

Welcome to the Black Box

1
1. offray 25 Oct 2016
  
  in Public
  
  My hope is that the book I’ve written gives people the courage to realize that this isn’t really about math at all, it’s about power.
  
  data week
Visit annotations in context

Tags

data week

Annotators

offray

URL

jacobinmag.com/2016/09/big-data-algorithms-math-facebook-advertisement-marketing/
medium.com medium.com

Deep Learning Is Going to Teach Us All the Lesson of Our Lives: Jobs Are for Machines – Basic income – Medium

1
1. otterscotter 23 Oct 2016
  
  in Public
  
  Big Data
  
  big data information deep learning machine learning AI
Visit annotations in context

Tags

deep learning

AI

machine learning

information

big data

Annotators

otterscotter

URL

medium.com/basic-income/deep-learning-is-going-to-teach-us-all-the-lesson-of-our-lives-jobs-are-for-machines-7c6442e37a49
news.fastcompany.com news.fastcompany.com

AltSchool opens its personalized learning platform to outside schools

1
1. otterscotter 19 Oct 2016
  
  in Public
  
  AltSchool
  
  altschool big data surveillance
Visit annotations in context

Tags

surveillance

altschool

big data

Annotators

otterscotter

URL

news.fastcompany.com/altschool-opens-its-personalized-learning-platform-to-outside-schools-4022136
www.nytimes.com www.nytimes.com

Wide Sentencing Disparity Found Among U.S. Judges

1
1. libriomancer 19 Oct 2016
  
  in Public
  
  because of the judiciary’s concern that such data could be used to single out judges, who were freed from restrictive sentencing guidelines in 2005
  
  so why is everyone talking about getting rid of mandatory minimums? This makes it sounds like they've already been gotten rid of
  
  judges data sentencing sentencing disparities
Visit annotations in context

Tags

judges

sentencing

sentencing disparities

data

Annotators

libriomancer

URL

nytimes.com/2012/03/06/nyregion/wide-sentencing-disparity-found-among-us-judges.html
www.businessinsider.com www.businessinsider.com

How IoT in Education is Changing the Way We Learn

2
1. Enkerli 18 Oct 2016
  
  in Public
  
  Outside of the classroom, universities can use connected devices to monitor their students, staff, and resources and equipment at a reduced operating cost, which saves everyone money.
  
  #privacy learner data Learner as Product
2. Enkerli 18 Oct 2016
  
  in Public
  
  Devices connected to the cloud allow professors to gather data on their students and then determine which ones need the most individual attention and care.
  
  Learning Analytics Learner Data Student Success
Visit annotations in context

Tags

Learning Analytics

Learner Data

learner data

Student Success

Learner as Product

#privacy

Annotators

Enkerli

URL

businessinsider.com/internet-of-things-education-2016-9
www.theguardian.com www.theguardian.com

Machine learning: why we mustn’t be slaves to the algorithm

1
1. otterscotter 16 Oct 2016
  
  in Public
  
  Machine learning:
  
  machine learning big data algorithm
Visit annotations in context

Tags

algorithm

machine learning

big data

Annotators

otterscotter

URL

theguardian.com/commentisfree/2016/oct/16/slaves-to-algorithm-machine-learning-hidden-bias
m.pnas.org m.pnas.org

PNAS | Mobile

1
1. awakenting 06 Oct 2016
  
  in Public
  
  (courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf)
  
  nice reference !
  
  data science
Visit annotations in context

Tags

data science

Annotators

awakenting

URL

m.pnas.org/content/113/34/9384.long
www.google.com www.google.com

Google for Education: Tools schools can trust

1
1. Enkerli 05 Oct 2016
  
  in Public
  
  For G Suite users in primary/secondary (K-12) schools, Google does not use any user personal information (or any information associated with a Google Account) to target ads.
  
  In other words, Google does use everyone’s information (Data as New Oil) and can use such things to target ads in Higher Education.
  
  Google GAfE Privacy Learner Data Learning Analytics
Visit annotations in context

Tags

Learner Data

Privacy

Google

GAfE

Learning Analytics

Annotators

Enkerli

URL

google.com/edu/trust/
Sep 2016
Local file Local file

I spent a weekend at Google talking with nerds about charity. I came away … worried.

1
1. offray 28 Sep 2016
  
  in Public
  
  But ultimately you have to stop being meta. As Jeff Kaufman — a developer in Cambridge who's famous among effective altruists for, along with his wife Julia Wise, donating half their household's income to effective charities — argued in a talk about why global poverty should be a major focus, if you take meta-charity too far, you get a movement that's really good at expanding itself but not necessarily good at actually helping people.
  
  "Stop being meta" could be applied in some sense to meta systems like Smalltalk and Lisp, because their tendency to develop meta tools used mostly by developers, instead of "tools" used by by mostly everyone else. Burring the distinction between "everyone else" and developers in their ability to build/use meta tools, means to deliver tools and practices that can be a bridge with meta-tools. This is something we're trying to do with Grafoscopio and the Data Week.
  
  meta sytems meta tools PhD grafoscopio data week
Tags

meta tools

PhD

data week

grafoscopio

meta sytems

Annotators

offray
medium.com medium.com

Transit 4.0 Is Now Live – Transit App – Medium

1
1. Enkerli 28 Sep 2016
  
  in Public
  
  (Crazy app uptake + riding data + math wizardry = many surprises in store.)
  
  Like Waze for public transit? Way to merge official Open Data from municipal authorities with the power of crowdsourcing mass transportation.
  
  Open Data Public Transit crowdsourcing Waze Transit App
Visit annotations in context

Tags

Open Data

Public Transit

Waze

crowdsourcing

Transit App

Annotators

Enkerli

URL

medium.com/transit-app/transit-4-0-is-now-live-2329f60fb3bb
studentprivacy.ed.gov studentprivacy.ed.gov

Protecting Student Privacy While Using Online Educational Services: Model Terms of Service

6
1. jeremydean 21 Sep 2016
  
  in Public
  
  all intellectual property rights, shall remain the exclusive property of the [School/District],
  
  This is definitely not the case. Even in private groups would it ever make sense to say this?
  
  FERPA privacy copyright intellectual property data
2. jeremydean 21 Sep 2016
  
  in Public
  
  Access
  
  This really just extends the issue of "transfer" mentioned in 9.
  
  FERPA data privacy
3. jeremydean 21 Sep 2016
  
  in Public
  
  Data Transfer or Destruction
  
  This is the first line item I don't feel like we have a proper contingency for or understand exactly how we would handle it.
  
  It seems important to address not just due to FERPA but to contracts/collaborations like that we have with eLife:
  
  What if eLife decides to drop h. Would we, could we delete all data/content related to their work with h? Even outside of contract termination, would we/could we transfer all their data back to them?
  
  The problems for our current relationship with schools is that we don't have institutional accounts whereby we might at least technically be able to collect all related data.
  
  Students could be signing up for h with personal email addresses.
  
  They could be using their h account outside of school so that their data isn't fully in the purview of the school.
  
  Question: if AISD starts using h on a big scale, 1) would we delete all AISD related data if they asked--say everything related to a certain email domain? 2) would we share all that data with them if they asked?
  
  FERPA data privacy bizdev
4. jeremydean 21 Sep 2016
  
  in Public
  
  Data cannot be shared with any additional parties without prior written consent of the Userexcept as required by law.”
  
  Something like this should probably be added to our PP.
  
  FERPA privacy data
5. jeremydean 21 Sep 2016
  
  in Public
  
  Data Collection
  
  I'm really pleased with how hypothes.is addresses the issues on this page in our Privacy Policy.
  
  FERPA privacy data
6. jeremydean 21 Sep 2016
  
  in Public
  
  There is nothing wrong with a provider usingde-‐identified data for other purposes; privacy statutes, after all, govern PII, not de-‐identified data.
  
  Key point.
  
  FERPA TOSs data privacy
Visit annotations in context

Tags

data

TOSs

FERPA

copyright

intellectual property

bizdev

privacy

Annotators

jeremydean

URL

studentprivacy.ed.gov/sites/default/files/resource_document/file/TOS_Guidance_Mar2016.pdf
www.sr.ithaka.org www.sr.ithaka.org

Untitled document

8
1. Enkerli 12 Sep 2016
  
  in Public
  
  Application Modern higher education institutions have unprecedentedly large and detailed collections of data about their students, and are growing increasingly sophisticated in their ability to merge datasets from diverse sources. As a result, institutions have great opportunities to analyze and intervene on student performance and student learning. While there are many potential applications of student data analysis in the institutional context, we focus here on four approaches that cover a broad range of the most common activities: data-based enrollment management, admissions, and financial aid decisions; analytics to inform broad-based program or policy changes related to retention; early-alert systems focused on successful degree completion; and adaptive courseware.
  
  Perhaps even more than other sections, this one recalls the trope:
  
  The difference probably comes from the impact of (institutional) “application”.
  
  Responsibility learner data Academic Institutions stakeholders
2. Enkerli 12 Sep 2016
  
  in Public
  
  the risk of re-identification increases by virtue of having more data points on students from multiple contexts
  
  Very important to keep in mind. Not only do we realise that re-identification is a risk, but this risk is exacerbated by the increase in “triangulation”. Hence some discussions about Differential Privacy.
  
  learner data anonymity de-anonymisation re-identification research ethics
3. Enkerli 12 Sep 2016
  
  in Public
  
  the automatic collection of students’ data through interactions with educational technologies as a part of their established and expected learning experiences raises new questions about the timing and content of student consent that were not relevant when such data collection required special procedures that extended beyond students’ regular educational experiences of students
  
  Useful reminder. Sounds a bit like “now that we have easier access to data, we have to be particularly careful”. Probably not the first reflex of most researchers before they start sending forms to their IRBs. Important for this to be explicitly designated as a concern, in IRBs.
  
  research ethics Data Economy
4. Enkerli 12 Sep 2016
  
  in Public
  
  Responsible Use
  
  Again, this is probably a more felicitous wording than “privacy protection”. Sure, it takes as a given that some use of data is desirable. And the preceding section makes it sound like Learning Analytics advocates mostly need ammun… arguments to push their agenda. Still, the notion that we want to advocate for responsible use is more likely to find common ground than this notion that there’s a “data faucet” that should be switched on or off depending on certain stakeholders’ needs. After all, there exists a set of data use practices which are either uncontroversial or, at least, accepted as “par for the course” (no pun intended). For instance, we probably all assume that a registrar should receive the grade data needed to grant degrees and we understand that such data would come from other sources (say, a learning management system or a student information system).
  
  research ethics ethics #privacy learner data Responsibility Responsible Use
5. Enkerli 12 Sep 2016
  
  in Public
  
  Data sharing over open-source platforms can create ambiguous rules about data ownership and publication authorship, or raise concerns about data misuse by others, thus discouraging liberal sharing of data.
  
  Surprising mention of “open-source platforms”, here. Doesn’t sound like these issues are absent from proprietary platforms. Maybe they mean non-institutional platforms (say, social media), where these issues are really pressing. But the wording is quite strange if that is the case.
  
  Open Source Learning Analytics learner data
6. Enkerli 08 Sep 2016
  
  in Public
  
  captures values such as transparency and student autonomy
  
  Indeed. “Privacy” makes it sound like a single factor, hiding the complexity of the matter and the importance of learners’ agency.
  
  Quotables #LearnerAgency learner data #privacy
7. Enkerli 08 Sep 2016
  
  in Public
  
  Activities such as time spent on task and discussion board interactions are at the forefront of research.
  
  Really? These aren’t uncontroversial, to say the least. For instance, discussion board interactions often call for careful, mixed-method work with an eye to preventing instructor effect and confirmation bias. “Time on task” is almost a codeword for distinctions between models of learning. Research in cognitive science gives very nuanced value to “time spent on task” while the Malcolm Gladwells of the world usurp some research results. A major insight behind Competency-Based Education is that it can allow for some variance in terms of “time on task”. So it’s kind of surprising that this summary puts those two things to the fore.
  
  Learning Analytics learner data measurability Time on task #CompetencyBasedEducation Cognitive Science Malcolm Gladwell Discourse Analysis #ConfirmationBias Instructor Effect
8. Enkerli 08 Sep 2016
  
  in Public
  
  Research: Student data are used to conduct empirical studies designed primarily to advance knowledge in the field, though with the potential to influence institutional practices and interventions. Application: Student data are used to inform changes in institutional practices, programs, or policies, in order to improve student learning and support. Representation: Student data are used to report on the educational experiences and achievements of students to internal and external audiences, in ways that are more extensive and nuanced than the traditional transcript.
  
  Ha! The Chronicle’s summary framed these categories somewhat differently. Interesting. To me, the “application” part is really about student retention. But maybe that’s a bit of a cynical reading, based on an over-emphasis in the Learning Analytics sphere towards teleological, linear, and insular models of learning. Then, the “representation” part sounds closer to UDL than to learner-driven microcredentials. Both approaches are really interesting and chances are that the report brings them together. Finally, the Chronicle made it sound as though the research implied here were less directed. The mention that it has “the potential to influence institutional practices and interventions” may be strategic, as applied research meant to influence “decision-makers” is more likely to sway them than the type of exploratory research we so badly need.
  
  learner data Learning Analytics Education Research meta-annotation Chronicle of Higher Education Applied Research
Visit annotations in context

Tags

Responsibility

learner data

Education Research

Quotables

re-identification

research ethics

measurability

ethics

Open Source

#CompetencyBasedEducation

stakeholders

Data Economy

Cognitive Science

Discourse Analysis

meta-annotation

Applied Research

Responsible Use

Chronicle of Higher Education

Malcolm Gladwell

de-anonymisation

Learning Analytics

Time on task

Instructor Effect

#ConfirmationBias

Academic Institutions

anonymity

#privacy

#LearnerAgency

Annotators

Enkerli

URL

sr.ithaka.org/publications/student-data-in-the-digital-era/
www.chronicle.com www.chronicle.com

Group Unveils a 'Model Policy' for Handling Student Data

2
1. Enkerli 08 Sep 2016
  
  in Public
  
  often private companies whose technologies power the systems universities use for predictive analytics and adaptive courseware
  
  #MoneyQuote #BigData learner data Learner as Product Business Models for Higher Education predictive models Learning Analytics Personal-ized Education
2. Enkerli 08 Sep 2016
  
  in Public
  
  the use of data in scholarly research about student learning; the use of data in systems like the admissions process or predictive-analytics programs that colleges use to spot students who should be referred to an academic counselor; and the ways colleges should treat nontraditional transcript data, alternative credentials, and other forms of documentation about students’ activities, such as badges, that recognize them for nonacademic skills.
  
  Useful breakdown. Research, predictive models, and recognition are quite distinct from one another and the approaches to data that they imply are quite different. In a way, the “personalized learning” model at the core of the second topic is close to the Big Data attitude (collect all the things and sense will come through eventually) with corresponding ethical problems. Through projects vary greatly, research has a much more solid base in both ethics and epistemology than the kind of Big Data approach used by technocentric outlets. The part about recognition, though, opens the most interesting door. Microcredentials and badges are a part of a broader picture. The data shared in those cases need not be so comprehensive and learners have a lot of agency in the matter. In fact, when then-Ashoka Charles Tsai interviewed Mozilla executive director Mark Surman about badges, the message was quite clear: badges are a way to rethink education as a learner-driven “create your own path” adventure. The contrast between the three models reveals a lot. From the abstract world of research, to the top-down models of Minority Report-style predictive educating, all the way to a form of heutagogy. Lots to chew on.
  
  Learning Analytics #BigData Data Economy research ethics ethics learner data #LearnerAgency Learner as Product learner-driven education predictive models #OpenBadges
Visit annotations in context

Tags

learner data

#BigData

#MoneyQuote

research ethics

ethics

Personal-ized Education

Data Economy

Business Models for Higher Education

#OpenBadges

Learning Analytics

predictive models

Learner as Product

learner-driven education

#LearnerAgency

Annotators

Enkerli

URL

chronicle.com/article/Group-Unveils-a-Model-Policy/237690
www.theguardian.com www.theguardian.com

Universities are tracking their students. Is it clever or creepy?

1
1. Enkerli 06 Sep 2016
  
  in Public
  
  “We need much more honesty, about what data is being collected and about the inferences that they’re going to make about people. We need to be able to ask the university ‘What do you think you know about me?’”
  
  Quotables #privacy learner data
Visit annotations in context

Tags

learner data

Quotables

#privacy

Annotators

Enkerli

URL

theguardian.com/higher-education-network/2016/aug/03/learning-analytics-universities-data-track-students
worrydream.com worrydream.com

What can a technologist do about climate change? A personal view.

7
1. offray 05 Sep 2016
  
  in Public
  
  The importance of models may need to be underscored in this age of “big data” and “data mining”. Data, no matter how big, can only tell you what happened in the past. Unless you’re a historian, you actually care about the future — what will happen, what could happen, what would happen if you did this or that. Exploring these questions will always require models. Let’s get over “big data” — it’s time for “big modeling”.
  
  big data small data big models model driven
2. offray 05 Sep 2016
  
  in Public
  
  Readers are thus encouraged to examine and critique the model. If they disagree, they can modify it into a competing model with their own preferred assumptions, and use it to argue for their position. Model-driven material can be used as grounds for an informed debate about assumptions and tradeoffs. Modeling leads naturally from the particular to the general. Instead of seeing an individual proposal as “right or wrong”, “bad or good”, people can see it as one point in a large space of possibilities. By exploring the model, they come to understand the landscape of that space, and are in a position to invent better ideas for all the proposals to come. Model-driven material can serve as a kind of enhanced imagination.
  
  This is a part where my previous comments on data activism data journalism (see 1,2 & 3) and more plural computing environments for engagement of concerned citizens on the important issues of our time could intersect with Victor's discourse.
  
  data activism data journalism
3. offray 05 Sep 2016
  
  in Public
  
  The Gamma: Programming tools for data journalism
  
  (b) languages for novices or end-users, [...] If we can provide our climate scientists and energy engineers with a civilized computing environment, I believe it will make a very significant difference.
  
  But data journalists, and in fact, data activist, social scientist, and so on, could be a "different type of novice", one that is more critically and politically involved (in the broader sense of the "politic" word).
  
  The wider dialogue on important matters that is mediated, backed up and understood by dealing data, (as climate change) requires more voices that the ones are involved today, and because they need to be reason and argument using data, we need to go beyond climate scientist or energy engeeners as the only ones who need a "civilized computing environment" to participate in the important complex and urgent matters of today world. Previously, these more critical voices (activists, journalists, scientists) have helped to make policy makers accountable and more sensible on other important and urgent issues.
  
  In that sense my work with reproducible research in my Panama Papers as a prototype of a data continuum environment, or others, like Gamma, could serve as an exploration, invitation and early implementation of what is possible to enrich this data/computing enhanced dialogue.
  
  data week panama papers
4. offray 05 Sep 2016
  
  in Public
  
  I say this despite the fact that my own work has been in much the opposite direction as Julia. Julia inherits the textual interaction of classic Matlab, SciPy and other children of the teletype — source code and command lines.
  
  The idea of a tradition technologies which are "children of teletype" is related to the comparison we do in the data week workshop/hackathon. In our case we talk about "unix fathers" versus "dynabook children" and bifurcation/recombination points of this technologies:
  
  children of teletype data week
5. offray 05 Sep 2016
  
  in Public
  
  If efficiency incentives and tools have been effective for utilities, manufacturers, and designers, what about for end users? One concern I’ve always had is that most people have no idea where their energy goes, so any attempt to conserve is like optimizing a program without a profiler.
  
  end users data visualization
6. offray 05 Sep 2016
  
  in Public
  
  The catalyst for such a scale-up will necessarily be political. But even with political will, it can’t happen without technology that’s capable of scaling, and economically viable at scale. As technologists, that’s where we come in.
  
  May be we come before, by enabling this conversation (as said previously). Political agenda is currently coopted by economical interests far away of a sustainable planet or common good. Feedback loops can be a place to insert counter-hegemonic discourse to enable a more plural and rational dialogue between civil society and goverment, beyond short term economic current interest/incumbents.
  
  data week data selfies
7. offray 04 Sep 2016
  
  in Public
  
  This is aimed at people in the tech industry, and is more about what you can do with your career than at a hackathon. I’m not going to discuss policy and regulation, although they’re no less important than technological innovation. A good way to think about it, via Saul Griffith, is that it’s the role of technologists to create options for policy-makers.
  
  Nice to see this conversation happening between technology and broader socio-political problems so explicit in Bret's discourse.
  
  What we're doing in fact is enabling this conversation between technologist and policy-makers first, and we're highlighting it via hackathon/workshops, but not reducing it only to what happens there (an interesting critique to the techno-solutionism hackathon is here), using the feedback loops in social networks, but with an intention of mobilizing a setup that goes beyond. One example is our twitter data selfies (picture/link below). The necesity of addressing urgent problem that involve techno-socio-political complex entanglements is more felt in the Global South.
  
  ^ Up | Twitter data selfies: a strategy to increase the dialog between technologist/hackers and policy makers (click here for details).
  
  data week data selfies hackathon dialogue data visualization smalltalk data activism
Visit annotations in context

Tags

model driven

end users

data selfies

big models

hackathon

smalltalk

dialogue

data week

children of teletype

small data

data activism

data journalism

panama papers

data visualization

big data

Annotators

offray

URL

worrydream.com/ClimateChange/
Aug 2016
www.dati.gov.it www.dati.gov.it

LG2016_finale.pdf

1
1. cirospat 31 Aug 2016
  
  in Public
  
  DATA GOVERNANCE
  
  la Data Governance fa pensare ad una Pubblica Amministrazione come unico organismo pensante e decisorio. Un concetto facile da metabolizzare, ma che non rispecchia spesso l'architettura reale delle PA di grandi dimensioni come i Comuni capoluogo, ad esempio.
  
  La Data Governance parte da una PA che ha progettato o implementato la sua piattaforma informatica di 1) gestione dei flussi di lavoro interni e 2) gestione di servizi erogati all'utenza, in maniera tale da eliminare totalmente l'uso del supporto cartaceo e da permettere esclusivamente il data entry sia internamente dagli uffici che dall'utenza che richiede servizi pubblici agli enti pubblici. La Data Governance può essere adeguatamente ed efficacemente attuata solo se nella PA si tiene conto di questi elementi anzidetti. In merito colgo l'occasione per citare le 7 piattaforme ICT che le 14 grandi città metropolitane italiane devono realizzare nel contesto del PON METRO. Ecco questa si presenta come un occasione per le 14 grandi città italiane di dotarsi della stessa DATA GOVERNANCE, visto che le 7 piattaforme ICT devono (requisito) essere interoperabili tra loro. La Data Governance si crea insieme alla progettazione delle piattaforme informatiche che permettono alla PA di "funzionare" nei territori. La Data Governance è indissolubilmente legata al "data entry". Il data entry non prevede scansioni di carta o gestione di formati di lavoro non aperti. La Data Governance nelle sue procedure operative quotidiana è alla base della politica open data di qualità. Una Data Governance della PA nel 2016-17-... non può ancora fondarsi nella costruzione manuale del formato CSV e relativa pubblicazione manuale ad opera del dipendente pubblico. Una Data Governance dovrebbe tenere in considerazione che le procedure di pubblicazione dei dataset devono essere automatiche e derivanti dalle funzionalità degli stessi applicativi gestionali (piattaforme ICT) in uso nella PA, senza alcun intervento umano se non nella fase di filtraggio/oscuramento dei dati che afferiscono alla privacy degli individui.
  
  Data Governance PON METRO data entry
Visit annotations in context

Tags

data entry

Data Governance

PON METRO

Annotators

cirospat

URL

dati.gov.it/sites/default/files/LG2016_finale.pdf
www.cdc.gov www.cdc.gov

Facts About ASDs

1
1. Credibull 12 Aug 2016
  
  in Public
  
  Credibull score = 9.60 / 10
  
  To provide feedback on the score fill in the form available here
  
  What is Credibull? getcredibull.com
  
  autism data prevalence
Visit annotations in context

Tags

data

autism

prevalence

Annotators

Credibull

URL

cdc.gov/ncbddd/autism/data.html
books.google.ca books.google.ca

Scholarship in the Digital Age

2
1. daniel.odonnell 01 Aug 2016
  
  in Public
  
  Page 122
  
  Borgman on terms used by the humanities and social sciences to describe data and other types of analysis
  
  humanist and social scientists frequently distinguish between primary and secondary information based on the degree of analysis. Yet this ordering sometimes conflates data, sources, and resources, as exemplified by a report that distinguishes "primary resources, E. G., Books close quotation from quotation secondary resources, eat. Gee., Catalogs close quotation . Resources also categorized as primary or sensor data, numerical data, and field notebooks, all of which would be considered data in the sciences. Rarely would books, conference proceedings, and feces that the report categorizes as primary resources be considered data, except when used for text-or data-mining purposes. Catalogs, subject indices, citation indexes, search engines, and web portals were classified as secondary resources. These are typically viewed as tertiary resources in the library community because they describe primary and secondary resources. The distinctions between data, sources, and resources very by discipline and circumstance. For the purposes of this book, primary resources are data, secondary resources are reports of research, whether publications or intern forms, and tertiary resources are catalogs, indexes, and directories that provide access to primary and secondary resources. Sources are the origins of these resources.
  
  Borgman 2007 Primary sources Secondary sources Data
2. daniel.odonnell 01 Aug 2016
  
  in Public
  
  Page XVIII
  
  Borgman notes that no social framework exist for data that is comparable to this framework that exist for analysis. CF. Kitchen 2014 who argues that pre-big data, we privileged analysis over data to the point that we threw away the data after words . This is what creates the holes in our archives.
  
  He wonders capabilities [of the data management] must be compared to the remarkably stable scholarly communication system in which they exist. The reward system continues to be based on publishing journal articles, books, and conference papers. Peer-reviewed legitimizes scholarly work. Competition and cooperation are carefully balanced. The means by which scholarly publishing occurs is an unstable state, but the basic functions remained relatively unchanged. while capturing and managing the "data deluge" is a major driver of the scholarly infrastructure developments, no Showshow same framework for data exist that is comparable to that for publishing.
  
  Borgman 2007 Kitchin 2014 Data publication Scholarly Communication
Visit annotations in context

Tags

Data publication

Data

Kitchin 2014

Borgman 2007

Primary sources

Scholarly Communication

Secondary sources

Annotators

daniel.odonnell

URL

books.google.ca/books/about/Scholarship_in_the_Digital_Age.html
Jul 2016
books.google.ca books.google.ca

Scholarship in the Digital Age

21
1. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 220
  
  Humanistic research takes place in a rich milieu that incorporates the cultural context of artifacts. Electronic text and models change the nature of scholarship in subtle and important ways, which have been discussed at great length since the humanities first began to contemplate the scholarly application of computing.
  
  borgman 2007 electronic texts data humanities data disciplinary difference
2. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 217
  
  Methods for organizing information in the humanities follow from their research practices. Humanists fo not rely on subject indexing to locate material to the extent that the social sciences or sciences do. They are more likely to be searching for new interpretations that are not easily described in advance; the journey through texts, libraries, and archives often is the research.
  
  borgman 2007 humanities methodology data humanities data
3. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 223
  
  Borgman is discussing here the difference in the way humanists handle data in comparison to the way that scientists and social scientist:
  
  When generating their own data such as interviews or observations, human efforts to describe and represent data are comparable to that of scholars and other disciplines. Often humanists are working with materials already described by the originator or holder of the records, such as libraries, archives, government agencies, or other entities. Whether or not the desired content already is described as data, scholars need to explain its evidentiary value in your own words. That report often becomes part of the final product. While scholarly publications in all fields set data within a context, the context and interpretation are scholarship in the humanities.
  
  borgman 2007 data humanities disciplinary difference
4. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Pages 220-221
  
  Digital Humanities projects result in two general types of products. Digital libraries arise from scholarly collaborations and the initiatives of cultural heritage institutions to digitize their sources. These collections are popular for research and education. … The other general category of digital humanities products consist of assemblages of digitized cultural objects with associated analyses and interpretations. These are the equivalent of digital books in that they present an integrated research story, but they are much more, as they often include interactive components and direct links to the original sources on which the scholarship is based. … Projects that integrate digital records for widely scattered objects are a mix of a digital library and an assemblage.
  
  borgman 2007 digital humanities editions digital libraries data humanities data
5. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 219
  
  In the humanities, it is difficult to separate artifacts from practices or publications from data.
  
  borgman 2007 humanities humanities data disciplinary difference
6. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 219
  
  Humanities scholars integrate and aggregate data from many sources. They need tools and services to analyze digital data, as others do the sciences and social sciences, but also tools that assist them interpretation and contemplation.
  
  borgman 2007 data humanities citation practices citation disciplinary difference
7. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 215
  
  What seems a clear line between publications and data in the sciences and social sciences is a decidedly fuzzy one in the humanities. Publications and other documents are central sources of data to humanists. … Data sources for the humanities are innumerable. Almost any document, physical artifact, or record of human activity can be used to study culture. Humanities scholars value new approaches, and recognizing something as a source of data (e.g., high school yearbooks, cookbooks, or wear patterns in the floor of public places) can be an act of scholarship. Discovering heretofore unknown treasures buried in the world's archives is particularly newsworthy. … It is impossible to inventory, much less digitize, all the data that might be useful scholarship communities. Also distinctive about humanities data is their dispersion and separation from context. Cultural artifacts are bought and sold, looted in wars, and relocated to museums and private collections. International agreements on the repatriation of cultural objects now prevent many items from being exported, but items that were exported decades or centuries ago are unlikely to return to their original site. … Digitizing cultural records and artifacts make them more malleable and mutable, which creates interesting possibilities for analyzing, contextualizing, and recombining objects. Yet digitizing objects separates them from the origins, exacerbating humanists’ problems in maintaining the context. Removing text from its physical embodiment in a fixed object may delete features that are important to researchers, such as line and page breaks, fonts, illustrations, choices of paper, bindings, and marginalia. Scholars frequently would like to compare such features in multiple additions or copies.
  
  borgman 2007 data humanities data humanities disciplinary difference
8. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 214
  
  Borgman on information artifacts and communities:
  
  Artifacts in the humanities differ from those of the sciences and social sciences in several respects. Humanist use the largest array of information sources, and as a consequence, the station between documents and data is the least clear. They also have a greater number of audiences for the data and the products of the research. Whereas scientific findings usually must be translated for a general audience, humanities findings often are directly accessible and of immediate interest to the general public.
  
  borgman 2007 humanities data humanities data disciplinary difference
9. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 204
  
  Borgman on the different types of data in the social sciences:
  
  Data in the social sciences fall into two general categories. The first is data collected by researchers through experiments, interviews, surveys, observations, or similar names, analogous to scientific methods. … the second category is data collected by other people or institutions, usually for purposes other than research.
  
  borgman 2007 social sciences data disciplinary difference
10. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 202
  
  Borgman on information artifacts in the social sciences
  
  like the sciences, the social sciences create and use minimal information. Yet they differ in the sources of the data. While almost all scientific data are created by for scientific purposes, a significant portion of social scientific data consists of records credit for other purposes, by other parties.
  
  borgman 2007 social sciences disciplinary difference Scholarly Communication data
11. daniel.odonnell 29 Jul 2016
  
  in Public
  
  Borgman, Christine L. 2007. Scholarship in the Digital Age: Information, Infrastructure, and the Internet. Cambridge, Mass: MIT Press.
  
  My notes
  
  borgman 2007 data digital scholarship Scholarly Communication digital humanities
12. daniel.odonnell 28 Jul 2016
  
  in Public
  
  Page 147
  
  Borgman on the challenges facing the humanities in the age of Big Data:
  
  Text and data mining offer similar Grand challenges in the humanities and social sciences. Gregory crane provide some answers to the question what do you do with a million books? Two obvious answers include the extraction of information about people, places, and events, and machine translation between languages. As digital libraries of books grow through scanning avert such as Google print, the open content Alliance, million books project, and comparable projects in Europe and China, and as more books are published in digital form technical advances in data description, and now it says, and verification are essential. These large collections differ from earlier, smaller after it's on several Dimensions. They are much larger in scale, the content is more heterogenous in topic and language, the granularity creases when individual words can be tagged and they were noisy then there well curated predecessors, and their audiences more diverse, reaching the general public in addition to the scholarly community. Computer scientists are working jointly with humanist, language, and other demands specialist to pars tax, extract named entities in places, I meant optical character recognition techniques counter and Advance the state of art of information retrieval.
  
  Borgman 2007 Data Humanities
13. daniel.odonnell 28 Jul 2016
  
  in Public
  
  Page 137
  
  Borgman discusses hear the case of NASA which lost the original video recording of the first moon landing in 1969. Backups exist, apparently, but they are lower quality than the originals.
  
  Borgman NASA Data Data preservation
14. daniel.odonnell 28 Jul 2016
  
  in Public
  
  Page 122
  
  Here Borgman suggest that there is some confusion or lack of overlap between the words that humanist and social scientists use in distinguishing types of information from the language used to describe data.
  
  Humanist and social scientists frequently distinguish between primary and secondary information based on the degree of analysis. Yet this ordering sometimes conflates data sources, and resorces, as exemplified by a report that distinguishes quote primary resources, ed books quote from quote secondary resources, Ed catalogs quote. Resorts is also categorized as primary wear sensor data AMA numerical data and filled notebooks, all of which would be considered data in The Sciences. But rarely would book cover conference proceedings, and he sees that the report categorizes as primary resources be considered data, except when used for text or data mining purposes. Catalogs, subject indices, citation index is, search engines, and web portals were classified as secondary resources.
  
  Borgman 2007 Data Classification Disciplinary differences
15. daniel.odonnell 28 Jul 2016
  
  in Public
  
  Pages 119 and 120
  
  Here Borgman discusses the various definitions of data showing them working across the fields
  
  the following definition of data is widely accepted in this context: AT&T portable representation of information in a formalized manner suitable for communication, interpretation, or processing. Examples of data include a sequence of bits, a table of numbers, the characters on a page, recording of sounds made by a person speaking Ori moon rocks specimen. Definitions of data often arise from Individual disciplines, but can apply to data used in science, technology, the social sciences, and the humanities: data are facts, numbers, letters, and symbols that describe an object, idea, condition, situation, or other factors.... Terms data and facts are treated interchangeably, as is the case in legal context. Sources of data includes observations, complications, experiment, and record-keeping. Observational data include weather measurements... And attitude surveys... Or involve multiple places and times. Computational data result from executing a computer model or simulation.... experimental data include results from laboratory studies such as measurements of chemical reactions or from field experiments such as controlled Behavioral Studies.... records of government, business, and public and private life also yield useful data for scientific, social scientific, and humanistic research.
  
  Borgman 2007 Data Definitions
16. daniel.odonnell 28 Jul 2016
  
  in Public
  
  Pages 117 to 1:19
  
  Here Borgman discusses the ability to go back and forth between data and reports on data she cites Phil born 2005 on this for a while medicine. She also discusses how in the pre-digital error data was understood as a support mechanism for final publication and as a result was allowed to deteriorate or be destroyed after the Publications upon which they were based appeared.
  
  Borgman 2007 Data
17. daniel.odonnell 28 Jul 2016
  
  in Public
  
  Page 115
  
  Borgman makes the point here that while there is a Commons in the infrastructure of scholarly publishing there is less of a Commons in the infrastructure 4 data across disciplines.
  
  The infrastructure of scholarly publishing Bridges disciplines: every field produces Journal articles, conference papers, and books albeit in differing ratios. Libraries select, collect organize and make accessible publications of all types, from all fields. No comparable infrastructure exists for data. A few Fields have major mechanisms for publishing data in repositories. Some fields are in the stage of developing standards and practices to activate their data resorces and Nathan were widely accessible. In most Fields, especially Outside The Sciences, data practices remain local idiosyncratic, and oriented to current usage rather than preservation operation, and access. Most data collections Dash where they exist Dash are managed by individual agencies within disciplines, rather than by libraries are archives. Data managers usually are trained within the disciplines they serve. Only a few degree programs and information studies include courses on data management. The lack of infrastructure for data amplifies the discontinuities in scholarly publishing despite common concerns, independent debates continue about access to Publications and data.
  
  Borgman 2007 Scholarly Communication Scholarly Commons Scholarly Data
18. daniel.odonnell 28 Jul 2016
  
  in Public
  
  Page 41
  
  discussions of digital scholarship tend to distinguish implicitly or explicitly between data and documents. Some of you data and documents as a Continuum rather than a dichotomy in this sense data such as numbers images and observations are the initial products of research, and Publications are the final products that set research findings in context.
  
  Borgman 2007 Data Documents
19. daniel.odonnell 27 Jul 2016
  
  in Public
  
  A great paragraph here on the value of interconnection
  
  scholarly data and documents are of most value when they are interconnected rather than independent. The outcomes of a research project could be understood most fully if it were possible to trace an important finding from a grant proposal, to data collection, to a data set, to its publication, to its subsequent review and comment period journal articles are more valuable if one can jump directly from the article to those insights into later articles that cite the source article. Articles are even more valuable if they provide links to data on which they are based. Some of these capabilities already are available, but their expansion depends more on the consistency of the data description, access arrangements, and intellectual property agreement then on technological advances.
  
  I think here of the line from Jim Gill may all your problems be technical
  
  Borgman 2007 Interconnection Data Scholarly Communication
20. daniel.odonnell 27 Jul 2016
  
  in Public
  
  p. 8-actually this is link to p. 7, since 8 is excluded
  
  Another trend is the blurring of the distinction between primary sources, generally viewed as unprocessed or unanalysed data, and secondary sources that set data in context.
  
  Good point about how this is a new thing. On the next page she discusses how we are now collpasing the traditional distinction between primary and secondary sources.
  
  borgman 2007 data primary sources secondary sources method
21. daniel.odonnell 27 Jul 2016
  
  in Public
  
  p. 6
  
  Retrieval methods designed for small databases decline rapidly in effectiveness as collections grow...
  
  This is an interesting point that is missed in the Distant reading controversies: its all very well to say that you prefer close reading, but close reading doesn't scale--or rather the methodologies used to decide what to close read were developed when big data didn't exist. How to you combine that when you can read everything. I.e. You close read Dickins because he's what survived the 19th C as being worth reading. But now, if we could recover everything from the 19th C how do you justify methodologically not looking more widely?
  
  borgman 2007 distant reading algorithmic criticism Moretti Ramsay Fish Culler data humanities data digital humanities
Visit annotations in context

Tags

digital libraries

digital humanities

Data

secondary sources

humanities

Borgman

primary sources

electronic texts

methodology

data

Definitions

citation

disciplinary difference

digital scholarship

Borgman 2007

algorithmic criticism

borgman 2007

Ramsay

NASA

method

Fish

citation practices

Scholarly Commons

Interconnection

humanities data

Scholarly Communication

social sciences

Disciplinary differences

Humanities

Culler

editions

Moretti

Classification

Scholarly

distant reading

Data preservation

Documents

Annotators

daniel.odonnell

URL

books.google.ca/books/about/Scholarship_in_the_Digital_Age.html
books.google.ca books.google.ca

Hermeneutica

2
1. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 14
  
  Rockwell and Sinclair note that corporations are mining text including our email; as they say here:
  
  more and more of our private textual correspondence is available for large-scale analysis and interpretation. We need to learn more about these methods to be able to think through the ethical, social, and political consequences. The humanities have traditions of engaging with issues of literacy, and big data should be not an exception. How to analyze interpret, and exploit big data are big problems for the humanities.
  
  rockwell and sinclair 2016 data humanities humanities data big data
2. daniel.odonnell 31 Jul 2016
  
  in Public
  
  Page 14
  
  Rockwell and Sinclair note that HTML and PDF documents account for 17.8% and 9.2% of (I think) all data on the web while images and movies account for 23.2% and 4.3%.
  
  rockwell and sinclair 2016 data web datatypes
Visit annotations in context

Tags

data

humanities data

rockwell and sinclair 2016

datatypes

humanities

web

big data

Annotators

daniel.odonnell

URL

books.google.ca/books/about/Hermeneutica.html
heretothere.trubox.ca heretothere.trubox.ca

Open analysis of open content & pedagogy

1
1. otterscotter 28 Jul 2016
  
  in Public
  
  “knowledge creation”.
  
  The "business" of univeristy?
  
  higher ed knowledge open data student data measure learning
Visit annotations in context

Tags

open data

measure learning

higher ed

knowledge

student data

Annotators

otterscotter

URL

heretothere.trubox.ca/open-analysis-of-open-content-pedagogy/
journals-openedition-org.accesdistant.sorbonne-universite.fr journals-openedition-org.accesdistant.sorbonne-universite.fr

Le profil : une rhétorique dispositive

1
1. laconis 28 Jul 2016
  
  in Public
  
  big data
  
  les algorithmes ont besoin de données soi-disant neutres.. c'est un peu aller dans le sens des discours d'accompagnement de ces algorithmes et services de recommandation qui considèrent leurs données "naturelles", sans valeur intrasèque. (voir Bonenfant 2015)
  
  gouvernementalité big data
Visit annotations in context

Tags

big data

gouvernementalité

Annotators

laconis

URL

journals-openedition-org.accesdistant.sorbonne-universite.fr/itineraires/3056
books.google.ca books.google.ca

The Data Revolution

4
1. daniel.odonnell 27 Jul 2016
  
  in Public
  
  p. 141
  
  Initially, the digital humanities consisted of the curation and analysis of data that were born digital, and the digitisation and archiving projects that sought to render analogue texts and material objects into digital forms that could be organised and searched and be subjects to basic forms of overarching, automated or guided analysis, such as summary visualisations of content or connections between documents, people or places. Subsequently, its advocates have argued that the field has evolved to provide more sophisticated tools for handling, searching, linking, sharing and analysing data that seek to complement and augment existing humanities methods, and facilitate traditional forms of interpretation and theory building, rather than replacing traditional methods or providing an empiricist or positivistic approach to humanities scholarship.
  
  summary of history of digital humanities
  
  Kitchin 2014 data humanities data digital humanities history history of science history of ideas
2. daniel.odonnell 27 Jul 2016
  
  in Public
  
  p. 100
  
  Data are not useful in and of themselves. They only have utility if meaning and value can be extracted from them. In other words, it is what is done with data that is important, not simply that they are generated. The whole of science is based on realising meaning and value from data. Making sense of scaled small data and big data poses new challenges. In the case of scaled small data, the challenge is linking together varied datasets to gain new insights and opening up the data to new analytical approaches being used in big data. With respect to big data, the challenge is coping with its abundance and exhaustivity (including sizeable amounts of data with low utility and value), timeliness and dynamism, messiness and uncertainty, high relationality, semi-structured or unstructured nature, and the fact that much of big data is generated with no specific question in mind or is a by-product of another activity. Indeed, until recently, data analysis techniques have primarily been designed to extract insights from scarce, static, clean and poorly relational datasets, scientifically sampled and adhering to strict assumptions (such as independence, stationarity, and normality), and generated and alanysed with a specific question in mind.
  
  Good discussion of the different approaches allowed/required by small v. big data.
  
  Kitchin 2014 data small data big data
3. daniel.odonnell 27 Jul 2016
  
  in Public
  
  p. 86
  
  25% of data stored in digital form in 2000 (the rest analogue; 94% by 2007
  
  Kitchin 2014 data
4. daniel.odonnell 27 Jul 2016
  
  in Public
  
  Kitchin, Rob. 2014. The Data Revolution. Thousand Oaks, CA: SAGE Publications Ltd.
  
  Kitchin 2014 data
Visit annotations in context

Tags

data

digital humanities

humanities data

history of ideas

Kitchin 2014

small data

history

history of science

big data

Annotators

daniel.odonnell

URL

books.google.ca/books/about/The_Data_Revolution.html
books.google.ca books.google.ca

The Data Revolution

1
1. daniel.odonnell 27 Jul 2016
  
  in Public
  
  Kitchin, Rob. 2014. The Data Revolution. Thousand Oaks, CA: SAGE Publications Ltd.
  
  Kitchin 2014 data
Visit annotations in context

Tags

data

Kitchin 2014

Annotators

daniel.odonnell

URL

books.google.ca/books/about/The_Data_Revolution.html
www.clir.org www.clir.org

pub171

1
1. micahvandegrift 26 Jul 2016
  
  in Public
  
  digital data
  
  are there non-digital data?
  
  data
Visit annotations in context

Tags

data

Annotators

micahvandegrift

URL

clir.org/pubs/reports/pub171/pub171
lawriephipps.co.uk lawriephipps.co.uk

Open Analytics?

2
1. daniellynds 22 Jul 2016
  
  in Public
  
  The visualisation may look like data, but it is a snapshot of how I am connected, it is my rhizomatic digital landscape. For me it reinforces the fact that digital is people.
  
  Really nice way to end the article.
  
  I love Data = People :)
  
  #data #dataviz
2. daniellynds 22 Jul 2016
  
  in Public
  
  Everyone should acquire the skills to understand data, and analytics.
  
  AMEN!
  
  #highered #data #dataviz
Visit annotations in context

Tags

#highered

#dataviz

#data

Annotators

daniellynds

URL

lawriephipps.co.uk/
hybridpedagogy.org hybridpedagogy.org

Teaching as Wayfinding

4
1. Enkerli 21 Jul 2016
  
  in Public
  
  what do we do with that information?
  
  Interestingly enough, a lot of teachers either don’t know that such data might be available or perceive very little value in monitoring learners in such a way. But a lot of this can be negotiated with learners themselves.
  
  Learning Analytics learner data
2. Enkerli 21 Jul 2016
  
  in Public
  
  turn students and faculty into data points
  
  Data=New Oil
  
  learner data
3. Enkerli 21 Jul 2016
  
  in Public
  
  E-texts could record how much time is spent in textbook study. All such data could be accessed by the LMS or various other applications for use in analytics for faculty and students.”
  
  Learning Analytics learner data adaptive learning Personal-ized Education
4. Enkerli 21 Jul 2016
  
  in Public
  
  not as a way to monitor and regulate
  
  Learning Analytics learner data
Visit annotations in context

Tags

adaptive learning

learner data

Personal-ized Education

Learning Analytics

Annotators

Enkerli

URL

hybridpedagogy.org/the-discussion-forum-is-dead-long-live-the-discussion-forum/
ideas.repec.org ideas.repec.org

Open Access to Data: An Ideal Professed but not Practised

1
1. ppival 21 Jul 2016
  
  in Public
  
  Replication data for this study can be found in Harvard's Dataverse
  
  data
Visit annotations in context

Tags

data

Annotators

ppival

URL

ideas.repec.org/p/rsw/rswwps/rswwps215.html
hackeducation.com hackeducation.com

Convivial Tools in an Age of Surveillance

1
1. Enkerli 19 Jul 2016
  
  in Public
  
  demanded by education policies — for more data
  
  Learning Analytics learner data #privacy surveillance society
Visit annotations in context

Tags

learner data

surveillance society

Learning Analytics

#privacy

Annotators

Enkerli

URL

hackeducation.com/2014/11/13/convivial-tools-in-an-age-of-surveillance
medium.com medium.com

A Postcolonial Look at the Future of #EdTech — Not Evenly Distributed — Medium

1
1. Enkerli 19 Jul 2016
  
  in Public
  
  data being collected about individuals for purposes unknown to these individuals
  
  #privacy learner data Learning Analytics busi
Visit annotations in context

Tags

learner data

Learning Analytics

busi

#privacy

Annotators

Enkerli

URL

medium.com/not-evenly-distributed/a-postcolonial-look-at-the-future-of-edtech-4b1c6db7e12e
www.businessinsider.com www.businessinsider.com

Colleges can now figure out which students will be successful — even before classes start

1
1. Enkerli 18 Jul 2016
  
  in Public
  
  Data collection on students should be considered a joint venture, with all parties — students, parents, instructors, administrators — on the same page about how the information is being used.
  
  stakeholders Learning Analytics learner data
Visit annotations in context

Tags

learner data

stakeholders

Learning Analytics

Annotators

Enkerli

URL

businessinsider.com/how-colleges-use-big-data-2016-6
www.educationdive.com www.educationdive.com

The ethics of big data in higher education

1
1. Enkerli 18 Jul 2016
  
  in Public
  
  there is some disparity and implicit bias
  
  BuriedLede diversity Learning Analytics learner data
Visit annotations in context

Tags

learner data

diversity

Learning Analytics

BuriedLede

Annotators

Enkerli

URL

educationdive.com/news/the-ethics-of-big-data-in-higher-education/422022/
motherboard.vice.com motherboard.vice.com

Why the Internet of Things May Change How We View Privacy

1
1. catchick2 02 Jul 2016
  
  in Public
  
  The arrival of quantified self means that it's no longer just what you type that is being weighed and measured, but how you slept last night, and with whom.
  
  data internet of things cyberpunk dystopia
Visit annotations in context

Tags

data

internet of things

cyberpunk dystopia

Annotators

catchick2

URL

motherboard.vice.com/read/internet-of-things-privacy
blog.clever.com blog.clever.com

So you're building an EdTech app? (An intro to data privacy) - Clever Blog

1
1. jeremydean 01 Jul 2016
  
  in Public
  
  Limit retention to what is useful.
  
  So what data does h retain?
  
  username
  
  email address
  
  Do annotations count as data?
  
  FERPA privacy data
Visit annotations in context

Tags

data

FERPA

privacy

Annotators

jeremydean

URL

blog.clever.com/2014/04/data-privacy-for-edtech-vendors/
Jun 2016
idlewords.com idlewords.com

Remarks at the SASE Panel On The Moral Economy of Tech

1
1. jeremydean 29 Jun 2016
  
  in Public
  
  Even if you trust everyone spying on you right now, the data they're collecting will eventually be stolen or bought by people who scare you. We have no ability to secure large data collections over time.
  
  Fair enough.
  
  And "Burn!!" on Microsoft with that link.
  
  data security Microsoft
Visit annotations in context

Tags

data

security

Microsoft

Annotators

jeremydean

URL

idlewords.com/talks/sase_panel.htm
digitalhumanities.org digitalhumanities.org

DHQ: Digital Humanities Quarterly: The Digital Future is Now: A Call to Action for the Humanities

1
1. daniel.odonnell 23 Jun 2016
  
  in Public
  
  Data in Digital Scholarship 23
  
  Data in digital scholarship
  
  data humanities data
Visit annotations in context

Tags

data

humanities data

Annotators

daniel.odonnell

URL

digitalhumanities.org/dhq/vol/3/4/000077/000077.html/000077.html
blog.jonudell.net blog.jonudell.net

Annotation is not (only) web comments

1
1. Enkerli 21 Jun 2016
  
  in Public
  
  Annotation can help us weave that web of linked data.
  
  This pithy statement brings together all sorts of previous annotations. Would be neat to map them.
  
  #LODLAM Linked Data Linked Open Data Semantic Web Semantic Annotation meta-annotation
Visit annotations in context

Tags

Linked Open Data

meta-annotation

Semantic Web

Linked Data

Semantic Annotation

#LODLAM

Annotators

Enkerli

URL

blog.jonudell.net/2016/04/24/annotation-is-not-only-web-comments/
www.forbes.com www.forbes.com

Seven Leadership Lessons From Brexit - Forbes

1
1. mrgunn 21 Jun 2016
  
  in Public
  
  dynamic documents
  
  A group of experts got together last year at Daghstuhl and wrote a white paper about this.
  
  Basically the idea is that the data, the code, the protocol/analysis/method, and the narrative should all exist as equal objects on the appropriate platform. Code in a code repository like Github, Data in a data repo that understands data formats, like Mendeley Data (my company) and Figshare, protocols somewhere like protocols.io and the narrative which ties it all together still at the publisher. Discussion and review can take the form of comments, or even better, annotations just like I'm doing now.
  
  scholcomm annotation data protocols
Visit annotations in context

Tags

scholcomm

protocols

data

annotation

Annotators

mrgunn

URL

forbes.com/sites/ceciliarodriguez/2017/05/29/blacklisting-venice-to-save-it-from-too-many-tourists-and-too-few-venetians/
Local file Local file

Hyperauthorship: A postmodern perversion or evidence of a structural shift in scholarly communication practices?

1
1. daniel.odonnell 16 Jun 2016
  
  in Public
  
  n a sample of 2,101 scientificpapers published between 1665 and 1800, Beaver andRosen found that 2.2% described collaborative work. No-table was the degree of joint authorship in astronomy,especially in situations where scientists were dependentupon observational data.
  
  Astronomy was area of collaboration because they needed to share data
  
  cronin 2001 authorship coauthorship data
Tags

data

cronin 2001

authorship

coauthorship

Annotators

daniel.odonnell
exposingtheinvisible.org exposingtheinvisible.org

Michael Kreil: An Honest Picture of Metadata | Exposing the Invisible

1
1. offray 12 Jun 2016
  
  in Public
  
  What type of team do you need to create these visualisations?  OpenDataCity has a special team of really high-level nerds. Experts on hardware, servers, software development, web design, user experience and so on. I contribute the more mathematical view on the data. But usually a project is done by just one person, who is chief and developer, and the others help him or her. So, it's not like a group project. Usually, it's a single person and a lot of help. That makes it definitely faster, than having a big team and a lot of meetings.
  
  This strengths the idea that data visualization is a field where a personal approach is still viable, as is shown also by a lot of individuals that are highly valuated as data visualizers.
  
  data visualization
Visit annotations in context

Tags

data visualization

Annotators

offray

URL

exposingtheinvisible.org/resources/showing-evidence/michael-kreil
wiki.de.dariah.eu wiki.de.dariah.eu

T7.1 Overview of publications/websites on Open Data - Humanities-at-Scale - DARIAH Wiki

1
1. Natct 02 Jun 2016
  
  in Public
  
  List of publications on open access research data
  
  nice bibliography!
  
  Open Access Research Data
Visit annotations in context

Tags

Open Access Research Data

Annotators

Natct

URL

wiki.de.dariah.eu/pages/viewpage.action
May 2016
en.wikipedia.org en.wikipedia.org

Gary Loveman - Wikipedia, the free encyclopedia

1
1. buluzhai 26 May 2016
  
  in Public
  
  After graduating from MIT at the age of 29, Loveman began teaching at Harvard Business School, where he was a professor for nine years.[8][10] While at Harvard, Loveman taught Service Management and developed an interest in the service industry and customer service.[8][10] He also launched a side career as a speaker and consultant after a 1994 paper he co-authored, titled "Putting the Service-Profit Chain to Work", attracted the attention of companies including Disney, McDonald's and American Airlines. The paper focused on the relationship between company profits and customer loyalty, and the importance of rewarding employees who interact with customers.[7][8] In 1997, Loveman sent a letter to Phil Satre, the then-chief executive officer of Harrah's Entertainment, in which he offered advice for growing the company.[7] Loveman, who had done some consulting work for the company in 1991,[11] again began to consult for Harrah's and, in 1998, was offered the position of chief operating officer.[8] He initially took a two year sabbatical from Harvard to take on the role of COO of Harrah's,[10] at the end of which Loveman decided to remain with the company.[12]
  
  Putting the Service-Profit Chain to Work
  
  data
Visit annotations in context

Tags

data

Annotators

buluzhai

URL

en.wikipedia.org/wiki/Gary_Loveman
blog.deming.org blog.deming.org

Unknown and Unknowable Data « The W. Edwards Deming Institute Blog

1
1. buluzhai 26 May 2016
  
  in Public
  
  the most important figures that one needs for management are unknown or unknowable (Lloyd S. Nelson, director of statistical methods for the Nashua corporation), but successful management must nevertheless take account of them.
  
  分清楚哪些是能知道的，哪些是不能知道的数据
  
  data
Visit annotations in context

Tags

data

Annotators

buluzhai

URL

blog.deming.org/2013/08/unknown-and-unknowable-data/
www.force11.org www.force11.org

Detailed Agenda

1
1. nalankannan 15 May 2016
  
  in Public
  
  From Bits to Narratives: The Rapid Evolution of Data Visualization Engines
  
  It was an amazing presentation by Mr Cesar A Hidalgo, It was an eye opener for me in the area of data visualisation, As the national level organisation, we have huge data, but we never thought about data visualisation. You projects particularly pantheon and immersion is marvelous and I came to know that, you are using D3. It is a great job
  
  data visualisation
Visit annotations in context

Tags

data visualisation

Annotators

nalankannan

URL

force11.org/meetings/force2016/program/agenda-details
www.swissinfo.ch www.swissinfo.ch

Around 40% of Swiss research is open access - SWI swissinfo.ch

1
1. otterscotter 12 May 2016
  
  in Public
  
  Around 40% of Swiss research is open access
  
  open access open data
Visit annotations in context

Tags

open data

open access

Annotators

otterscotter

URL

swissinfo.ch/eng/sci-tech/sharing-knowledge_around-40--of-swiss-research-is-open-access/42144264
www.insidehighered.com www.insidehighered.com

Rutgers Graduate School faculty takes a stand against Academic Analytics

1
1. otterscotter 11 May 2016
  
  in Public
  
  The entirely quantitative methods and variables employed by Academic Analytics -- a corporation intruding upon academic freedom, peer evaluation and shared governance -- hardly capture the range and quality of scholarly inquiry, while utterly ignoring the teaching, service and civic engagement that faculty perform,
  
  data analytics
Visit annotations in context

Tags

data analytics

Annotators

otterscotter

URL

insidehighered.com/news/2016/05/11/rutgers-graduate-school-faculty-takes-stand-against-academic-analytics
datascience.codata.org datascience.codata.org

Are Scientific Data Repositories Coping with Research Data Publishing?

1
1. scotted400 05 May 2016
  
  in Public
  
  What is missing and trends.
  
  Cough GigaScience Cough. See integrated GigaDB repo http://database.oxfordjournals.org/content/2014/bau018.full
  
  GigaDB data publishing
Visit annotations in context

Tags

data publishing

GigaDB

Annotators

scotted400

URL

datascience.codata.org/articles/10.5334/dsj-2016-006/
Apr 2016
googleguacamole.wordpress.com googleguacamole.wordpress.com

I am #IndieEdTech

1
1. Enkerli 26 Apr 2016
  
  in Public
  
  followed a TAGS Explorer of a conference hashtag
  
  Neat use of TAGS Explorer. Did do so, to interesting effects. Regular livetweet events may be more interesting. And there’s something to be said about NodeXL and COSMOS.
  
  Twitter Data Conference Analysis Hashtag Analytics Social Network Analysis NodeXL COSMOS Project TAGS Explorer
Visit annotations in context

Tags

Conference Analysis

Social Network Analysis

Hashtag Analytics

Twitter Data

TAGS Explorer

NodeXL

COSMOS Project

Annotators

Enkerli

URL

googleguacamole.wordpress.com/2016/04/24/i-am-indieedtech/
socialboost.com.ua socialboost.com.ua

SocialBoost

1
1. toka 26 Apr 2016
  
  in Public
  
  SocialBoost — is a tech NGO that promotes open data and coordinates the activities of more than 1,000 IT-enthusiasts, biggest IT-companies and government bodies in Ukraine through hackathons for socially meaningful IT-projects, related to e-government, e-services, data visualization and open government data. SocialBoost has developed dozens of public services, interactive maps, websites for niche communities, as well as state projects such as data.gov.ua, ogp.gov.ua. SocialBoost builds the bridge between civic activists, government and IT-industry through technology. Main goal is to make government more open by crowdsourcing the creation of innovative public services with the help of civic society.
  
  for:tm is:network open data
Visit annotations in context

Tags

for:tm

open data

is:network

Annotators

toka

URL

socialboost.com.ua/
mitpress.mit.edu mitpress.mit.edu

Great Principles of Computing

1
1. daveh70 23 Apr 2016
 
 in Public
 
 Great Principles of Computing Peter J. Denning, Craig H. Martell
 
 This is a book about the whole of computing—its algorithms, architectures, and designs.
 
 Denning and Martell divide the great principles of computing into six categories: communication, computation, coordination, recollection, evaluation, and design.
 
 "Programmers have the largest impact when they are designers; otherwise, they are just coders for someone else's design."
 
 computer science technology programming data science electronics
Visit annotations in context

Tags

computer science

programming

data science

technology

electronics

Annotators

daveh70

URL

mitpress.mit.edu/books/great-principles-computing
techcrunch.com techcrunch.com

Your Algorithmic Self Meets Super-Intelligent AI

1
1. daveh70 21 Apr 2016
  
  in Public
  
  We should have control of the algorithms and data that guide our experiences online, and increasingly offline. Under our guidance, they can be powerful personal assistants.
  
  Big business has been very militant about protecting their "intellectual property". Yet they regard every detail of our personal lives as theirs to collect and sell at whim. What a bunch of little darlings they are.
  
  machine learning deep learning big data personal data internet web artificial intelligence
Visit annotations in context

Tags

artificial intelligence

internet

deep learning

machine learning

big data

web

personal data

Annotators

daveh70

URL

techcrunch.com/2015/12/14/your-algorithmic-self-meets-super-intelligent-ai/
oerresearchhub.org oerresearchhub.org

Data Report 2013-2015: Educators

1
1. otterscotter 21 Apr 2016
  
  in Public
  
  OER Data Report
  
  OER open data
Visit annotations in context

Tags

OER

open data

Annotators

otterscotter

URL

oerresearchhub.org/2015/09/21/data-report-2013-2015-educators/
dauwhe.github.io dauwhe.github.io

Scholarly Publishing in a Connected World

1
1. Enkerli 20 Apr 2016
  
  in Public
  
  Is it possible to add information to a resource without touching it?
  
  That’s something we’ve been doing, yes.
  
  Linked Data Linked Open Data #LODLAM Semantic Annotation Semantic Web Open World Assumption
Visit annotations in context

Tags

Linked Open Data

Open World Assumption

Semantic Web

Linked Data

Semantic Annotation

#LODLAM

Annotators

Enkerli

URL

dauwhe.github.io/epub-zero/tsiegman-presentations/WWW2016reveal.html
wiki.surfnet.nl wiki.surfnet.nl

2. Facilitate text and data mining of content - Amsterdam Call for Action - Collaboration Infrastructure Wiki

2
1. Daniel_Mietchen 14 Apr 2016
  
  in Public
  
  preferably
  
  Delete "preferably". Limiting the scope of text mining to exclude societal and commercial purposes limits the usefulness to enterprises (especially SMEs that cannot mine on their own) as well as to society. These limitations have ramifications in terms of limiting the research questions that researchers can and will pursue.
  
  text mining data mining content mining
2. Daniel_Mietchen 14 Apr 2016
  
  in Public
  
  Encourage researchers not to transfer the copyright on their research outputs before publication.
  
  This statement is more generally applicable than just to TDM. Besides, "Encourage" is too weak a word here, and from a societal perspective, it would be far better if researchers were to retain their copyright (where it applies), but make their copyrightable works available under open licenses that allow publishers to publish the works, and others to use and reuse it.
  
  text m data mining content mining copyright
Visit annotations in context

Tags

copyright

text m

data mining

text mining

content mining

Annotators

Daniel_Mietchen

URL

wiki.surfnet.nl/display/OSCFA/2.+Facilitate+text+and+data+mining+of+content
gigadb.org gigadb.org

GigaDB Dataset - DOI 10.5524/100001 - Genomic data from Escherichia coli O104:H4 isolate TY-2482

1
1. scotted400 14 Apr 2016
  
  in Public
  
  To maximize its utility
  
  The unusual data released strategy involving crowdsourcing on twitter, is discussed in more detail in this blog http://blogs.biomedcentral.com/gigablog/2011/08/03/notes-from-an-e-coli-tweenome-lessons-learned-from-our-first-data-doi/
  
  genomics open data microbiology e. coli
Visit annotations in context

Tags

genomics

microbiology

open data

e. coli

Annotators

scotted400

URL

gigadb.org/dataset/100001
thenewinquiry.com thenewinquiry.com

Fitted

1
1. padminirm 13 Apr 2016
  
  in Public
  
  In December 2014, FitBit released a pledge stating that it “is deeply committed to protecting the security of your data.” Still, we may soon be obliged to turn over the sort of information the device is designed to collect in order to obtain medical coverage or life insurance. Some companies currently offer incentives like discounted premiums to members who volunteer information from their activity trackers. Many health and fitness industry experts say it is only a matter of time before all insurance providers start requiring this information.
  
  personal data protection
Visit annotations in context

Tags

personal data protection

Annotators

padminirm

URL

thenewinquiry.com/essays/fitted/
gigadb.org gigadb.org

GigaDB Dataset - DOI 10.5524/100004 - Genomic data from the giant panda (Ailuropoda melanoleuca).

1
1. scotted400 13 Apr 2016
  
  in Public
  
  Related manuscripts:
  
  See also this population genomics study in Nature Genetics that uses this data: http://www.nature.com/ng/journal/v45/n1/full/ng.2494.html See also this blog posting on data citation of this data (and related problems): http://blogs.biomedcentral.com/gigablog/2012/12/21/promoting-datacitation-in-nature/
  
  data citation genomics panda
Visit annotations in context

Tags

genomics

data citation

panda

Annotators

scotted400

URL

gigadb.org/dataset/100004
www.nature.com www.nature.com

Whole-genome sequencing of giant pandas provides insights into demographic history and local adaptation

1
1. scotted400 13 Apr 2016
  
  in Public
  
  Accession codes
  
  The panda and polar bear datasets should have been included in the data section rather than hidden in the URLs section. Production removed the DOIs and used (now dead) URLs instead, but for the working links and insight see the following blog: http://blogs.biomedcentral.com/gigablog/2012/12/21/promoting-datacitation-in-nature/
  
  data citation polar bear panda genomics
Visit annotations in context

Tags

genomics

panda

data citation

polar bear

Annotators

scotted400

URL

nature.com/ng/journal/v45/n1/full/ng.2494.html
gigadb.org gigadb.org

GigaDB Dataset - DOI 10.5524/100008 - Genomic data from the polar bear (Ursus maritimus).

1
1. scotted400 13 Apr 2016
  
  in Public
  
  doi:10.1016/j.cell.2014.03.054
  
  More on the backstory and other papers using and citing this data before the Cell publication in ths blog posting: http://blogs.biomedcentral.com/gigablog/2014/05/14/the-latest-weapon-in-publishing-data-the-polar-bear/
  
  data citation genomics polar bear
Visit annotations in context

Tags

genomics

data citation

polar bear

Annotators

scotted400

URL

gigadb.org/dataset/100008
gigadb.org gigadb.org

GigaDB Dataset - DOI 10.5524/100043 - Bisulfite-PCR combined with cloning Sanger sequencing data for validating DNA methylation level in ...

1
1. scotted400 12 Apr 2016
  
  in Public
  
  To date 5'-cytosine methylation (5mC) has not been reported in Caenorhabditis elegans, and using ultra-performance liquid chromatography/tandem mass spectrometry (UPLC-MS/MS) the existence of DNA methylation in T. spiralis was detected, making it the first 5mC reported in any species of nematode.
  
  As a novel and potentially controversial finding, the huge amounts of supporting data are depositedhere to assist others to follow on and reproduce the results. This won the BMC Open Data Prize, as the judges were impressed by the numerous extra steps taken by the authors in optimizing the openness and easy accessibility of this data, and were keen to emphasize that the value of open data for such breakthrough science lies not only in providing a resource, but also in conferring transparency to unexpected conclusions that others will naturally wish to challenge. You can see more in the blog posting and interview with the authors here: http://blogs.biomedcentral.com/gigablog/2013/10/02/open-data-for-the-win/
  
  open data epigenomics
Visit annotations in context

Tags

open data

epigenomics

Annotators

scotted400

URL

gigadb.org/dataset/100043
biosharing.org biosharing.org

BioSharing: biodbcore-000595: GigaDB

1
1. scotted400 12 Apr 2016
  
  in Public
  
  Giga Science Database
  
  For more about GigaDB, see the paper in Database Journal: http://database.oxfordjournals.org/content/2014/bau018.full
  
  GigaDB databases open data
Visit annotations in context

Tags

databases

GigaDB

open data

Annotators

scotted400

URL

biosharing.org/biodbcore-000595
edtechdigest.wordpress.com edtechdigest.wordpress.com

The New Politics of Educational Data

1
1. otterscotter 11 Apr 2016
  
  in Public
  
  The New Politics of Educational Data
  
  assessment big data DoOO groom agency student centered
Visit annotations in context

Tags

agency

student centered

assessment

groom

DoOO

big data

Annotators

otterscotter

URL

edtechdigest.wordpress.com/2016/04/11/the-new-politics-of-educational-data/
Mar 2016
www.jonbecker.net www.jonbecker.net

Ranty Blog Post about Big Data, Learning Analytics, & Higher Ed

1
1. otterscotter 24 Mar 2016
  
  in Public
  
  Ranty Blog Post about Big Data, Learning Analytics, & Higher Ed
  
  big dat assessment data analytics learning analytics
Visit annotations in context

Tags

learning analytics

data analytics

big dat

assessment

Annotators

otterscotter

URL

jonbecker.net/ranty-blog-post-about-big-data-learning-analytics-higher-ed/
medium.com medium.com

EdTech’s culture problem — Medium

1
1. otterscotter 21 Mar 2016
  
  in Public
  
  There is a human story behind every data point and as educators and innovators we have to shine a light on it.
  
  data edtech
Visit annotations in context

Tags

data

edtech

Annotators

otterscotter

URL

medium.com/@fjmubeen/edtech-s-culture-problem-c6e37e6cbba2
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Diagnostic and Sex Effects on Limbic Volumes in Early-Onset Bipolar Disorder and Schizophrenia

1
1. chaselgrove 17 Mar 2016
  
  in Public
  
  three-dimensional inversion recovery-prepped spoiled grass coronal series
  
  ID: BPwPsyStructuralData SubjectGroup: BPwPsy Acquisition: Anatomical DOI: 10.18116/C6159Z
  
  ID: BPwoPsyStructuralData SubjectGroup: BPwoPsy Acquisition: Anatomical DOI: 10.18116/C6159Z
  
  ID: HCStructuralData SubjectGroup: HC Acquisition: Anatomical DOI: 10.18116/C6159Z
  
  ID: SZStructuralData SubjectGroup: SZ Acquisition: Anatomical DOI: 10.18116/C6159Z
  
  CANDISharePub Data
Visit annotations in context

Tags

Data

CANDISharePub

Annotators

chaselgrove

URL

ncbi.nlm.nih.gov/pmc/articles/PMC2632388/
opentextbc.ca opentextbc.ca

10.3 Open textbooks, open research and open data | Teaching in a Digital Age

1
1. Enkerli 16 Mar 2016
  
  in Public
  
  Open data
  
  Sadly, there may not be much work on opening up data in Higher Education. For instance, there was only one panel at last year’s international Open Data Conference. https://www.youtube.com/watch?v=NUtQBC4SqTU
  
  Looking at the interoperability of competency profiles, been wondering if it could be enhanced through use of Linked Open Data.
  
  #OpenData Open Education #IODC15 #LODLAM Linked Open Data Semantic Web
Visit annotations in context

Tags

Linked Open Data

Open Education

#OpenData

Semantic Web

#IODC15

#LODLAM

Annotators

Enkerli

URL

opentextbc.ca/teachinginadigitalage/chapter/10-8-open-textbooks/
amstat.tandfonline.com amstat.tandfonline.com

The ASA's statement on p-values: context, process, and purpose

1
1. daveh70 07 Mar 2016
  
  in Public
  
  American Statistical Association statement on p-values
  
  statistics data analysis science
Visit annotations in context

Tags

science

statistics

data analysis

Annotators

daveh70

URL

amstat.tandfonline.com/doi/abs/10.1080/00031305.2016.1154108
www.genomeweb.com www.genomeweb.com

Baylor Researchers Make Seven Consented Cancer Patients' Genomic Data Available Open Access

1
1. otterscotter 02 Mar 2016
  
  in Public
  
  right to privacy, while allowing them to make an informed choice about taking reasonable risks to their privacy in order to help advance research
  
  open data dna
Visit annotations in context

Tags

dna

open data

Annotators

otterscotter

URL

genomeweb.com/sequencing-technology/baylor-researchers-make-seven-consented-cancer-patients-genomic-data-available
Feb 2016
chronicle.com chronicle.com

As Big-Data Companies Come to Teaching, a Pioneer Issues a Warning

1
1. otterscotter 27 Feb 2016
  
  in Public
  
  As Big-Data Companies Come to Teaching, a Pioneer Issues a Warning
  
  big data
Visit annotations in context

Tags

big data

Annotators

otterscotter

URL

chronicle.com/article/As-Big-Data-Companies-Come-to/235400/
www.whitehouse.gov www.whitehouse.gov

Increasing Access to the Results of Federally Funded Science

1
1. otterscotter 22 Feb 2016
  
  in Public
  
  federally funded research publicly accessible are becoming the norm
  
  open access gov open data
Visit annotations in context

Tags

open data

gov

open access

Annotators

otterscotter

URL

whitehouse.gov/blog/2016/02/22/increasing-access-results-federally-funded-science
blog.databaseanimals.com blog.databaseanimals.com

Paul Houle - The trouble with DBpedia

1
1. almereyda 21 Feb 2016
  
  in Public
  
  I read my first books on data mining back in the early 1990's and one thing I read was that "80% of the effort in a data mining project goes into data cleaning."
  
  transformap data mining
Visit annotations in context

Tags

data mining

transformap

Annotators

almereyda

URL

blog.databaseanimals.com/the-trouble-with-dbpedia
www.techdirt.com www.techdirt.com

Beyond Open Access And Open Data: Open Science -- And No Patents | Techdirt

1
1. otterscotter 03 Feb 2016
  
  in Public
  
  "It comes down to what is the reason for our existence? It's to accelerate science, not to make money."
  
  open research open data
Visit annotations in context

Tags

open research

open data

Annotators

otterscotter

URL

techdirt.com/articles/20160129/09420033460/beyond-open-access-open-data-open-science-no-patents.shtml
leanpub.com leanpub.com

Roger D. Peng

1
1. daveh70 02 Feb 2016
  
  in Public
  
  Books on data science and R programming by Roger D. Peng of Johns Hopkins.
  
  statistics data science data analysis data visualization
Visit annotations in context

Tags

statistics

data visualization

data analysis

data science

Annotators

daveh70

URL

leanpub.com/u/rdpeng
blog.cloudera.com blog.cloudera.com

Common Probability Distributions: The Data Scientist's Crib Sheet - Cloudera Engineering Blog

1
1. daveh70 01 Feb 2016
  
  in Public
  
  Great explanation of 15 common probability distributions: Bernouli, Uniform, Binomial, Geometric, Negative Binomial, Exponential, Weibull, Hypergeometric, Poisson, Normal, Log Normal, Student's t, Chi-Squared, Gamma, Beta.
  
  statistics probability data science
Visit annotations in context

Tags

probability

statistics

data science

Annotators

daveh70

URL

blog.cloudera.com/blog/2015/12/common-probability-distributions-the-data-scientists-crib-sheet/
f1000research.com f1000research.com

Software Carpentry: lessons learned

1
1. daveh70 01 Feb 2016
 
 in Public
 
 Since its start in 1998, Software Carpentry has evolved from a week-long training course at the US national laboratories into a worldwide volunteer effort to improve researchers' computing skills. This paper explains what we have learned along the way, the challenges we now face, and our plans for the future.
 
 http://software-carpentry.org/lessons/ Basic programming skills for scientific researchers. SQL, and Python, R, or MATLAB.
 
 http://www.datacarpentry.org/lessons/ Managing and analyzing data.
 
 programming science data analysis
Visit annotations in context

Tags

science

programming

data analysis

Annotators

daveh70

URL

f1000research.com/articles/3-62/v2
Jan 2016
www.readability.com www.readability.com

The Winnower: An Interview with Josh Nicholson

1
1. offray 31 Jan 2016
  
  in Public
  
  The journal will accommodate data but should be presented in the context of a paper. The Winnower should not act as a forum for publishing data sets alone. It is our feeling that data in absence of theory is hard to interpret and thus may cause undue noise to the site.
  
  This will be the case also for the data visualizations showed here, once the data is curated and verified properly. Still data visualizations can start a global conversation without having the full paper translated to English.
  
  data context data provenance
Visit annotations in context

Tags

data provenance

data context

Annotators

offray

URL

readability.com/articles/cruynnku
courses.csail.mit.edu courses.csail.mit.edu

50YearsDataScience.pdf

1
1. daveh70 31 Jan 2016
 
 in Public
 
 50 Years of Data Science, David Donoho 2015, 41 pages
 
 This paper reviews some ingredients of the current "Data Science moment", including recent commentary about data science in the popular media, and about how/whether Data Science is really different from Statistics.
 
 The now-contemplated field of Data Science amounts to a superset of the fields of statistics and machine learning which adds some technology for 'scaling up' to 'big data'.
 
 data science data analysis statistics science big data
Visit annotations in context

Tags

science

data science

statistics

data analysis

big data

Annotators

daveh70

URL

courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf
www.stm-assoc.org www.stm-assoc.org

STM Report 2015 Final 2015-02-20

1
1. offray 25 Jan 2016
  
  in Public
  
  The explosion of data-intensive research is challenging publishers to create new solutions to link publications to research data (and vice versa), to facilitate data mining and to manage the dataset as a potential unit of publication. Change continues to be rapid, with new leadership and coordination from the Research Data Alliance (launched 2013): most research funders have introduced or tightened policies requiring deposit and sharing of data; data repositories have grown in number and type (including repositories for “orphan” data); and DataCite was launched to help make research data cited, visible and accessible. Meanwhile publishers have responded by working closely with many of the community-led projects; by developing data deposit and sharing policies for journals, and introducing data citation policies; by linking or incorporating data; by launching some pioneering data journals and services; by the development of data discovery services such as Thomson Reuters’ Data Citation Index (page 138).
  
  data intensive research
Visit annotations in context

Tags

data intensive research

Annotators

offray

URL

stm-assoc.org/2015_02_20_STM_Report_2015.pdf
www.whitehouse.gov www.whitehouse.gov

Remarks of President Barack Obama – State of the Union Address As Delivered

1
1. SeamusKraft 13 Jan 2016
  
  in Public
  
  It doesn’t work if we think the people who disagree with us are all motivated by malice, or that our political opponents are unpatriotic. Democracy grinds to a halt without a willingness to compromise; or when even basic facts are contested, and we listen only to those who agree with us.
  
  C'mon, civic technologists, government innovators, open data advocates: this can be a call to arms. Isn't the point of "open government" to bring people together to engage with their leaders, provide the facts, and allow more informed, engaged debate?
  
  opengov sotu civic tech civic engagement open source open data
Visit annotations in context

Tags

opengov

civic engagement

open source

open data

civic tech

sotu

Annotators

SeamusKraft

URL

whitehouse.gov/the-press-office/2016/01/12/remarks-president-barack-obama-–-prepared-delivery-state-union-address
quoracast.quora.com quoracast.quora.com

Dima Korolev: Engineering, Entrepreneurship, an... - The Quoracast - Quora

1
1. johngravesdm 12 Jan 2016
  
  in Public
  
  "A friend of mine said a really great phrase: 'remember those times in early 1990's when every single brick-and-mortar store wanted a webmaster and a small website. Now they want to have a data scientist.' It's good for an industry when an attitude precedes the technology."
  
  data science
Visit annotations in context

Tags

data science

Annotators

johngravesdm

URL

quoracast.quora.com/Dima-Korolev-Engineering-Entrepreneurship-and-Big-Data
wilkelab.org wilkelab.org

SDS 348, Spring 2015

1
1. daveh70 11 Jan 2016
  
  in Public
  
  UT Austin SDS 348, Computational Biology and Bioinformatics. Course materials and links: R, regression modeling, ggplot2, principal component analysis, k-means clustering, logistic regression, Python, Biopython, regular expressions.
  
  data analysis data visualization machine learning
Visit annotations in context

Tags

data visualization

machine learning

data analysis

Annotators

daveh70

URL

wilkelab.org/classes/SDS348_spring_2015.html
phys.org phys.org

Why too much evidence can be a bad thing

1
1. daveh70 10 Jan 2016
  
  in Public
  
  paradox of unanimity - Unanimous or nearly unanimous agreement doesn't always indicate the correct answer. If agreement is unlikely, it indicates a problem with the system.
  
  Witnesses who only saw a suspect for a moment are not likely to be able to pick them out of a lineup accurately. If several witnesses all pick the same suspect, you should be suspicious that bias is at work. Perhaps these witnesses were cherry-picked, or they were somehow encouraged to choose a particular suspect.
  
  science statistics data analysis probability
Visit annotations in context

Tags

science

probability

statistics

data analysis

Annotators

daveh70

URL

phys.org/news/2016-01-evidence-bad.html
www.slate.com www.slate.com

What’s Even Creepier Than Target Guessing That You’re Pregnant?

1
1. jeremydean 08 Jan 2016
  
  in Public
  
  Wow!
  
  Big data
Visit annotations in context

Tags

Big data

Annotators

jeremydean

URL

slate.com/blogs/how_not_to_be_wrong/2014/06/09/big_data_what_s_even_creepier_than_target_guessing_that_you_re_pregnant.html
rpy2.readthedocs.org rpy2.readthedocs.org

Documentation for rpy2 — rpy2 2.7.6 documentation

1
1. daveh70 08 Jan 2016
 
 in Public
 
 Python interface to the R programming language. Use R functions and packages from Python. https://pypi.python.org/pypi/rpy2
 
 statistics data analysis data visualization machine learning
Visit annotations in context

Tags

statistics

data visualization

machine learning

data analysis

Annotators

daveh70

URL

rpy2.readthedocs.org/en/version_2.7.x/
matthewlincoln.net matthewlincoln.net

Some problems with GLAM data on GitHub

1
1. daveh70 07 Jan 2016
 
 in Public
 
 Guidelines for publishing GLAM data (galleries, libraries, archives, museums) on GitHub. It applies to publishing any kind of data anywhere.
 
 Document the schema of the data.
 
 Make the usage terms and conditions clear.
 
 Tell people how to report issues. Or, tell them that they're on their own.
 
 Tell people whether you accept pull requests (user-contributed edits and additions), and how.
 
 Tell people how often the data will be updated, even if the answer is "sporadically" or "maybe never".
 
 https://en.wikipedia.org/wiki/Open_Knowledge http://openglam.org/faq/
 
 open access open data digital archives
Visit annotations in context

Tags

open data

digital archives

open access

Annotators

daveh70

URL

matthewlincoln.net/2016/01/06/some-problems-with-glam-data-on-github.html
manual.calibre-ebook.com manual.calibre-ebook.com

Editing E-books — calibre User Manual

1
1. Enkerli 05 Jan 2016
  
  in Public
  
  Set Semantics¶ This tool is used to set semantics in EPUB files. Semantics are simply, links in the OPF file that identify certain locations in the book as having special meaning. You can use them to identify the foreword, dedication, cover, table of contents, etc. Simply choose the type of semantic information you want to specify and then select the location in the book the link should point to. This tool can be accessed via Tools->Set semantics.
  
  Though it’s described in such a simple way, there might be hidden power in adding these tags, especially when we bring eBooks to the Semantic Web. Though books are the prime example of a “Web of Documents”, they can also contribute to the “Web of Data”, if we enable them. It might take long, but it could happen.
  
  Semantic Web Semantic Annotation Web of Data #OpenWeb Web of Documents eBooks eBooks vs. Apps
Visit annotations in context

Tags

Web of Documents

Semantic Web

Web of Data

eBooks

Semantic Annotation

#OpenWeb

eBooks vs. Apps

Annotators

Enkerli

URL

manual.calibre-ebook.com/edit.html
Dec 2015
rainystreets.wikity.cc rainystreets.wikity.cc

Big Data and OxyContin – Rainy Streets

1
1. daveh70 30 Dec 2015
  
  in Public
  
  The idea was to pinpoint the doctors prescribing the most pain medication and target them for the company’s marketing onslaught. That the databases couldn’t distinguish between doctors who were prescribing more pain meds because they were seeing more patients with chronic pain or were simply looser with their signatures didn’t matter to Purdue.
  
  drugs medicine big data ethics
Visit annotations in context

Tags

drugs

medicine

ethics

big data

Annotators

daveh70

URL

rainystreets.wikity.cc/big-data-and-oxycontin/
www.edsurge.com www.edsurge.com

BYU’s Bold Plan to Give Students Control of Their Data (EdSurge News)

1
1. Enkerli 23 Dec 2015
  
  in Public
  
  Users publish coursework, build portfolios or tinker with personal projects, for example.
  
  Useful examples. Could imagine something like Wikity, FedWiki, or other forms of content federation to work through this in a much-needed upgrade from the “Personal Home Pages” of the early Web. Do see some connections to Sandstorm and the new WordPress interface (which, despite being targeted at WordPress.com users, also works on self-hosted WordPress installs). Some of it could also be about the longstanding dream of “keeping our content” in social media. Yes, as in the reverse from Facebook. Multiple solutions exist to do exports and backups. But it can be so much more than that and it’s so much more important in educational contexts.
  
  Federated Wiki Federated WordPress Federated OER OER portfolios #OpenBadges Student Data Data Economy User-generated content
Visit annotations in context

Tags

Federated WordPress

User-generated content

#OpenBadges

Student Data

Federated Wiki

portfolios

Federated OER

OER

Data Economy

Annotators

Enkerli

URL

edsurge.com/news/2015-12-18-byu-s-bold-plan-to-give-students-control-of-their-data
medium.com medium.com

Which Students Get to Have Privacy? — The Message — Medium

1
1. Enkerli 23 Dec 2015
  
  in Public
  
  (Not surprisingly, none of the bills provide for funding to help schools come up to speed.)
  
  #privacy Student Data Data Economy
Visit annotations in context

Tags

Student Data

Data Economy

#privacy

Annotators

Enkerli

URL

medium.com/message/which-students-get-to-have-privacy-e9773f9a064
bavatuesdays.com bavatuesdays.com

Domains as Ground Zero for the Struggle over Agency

1
1. daveh70 22 Dec 2015
  
  in Public
  
  A personal API builds on the domain concept—students store information on their site, whether it’s class assignments, financial aid information or personal blogs, and then decide how they want to share that data with other applications and services. The idea is to give students autonomy in how they develop and manage their digital identities at the university and well into their professional lives
  
  web internet data personal data privacy
Visit annotations in context

Tags

data

internet

privacy

web

personal data

Annotators

daveh70

URL

bavatuesdays.com/domains-as-ground-zero-for-the-struggle-of-agency/
mfeldstein.com mfeldstein.com

Why Big Data (Mostly) Can’t Help Improve Teaching

3
1. Enkerli 21 Dec 2015
  
  in Public
  
  sufficiently rich information
  
  Thick data
  
  Thick Data #BigData
2. Enkerli 21 Dec 2015
  
  in Public
  
  who owns the data
  
  Student Data
3. Enkerli 21 Dec 2015
  
  in Public
  
  It’s educators who come up with hypotheses and test them using a large data set.
  
  And we need an ever-larger data set, right?
  
  #BigData Student Data Learning Analytics
Visit annotations in context

Tags

Thick Data

Student Data

#BigData

Learning Analytics

Annotators

Enkerli

URL

mfeldstein.com/why-big-data-mostly-cant-help-improve-teaching/
bits.blogs.nytimes.com bits.blogs.nytimes.com

InBloom Student Data Repository to Close

1
1. Enkerli 21 Dec 2015
  
  in Public
  
  nearly $8 billion prekindergarten through 12th-grade education technology software market
  
  Business Models for Education Data Economy EdTech EncodableFactoid
Visit annotations in context

Tags

EncodableFactoid

Business Models for Education

EdTech

Data Economy

Annotators

Enkerli

URL

bits.blogs.nytimes.com/2014/04/21/inbloom-student-data-repository-to-close/
hackeducation.com hackeducation.com

Top Ed-Tech Trends of 2015: The Compulsion for Data

1
1. Enkerli 21 Dec 2015
  
  in Public
  
  As usual, @AudreyWatters puts things in proper perspective.
  
  Learning Analytics @AudreyWatters #privacy #BigData Student Data Data Economy
Visit annotations in context

Tags

Data Economy

Student Data

@AudreyWatters

#BigData

Learning Analytics

#privacy

Annotators

Enkerli

URL

hackeducation.com/2015/12/16/trends-data
www.theatlantic.com www.theatlantic.com

Can Academics Trust Their Research With For-Profit Companies?

1
1. otterscotter 18 Dec 2015
  
  in Public
  
  “What we’re seeing is that the general public wants to read scholarly papers.”
  
  open access open data
Visit annotations in context

Tags

open data

open access

Annotators

otterscotter

URL

theatlantic.com/education/archive/2015/12/the-convoluted-profits-of-academic-publishing/421047/
mfeldstein.com mfeldstein.com

Personalized Learning and the Teacher

1
1. Enkerli 14 Dec 2015
  
  in Public
  
  increased investment in professional development and teaching-friendly tenure and promotion practices
  
  Even those who adopt a taylorist model to education may understand that “it takes money to save money”.
  
  Learning Analytics Teaching Analytics Personal Learning Network personalization personalized learning Student-Driven Student Data
Visit annotations in context

Tags

personalized learning

Student-Driven

Teaching Analytics

Student Data

personalization

Learning Analytics

Personal Learning Network

Annotators

Enkerli

URL

mfeldstein.com/personalized-learning-and-the-teacher/
code.facebook.com code.facebook.com

Facebook to open-source AI hardware design

1
1. daveh70 11 Dec 2015
  
  in Public
  
  Big Sur is our newest Open Rack-compatible hardware designed for AI computing at a large scale. In collaboration with partners, we've built Big Sur to incorporate eight high-performance GPUs
  
  ai artificial intelligence machine learning data science
Visit annotations in context

Tags

ai

artificial intelligence

machine learning

data science

Annotators

daveh70

URL

code.facebook.com/posts/1687861518126048/facebook-to-open-source-ai-hardware-design/
support.vitalsource.com support.vitalsource.com

Accessibility – Bookshelf Support

1
1. Enkerli 10 Dec 2015
  
  in Public
  
  The EDUPUB Initiative VitalSource regularly collaborates with independent consultants and industry experts including the National Federation of the Blind (NFB), American Foundation for the Blind (AFB), Tech For All, JISC, Alternative Media Access Center (AMAC), and others. With the help of these experts, VitalSource strives to ensure its platform conforms to applicable accessibility standards including Section 508 of the Rehabilitation Act and the Accessibility Guidelines established by the Worldwide Web Consortium known as WCAG 2.0. The state of the platform's conformance with Section 508 at any point in time is made available through publication of Voluntary Product Accessibility Templates (VPATs). VitalSource continues to support industry standards for accessibility by conducting conformance testing on all Bookshelf platforms – offline on Windows and Macs; online on Windows and Macs using standard browsers (e.g., Internet Explorer, Mozilla Firefox, Safari); and on mobile devices for iOS and Android. All Bookshelf platforms are evaluated using industry-leading screen reading programs available for the platform including JAWS and NVDA for Windows, VoiceOver for Mac and iOS, and TalkBack for Android. To ensure a comprehensive reading experience, all Bookshelf platforms have been evaluated using EPUB® and enhanced PDF books.
  
  Could see a lot of potential for Open Standards, including annotations. What’s not so clear is how they can manage to produce such ePub while maintaining their DRM-focused practice. Heard about LCP (Lightweight Content Protection). But have yet to get a fully-accessible ePub which is also DRMed in such a way.
  
  #OpenAnnotation Open Standards Open Data EDUPUB ePub #a11y Accessibility OpenTextbooks textbooks #DRM
Visit annotations in context

Tags

Open Data

OpenTextbooks

ePub

Accessibility

#DRM

EDUPUB

#OpenAnnotation

#a11y

textbooks

Open Standards

Annotators

Enkerli

URL

support.vitalsource.com/hc/en-us/categories/200184597-Accessibility
www.w3.org www.w3.org

New Scholarly Coalition Embraces W3C Web Annotations | W3C Blog

1
1. Enkerli 07 Dec 2015
  
  in Public
  
  add tags for categorization and search
  
  Well-structured annotations can pave the way towards Linked Open Data.
  
  Semantic Annotation Semantic Web Linked Data #LODLAM
Visit annotations in context

Tags

Semantic Web

Linked Data

Semantic Annotation

#LODLAM

Annotators

Enkerli

URL

w3.org/blog/2015/12/annotation-coalition-launched/
iso-sc36.auf.org iso-sc36.auf.org

Migration de Normetic 2.0 vers MLR : exemple à suivre en Francophonie | Liaison AUF/SC36

1
1. Enkerli 07 Dec 2015
  
  in Public
  
  tout enregistrement MLR conforme au profil Normetic 2.0 est automatiquement conforme au profil d’application MLR de base.
  
  L’interopérabilité est essentielle à l’avènement du Web des données liées (en éducation comme ailleurs).
  
  Semantic Web #OpenWeb Web of Data Linked Data Linked Open Data #LODLAM interoperability Open Standards MLR
Visit annotations in context

Tags

#LODLAM

MLR

Web of Data

Linked Data

interoperability

Linked Open Data

Semantic Web

Open Standards

#OpenWeb

Annotators

Enkerli

URL

iso-sc36.auf.org/2015/12/migration-de-normetci-2-0-vers-mlr-un-exemple-pour-un-profil-dapplication-francophone/
math.mit.edu math.mit.edu

CT4S.pdf

1
1. bbarker 06 Dec 2015
  
  in Public
  
  Data gathering is ubiquitous in science. Giant databases are currently being minedfor unknown patterns, but in fact there are many (many) known patterns that simplyhave not been catalogued. Consider the well-known case of medical records. A patient’smedical history is often known by various individual doctor-offices but quite inadequatelyshared between them. Sharing medical records often means faxing a hand-written noteor a filled-in house-created form between offices.
  
  category theory patterns EHR EMR data databases
Visit annotations in context

Tags

patterns

category theory

data

EHR

databases

EMR

Annotators

bbarker

URL

math.mit.edu/~dspivak/teaching/sp13/CT4S.pdf
blogs.edweek.org blogs.edweek.org

Textbooks Out of Step With Scientists on Climate Change, Study Says

1
1. otterscotter 03 Dec 2015
  
  in Public
  
  Textbooks Out of Step With Scientists on Climate Change, Study Says
  
  OER open textbooks open data
Visit annotations in context

Tags

OER

open data

open textbooks

Annotators

otterscotter

URL

blogs.edweek.org/edweek/curriculum/2015/12/textbooks_out_of_step_with_scientists_on_climate_change.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators