Hypothesis

4,144 Matching Annotations

Dec 2019
github.com github.com

sanpii/effitask

3
1. TylerRick 30 Dec 2019
  
  in Public
  
  You can create sub-projects (or sub-contexts) by adding a backslash
  
  tree data todo.txt
2. TylerRick 30 Dec 2019
  
  in Public
  
  Double click on a project/context select all there sub-projects/contexts, therefore show their tasks
  
  tree data todo.txt
3. TylerRick 30 Dec 2019
  
  in Public
  
  arborescence
  
  First sighting of word arborescence. I thought they were just doing that for fun, as a play on "tree", but I guess it's a real graph theory concept (https://en.wikipedia.org/wiki/Arborescence_(graph_theory)).
  
  tree data graph theory
Visit annotations in context

Tags

tree data

graph theory

todo.txt

Annotators

TylerRick

URL

github.com/sanpii/effitask
swiftodoapp.com swiftodoapp.com

SwiftoDo Desktop - task list app for todo.txt on macOS

2
1. TylerRick 30 Dec 2019
  
  in Public
  
  Your task list is a plain text file, not some proprietary format owned by a company or locked to a specific application.
  
  storing data in plain-text files proprietary format under my control open file format
2. TylerRick 30 Dec 2019
  
  in Public
  
  A simple and timeless format Plain text is the simplest file format there is. It will always be accessible, by some kind of application, forever.
  
  storing data in plain-text files future-proof data
Visit annotations in context

Tags

future-proof data

open file format

under my control

proprietary format

storing data in plain-text files

Annotators

TylerRick

URL

swiftodoapp.com/desktop/
www.maketecheasier.com www.maketecheasier.com

QTodoTxt - A todo.txt GUI Client for Linux - Make Tech Easier

1
1. TylerRick 30 Dec 2019
  
  in Public
  
  And since it’s just a client, you can always use the todo.txt format text file created by it in the very beginning with any other client. Give it a try.
  
  data freedom app lock-in separation of concerns distinction between data format and client to interact with data compatibility
Visit annotations in context

Tags

distinction between data format and client to interact with data

separation of concerns

compatibility

data freedom

app lock-in

Annotators

TylerRick

URL

maketecheasier.com/qtodotxt-todo-txt-gui-client-for-linux/
plaintext-productivity.net plaintext-productivity.net

Introduction

3
1. TylerRick 30 Dec 2019
  
  in Public
  
  Avoiding complicated outlining or mind-mapping software saves a bunch of mouse clicks or dreaming up complicated visualizations (it helps if you are a linear thinker).
  
  Hmm. I'm not sure I agree with this thought/sentiment (though it's hard to tell since it's an incomplete sentence). I think visualizations and mind-mapping software might be an even better way to go, in terms of efficiency of editing (since they are specialized for the task), enjoyment of use, etc.
  
  The main thing text files have going for them is flexibility, portability, client-neutrality, the ability to get started right now without researching and evaluating a zillion competing GUI app alternatives.
  
  data visualization user interface design
2. TylerRick 30 Dec 2019
  
  in Public
  
  Plaintext files are tiny, simple, quick to work with, editable by tons of great programs, searchable by all modern operating systems, easy to back up, perfect for versioning, trivial to sync between devices, and are amazingly flexible in their uses and formats.
  
  storing data in plain-text files
3. TylerRick 30 Dec 2019
  
  in Public
  
  In this system, plaintext files are used for most of the backbone of your organizational system.
  
  app data stored in text file simple text-based file format storing data in plain-text files
Visit annotations in context

Tags

user interface design

data visualization

simple text-based file format

app data stored in text file

storing data in plain-text files

Annotators

TylerRick

URL

plaintext-productivity.net/
burnsoftware.wordpress.com burnsoftware.wordpress.com

P.S. Notes.

1
1. TylerRick 30 Dec 2019
  
  in Public
  
  made to work alongside the various plain-text, Dropbox syncing mobile notes apps such as Denote for Android and Jottings for iPhone from an app for the Ubuntu desktop. Plain text notes anywhere you want. Easily synced between your desktop and phone. Notes, plain and simple.
  
  plain text storing data in plain-text files
Visit annotations in context

Tags

storing data in plain-text files

plain text

Annotators

TylerRick

URL

burnsoftware.wordpress.com/p-s-notes/
burnsoftware.wordpress.com burnsoftware.wordpress.com

DayJournal

1
1. TylerRick 30 Dec 2019
  
  in Public
  
  Future proofs your journal entries by saving them as plain text and organizing them as you go. This means you can read or create entries when you don’t have DayJournal.
  
  plain text app data stored in text file future-proof data
Visit annotations in context

Tags

future-proof data

app data stored in text file

plain text

Annotators

TylerRick

URL

burnsoftware.wordpress.com/dayjournal/
www.howtogeek.com www.howtogeek.com

Every To-Do List App Sucks, Switch To todo.txt Instead

2
1. TylerRick 29 Dec 2019
  
  in Public
  
  It’s flexible in precisely the way so many modern apps aren’t, and if you like tweaking things until they’re just right, I can’t recommend it enough.
  
  flexibility todo.txt plain text app data stored in text file
2. TylerRick 29 Dec 2019
  
  in Public
  
  And if all else fails, you can just use a text editor.
  
  plain text app data stored in text file failsafe
Visit annotations in context

Tags

failsafe

flexibility

todo.txt

app data stored in text file

plain text

Annotators

TylerRick

URL

howtogeek.com/355890/every-to-do-list-app-sucks-switch-to-todo.txt-instead/
github.com github.com

todotxt/todo.txt

1
1. TylerRick 29 Dec 2019
  
  in Public
  
  Plain text is software and operating system agnostic. It's searchable, portable, lightweight, and easily manipulated. It's unstructured. It works when someone else's web server is down or your Outlook .PST file is corrupt. There's no exporting and importing, no databases or tags or flags or stars or prioritizing or insert company name here-induced rules on what you can and can't do with it.
  
  app data stored in text file plain text
Visit annotations in context

Tags

app data stored in text file

plain text

Annotators

TylerRick

URL

github.com/todotxt/todo.txt
todotxt.org todotxt.org

Todo.txt: Future-proof task tracking in a file you control

1
1. TylerRick 29 Dec 2019
  
  in Public
  
  Countless productivity apps and sites store your tasks in their own proprietary database and file format. But you can work with your todo.txt file in every text editor ever made, regardless of operating system or vendor.
  
  compatibility app data stored in text file proprietary format
Visit annotations in context

Tags

compatibility

proprietary format

app data stored in text file

Annotators

TylerRick

URL

todotxt.org/
zapier.com zapier.com

The 11 best to do list apps of 2020

1
1. TylerRick 29 Dec 2019
  
  in Public
  
  Most to-do lists give you no control over your data. Your tasks live inside the app, not in a document you can edit, and syncing is handled by whichever company made the app. If you don't like this, todo.txt is a great alternative.
  
  at mercy of software publisher platform lock-in data lives in app proprietary format open file format simple text-based file format app data stored in text file
Visit annotations in context

Tags

open file format

at mercy of software publisher

platform lock-in

data lives in app

proprietary format

simple text-based file format

app data stored in text file

Annotators

TylerRick

URL

zapier.com/blog/best-todo-list-apps/
wellcomeopenresearch.org wellcomeopenresearch.org

Diffusion of ethical governance policy on sharing of biological materials and related data for biomedical research

2
1. Daniel_Mietchen 24 Dec 2019
  
  in Public
  
  greater integration of data, data security, and data sharing through the establishment of a searchable database.
  
  Would be great to connect these efforts with others who work on this from the data end, e.g. RDA as mentioned above.
  
  Also, the presentation at http://www.gfbr.global/wp-content/uploads/2018/12/PG4-Alpha-Ahmadou-Diallo.pptx states
  
  This data will be made available to the public and to scientific and humanitarian health communities to disseminate knowledge about the disease, support the expansion of research in West Africa, and improve patient care and future response to an outbreak.
  
  but the notion of public access is not clearly articulated in the present article.
  
  data integration data management data security data sharing
2. Daniel_Mietchen 24 Dec 2019
  
  in Public
  
  platform
  
  Does it have a name and online presence? The details provided here go beyond what's given in reference 13, but some more detail would still be useful, e.g. to connect the initiative to efforts directed at data management and curation more generally, for instance in the framework of the Research Data Alliance, https://www.rd-alliance.org/ .
  
  data management data sharing platform
Visit annotations in context

Tags

data sharing platform

data management

data sharing

data security

data integration

Annotators

Daniel_Mietchen

URL

wellcomeopenresearch.org/articles/4-170

Practical highlights in my opinion:

It's important to know about data padding in PG.
Be conscious when modelling data tables about columns ordering, but don't be pure-school and do it in a best-effort basis.
Gains up to 25% in wasted storage are impressive but always keep in mind the scope of the system. For me, gains are not worth it in the short-term. Whenever a system grows, it is possible to migrate data to more storage-efficient tables but mind the operative burder.

Here follows my own commands on trying the article points. I added - pg_column_size(row()) on each projection to have clear absolute sizes.

-- How does row function work?

SELECT pg_column_size(row()) AS empty,
       pg_column_size(row(0::SMALLINT)) AS byte2,
       pg_column_size(row(0::BIGINT)) AS byte8,
       pg_column_size(row(0::SMALLINT, 0::BIGINT)) AS byte16,
       pg_column_size(row(''::TEXT)) AS text0,
       pg_column_size(row('hola'::TEXT)) AS text4,
       0 AS term
;

-- My own take on that

SELECT pg_column_size(row()) AS empty,
       pg_column_size(row(uuid_generate_v4())) AS uuid_type,
       pg_column_size(row('hola mundo'::TEXT)) AS text_type,
       pg_column_size(row(uuid_generate_v4(), 'hola mundo'::TEXT)) AS uuid_text_type,
       pg_column_size(row('hola mundo'::TEXT, uuid_generate_v4())) AS text_uuid_type,
       0 AS term
;

CREATE TABLE user_order (
  is_shipped    BOOLEAN NOT NULL DEFAULT false,
  user_id       BIGINT NOT NULL,
  order_total   NUMERIC NOT NULL,
  order_dt      TIMESTAMPTZ NOT NULL,
  order_type    SMALLINT NOT NULL,
  ship_dt       TIMESTAMPTZ,
  item_ct       INT NOT NULL,
  ship_cost     NUMERIC,
  receive_dt    TIMESTAMPTZ,
  tracking_cd   TEXT,
  id            BIGSERIAL PRIMARY KEY NOT NULL
);

SELECT a.attname, t.typname, t.typalign, t.typlen
  FROM pg_class c
  JOIN pg_attribute a ON (a.attrelid = c.oid)
  JOIN pg_type t ON (t.oid = a.atttypid)
 WHERE c.relname = 'user_order'
   AND a.attnum >= 0
 ORDER BY a.attnum;

-- What is it about pg_class, pg_attribute and pg_type tables? For future investigation.

-- SELECT sum(t.typlen)
-- SELECT t.typlen
SELECT a.attname, t.typname, t.typalign, t.typlen
  FROM pg_class c
  JOIN pg_attribute a ON (a.attrelid = c.oid)
  JOIN pg_type t ON (t.oid = a.atttypid)
 WHERE c.relname = 'user_order'
   AND a.attnum >= 0
 ORDER BY a.attnum
;

-- Whoa! I need to master mocking data directly into db.

INSERT INTO user_order (
    is_shipped, user_id, order_total, order_dt, order_type,
    ship_dt, item_ct, ship_cost, receive_dt, tracking_cd
)
SELECT true, 1000, 500.00, now() - INTERVAL '7 days',
       3, now() - INTERVAL '5 days', 10, 4.99,
       now() - INTERVAL '3 days', 'X5901324123479RROIENSTBKCV4'
  FROM generate_series(1, 1000000);

-- New item to learn, pg_relation_size. 

SELECT pg_relation_size('user_order') AS size_bytes,
       pg_size_pretty(pg_relation_size('user_order')) AS size_pretty;

SELECT * FROM user_order LIMIT 1;

SELECT pg_column_size(row(0::NUMERIC)) - pg_column_size(row()) AS zero_num,
       pg_column_size(row(1::NUMERIC)) - pg_column_size(row()) AS one_num,
       pg_column_size(row(9.9::NUMERIC)) - pg_column_size(row()) AS nine_point_nine_num,
       pg_column_size(row(1::INT2)) - pg_column_size(row()) AS int2,
       pg_column_size(row(1::INT4)) - pg_column_size(row()) AS int4,
       pg_column_size(row(1::INT2, 1::NUMERIC)) - pg_column_size(row()) AS int2_one_num,
       pg_column_size(row(1::INT4, 1::NUMERIC)) - pg_column_size(row()) AS int4_one_num,
       pg_column_size(row(1::NUMERIC, 1::INT4)) - pg_column_size(row()) AS one_num_int4,
       0 AS term
;

SELECT pg_column_size(row(''::TEXT)) - pg_column_size(row()) AS empty_text,
       pg_column_size(row('a'::TEXT)) - pg_column_size(row()) AS len1_text,
       pg_column_size(row('abcd'::TEXT)) - pg_column_size(row()) AS len4_text,
       pg_column_size(row('abcde'::TEXT)) - pg_column_size(row()) AS len5_text,
       pg_column_size(row('abcdefgh'::TEXT)) - pg_column_size(row()) AS len8_text,
       pg_column_size(row('abcdefghi'::TEXT)) - pg_column_size(row()) AS len9_text,
       0 AS term
;

SELECT pg_column_size(row(''::TEXT, 1::INT4)) - pg_column_size(row()) AS empty_text_int4,
       pg_column_size(row('a'::TEXT, 1::INT4)) - pg_column_size(row()) AS len1_text_int4,
       pg_column_size(row('abcd'::TEXT, 1::INT4)) - pg_column_size(row()) AS len4_text_int4,
       pg_column_size(row('abcde'::TEXT, 1::INT4)) - pg_column_size(row()) AS len5_text_int4,
       pg_column_size(row('abcdefgh'::TEXT, 1::INT4)) - pg_column_size(row()) AS len8_text_int4,
       pg_column_size(row('abcdefghi'::TEXT, 1::INT4)) - pg_column_size(row()) AS len9_text_int4,
       0 AS term
;

SELECT pg_column_size(row(1::INT4, ''::TEXT)) - pg_column_size(row()) AS int4_empty_text,
       pg_column_size(row(1::INT4, 'a'::TEXT)) - pg_column_size(row()) AS int4_len1_text,
       pg_column_size(row(1::INT4, 'abcd'::TEXT)) - pg_column_size(row()) AS int4_len4_text,
       pg_column_size(row(1::INT4, 'abcde'::TEXT)) - pg_column_size(row()) AS int4_len5_text,
       pg_column_size(row(1::INT4, 'abcdefgh'::TEXT)) - pg_column_size(row()) AS int4_len8_text,
       pg_column_size(row(1::INT4, 'abcdefghi'::TEXT)) - pg_column_size(row()) AS int4_len9_text,
       0 AS term
;

SELECT pg_column_size(row()) - pg_column_size(row()) AS empty_row,
       pg_column_size(row(''::TEXT)) - pg_column_size(row()) AS no_text,
       pg_column_size(row('a'::TEXT)) - pg_column_size(row()) AS min_text,
       pg_column_size(row(1::INT4, 'a'::TEXT)) - pg_column_size(row()) AS two_col,
       pg_column_size(row('a'::TEXT, 1::INT4)) - pg_column_size(row()) AS round4;

SELECT pg_column_size(row()) - pg_column_size(row()) AS empty_row,
       pg_column_size(row(1::SMALLINT)) - pg_column_size(row()) AS int2,
       pg_column_size(row(1::INT)) - pg_column_size(row()) AS int4,
       pg_column_size(row(1::BIGINT)) - pg_column_size(row()) AS int8,
       pg_column_size(row(1::SMALLINT, 1::BIGINT)) - pg_column_size(row()) AS padded,
       pg_column_size(row(1::INT, 1::INT, 1::BIGINT)) - pg_column_size(row()) AS not_padded;

SELECT a.attname, t.typname, t.typalign, t.typlen
  FROM pg_class c
  JOIN pg_attribute a ON (a.attrelid = c.oid)
  JOIN pg_type t ON (t.oid = a.atttypid)
 WHERE c.relname = 'user_order'
   AND a.attnum >= 0
 ORDER BY t.typlen DESC;

DROP TABLE user_order;

CREATE TABLE user_order (
  id            BIGSERIAL PRIMARY KEY NOT NULL,
  user_id       BIGINT NOT NULL,
  order_dt      TIMESTAMPTZ NOT NULL,
  ship_dt       TIMESTAMPTZ,
  receive_dt    TIMESTAMPTZ,
  item_ct       INT NOT NULL,
  order_type    SMALLINT NOT NULL,
  is_shipped    BOOLEAN NOT NULL DEFAULT false,
  order_total   NUMERIC NOT NULL,
  ship_cost     NUMERIC,
  tracking_cd   TEXT
);

-- And, what about other varying size types as JSONB?

SELECT pg_column_size(row('{}'::JSONB)) - pg_column_size(row()) AS empty_jsonb,
       pg_column_size(row('{}'::JSONB, 0::INT4)) - pg_column_size(row()) AS empty_jsonb_int4,
       pg_column_size(row(0::INT4, '{}'::JSONB)) - pg_column_size(row()) AS int4_empty_jsonb,
       pg_column_size(row('{"a": 1}'::JSONB)) - pg_column_size(row()) AS basic_jsonb,
       pg_column_size(row('{"a": 1}'::JSONB, 0::INT4)) - pg_column_size(row()) AS basic_jsonb_int4,
       pg_column_size(row(0::INT4, '{"a": 1}'::JSONB)) - pg_column_size(row()) AS int4_basic_jsonb,
       0 AS term;

postgresql optimization storage data padding

Visit annotations in context

Annotators

jomendoz

URL

2ndquadrant.com/en/blog/on-rocks-and-sand/

medium.com medium.com

How I put the World(map) in a Graph

1
1. mavery 18 Dec 2019
  
  in Public
  
  mapping data global
Visit annotations in context

Tags

data

global

mapping

Annotators

mavery

URL

medium.com/neo4j/how-i-put-the-world-map-in-a-graph-422b651780e9
www.civilsociety.co.uk www.civilsociety.co.uk

Foundation finds its grantees 'significantly outperform' similar charities

1
1. mlenc 13 Dec 2019
  
  in Public
  
  Foundation finds its grantees 'significantly outperform' similar charities
  
  nonprofit data charity data npdata data analysis
Visit annotations in context

Tags

npdata

nonprofit data

data analysis

charity data

Annotators

mlenc

URL

civilsociety.co.uk/news/foundations-finds-grantees-significantly-outperform-similar-charities.html
theodi.org theodi.org

ODI-Data-Ecosystem-Mapping-A1-fold-to-A5-2019-1.pdf

1
1. mlenc 13 Dec 2019
  
  in Public
  
  open data ecosystem odi open data
Visit annotations in context

Tags

open data ecosystem

open data

odi

Annotators

mlenc

URL

theodi.org/wp-content/uploads/2019/07/ODI-Data-Ecosystem-Mapping-A1-fold-to-A5-2019-1.pdf
academic.oup.com academic.oup.com

Post–Modern Epidemiology: When Methods Meet Matter

1
1. marivas 12 Dec 2019
  
  in Public
  
  Remarkably, studies receiving mainly public funding can, a quarter of a century on, still survive without making their data available in a useful way. In the UK a series of studies—the Avon Longitudinal Study of Parents and Children (ALSPAC) (100), UK Biobank (101), and Born in Bradford (102), among others—have surely been exemplary in promoting data accessibility.
  
  Critical points!
  
  George Davey Smith accessibility of data
Visit annotations in context

Tags

George Davey Smith

accessibility of data

Annotators

marivas

URL

academic.oup.com/aje/article/188/8/1410/5381900
github.com github.com

TylerRick/button_to_form

1
1. TylerRick 12 Dec 2019
  
  in Public
  
  view-helpers form-helpers form-helper view-helper button buttons form forms
  
  Since I didn't know which variant was canonical, I tagged with both/all variants. Gross.
  
  crowd-sourced data is messy crowd-sourced tags
Visit annotations in context

Tags

crowd-sourced tags

crowd-sourced data is messy

Annotators

TylerRick

URL

github.com/TylerRick/button_to_form
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Why common carrier and network neutrality principles apply to the Nationwide Health Information Network (NWHIN)

1
1. mlenc 09 Dec 2019
  
  in Public
  
  net neutrality common carriage common carrier admin data health data
Visit annotations in context

Tags

health data

admin data

common carriage

common carrier

net neutrality

Annotators

mlenc

URL

ncbi.nlm.nih.gov/pmc/articles/PMC3912707/
www.ourcommunity.com.au www.ourcommunity.com.au

CLASSIEfier: Using machine learning to paint a picture of social sector trends - ourcommunity.com.au

1
1. mlenc 06 Dec 2019
  
  in Public
  
  taxonomy copyright open data nonprofit sector ml classifier
Visit annotations in context

Tags

open data

copyright

ml

nonprofit sector

taxonomy

classifier

Annotators

mlenc

URL

ourcommunity.com.au/general/general_article.jsp
jamanetwork.com jamanetwork.com

The Challenges of Sharing Data in an Era of Politicized Science

1
1. mlenc 06 Dec 2019
  
  in Public
  
  admin data challenges risks admin data risks
Visit annotations in context

Tags

risks

admin data risks

admin data

challenges

Annotators

mlenc

URL

jamanetwork.com/journals/jama/fullarticle/2756117
www.inkandswitch.com www.inkandswitch.com

Local-first software: You own your data, in spite of the cloud

1
1. almereyda 04 Dec 2019
  
  in Public
  
  local-first software data cloud offline CRDT
Visit annotations in context

Tags

data

CRDT

offline

local-first

software

cloud

Annotators

almereyda

URL

inkandswitch.com/local-first.html
www.ag-grid.com www.ag-grid.com

ag-Grid

1
1. TylerRick 02 Dec 2019
  
  in Public
  
  ag-grid: cell editing data table: cell editor
Visit annotations in context

Tags

data table: cell editor

ag-grid: cell editing

Annotators

TylerRick

URL

ag-grid.com/javascript-grid-cell-editing/
academic.ouprc.silverchair.com academic.ouprc.silverchair.com

Open Humans: A platform for participant-centered research and personal data exploration

1
1. Lgoodman 02 Dec 2019
  
  in Gigascience Annotations
  
  Bastian Greshake Tzovaras
  
  See Author Q&A "Power to the People..."with B.G. Tzorvaras and M.P. Ball at http://gigasciencejournal.com/blog/open-humans-qa
  
  Author Q&A GigaBlog Data Sharing Human data
Visit annotations in context

Tags

GigaBlog

Author Q&A

Data Sharing

Human data

Annotators

Lgoodman

URL

academic.ouprc.silverchair.com/gigascience/article/8/6/giz076/5523201
engineering.linkedin.com engineering.linkedin.com

The Log: What every software engineer should know about real-time data's unifying abstraction

3
1. rufuspollock 01 Dec 2019
  
  in Public
  
  I'll give a little bit of the history to provide context. My own involvement in this started around 2008 after we had shipped our key-value store. My next project was to try to get a working Hadoop setup going, and move some of our recommendation processes there. Having little experience in this area, we naturally budgeted a few weeks for getting data in and out, and the rest of our time for implementing fancy prediction algorithms. So began a long slog. We originally planned to just scrape the data out of our existing Oracle data warehouse. The first discovery was that getting data out of Oracle quickly is something of a dark art. Worse, the data warehouse processing was not appropriate for the production batch processing we planned for Hadoop—much of the processing was non-reversable and specific to the reporting being done. We ended up avoiding the data warehouse and going directly to source databases and log files. Finally, we implemented another pipeline to load data into our key-value store for serving results. This mundane data copying ended up being one of the dominate items for the original development. Worse, any time there was a problem in any of the pipelines, the Hadoop system was largely useless—running fancy algorithms on bad data just produces more bad data. Although we had built things in a fairly generic way, each new data source required custom configuration to set up. It also proved to be the source of a huge number of errors and failures. The site features we had implemented on Hadoop became popular and we found ourselves with a long list of interested engineers. Each user had a list of systems they wanted integration with and a long list of new data feeds they wanted. ETL in Ancient Greece. Not much has changed.
  
  A great anecdote / story on the (pains) of data integration
  
  data-integration anecdote
2. rufuspollock 01 Dec 2019
  
  in Public
  
  Effective use of data follows a kind of Maslow's hierarchy of needs. The base of the pyramid involves capturing all the relevant data, being able to put it together in an applicable processing environment (be that a fancy real-time query system or just text files and python scripts). This data needs to be modeled in a uniform way to make it easy to read and process. Once these basic needs of capturing data in a uniform way are taken care of it is reasonable to work on infrastructure to process this data in various ways—MapReduce, real-time query systems, etc. It's worth noting the obvious: without a reliable and complete data flow, a Hadoop cluster is little more than a very expensive and difficult to assemble space heater. Once data and processing are available, one can move concern on to more refined problems of good data models and consistent well understood semantics. Finally, concentration can shift to more sophisticated processing—better visualization, reporting, and algorithmic processing and prediction. In my experience, most organizations have huge holes in the base of this pyramid—they lack reliable complete data flow—but want to jump directly to advanced data modeling techniques. This is completely backwards. So the question is, how can we build reliable data flow throughout all the data systems in an organization?
  
  +100 data-integration
3. rufuspollock 01 Dec 2019
  
  in Public
  
  Data integration is making all the data an organization has available in all its services and systems.
  
  data data-integration
Visit annotations in context

Tags

data-integration

data

anecdote

+100

Annotators

rufuspollock

URL

engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying
Nov 2019
rstudio-pubs-static.s3.amazonaws.com rstudio-pubs-static.s3.amazonaws.com

Manipulating Time Series Data in R with xts & zoo

1
1. udaybhaskar 30 Nov 2019
  
  in Public
  
  1. Introduction to eXtensible Time Series, using xts and zoo for time series Introducing xts and
  
  question?
  
  data analysis data science
Visit annotations in context

Tags

data analysis

data science

Annotators

udaybhaskar

URL

rstudio-pubs-static.s3.amazonaws.com/288218_117e183e74964557a5da4fc5902fc671.html
www.culturecreates.com www.culturecreates.com

Culture Creates Inc

1
1. mlenc 26 Nov 2019
  
  in Public
  
  linked data arts digital strategy
Visit annotations in context

Tags

linked data

arts

digital strategy

Annotators

mlenc

URL

culturecreates.com/fr/index.html
www.ag-grid.com www.ag-grid.com

ag-Grid

2
1. TylerRick 26 Nov 2019
  
  in Public
  
  Filter
  
  ag-grid data table: filtering
2. TylerRick 26 Nov 2019
  
  in Public
  
  Cell Editor
  
  ag-grid data table: cell editor
Visit annotations in context

Tags

data table: cell editor

data table: filtering

ag-grid

Annotators

TylerRick

URL

ag-grid.com/react-hooks/
www.theregister.co.uk www.theregister.co.uk

Sure, we made your Wi-Fi routers phone home with telemetry, says Ubiquiti. What of it?

1
1. TylerRick 25 Nov 2019
  
  in Public
  
  telemetry sending user data to server without user's consent GDPR fallacies: just because it meets some certification/compliance doesn't mean it can be trusted/is okay
Visit annotations in context

Tags

sending user data to server without user's consent

GDPR

telemetry

fallacies: just because it meets some certification/compliance doesn't mean it can be trusted/is okay

Annotators

TylerRick

URL

theregister.co.uk/2019/11/07/ubiquiti_networks_phone_home/
github.com github.com

gorhill/uBlock

2
1. TylerRick 25 Nov 2019
  
  in Public
  
  Second, uBlock Origin does not have a dedicated server, it can't "phone home" with your browsing data, there is only GitHub, and GitHub is completely unrelated to uBlock Origin.
  
  sending user data to server without user's consent browser extensions
2. TylerRick 25 Nov 2019
  
  in Public
  
  Is there a home server?
  
  sending user data to server without user's consent
Visit annotations in context

Tags

sending user data to server without user's consent

browser extensions

Annotators

TylerRick

URL

github.com/gorhill/uBlock/wiki/Can-you-trust-uBlock-Origin
trackmenot.io trackmenot.io

TrackMeNot

1
1. TylerRick 25 Nov 2019
  
  in Public
  
  TrackMeNot is user-installed and user-managed, residing wholly on users' system and functions without the need for 3rd-party servers or services. Placing users in full control is an essential feature of TrackMeNot, whose purpose is to protect against the unilateral policies set by search companies in their handling of our personal information.
  
  data resides locally on user's computer
Visit annotations in context

Tags

data resides locally on user's computer

Annotators

TylerRick

URL

trackmenot.io/
theodi.org theodi.org

Data Ecosystem Mapping tool – The ODI

1
1. mlenc 18 Nov 2019
  
  in Public
  
  data ecosystem mapping odi
Visit annotations in context

Tags

data ecosystem mapping

odi

Annotators

mlenc

URL

theodi.org/article/data-ecosystem-mapping-tool/
github.com github.com

pandas-dev/pandas

1
1. bourbakis 18 Nov 2019
  
  in Public
  
  data-analysis pandas flexible alignment python
Visit annotations in context

Tags

data-analysis

flexible

alignment

pandas

python

Annotators

bourbakis

URL

github.com/pandas-dev/pandas
depictdatastudio.com depictdatastudio.com

The Data Visualization Checklist, 2016 Edition | Depict Data Studio

1
1. pbk1 16 Nov 2019
  
  in Public
  
  landscape vs. portrait.
  
  slides are landscape, reports are portrait!
  
  data visualization
Visit annotations in context

Tags

data visualization

Annotators

pbk1

URL

depictdatastudio.com/checklist/
github.com github.com

davidguttman/react-pivot

1
1. TylerRick 15 Nov 2019
  
  in Public
  
  react data table
Visit annotations in context

Tags

data table

react

Annotators

TylerRick

URL

github.com/davidguttman/react-pivot
github.com github.com

reactabular/reactabular

1
1. TylerRick 15 Nov 2019
  
  in Public
  
  data table react react component Reactabular
Visit annotations in context

Tags

data table

react

Reactabular

react component

Annotators

TylerRick

URL

github.com/reactabular/reactabular
reactabular.js.org reactabular.js.org

Reactabular 8.17.0 – Introduction

1
1. TylerRick 15 Nov 2019
  
  in Public
  
  data table react canonical website Reactabular
Visit annotations in context

Tags

canonical website

data table

react

Reactabular

Annotators

TylerRick

URL

reactabular.js.org/
rsuite.github.io rsuite.github.io

RSUITE Table

1
1. TylerRick 15 Nov 2019
  
  in Public
  
  react data table tree component rsuite component library
Visit annotations in context

Tags

data table

tree component

react

rsuite component library

Annotators

TylerRick

URL

rsuite.github.io/rsuite-table/
www.ag-grid.com www.ag-grid.com

ag-Grid

1
1. TylerRick 14 Nov 2019
  
  in Public
  
  how to nest grids inside grids using a Master / Detail configuration
  
  ag-grid data table
Visit annotations in context

Tags

data table

ag-grid

Annotators

TylerRick

URL

ag-grid.com/javascript-grid-master-detail/
github.com github.com

sematext/sematable

1
1. TylerRick 14 Nov 2019
  
  in Public
  
  react data table
Visit annotations in context

Tags

data table

react

Annotators

TylerRick

URL

github.com/sematext/sematable
shine.wiki shine.wiki

Shineout api document 1.3.x

1
1. TylerRick 14 Nov 2019
  
  in Public
  
  data table react Shineout
Visit annotations in context

Tags

data table

Shineout

react

Annotators

TylerRick

URL

shine.wiki/1.3.x/en/components/Table
github.com github.com

gregnb/mui-datatables

1
1. TylerRick 14 Nov 2019
  
  in Public
  
  Too bad I have to choose between Material Design (Material UI) and https://www.ag-grid.com (which has a lot more features).
  
  data table @material-ui/core integration
Visit annotations in context

Tags

data table

integration

@material-ui/core

Annotators

TylerRick

URL

github.com/gregnb/mui-datatables
www.theverge.com www.theverge.com

Google reveals ‘Project Nightingale’ after being accused of secretly gathering personal health records

1
1. pivic 12 Nov 2019
  
  in Public
  
  Google has confirmed that it partnered with health heavyweight Ascension, a Catholic health care system based in St. Louis that operates across 21 states and the District of Columbia.
  
  What happened to 'thou shalt not steal'?
  
  google surveillance capitalism abuse privacy healthcare data
Visit annotations in context

Tags

data

surveillance capitalism

abuse

privacy

google

healthcare

Annotators

pivic

URL

theverge.com/2019/11/11/20959771/google-health-records-project-nightingale-privacy-ascension
thenextweb.com thenextweb.com

Amazon's roadmap for Alexa is scarier than anything Facebook or Twitter is doing

1
1. pivic 11 Nov 2019
  
  in Public
  
  Speaking with MIT Technology Review, Rohit Prasad, Alexa’s head scientist, has now revealed further details about where Alexa is headed next. The crux of the plan is for the voice assistant to move from passive to proactive interactions. Rather than wait for and respond to requests, Alexa will anticipate what the user might want. The idea is to turn Alexa into an omnipresent companion that actively shapes and orchestrates your life. This will require Alexa to get to know you better than ever before.
  
  This is some next-level onslaught.
  
  surveillance capitalism amazon alexa capitalism data privacy abuse
Visit annotations in context

Tags

data

surveillance capitalism

alexa

abuse

privacy

capitalism

amazon

Annotators

pivic

URL

thenextweb.com/artificial-intelligence/2019/11/08/amazons-roadmap-for-alexa-is-scarier-than-anything-facebook-or-twitter-is-doing/
www.srdc.org www.srdc.org

adult-learning-final-report.pdf

1
1. jab678 09 Nov 2019
  
  in Public
  
  This article is a great example of a research model in measuring outcomes of adult learning.
  
  ETC556 adult learning adult learning research model framework for research data needs
Visit annotations in context

Tags

ETC556

adult learning

data needs

framework for research

adult learning research model

Annotators

jab678

URL

srdc.org/media/199726/adult-learning-final-report.pdf
www.thirdsectorcap.org www.thirdsectorcap.org

Outcomes-Based Financing Data Advisory Council | Third Sector Capital Partners

1
1. mlenc 08 Nov 2019
  
  in Public
  
  Outcomes-Based Financing Data Advisory Council
  
  outcomes outcomes-based admin data social services
Visit annotations in context

Tags

outcomes-based

admin data

outcomes

social services

Annotators

mlenc

URL

thirdsectorcap.org/news/outcomes-based-financing-data-advisory-council/
www.publicsafety.gc.ca www.publicsafety.gc.ca

Research Summary - Developing a Common Data Standard for Measuring Attitudes toward the Police in Canada

1
1. mlenc 08 Nov 2019
  
  in Public
  
  open data police security canada federal government data standard
Visit annotations in context

Tags

police

federal government

canada

security

data standard

open data

Annotators

mlenc

URL

publicsafety.gc.ca/cnt/rsrcs/pblctns/2019-s003/index-en.aspx
osp.od.nih.gov osp.od.nih.gov

NIH Data Management and Sharing Activities Related to Public Access and Open Science

1
1. mlenc 08 Nov 2019
  
  in Public
  
  Draft NIH Policy for Data Management and Sharing
  
  data managment nih funders data infrastructure open data
Visit annotations in context

Tags

data infrastructure

data managment

funders

nih

open data

Annotators

mlenc

URL

osp.od.nih.gov/scientific-sharing/nih-data-management-and-sharing-activities-related-to-public-access-and-open-science/
www.publicsafety.gc.ca www.publicsafety.gc.ca

Developing a Common Data Standard for Measuring Attitudes toward the Police in Canada

1
1. mlenc 08 Nov 2019
  
  in Public
  
  open data data standard canada federal government police survey data
Visit annotations in context

Tags

police

federal government

canada

data standard

survey data

open data

Annotators

mlenc

URL

publicsafety.gc.ca/cnt/rsrcs/pblctns/2019-r003/index-en.aspx
codeactsineducation.wordpress.com codeactsineducation.wordpress.com

Psychodata

1
1. mrkrndvs 08 Nov 2019
  
  in Public
  
  SEL measurement is being done in myriad ways, involving multiple different conceptualizations of SEL, different political positions, and different sectoral interests.
  
  Here I am reminded of the book Counting What Counts
  
  SEL Data
Visit annotations in context

Tags

Data

SEL

Annotators

mrkrndvs

URL

codeactsineducation.wordpress.com/2019/10/07/psychodata/
a-little-book-of-r-for-time-series.readthedocs.io a-little-book-of-r-for-time-series.readthedocs.io

Using R for Time Series Analysis — Time Series 0.2 documentation

1
1. udaybhaskar 07 Nov 2019
  
  in Public
  
  This booklet itells you how to use the R statistical software to carry out some simple analyses that are common in analysing time series data.
  
  what is time series?
  
  time series Statistics data analysis
Visit annotations in context

Tags

Statistics

data analysis

time series

Annotators

udaybhaskar

URL

a-little-book-of-r-for-time-series.readthedocs.io/en/latest/src/timeseries.html
www.bleepingcomputer.com www.bleepingcomputer.com

Google Bans AdNauseam from Chrome, the Ad Blocker That Clicks on All Ads

1
1. TylerRick 05 Nov 2019
  
  in Public
  
  "While we hope that Google will lift these unwarranted sanctions for AdNauseam, it highlights a much more serious problem for Chrome users," the AdNauseam team adds. "It is frightening to think that at any moment Google can quietly make your extensions and data disappear, without so much as a warning."
  
  ability for 3rd party to delete your data Google: evil Chrome browser
Visit annotations in context

Tags

ability for 3rd party to delete your data

Chrome browser

Google: evil

Annotators

TylerRick

URL

bleepingcomputer.com/news/google/google-bans-adnauseam-from-chrome-the-ad-blocker-that-clicks-on-all-ads/
Oct 2019
reacttraining.com reacttraining.com

Reach UI - Styling

1
1. TylerRick 31 Oct 2019
  
  in Public
  
  "Element" SelectorsEach component has a data-reach-* attribute on the underlying DOM element that you can think of as the "element" for the component.
  
  style-related JavaScript libraries HTML: data- attributes reach-ui
Visit annotations in context

Tags

style-related JavaScript libraries

reach-ui

HTML: data- attributes

Annotators

TylerRick

URL

reacttraining.com/reach-ui/styling/
stackoverflow.blog stackoverflow.blog

Research update: Coding on the Weekends - Stack Overflow Blog

1
1. TylerRick 30 Oct 2019
  
  in Public
  
  Data Scientist
  
  data scientist
Visit annotations in context

Tags

data scientist

Annotators

TylerRick

URL

stackoverflow.blog/2019/10/28/research-update-coding-on-the-weekends/
blogs.worldbank.org blogs.worldbank.org

New resources for sovereign ESG data and investors

1
1. mlenc 29 Oct 2019
  
  in Public
  
  sdg data sdg reporting worldbank world bank data portal api nonprofit data
Visit annotations in context

Tags

nonprofit data

reporting

worldbank

data portal

sdg data

api

sdg

world bank

Annotators

mlenc

URL

blogs.worldbank.org/opendata/new-resources-sovereign-esg-data-and-investors
www.data.gov www.data.gov

Tools - Data.gov

1
1. SamRose 28 Oct 2019
  
  in Public
  
  food21 data source food resilience
Visit annotations in context

Tags

data source

food resilience

food21

Annotators

SamRose

URL

data.gov/climate/foodresilience/foodresilience-tools
www.fast.ai www.fast.ai

fast.ai · Making neural nets uncool again

1
1. fuelpress 20 Oct 2019
  
  in Public
  
  I frequently talk with people who are not that concerned about surveillance, or who feel that the positives outweigh the risks. Here, I want to share some important truths about surveillance: Surveillance can facilitate human rights abuses and even genocide Data is often used for different purposes than why it was collected Data often contains errors Surveillance typically operates with no accountability Surveillance changes our behavior Surveillance disproportionately impacts the marginalized Data privacy is a public good We don’t have to accept invasive surveillance
  
  #data
Visit annotations in context

Tags

#data

Annotators

fuelpress

URL

fast.ai/
press.anu.edu.au press.anu.edu.au

Indigenous Data Sovereignty

1
1. mlenc 15 Oct 2019
  
  in Public
  
  indigenous data sovereignty data managment toread
Visit annotations in context

Tags

indigenous data sovereignty

toread

data managment

Annotators

mlenc

URL

press.anu.edu.au/publications/series/caepr/indigenous-data-sovereignty
www.iste.co.uk www.iste.co.uk

Geographic Data Imperfection 1 - ISTE

1
1. almereyda 15 Oct 2019
  
  in Public
  
  uncertainty gis book data
Visit annotations in context

Tags

data

book

gis

uncertainty

Annotators

almereyda

URL

iste.co.uk/book.php
mutabit.com mutabit.com

Untitled document

3
1. offray 09 Oct 2019
  
  in Public
  
  Terminar los proyectos que empezamos en 2019, con prioridad en Documentatón, ya que no es un cover, sino que es nuestro propio libro.
  
  Para mí el tema de acabarlo son recursos (tiempo y dinero, etc). Podemos ir avanzando de a trozos un capítulo a la vez, haciéndolo de encuentro en encuentro, pero esto daría un ritmo muy lento. La experiencia previa muestra que esto no es sostenible y que si queremos un libro terminado, más que un esfuerzo colectivo, se requerirá un alto esfuerzo individual. Como muestra la gráfica de reportes de la documentatón, una persona puede hacer más que la suma de las restantes (hablando de no temerle a la soledad):
  
  Sin embargo, si esto es lo que está pasando, reflejando las métricas de muchos proyectos de software libre, dependemos fuertemente de los tiempos de esos individuos. En mi caso, no puedo continuar con la documentatón hasta no resolver el tema de los artículos de mi graduación, que se volvió una verdadera telenovela (eso merece su entrada de blog aparte) y preferí asuntos como los Data Haiku, precisamente porque son actividades más puntuales que todo un libro, que luego pueden convertirse en capítulos de uno (por ejemplo el de Datactivismos) pero transmitiendo ese aire de lo ágil y de lo terminado, que precisamente quisiéramos comunicar.
  
  Creo que tenemos que reconocer en las dinámicas comunitarias, qué podemos hacer en ellas y en cuáles ritmos, y entender que lo otro requerirá de recursos extra (económicos, personales, etc) que tendremos que proveer como personas naturales o jurídicas, con nuestro propio esfuerzo o el de nuestras empresas/fundaciones.
  
  data haiku documentatón
2. offray 06 Oct 2019
  
  in Public
  
  Selección comunitaria de temas para Data Weeks o Data Rodas. Apoyo de proyectos de los participantes de la comunidad. Reuniones periódicas de la Comunidad, algo así como Data Roda el primer viernes de cada mes, así sea para saludarnos síncronamente y ver en qué andamos y hacer un encera - brilla de bacanes.
  
  Estos tres puntos se podrían juntar con la idea de que los participantes propongan sus propios proyectos y se apropien de la planeación y ejecución de las Data Rodas o Data Weeks venideros.
  
  Sólo quitaría el carácter periódico, pues creo que una de las potencias de nuestra comunidad es responder flexiblemente a lo eventual. Por ejemplo, ahora tenemos un periodo electoral en Colombia. De allí surgió mi preocupación por visualizar financiación de campañas, pero los eventos de la semana pasada derivaron en blikis, con soporte de comentarios. Una reacción ágil a la contingencia y no el seguimiento riguroso de algo pre-planeado (a mi me gustaría retomar lo de financiación de campañas, pero será luego).
  
  De nuevo la sugerencia, como dije en mi entrada de respuesta a esta, y en otras ocasiones es sustituir la planeación por la coordinación. Mi propuesta de coordinación es la siguiente:
  
  Los miembros que quieren ver otras temáticas las proponen en los canales comunitarios y se apersonan de su preparación y ejecución.
  
  Los otros miembros respondemos a esas iniciativas autónomas, en solidaridad, acompañando esas sesiones y aportándoles.
  
  Al final de cada evento, miramos hacia dónde podemos llevar los otros.
  
  planeación vs coordinación data weeks data rodas
3. offray 05 Oct 2019
  
  in Public
  
  Si puedo aportar una herramienta más a Grafoscopio, quiero que sea la querendura.
  
  Me parece muy potente la querendura como metodología. Sin embargo, por lo pronto siento que es un listado amplio de ideas sueltas en el enlace que nos presentas y me gustaría indagar por las prácticas concretas que la hacen posible.
  
  querendura data weeks
Visit annotations in context

Tags

documentatón

data rodas

data weeks

data haiku

planeación vs coordinación

querendura

Annotators

offray

URL

mutabit.com/repos.fossil/dataweek/doc/tip/Participantes/Hiperterminal/blog/dataweek15.md.html
www.ers.usda.gov www.ers.usda.gov

Independent Grocery Stores in the Changing Landscape of the U.S. Food Retail Industry

1
1. SamRose 04 Oct 2019
  
  in Public
  
  https://www.ers.usda.gov/webdocs/publications/85783/err-240.pdf?v=0
  
  food21 grocery data
Visit annotations in context

Tags

data

food21

grocery

Annotators

SamRose

URL

ers.usda.gov/webdocs/publications/85783/err-240.pdf
Sep 2019
rupress-org.ezproxy.rice.edu rupress-org.ezproxy.rice.edu

Error bars in experimental biology. J Cell Biol

1
1. pbk1 30 Sep 2019
  
  in Public
  
  if n is very small (for example n = 3), rather than showing error bars and statistics, it is better to simply plot the individual data points.
  
  plot-individual-data-points
Visit annotations in context

Tags

plot-individual-data-points

Annotators

pbk1

URL

rupress-org.ezproxy.rice.edu/jcb/article/177/1/7/34602/Error-bars-in-experimental-biology
codeburst.io codeburst.io

The Curious Case of Mobx State Tree

1
1. TylerRick 25 Sep 2019
  
  in Public
  
  Keep the ergonomics of stable reference and directly mutable objects. In other words; be able to have a variable pointing to an object, and make subsequent reads or writes to it. Without needing to fear that you’re working with old data. While, in the background,..State is stored in an immutable, structurally shared tree.
  
  immutable data
Visit annotations in context

Tags

immutable data

Annotators

TylerRick

URL

codeburst.io/the-curious-case-of-mobx-state-tree-7b4e22d461f
mobx.js.org mobx.js.org

Introduction | MobX

1
1. TylerRick 25 Sep 2019
  
  in Public
  
  With MobX you don't need to normalize your data.
  
  flip side: https://codeburst.io/the-curious-case-of-mobx-state-tree-7b4e22d461f:
  
  MobX cannot guarantee your data is JSON serializable,
  
  normalizing data
Visit annotations in context

Tags

normalizing data

Annotators

TylerRick

URL

mobx.js.org/
arthurperret.fr arthurperret.fr

Hyperdocumentation: origin and evolution of a concept

1
1. loupbrun 24 Sep 2019
  
  in Public
  
  the absence of a social contract
  
  actual level of consent of individuals being documented (and by whom? by private corporations, mostly)
  
  privacy data collection data trail mass surveillance hyperdocumentation
Visit annotations in context

Tags

mass surveillance

hyperdocumentation

data collection

data trail

privacy

Annotators

loupbrun

URL

arthurperret.fr/publications/hyperdocumentation/
lateraleconomics.com.au lateraleconomics.com.au

Untitled document

2
1. cpsupolicyresearch 24 Sep 2019
  
  in Public
  
  Estimated economic benefit of data linkage
  
  the potential value from linking Census data to administrative data sets is only beginning to be realised and holds immense potential.(In other work for the Population Health Research Network, Lateral Economics concluded that data linkage generated over $16 for every dollar invested).
  
  ABS Census Data Value 2019 Lateral Economics Cost Benefit Analysis
2. cpsupolicyresearch 24 Sep 2019
  
  in Public
  
  Cost reduction suggestion
  
  there may be ways to reduce costs associated with the development of Census-equivalent statistics, including relying less on the general public to answer questions every five years
  
  ABS Census Efficiency Data 2019 Lateral Economics Cost reduction
Visit annotations in context

Tags

Cost Benefit Analysis

Cost reduction

Efficiency

Census

2019

Lateral Economics

ABS

Data

Value

Annotators

cpsupolicyresearch

URL

lateraleconomics.com.au/wp-content/uploads/LE-Census-Report-ABS-Full-19-Sept.pdf
theonn.ca theonn.ca

ONN in conversation: Yes, government funding will get easier for nonprofits

1
1. mlenc 19 Sep 2019
  
  in Public
  
  transfer payments grants ontario open data
Visit annotations in context

Tags

transfer payments

grants ontario

open data

Annotators

mlenc

URL

theonn.ca/onn-conversation-yes-government-funding-will-get-easier-nonprofits/
wisc.pb.unizin.org wisc.pb.unizin.org

Protected: Victorian(ists), Reader Engagement, and… Surveillance Capital?

1
1. Naomi.Salmon 18 Sep 2019
  
  in UW-Madison Pressbooks
  
  “But then again,” a person who used information in this way might say, “it’s not like I would be deliberately discriminating against anyone. It’s just an unfortunate proxy variable for lack of privilege and proximity to state violence.
  
  In the current universe, Twitter also makes a number of predictions about users that could be used as proxy variables for economic and cultural characteristics. It can display things like your audience's net worth as well as indicators commonly linked to political orientation. Triangulating some of this data could allow for other forms of intended or unintended discrimination.
  
  I've already been able to view a wide range (possibly spurious) information about my own reading audience through these analytics. On September 9th, 2019, I started a Twitter account for my 19th Century Open Pedagogy project and began serializing installments of critical edition, The Woman in White: Grangerized. The @OPP19c Twitter account has 62 followers as of September 17th.
  
  Having followers means I have access to an audience analytics toolbar. Some of the account's followers are nineteenth-century studies or pedagogy organizations rather than individuals. Twitter tracks each account as an individual, however, and I was surprised to see some of the demographics Twitter broke them down into. (If you're one of these followers: thank you and sorry. I find this data a bit uncomfortable.)
  
  Within this dashboard, I have a "Consumer Buying Styles" display that identifies categories such as "quick and easy" "ethnic explorers" "value conscious" and "weight conscious." These categories strike me as equal parts confusing and problematic: (Link to image expansion)
  
  I have a "Marital Status" toolbar alleging that 52% of my audience is married and 49% single.
  
  I also have a "Home Ownership" chart. (I'm presuming that the Elizabeth Gaskell House Museum's Twitter is counted as an owner...)
  
  ....and more
  
  undissertation reader analytics data analytics
Visit annotations in context

Tags

undissertation

data analytics

reader analytics

Annotators

Naomi.Salmon

URL

wisc.pb.unizin.org/undissertating19c/chapter/engagement-surveillance-capital/
batumi.estate batumi.estate

Open Graph protocol

1
1. TylerRick 18 Sep 2019
  
  in Public
  
  metadata Open Graph Protocol linked data semantic web
Visit annotations in context

Tags

linked data

semantic web

metadata

Open Graph Protocol

Annotators

TylerRick

URL

batumi.estate/ru
www.semantic-web-journal.net www.semantic-web-journal.net

swj282_1.pdf

1
1. TylerRick 18 Sep 2019
  
  in Public
  
  Open Graph Protocol linked data semantic web
Visit annotations in context

Tags

linked data

semantic web

Open Graph Protocol

Annotators

TylerRick

URL

semantic-web-journal.net/sites/default/files/swj282_1.pdf
github.com github.com

MarioRuiz/nice_hash

1
1. TylerRick 03 Sep 2019
  
  in Public
  
  ruby data generation
Visit annotations in context

Tags

data generation

ruby

Annotators

TylerRick

URL

github.com/MarioRuiz/nice_hash
www.w3.org www.w3.org

Web Architecture: Generic Resources

1
1. tilgovi 02 Sep 2019
  
  in Public
  
  On the other hand, a resource may be generic in that as a concept it is well specified but not so specifically specified that it can only be represented by a single bit stream. In this case, other URIs may exist which identify a resource more specifically. These other URIs identify resources too, and there is a relationship of genericity between the generic and the relatively specific resource.
  
  I was not aware of this page when the Web Annotations WG was working through its specifications. The word "Specific Resource" used in the Web Annotations Data Model Specification always seemed adequate, but now I see that it was actually quite a good fit.
  
  annotation Web Annotation Web Annotation Data Model HTTP TimBL
Visit annotations in context

Tags

Web Annotation Data Model

Web Annotation

TimBL

HTTP

annotation

Annotators

tilgovi

URL

w3.org/DesignIssues/Generic.html
Aug 2019
docutopia.tupale.co docutopia.tupale.co

Data Roda 34: Cambiar Juntos - CodiMD

1
1. edycop 29 Aug 2019
  
  in Public
  
  Data Roda 34: Cambiar Juntos
  
  Data Roda 34: Cambiar Juntos
  
  data roda
Visit annotations in context

Tags

data roda

Annotators

edycop

URL

docutopia.tupale.co/dataroda34
github.com github.com

Way to do "fetchMore"/"pagination"/"infinite scroll"? · Issue #36 · marcin-piela/react-fetching-library

1
1. TylerRick 28 Aug 2019
  
  in Public
  
  fetching data react-fetching-library example
Visit annotations in context

Tags

react-fetching-library

fetching data

example

Annotators

TylerRick

URL

github.com/marcin-piela/react-fetching-library/issues/36
mutabit.com mutabit.com

Data Week: Lectura anotada

2
1. edycop 28 Aug 2019
  
  in Public
  
  Hi, this is en example of annotation.
  
  data-week
2. edycop 28 Aug 2019
  
  in Public
  
  Hi, this is en example of annotation.
  
  data-week
Visit annotations in context

Tags

data-week

Annotators

edycop

URL

mutabit.com/repos.fossil/dataweek/doc/tip/wiki/lectura-anotada.md
codesandbox.io codesandbox.io

CodeSandbox

1
1. TylerRick 27 Aug 2019
  
  in Public
  
  fetching data react-fetching-library runnable example
Visit annotations in context

Tags

react-fetching-library

fetching data

runnable example

Annotators

TylerRick

URL

codesandbox.io/s/github/marcin-piela/react-fetching-library/tree/master/examples/use-query-hook
www.nature.com www.nature.com

Do no harm: a roadmap for responsible machine learning for health care

1
1. mlenc 27 Aug 2019
  
  in Public
  
  admin data machine learning
Visit annotations in context

Tags

machine learning

admin data

Annotators

mlenc

URL

nature.com/articles/s41591-019-0548-6
www.robinwieruch.de www.robinwieruch.de

How to fetch data in React - RWieruch

1
1. TylerRick 27 Aug 2019
  
  in Public
  
  supersededBy: https://www.robinwieruch.de/react-hooks-fetch-data
  
  react fetching data excellent: tutorial excellent technical writing
Visit annotations in context

Tags

excellent technical writing

fetching data

react

excellent: tutorial

Annotators

TylerRick

URL

robinwieruch.de/react-fetching-data
github.com github.com

CharlesStover/fetch-suspense

1
1. TylerRick 27 Aug 2019
  
  in Public
  
  react suspense fetching data
Visit annotations in context

Tags

fetching data

react suspense

Annotators

TylerRick

URL

github.com/CharlesStover/fetch-suspense
github.com github.com

marcin-piela/react-fetching-library

1
1. TylerRick 26 Aug 2019
  
  in Public
  
  inspiredBy: https://github.com/CharlesStover/fetch-suspense
  
  react-fetching-library fetching data react suspense react library
Visit annotations in context

Tags

react-fetching-library

fetching data

react suspense

react library

Annotators

TylerRick

URL

github.com/marcin-piela/react-fetching-library
www.robinwieruch.de www.robinwieruch.de

How to fetch data with React Hooks? - RWieruch

1
1. TylerRick 26 Aug 2019
  
  in Public
  
  Suspense
  
  supersedes: https://www.robinwieruch.de/react-fetching-data
  
  fetching data react-final-form react hooks
Visit annotations in context

Tags

fetching data

react hooks

react-final-form

Annotators

TylerRick

URL

robinwieruch.de/react-hooks-fetch-data
codesandbox.io codesandbox.io

CodeSandbox

1
1. TylerRick 26 Aug 2019
  
  in Public
  
  runnableExampleOf: https://github.com/dai-shi/react-hooks-async/tree/master/examples/05_axios/src
  
  runnable example react hooks async fetching data react-hooks-async
Visit annotations in context

Tags

fetching data

react hooks

async

runnable example

react-hooks-async

Annotators

TylerRick

URL

codesandbox.io/s/github/dai-shi/react-hooks-async/tree/master/examples/05_axios
github.com github.com

dai-shi/react-hooks-async

1
1. TylerRick 26 Aug 2019
  
  in Public
  
  hasRunnableExample: https://codesandbox.io/s/github/dai-shi/react-hooks-async/tree/master/examples/05_axios
  
  react hooks async fetching data react-hooks-async
Visit annotations in context

Tags

async

fetching data

react-hooks-async

react hooks

Annotators

TylerRick

URL

github.com/dai-shi/react-hooks-async/tree/master/examples/05_axios/src
itnext.io itnext.io

How to create React custom hooks for data fetching with useEffect

1
1. TylerRick 26 Aug 2019
  
  in Public
  
  AbortController useReducer react hooks fetching data aborting requests
Visit annotations in context

Tags

fetching data

react hooks

aborting requests

AbortController

useReducer

Annotators

TylerRick

URL

itnext.io/how-to-create-react-custom-hooks-for-data-fetching-with-useeffect-74c5dc47000a
storybook.grommet.io storybook.grommet.io

Storybook

1
1. TylerRick 14 Aug 2019
  
  in Public
  
  data table react component
Visit annotations in context

Tags

data table

react component

Annotators

TylerRick

URL

storybook.grommet.io/
github.com github.com

mbrn/material-table

1
1. TylerRick 13 Aug 2019
  
  in Public
  
  data table react component
Visit annotations in context

Tags

data table

react component

Annotators

TylerRick

URL

github.com/mbrn/material-table
material.io material.io

Data visualization

1
1. TylerRick 12 Aug 2019
  
  in Public
  
  Material Design Material System Introduction Material studies About our Material studies Basil Crane Fortnightly Owl Rally Reply Shrine Material Foundation Foundation overview Environment Surfaces Elevation Light and shadows Layout Understanding layout Pixel density Responsive layout grid Spacing methods Component behavior Applying density Navigation Understanding navigation Navigation transitions Search Color The color system Applying color to UI Color usage Text legibility Dark theme Typography The type system Understanding typography Language support Sound About sound Applying sound to UI Sound attributes Sound choreography Sound resources Iconography Product icons System icons Animated icons Shape About shape Shape and hierarchy Shape as expression Shape and motion Applying shape to UI Motion Understanding motion Speed Choreography Customization Interaction Gestures Selection States Material Guidelines Communication Confirmation & acknowledgement Data formats Data visualization Principles Types Selecting charts Style Behavior Dashboards Empty states Help & feedback Imagery Launch screen Onboarding Offline states Writing Guidelines overview Material Theming Overview Implementing your theme Components App bars: bottom App bars: top Backdrop Banners Bottom navigation Buttons Buttons: floating action button Cards Chips Data tables Dialogs Dividers Image lists Lists Menus Navigation drawer Pickers Progress indicators Selection controls Sheets: bottom Sheets: side Sliders Snackbars Tabs Text fields Tooltips Usability Accessibility Bidirectionality Platform guidance Android bars Android fingerprint Android haptics Android icons Android navigating between apps Android notifications Android permissions Android settings Android slices Android split-screen Android swipe to refresh Android text selection toolbar Android widget Cross-platform adaptation Data visualization Data visualization depicts information in graphical form. Contents Principles Types Selecting charts Style Behavior Dashboards Principles Data visualization is a form of communication that portrays dense and complex information in graphical form. The resulting visuals are designed to make it easy to compare data and use it to tell a story – both of which can help users in decision making. Data visualization can express data of varying types and sizes: from a few data points to large multivariate datasets. AccuratePrioritize data accuracy, clarity, and integrity, presenting information in a way that doesn’t distort it. HelpfulHelp users navigate data with context and affordances that emphasize exploration and comparison. ScalableAdapt visualizations for different device sizes, while anticipating user needs on data depth, complexity, and modality. Types Data visualization can be expressed in different forms. Charts are a common way of expressing data, as they depict different data varieties and allow data comparison.The type of chart you use depends primarily on two things: the data you want to communicate, and what you want to convey about that data. These guidelines provide descriptions of various different types of charts and their use cases.Types of chartsChange over time charts show data over a period of time, such as trends or comparisons across multiple categories. Common use cases include: Category comparison...Read MoreChange over timeChange over time charts show data over a period of time, such as trends or comparisons across multiple categories.Common use cases include: Stock price performanceHealth statisticsChronologies Change over time charts include:1. Line charts 2. Bar charts 3. Stacked bar charts 4. Candlestick charts 5. Area charts 6. Timelines 7. Horizon charts 8. Waterfall charts Category comparisonCategory comparison charts compare data between multiple distinct categories. Use cases include: Income across different countriesPopular venue timesTeam allocations Category comparison charts include: 1. Bar charts 2. Grouped bar charts 3. Bubble charts 4. Multi-line charts 5. Parallel coordinate charts 6. Bullet charts RankingRanking charts show an item’s position in an ordered list.Use cases include: Election resultsPerformance statistics Ranking charts include: 1. Ordered bar charts 2. Ordered column charts 3. Parallel coordinate charts Part-to-wholePart-to-whole charts show how partial elements add up to a total.Use cases include: Consolidated revenue of product categoriesBudgets Part-to-whole charts include: 1. Stacked bar charts 2. Pie charts 3. Donut charts 4. Stacked area charts 5. Treemap charts 6. Sunburst charts CorrelationCorrelation charts show correlation between two or more variables.Use cases include: Income and life expectancy Correlation charts include: 1. Scatterplot charts 2. Bubble charts 3. Column and line charts 4. Heatmap charts DistributionDistribution charts show how often each values occur in a dataset. Use cases include: Population distributionIncome distribution Distribution charts include: 1. Histogram charts 2. Box plot charts 3. Violin charts 4. Density charts FlowFlow charts show movement of data between multiple states.Use cases include: Fund transfersVote counts and election results Flow charts include: 1. Sankey charts 2. Gantt charts 3. Chord charts 4. Network charts RelationshipRelationship charts show how multiple items relate to one other.Use cases includeSocial networksWord charts Relationship charts include: 1. Network charts 2. Venn diagrams 3. Chord charts 4. Sunburst charts Selecting charts Multiple types of charts can be suitable for depicting data. The guidelines below provide insight into how to choose one chart over another. Showing change over timeChange over time can be expressed using a time series chart, which is a chart that represents data points in chronological order. Charts that express...Read MoreChange over time can be expressed using a time series chart, which is a chart that represents data points in chronological order. Charts that express change over time include: line charts, bar charts, and area charts.Type of chartUsageBaseline value * Quantity of time seriesData typeLine chartTo express minor variations in dataAny valueAny time series (works well for charts with 8 or more time series)ContinuousBar chartTo express larger variations in data, how individual data points relate to a whole, comparisons, and rankingZero4 or fewerDiscrete or categoricalArea chartTo summarize relationships between datasets, how individual data points relate to a wholeZero (when there’s more than one series)8 or fewerContinuous* The baseline value is the starting value on the y-axis.Bar and pie chartsBoth bar charts and pie charts can be used to show proportion, which expresses a partial value in comparison to a total value. Bar charts,...Read MoreBoth bar charts and pie charts can be used to show proportion, which expresses a partial value in comparison to a total value. Bar charts express quantities through a bar’s length, using a common baselinePie charts express portions of a whole, using arcs or angles within a circleBar charts, line charts, and stacked area charts are more effective at showing change over time than pie charts. Because all three of these charts share the same baseline of possible values, it’s easier to compare value differences based on bar length. Do.Use bar charts to show changes over time or differences between categories. Don’t.Don’t use multiple pie charts to show changes over time. It’s difficult to compare the difference in size across each slice of the pie. Area chartsArea charts come in several varieties, including stacked area charts and overlapped area charts: Overlapping area charts are not recommended with more than two time...Read MoreArea charts come in several varieties, including stacked area charts and overlapped area charts:Stacked area charts show multiple time series (over the same time period) stacked on top of one another Overlapped area charts show multiple time series (over the same time period) overlapping one anotherOverlapping area charts are not recommended with more than two time series, as doing so can obscure the data. Instead, use a stacked area chart to compare multiple values over a time interval (with time represented on the horizontal axis). Do.Use a stacked area chart to represent multiple time series and maintain a good level of legibility. Don’t.Don’t use overlapped area charts as it obscures data values and reduces readability. Style Data visualizations use custom styles and shapes to make data easier to understand at a glance, in ways that suit the user’s needs and context.Charts can benefit from customizing the following: Graphical elementsTypographyIconographyAxes and labelsLegends and annotationsStyling different types of dataVisual encoding is the process of translating data into visual form. Unique graphical attributes can be applied to both quantitative data (such as temperature, price,...Read MoreVisual encoding is the process of translating data into visual form. Unique graphical attributes can be applied to both quantitative data (such as temperature, price, or speed) and qualitative data (such as categories, flavors, or expressions). These attributes include:ShapeColorSizeAreaVolumeLengthAnglePosition DirectionDensityExpressing multiple attributesMultiple visual treatments can be applied to more than one aspect of a data point. For example, a bar color can represent a category, while a bar’s length can express a value (like population size). Shape can be used to represent qualitative data. In this chart, each category is represented by a specific shape (circles, squares, and triangles), which makes it easy to compare data both within a specific range or against other categories. ShapeCharts can use shapes to display data in a range of ways. A shape can be styled as playful and curvilinear, or precise and high-fidelity,...Read MoreCharts can use shapes to display data in a range of ways. A shape can be styled as playful and curvilinear, or precise and high-fidelity, among other ways in between. Level of shape detailCharts can represent data at varying levels of precision. Data intended for close exploration should be represented by shapes that are suitable for interaction (in terms of touch target size and related
  
  data visualization
Visit annotations in context

Tags

data visualization

Annotators

TylerRick

URL

material.io/design/communication/data-visualization.html
www.youtube.com www.youtube.com

(89) React.js Conf 2015 - Immutable Data and React - YouTube

1
1. TylerRick 05 Aug 2019
  
  in Public
  
  .
  
  immutable data javascript react
Visit annotations in context

Tags

immutable data

javascript

react

Annotators

TylerRick

URL

youtube.com/watch
labsblog.f-secure.com labsblog.f-secure.com

Adversarial Attacks Against AI

1
1. Pictor 02 Aug 2019
  
  in Public
  
  Security Issues, Dangers And Implications Of Smart Systems
  
  machine-learning data-analysis artificial-intelligence
Visit annotations in context

Tags

machine-learning

data-analysis

artificial-intelligence

Annotators

Pictor

URL

labsblog.f-secure.com/2019/07/11/adversarial-attacks-against-ai/
Jul 2019
medium.com medium.com

How can Indigenous Data Sovereignty (IDS) be promoted and mainstreamed within open data movements?

1
1. mlenc 25 Jul 2019
  
  in Public
  
  Indigenous Data Sovereignty
  
  Indigenous Data Sovereignty ids open data
Visit annotations in context

Tags

open data

Indigenous Data Sovereignty

ids

Annotators

mlenc

URL

medium.com/@opendevmekong/how-can-indigenous-data-sovereignty-ids-be-promoted-and-mainstreamed-within-open-data-movements-e70464846b34
www.theatlantic.com www.theatlantic.com

The Economist Who Would Fix the American Dream

1
1. mlenc 24 Jul 2019
  
  in Public
  
  admin data evidence based policy
Visit annotations in context

Tags

admin data

evidence based policy

Annotators

mlenc

URL

theatlantic.com/magazine/archive/2019/08/raj-chettys-american-dream/592804/
mikeindustries.com mikeindustries.com

Superhuman is Spying on You » Mike Industries

1
1. Enkerli 24 Jul 2019
  
  in Public
  
  Every time your child opens the email, that person knows generally where they are (or specifically, if they have other info to triangulate against).
  
  Metadata Data Aggregation
Visit annotations in context

Tags

Data Aggregation

Metadata

Annotators

Enkerli

URL

mikeindustries.com/blog/archive/2019/06/superhuman-is-spying-on-you
journals.sagepub.com journals.sagepub.com

The Case for Alternative Social Media - Robert W. Gehl, 2015

1
1. chrisaldrich 21 Jul 2019
  
  in Public
  
  In contrast to such pseudonymous social networking, Facebook is notable for its longstanding emphasis on real identities and social connections.
  
  Lack of anonymity also increases Facebook's ability to properly link shadow profiles purchased from other data brokers.
  
  data brokerage surveillance capitalism Facebook
Visit annotations in context

Tags

surveillance capitalism

Facebook

data brokerage

Annotators

chrisaldrich

URL

journals.sagepub.com/doi/full/10.1177/2056305115604338
www.khanacademy.org www.khanacademy.org

Residual plots

1
1. Pictor 19 Jul 2019
  
  in Public
  
  our sum of squares is 41.187941.187941.1879
  
  Just considering the Y, and not the X. Calculating the residuals from the average/mean Y.
  
  data-analysis
Visit annotations in context

Tags

data-analysis

Annotators

Pictor

URL

khanacademy.org/math/ap-statistics/bivariate-data-ap/assessing-fit-least-squares-regression/a/r-squared-intuition
www.sthda.com www.sthda.com

FAMD - Factor Analysis of Mixed Data in R: Essentials - Articles - STHDA

1
1. intelligence.refinery 18 Jul 2019
  
  in Public
  
  it acts as PCA quantitative variables and as MCA for qualitative variables.
  
  Factor analysis of mixed data
Visit annotations in context

Tags

Factor analysis of mixed data

Annotators

intelligence.refinery

URL

sthda.com/english/articles/31-principal-component-methods-in-r-practical-guide/115-famd-factor-analysis-of-mixed-data-in-r-essentials/
sebastianraschka.com sebastianraschka.com

About Feature Scaling and Normalization

2
1. intelligence.refinery 17 Jul 2019
  
  in Public
  
  in clustering analyses, standardization may be especially crucial in order to compare similarities between features based on certain distance measures. Another prominent example is the Principal Component Analysis, where we usually prefer standardization over Min-Max scaling, since we are interested in the components that maximize the variance
  
  Use standardization, not min-max scaling, for clustering and PCA.
  
  Clustering PCA Data normalization
2. intelligence.refinery 05 Jul 2019
  
  in Public
  
  As a rule of thumb I’d say: When in doubt, just standardize the data, it shouldn’t hurt.
  
  Data normalization
Visit annotations in context

Tags

Clustering

PCA

Data normalization

Annotators

intelligence.refinery

URL

sebastianraschka.com/Articles/2014_about_feature_scaling.html
www.data4sdgs.org www.data4sdgs.org

Administrative Data Initiative

1
1. mlenc 17 Jul 2019
  
  in Public
  
  admin data sdg
Visit annotations in context

Tags

sdg

admin data

Annotators

mlenc

URL

data4sdgs.org/initiatives/administrative-data-initiative
www.data4sdgs.org www.data4sdgs.org

The Most Unsexy Data Could Hold the Most Promise

1
1. mlenc 17 Jul 2019
  
  in Public
  
  admin data sdg
Visit annotations in context

Tags

sdg

admin data

Annotators

mlenc

URL

data4sdgs.org/news/most-unsexy-data-could-hold-most-promise
www.gatesfoundation.org www.gatesfoundation.org

K-12 Education

1
1. jeremydean 16 Jul 2019
  
  in Public
  
  driven by data—where schools use data to identify a problem, select a strategy to address the problem, set a target for improvement, and iterate to make the approach more effective and improve student achievement.
  
  Gates data model.
  
  education data annotation Gates
Visit annotations in context

Tags

education

data

Gates

annotation

Annotators

jeremydean

URL

gatesfoundation.org/What-We-Do/US-Program/K-12-Education
www.sthda.com www.sthda.com

Visualizing Multivariate Categorical Data - Articles - STHDA

1
1. intelligence.refinery 09 Jul 2019
  
  in Public
  
  Balloon plot
  
  Balloon plot
  
  Data visualization
Visit annotations in context

Tags

Data visualization

Annotators

intelligence.refinery

URL

sthda.com/english/articles/32-r-graphics-essentials/129-visualizing-multivariate-categorical-data/
towardsdatascience.com towardsdatascience.com

Scale, Standardize, or Normalize with Scikit-Learn – Towards Data Science

1
1. intelligence.refinery 06 Jul 2019
  
  in Public
  
  how the features are all on the same relative scale. The relative spaces between each feature’s values have been maintained.
  
  Data scaling
Visit annotations in context

Tags

Data scaling

Annotators

intelligence.refinery

URL

towardsdatascience.com/scale-standardize-or-normalize-with-scikit-learn-6ccc7d176a02
scikit-learn.org scikit-learn.org

4.3. Preprocessing data — scikit-learn 0.19.dev0 documentation

1
1. intelligence.refinery 06 Jul 2019
  
  in Public
  
  many elements used in the objective function of a learning algorithm (such as the RBF kernel of Support Vector Machines or the l1 and l2 regularizers of linear models) assume that all features are centered around zero and have variance in the same order. If a feature has a variance that is orders of magnitude larger than others, it might dominate the objective function and make the estimator unable to learn from other features correctly as expected.
  
  Data normalization Data standardization
Visit annotations in context

Tags

Data standardization

Data normalization

Annotators

intelligence.refinery

URL

scikit-learn.org/stable/modules/preprocessing.html
Jun 2019
mutabit.com mutabit.com

bitacoraPorTemas.pdf

2
1. offray 29 Jun 2019
  
  in Public
  
  4ExportardatosdesdeTwitter
  
  data selfies
2. offray 29 Jun 2019
  
  in Public
  
  1Microblogsymicropolítica:retratosdedatosydataselfiesenTwitter
  
  data selfies
Visit annotations in context

Tags

data selfies

Annotators

offray

URL

mutabit.com/repos.fossil/dataweek/uv/Artefactos/DataSelfies/Visualizar2018/bitacoraPorTemas.pdf
docs.google.com docs.google.com

Food Abundance Index

1
1. SamRose 27 Jun 2019
  
  in Public
  
  commons engine regenerative agriculture data commons supply chain
Visit annotations in context

Tags

supply chain

regenerative agriculture

commons engine

data commons

Annotators

SamRose

URL

docs.google.com/document/d/148J2sx4UbOTgbhblpdnNrSHflshGrnPxxlWQ5uK6y-4/edit
varsellcm.r-forge.r-project.org varsellcm.r-forge.r-project.org

VarSelLCM

1
1. intelligence.refinery 25 Jun 2019
  
  in Public
  
  missing values are managed, without any pre-processing, by the model used to cluster with the assumption that values are missing completely at random.
  
  VarSelLCM package
  
  R Mixed-type data clustering
Visit annotations in context

Tags

R

Mixed-type data clustering

Annotators

intelligence.refinery

URL

varsellcm.r-forge.r-project.org/
Local file Local file

Practical Data Science with R, Second Edition MEAP V05

1
1. intelligence.refinery 23 Jun 2019
  
  in Public
  
  Success ina data science project comes not from access to any one exotic tool, but from having quantifiablegoals, good methodology, crossdiscipline interactions, and a repeatable workflow.
  
  Data science
Tags

Data science

Annotators

intelligence.refinery
Local file Local file

Invisible Women

1
1. nahmedin 20 Jun 2019
  
  in Public
  
  Academicsarealsoatfaulthere:arecentanalysisof29millionpapersinover15,000peer-reviewedtitlespublishedaroundthetimeoftheZikaandEbolaepidemicsfoundthatlessthan1%exploredthegenderedimpactoftheoutbreaks
  
  How do we prevent this pattern here at Georgia Tech? There is a very obvious gender gap, especially in STEM where bad data in medicine and engineering are collected? What are some mini steps we can take to encourage pursuing data for different backgrounds? Education is always first, starting with class similar to this one informing people about how gender plays a role. Perhaps then we can create projects exploring the issue in data related to each person's major.
  
  gendered data
Tags

gendered data

Annotators

nahmedin
www.threesixtygiving.org www.threesixtygiving.org

Informing policy discussions - 360Giving

1
1. mlenc 13 Jun 2019
  
  in Public
  
  policy grants data open grants data npdata
Visit annotations in context

Tags

policy grants data

npdata

open grants data

Annotators

mlenc

URL

threesixtygiving.org/2019/06/10/informing-policy-discussions/
sebastianraschka.com sebastianraschka.com

About Feature Scaling and Normalization

2
1. intelligence.refinery 11 Jun 2019
  
  in Public
  
  However, this doesn’t mean that Min-Max scaling is not useful at all! A popular application is image processing, where pixel intensities have to be normalized to fit within a certain range (i.e., 0 to 255 for the RGB color range). Also, typical neural network algorithm require data that on a 0-1 scale.
  
  Use min-max scaling for image processing & neural networks.
  
  Min-max scaling Neural networks Data normalization
2. intelligence.refinery 11 Jun 2019
  
  in Public
  
  The result of standardization (or Z-score normalization) is that the features will be rescaled so that they’ll have the properties of a standard normal distribution with μ=0μ=0\mu = 0 and σ=1σ=1\sigma = 1 where μμ\mu is the mean (average) and σσ\sigma is the standard deviation from the mean
  
  Data normalization Definitions
Visit annotations in context

Tags

Data normalization

Neural networks

Min-max scaling

Definitions

Annotators

intelligence.refinery

URL

sebastianraschka.com/Articles/2014_about_feature_scaling.html
link.springer.com link.springer.com

A semiparametric method for clustering mixed data

1
1. intelligence.refinery 11 Jun 2019
  
  in Public
  
  Threshold values of 0.8-0.9 are recommended for well separated clusters; to allow for overlapping clusters, we chose a threshold of 0.6.
  
  Mixed data clustering clustMixType
Visit annotations in context

Tags

clustMixType

Mixed data clustering

Annotators

intelligence.refinery

URL

link.springer.com/article/10.1007/s10994-016-5575-7
www.thirdsectorcap.org www.thirdsectorcap.org

Integrated Data Systems and Outcomes-Oriented Contracting: A Powerful Combination for Improving Outcomes | Third Sector Capital Partners

1
1. mlenc 06 Jun 2019
  
  in Public
  
  a two-generation approach to outcomes by linking data on individuals within a family or household unit.
  
  admin data integrated systems ids integrated data systems two-generation approach
Visit annotations in context

Tags

integrated data systems

two-generation approach

admin data

integrated systems

ids

Annotators

mlenc

URL

thirdsectorcap.org/blog/integrated-data-systems-and-outcomes-oriented-contracting-a-powerful-combination-for-improving-outcomes/
www.blagravetrust.org www.blagravetrust.org

Power and vulnerability in the charity-funder relationship - The Blagrave Trust

1
1. mlenc 05 Jun 2019
  
  in Public
  
  evaluation anonymous collective responsibility data
Visit annotations in context

Tags

anonymous

evaluation

data

collective responsibility

Annotators

mlenc

URL

blagravetrust.org/listening/power-and-vulnerability-in-the-charity-funder-relationship/
May 2019
engl201.opened.ca engl201.opened.ca

Visualize-This_Chapter1.pdf

2
1. eden_collyer 31 May 2019
  
  in Public
  
  1RWDOOPRYLHVKDYHWREHGRFXPHQWDULHVDQGQRWDOOYLVXDOL]DWLRQKDVWREHWUDGLWLRQDOFKDUWVDQGJUDSKV
  
  This is an interesting fact, usually when I think of visualization and data I go to the classic default charts and data. I'll have to keep this iin mind.
  
  Digital Humanities data visualization engl201
2. eden_collyer 31 May 2019
  
  in Public
  
  7KHEDVHRIWKHJUDSKLFLVVLPSO\DOLQHFKDUW+RZHYHUGHVLJQHOHPHQWVKHOSWHOOWKHVWRU\EHWWHU/DEHOLQJDQGSRLQWHUVSURYLGHFRQWH[WDQGKHOS\RXVHHZK\WKHGDWDLVLQWHUHVWLQJDQGOLQHZLGWKDQGFRORUGLUHFW\RXUH\HVWRZKDW¶VLPSRUWDQW
  
  I really like this because I don't see it often and it actually does draw my eye to the data and capture my interest.
  
  Digital Humanities engl201 data interesting
Visit annotations in context

Tags

data

visualization

Digital Humanities

interesting

engl201

Annotators

eden_collyer

URL

engl201.opened.ca/wp-content/uploads/sites/57/2019/05/Visualize-This_Chapter1.pdf
www.itweb.co.za www.itweb.co.za

Eight steps to success when designing document-centric workflows in financial institutions

1
1. malcolmjmr 24 May 2019
  
  in Public
  
  Virtually all BPMs have utilities for creating simple, data-gathering forms. And in many types of workflows, these simple forms may be adequate. However, in any workflow that includes complex document assembly (such as loan origination workflows), BPM forms are not likely to get the job done. Automating the assembly of complex documents requires ultra-sophisticated data-gathering forms, which can only be designed and created after the documents themselves have been automated. Put another way, you won't know which questions need to be asked to generate the document(s) until you've merged variables and business logic into the documents themselves. The variables you merge into the document serve as question fields in the data gathering forms. And here's the key point - since you have to use the document assembly platform to create interviews that are sophisticated enough to gather data for your complex documents, you might as well use the document assembly platform to generate all data-gathering forms in all of your workflows.
  
  data acquisition document benefits
Visit annotations in context

Tags

data

benefits

acquisition

document

Annotators

malcolmjmr

URL

itweb.co.za/content/XGxwQDM1bKRqlPVo
mutabit.com mutabit.com

Microsoft Word - Categoría dimensiones.docx

1
1. offray 19 May 2019
  
  in Public
  
  El ritmo de las actividades de diseño e instalación de redes comunitarias en veredas del municipio de Fusagasugá se ve acrecentado por las convocatorias internas de investigación de la Universidad de Cundinamarca que a lo largo del tiempo de vida de Red FusaLibrehan sido un músculo financiero que les permite acelerar los proc
  
  Interesante vínculo entre comunidad y universidad. En nuestro caso, no hemos logrado un vínculo permanente y si bien algunos dineros de convocatorias de investigación universitaria y convocatorias internacionales permitieron pagar parte de los Data Weeks, junto con una contribución menor de algunos asistentes, en general ha sido un proyecto financiado con recursos propios y préstamos familiares.
  
  data week financiación sostenibilidad
Visit annotations in context

Tags

data week

financiación

sostenibilidad

Annotators

offray

URL

mutabit.com/repos.fossil/offray-blog/uv/categoría-dimensiones.pdf
www.datacoalition.org www.datacoalition.org

An Executive Perspective on Open Data and Evidence-Based Policymaking

1
1. mlenc 17 May 2019
  
  in Public
  
  open data evidence-based policymaking data coalition
Visit annotations in context

Tags

open data

evidence-based policymaking

data coalition

Annotators

mlenc

URL

datacoalition.org/an-executive-perspective-on-open-data-and-evidence-based-policymaking/
www.ecfoundation.org www.ecfoundation.org

ECF Embraces Open Access | Edmonton Community Foundation

1
1. mlenc 17 May 2019
  
  in Public
  
  canada open data open grantmaking open grants data funder
Visit annotations in context

Tags

funder

open grants data

open grantmaking

canada

open data

Annotators

mlenc

URL

ecfoundation.org/blog/ecf-embraces-open-access/
medium.com medium.com

A data trust for Canada & the free flow of data between inter-provincial borders — lessons from…

1
1. mlenc 16 May 2019
  
  in Public
  
  data infrastructure powered by data admin data
Visit annotations in context

Tags

powered by data

data infrastructure

admin data

Annotators

mlenc

URL

medium.com/@natalie.mcgee/a-data-trust-for-canada-the-free-flow-of-data-between-inter-provincial-borders-lessons-from-b6f5cbb0a7d2
sustainablecopper.org sustainablecopper.org

ICA-Summary-Document-The-Impacts-of-Copper-Mining-in-Chile-FV-04.04.2018.pdf

1
1. jsch2202 15 May 2019
  
  in Public
  
  Chile has developed from 1990 to date, a considerable decrease in poverty rates, which fell from 40.5% in 1990 to 8.5% in 2015.
  
  this is a benefit
  
  k-graph k-data c-copper c-social
Visit annotations in context

Tags

c-social

c-copper

k-data

k-graph

Annotators

jsch2202

URL

sustainablecopper.org/wp-content/uploads/2018/05/ICA-Summary-Document-The-Impacts-of-Copper-Mining-in-Chile-FV-04.04.2018.pdf
www.nytimes.com www.nytimes.com

Opinion | The Trauma of Sanctuary

1
1. mlenc 14 May 2019
  
  in Public
  
  admin data pbd toread
Visit annotations in context

Tags

pbd

admin data

toread

Annotators

mlenc

URL

nytimes.com/2019/05/14/opinion/undocumented-immigrants.html
rctom.hbs.org rctom.hbs.org

State-owned copper mining: climate change vs. country development – Technology and Operations Management

2
1. ktim2201 14 May 2019
  
  in Public
  
  Developing economies’ copper demand has steadily grown over the last decades, fueling economic and social improvement. By 2011, China already represented 40% of the demand.
  
  Why does China need so much.
  
  c-copper c-conflict k-fact k-data
2. ktim2201 14 May 2019
  
  in Public
  
  Codelco is a state-owned Chilean mining company and the world’s largest copper producer. Based on their annual report and USGS statistics, they produced ~10% of the world’s copper in 2015 and own 8% of global reserves. They are also a large producer of greenhouse gas emissions. Last year, Codelco produced 3,2 t CO2e/millions tmf from both indirect and direct effects, and in 2011 it consumed 12% of the total national electricity supply.
  
  Goddamn they should start recylcling
  
  c-mine c-copper c-social-costs k-fact k-problem k-data
Visit annotations in context

Tags

c-social-costs

k-fact

k-data

k-problem

c-copper

c-mine

c-conflict

Annotators

ktim2201

URL

rctom.hbs.org/submission/state-owned-copper-mining-climate-change-vs-country-development/
www.montrealdatalicense.com www.montrealdatalicense.com

Montreal Data License

1
1. mlenc 13 May 2019
  
  in Public
  
  reading group open data license
Visit annotations in context

Tags

open data license

reading group

Annotators

mlenc

URL

montrealdatalicense.com/en
www.randhome.io www.randhome.io

2019 OSINT Guide

1
1. dem 11 May 2019
  
  in Public
  
  Methodology The classic OSINT methodology you will find everywhere is strait-forward: Define requirements: What are you looking for? Retrieve data Analyze the information gathered Pivoting & Reporting: Either define new requirements by pivoting on data just gathered or end the investigation and write the report.
  
  Etienne's blog! Amazing resource for OSINT; particularly focused on technical attacks.
  
  OSINT Open-Source Intelligence Data Analysis Investigation methodology tools Technology
Visit annotations in context

Tags

tools

OSINT

Open-Source

Intelligence

Analysis

Technology

methodology

Data

Investigation

Annotators

dem

URL

randhome.io/blog/2019/01/05/2019-osint-guide/
wellsky.com wellsky.com

WellSky Strengthens Interoperability by Supporting the Human Service Data Specification | WellSky

1
1. mlenc 10 May 2019
  
  in Public
  
  open referral data standards hifis hmis knowledge infrastructure social sector interoperability
Visit annotations in context

Tags

hifis

knowledge infrastructure

interoperability

open referral

hmis

social sector

data standards

Annotators

mlenc

URL

wellsky.com/blog/2019/05/09/wellsky-strengthens-interoperability-by-supporting-the-human-service-data-specification/
link.springer.com link.springer.com

Social Construction of Knowledge

1
1. LCS 03 May 2019
  
  in Public
  
  important distinction, information vs knowledge
  
  social construction of knowledge literacy data literacy
Visit annotations in context

Tags

social construction of knowledge

data literacy

literacy

Annotators

LCS

URL

link.springer.com/chapter/10.1007/978-3-663-05852-6_9
assets.publishing.service.gov.uk assets.publishing.service.gov.uk

Open aid, open societies: A vision for a transparent world

1
1. jfnb 02 May 2019
  
  in Public
  
  open aid - argument from dfid
  
  dfid open data transparency
Visit annotations in context

Tags

transparency

open data

dfid

Annotators

jfnb

URL

assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/682143/Open-Aid-Open-Societies.pdf
Apr 2019
mapthesystem.sbs.ox.ac.uk mapthesystem.sbs.ox.ac.uk

Canada – Map the System

1
1. mlenc 24 Apr 2019
  
  in Public
  
  landscape map the system open data
Visit annotations in context

Tags

open data

landscape

map the system

Annotators

mlenc

URL

mapthesystem.sbs.ox.ac.uk/canada/
www.canada.ca www.canada.ca

Minister of Health announces $81M initiative to increase access to health research data - Canada.ca

1
1. mlenc 23 Apr 2019
  
  in Public
  
  admin data funding cihr
Visit annotations in context

Tags

funding

admin data

cihr

Annotators

mlenc

URL

canada.ca/en/institutes-health-research/news/2019/04/minister-of-health-announces-81m-initiative-to-increase-access-to-health-research-data.html
www.globaldevhub.org www.globaldevhub.org

About Consultation for development of the 2019-2022 IATI Strategic Plan | Global Dev Hub

1
1. mlenc 20 Apr 2019
  
  in Public
  
  toread iati strategy data strategy
Visit annotations in context

Tags

data strategy

strategy

iati

toread

Annotators

mlenc

URL

globaldevhub.org/iati-sp-2019
yro.slashdot.org yro.slashdot.org

Canadian

1
1. mlenc 20 Apr 2019
  
  in Public
  
  admin data scanda hrsdc onboarding
Visit annotations in context

Tags

hrsdc

admin data

scanda

onboarding

Annotators

mlenc

URL

yro.slashdot.org/story/00/05/30/1214212/canadian-big-brother-database-scrapped
www.theglobeandmail.com www.theglobeandmail.com

Phillips slew more than Big Brother

1
1. mlenc 20 Apr 2019
  
  in Public
  
  longitudinal labour force file hrsdc scandal privacy admin data onbording admindata
Visit annotations in context

Tags

hrsdc

scandal

admin data

longitudinal labour force file

privacy

admindata

onbording

Annotators

mlenc

URL

theglobeandmail.com/news/national/phillips-slew-more-than-big-brother/article768007/
www.go-fair.org www.go-fair.org

Discovery - GO FAIR

1
1. mlenc 18 Apr 2019
  
  in Public
  
  open science fair data strategy academic strategies research outputs
Visit annotations in context

Tags

open science

academic strategies

data strategy

research outputs

fair

Annotators

mlenc

URL

go-fair.org/implementation-networks/overview/discovery/
blog.wikimedia.org blog.wikimedia.org

Wikimedia 2030: A draft strategic direction for our movement – Wikimedia Blog

1
1. mlenc 18 Apr 2019
  
  in Public
  
  wikipedia strategies data strategy
Visit annotations in context

Tags

data strategy

wikipedia strategies

Annotators

mlenc

URL

blog.wikimedia.org/2017/08/10/wikimedia-2030-draft-strategic-direction/
www.societybyte.swiss www.societybyte.swiss

How Wikidata Is Solving Its Chicken-or-Egg-Problem in the Field of Cultural Heritage

1
1. mlenc 18 Apr 2019
  
  in Public
  
  wikipedia strategies data strategies wikibase wikidata
Visit annotations in context

Tags

data strategies

wikibase

wikipedia strategies

wikidata

Annotators

mlenc

URL

societybyte.swiss/2018/11/07/how-wikidata-is-solving-its-chicken-or-egg-problem-in-the-field-of-cultural-heritage/
www.cigionline.org www.cigionline.org

Reclaiming Data Trusts

1
1. mlenc 18 Apr 2019
  
  in Public
  
  data trusts good criticism
Visit annotations in context

Tags

good criticism

data trusts

Annotators

mlenc

URL

cigionline.org/articles/reclaiming-data-trusts
pfc.ca pfc.ca

PFC Publications - Philanthropic Foundations Canada

1
1. mlenc 18 Apr 2019
  
  in Public
  
  Powered by Data wrote 4 of the resources on this page. "Measuring Outcomes" is about admin data. "Understanding the Philanthropic Landscape" is about open data - sp. open grants data. "Effective Giving" is an intro. And "Emerging Data Practices" is a tech backgrounder from June 2015.
  
  onboarding @pwrd_by_data philanthropy data data for impact data4impact
Visit annotations in context

Tags

data for impact

philanthropy data

@pwrd_by_data

data4impact

onboarding

Annotators

mlenc

URL

pfc.ca/resources/pfc-publications/
www.arl.org www.arl.org

ARL White Paper on Wikidata: Opportunities and Recommendations | Association of Research Libraries® | ARL®

1
1. mlenc 18 Apr 2019
  
  in Public
  
  wikidata linked data glam information infrastructure
Visit annotations in context

Tags

linked data

information infrastructure

wikidata

glam

Annotators

mlenc

URL

arl.org/publications-resources/4751-arl-white-paper-on-wikidata-opportunities-and-recommendations
www.philanthropy.com www.philanthropy.com

No Equity Without Everyone: Philanthropy Must Fully Include People With Disabilities (Opinion)

1
1. mlenc 17 Apr 2019
  
  in Public
  
  grantstory landscape open data
Visit annotations in context

Tags

open data

landscape

grantstory

Annotators

mlenc

URL

philanthropy.com/article/No-Equity-Without-Everyone-/245991
medium.com medium.com

Transforming (Digital) Government in Ontario: Part 1

1
1. mlenc 17 Apr 2019
  
  in Public
  
  open data ontario grantstory
Visit annotations in context

Tags

grantstory

open data

ontario

Annotators

mlenc

URL

medium.com/@amandaerinclarke/transforming-digital-government-in-ontario-part-1-1ef2e5157108
www.instituteforgovernment.org.uk www.instituteforgovernment.org.uk

Canada shows the way on government financial transparency

1
1. mlenc 17 Apr 2019
  
  in Public
  
  infobase transparency open data canada performance data results-based management accountability
Visit annotations in context

Tags

transparency

infobase

performance data

canada

open data

accountability

results-based management

Annotators

mlenc

URL

instituteforgovernment.org.uk/blog/canada-shows-way-government-financial-transparency
data.unicef.org data.unicef.org

Data for Children Strategic Framework - UNICEF DATA

1
1. mlenc 15 Apr 2019
  
  in Public
  
  data strategy
Visit annotations in context

Tags

data strategy

Annotators

mlenc

URL

data.unicef.org/resources/data-children-strategic-framework/
blog.socialcops.com blog.socialcops.com

Technology Archives - SocialCops

1
1. d3vr 13 Apr 2019
  
  in Public
  
  Interesting data science / development / technology blog from an Indian Start up
  
  programming data science development
Visit annotations in context

Tags

data science

development

programming

Annotators

d3vr

URL

blog.socialcops.com/category/technology/
wso2.com wso2.com

Conceptualizing the Knowledge Graph Construction Pipeline

1
1. mlenc 12 Apr 2019
  
  in Public
  
  pretty great intro to knowledge graphs
  
  knowledge graphs linked data knowledge infrastructure
Visit annotations in context

Tags

linked data

knowledge graphs

knowledge infrastructure

Annotators

mlenc

URL

wso2.com/blog/research/conceptualizing-the-knowledge-graph-construction-pipeline
smethur.st smethur.st

Mithering about the unmodellable

1
1. jfnb 11 Apr 2019
  
  in Public
  
  modelling UK parliament
  
  data model data standards
Visit annotations in context

Tags

data model

data standards

Annotators

jfnb

URL

smethur.st/posts/176135867
medium.com medium.com

Data-sharing in government: why it’s time for a new social contract

1
1. mlenc 08 Apr 2019
  
  in Public
  
  Instead of encouraging more “data-sharing”, the focus should be the cultivation of “data infrastructure”,¹⁴ maintained for the public good by institutions with clear responsibilities and lines of accountability.
  
  social contract data data infrastructure identity admin data
Visit annotations in context

Tags

data

data infrastructure

identity

admin data

social contract

Annotators

mlenc

URL

medium.com/@richardjpope/data-sharing-in-government-why-its-time-for-a-new-social-contract-7260bbe2372a
Mar 2019
www.archivogeneral.gov.co www.archivogeneral.gov.co

ADN_AGN.pdf

1
1. 098 27 Mar 2019
  
  in Public
  
  Normalización de las entradas descriptivas: Personas, Lugares, Instituciones (utilización de Linked Open Data (LOD) cuando sea posible.
  
  ¿Qué sistema de organización de conocimiento se los posibilita? ¿Qué están usando para enlazar datos y en qué formato?
  
  Linked Open Data AGN Archivo digital Colombia
Visit annotations in context

Tags

AGN

Colombia

Linked Open Data

Archivo digital

Annotators

098

URL

archivogeneral.gov.co/sites/default/files/Estructura_Web/magazine/ADN/ADN_AGN.pdf
www.gdeltproject.org www.gdeltproject.org

The GDELT Project

1
1. mlenc 26 Mar 2019
  
  in Public
  
  all the data
  
  data source open data
Visit annotations in context

Tags

data source

open data

Annotators

mlenc

URL

gdeltproject.org/
www.internationalpolicydigest.org www.internationalpolicydigest.org

Building a Smart Nation: A Nuanced Understanding of Hyper-Connected Singapore

1
1. kureshii 21 Mar 2019
  
  in Public
  
  The government needs to place tough restrictions on data collection and storage by businesses to limit the amount of damage in the event of a cyber breach.
  
  I find it hard to imagine how this could be usefully implemented. How is monitoring of data collection going to be done?
  
  Even simpler ideas, like the Do Not Call registry, have difficulty clamping down on businesses that breach regulations.
  
  #data
Visit annotations in context

Tags

#data

Annotators

kureshii

URL

internationalpolicydigest.org/2015/08/26/building-a-smart-nation-a-nuanced-understanding-of-hyper-connected-singapore/
smethur.st smethur.st

Mithering about the unmodellable

1
1. mlenc 19 Mar 2019
  
  in Public
  
  Mithering about the unmodellable. "Sometime late last year I went to the Euro IA conference with Anya and Silver to give a talk on the domain modelling work we've been doing in UK Parliament."
  
  data modelling mithering mither parliament uk parliament
Visit annotations in context

Tags

mither

uk parliament

data modelling

mithering

parliament

Annotators

mlenc

URL

smethur.st/posts/176135867
www.broadbentinstitute.ca www.broadbentinstitute.ca

Progress Summit 2019 Awards

1
1. mlenc 15 Mar 2019
  
  in Public
  
  to consider applying for this for Admin Data Coalition work.
  
  powered by data pbd policy award apply
Visit annotations in context

Tags

powered by data

pbd

apply

policy award

Annotators

mlenc

URL

broadbentinstitute.ca/summit2019_awards
dxtera.org dxtera.org

Dxtera - Home - DXtera Institute

1
1. nateangell 09 Mar 2019
  
  in Public
  
  DXtera Institute is a non-profit, collaborative member-based consortium dedicated to transforming student and institutional outcomes in higher education.
  
  DXtera Institute is a non-profit, collaborative member-based consortium dedicated to transforming student and institutional outcomes in higher education. We specialize in helping higher education professionals drive more efficient access to information and insights for effective decision-making and realize long-term cost savings, by simplifying and removing barriers to systems integration and improving data aggregation and control.
  
  With partners across the U.S. and Europe, our consortium includes some of the brightest minds in education and technology, all working together to solve critical higher education issues on a global scale.
  
  dxtera jeffmerriman data systems edu okp consortia
Visit annotations in context

Tags

data

dxtera

consortia

okp

edu

jeffmerriman

systems

Annotators

nateangell

URL

dxtera.org/
journalistsresource.org journalistsresource.org

Data journalism at two elite news outlets lacked transparency: Research

3
1. shelleyberry 06 Mar 2019
  
  in Public
  
  Data journalism produced by two of the nation’s most prestigious news organizations — The New York Times and The Washington Post — has lacked transparency, often failing to explain the methods journalists or others used to collect or analyze the data on which the articles were based, a new study finds. In addition, the news outlets usually did not provide the public with access to that data
  
  While this is a worthwhile topic, I would like to see more exploration of data journalism in the 99.99999 percent of news organizations that are NOT the New York Times or the Washington Post and don't have the resources to publish so many data stories despite the desperate need for them across the nation. Also, why no digital news outlets included?
  
  data journalism research newspapers
2. shelleyberry 06 Mar 2019
  
  in Public
  
  Worse yet, it wouldn’t surprise me if we saw more unethical people publish data as a strategic communication tool, because they know people tend to believe numbers more than personal stories. That’s why it’s so important to have that training on information literacy and methodology.”
  
  Like the way unethical people use statistics in general? This should be a concern, especially as government data, long considered the gold standard of data, undergoes attacks that would skew the data toward political ends. (see the census 2020)
  
  data journalism data research ethics
3. shelleyberry 06 Mar 2019
  
  in Public
  
  fall short of the ideal of data journalism
  
  Is this the ideal of data journalism? Where is this ideal spelled out, and is there any sign that the NYT and WaPo have agreed to abide by this ideal?
  
  data journalism transparency
Visit annotations in context

Tags

data

newspapers

ethics

transparency

research

data journalism

Annotators

shelleyberry

URL

journalistsresource.org/studies/society/news-media/data-journalism-at-two-elite-news-outlets-lacked-transparency-research/
Feb 2019
paleorxiv.org paleorxiv.org

Sookias_Homoiology_Main_Text_and_Figures.pdf

1
1. dasGrimm 25 Feb 2019
  
  in Public
  
  set; if this is higher, the tree 2can be considered to fit the data less well
  
  To test the fit between data and more than one alternative tree, you can just do a bootstrap analysis, and map the results on a neighbour-net splits graph based on the same data.
  
  Note that the phangorn library includes functions to transfer information between trees/tree samples and trees and networks:<br/> Schliep K, Potts AJ, Morrison DA, Grimm GW. 2017. Intertwining phylogenetic trees and networks. Methods in Ecology and Evolution (DOI:10.1111/2041-210X.12760.)[http://onlinelibrary.wiley.com/doi/10.1111/2041-210X.12760/full] – the basic functions and script templates are provided in the associated vignette.
  
  data compatibility exploratory data analysis EDA topological ambiguity
Visit annotations in context

Tags

exploratory data analysis

data compatibility

EDA

topological ambiguity

Annotators

dasGrimm

URL

paleorxiv.org/5swd6
getpocket.com getpocket.com

Pocket: Interview: Adam Hyde, FLOSS Manuals

1
1. offray 20 Feb 2019
  
  in Public
  
  These models are emerging, which is why its exciting to be involved in the ground floor of this sector, however some models clearly make sense already and thats largely because they closely follow the models free software itself has shaped. If you want status, then you can make a name for yourself by leading a team to write the docs ala free software itself, if you want money then build the reputation for the documentation team and contract out your knowledge (eg. extend the docs on contract ala free software).
  
  Creo que hay que conectarlo con modelos de microfinanciación y tiendas independientes tipo Itch.io y que el experimento debería ser progresivo pero dejar un mapa posible de su propio futuro. Algo así intentaremos en la edición 13a del Data Week.
  
  microfinanciación data week auto publicación
Visit annotations in context

Tags

auto publicación

data week

microfinanciación

Annotators

offray

URL

getpocket.com/a/read/17792780
mbio.asm.org mbio.asm.org

Dissecting Flavivirus Biology in Salivary Gland Cultures from Fed and Unfed Ixodes scapularis (Black-Legged Tick)

1
1. heatherstaines 08 Feb 2019
  
  in Public
  
  Dissecting Flavivirus Biology in Salivary Gland Cultures from Fed and Unfed Ixodes scapularis (Black-Legged Tick)
  
  Data worth viewing: a tick trachea with viral infection in its salivary glands.
  
  Data Worth Viewing
Visit annotations in context

Tags

Data Worth Viewing

Annotators

heatherstaines

URL

mbio.asm.org/content/10/1/e02628-18
static1.squarespace.com static1.squarespace.com

hume_of_the_standard_of_taste.pdf

1
1. kmurphy1 03 Feb 2019
  
  in Public
  
  !..�P'�r\0CA \= e,;4 ��'-"-'
  
  Could empirical data made up of experiences present in the form of an ethnography? Or autoethnography? I'm not sure if this is what you were getting at here, but it is a thought that came to mind!
  
  Experience Data Ethnography Autoethnography
Visit annotations in context

Tags

Experience

Autoethnography

Data

Ethnography

Annotators

kmurphy1

URL

static1.squarespace.com/static/53713bf0e4b0297decd1ab8b/t/5c436e4dc74c5024873bc2ff/1547923027682/hume_of_the_standard_of_taste.pdf
Jan 2019
muse.jhu.edu muse.jhu.edu

Motivated Reasoning, Political Information, and Information Literacy Education

1
1. JoeMurphy 30 Jan 2019
  
  in Public
  
  Nyhan and Reifler also found that presenting challenging information in a chart or graph tends to reduce disconfirmation bias. The researchers concluded that the decreased ambiguity of graphical information (as opposed to text) makes it harder for test subjects to question or argue against the content of the chart.
  
  Amazingly important double-edged finding for discussions of data visualization!
  
  information literacy data visualization
Visit annotations in context

Tags

information literacy

data visualization

Annotators

JoeMurphy

URL

muse.jhu.edu/article/624187
hackr.io hackr.io

Hackr.io - Find & share the best online programming courses & tutorials

1
1. tjfwalker 22 Jan 2019
  
  in Public
  
  resources ∋ learning ∋ CS / programming / design / data / dev
Visit annotations in context

Tags

resources ∋ learning ∋ CS / programming / design / data / dev

Annotators

tjfwalker

URL

hackr.io/
tutormentor.blogspot.com tutormentor.blogspot.com

How I'll Honor ML King Jr. Holiday

1
1. sheri42 22 Jan 2019
  
  in Public
  
  doing the research
  
  I tried to look up info for WA State at census.gov and found the site unavailable due to the shut down and no funding. It will be very difficult if the politics of this country eliminates our access to accurate data.
  
  https://www.census.gov/did/www/saipe/data/interactive/cedr/cdr.html?s_appName=saipe&map_yearSelector=2013&map_geoSelector=aa_c&s_state=53&s_measures=aa_snc&menu=map_proxy
  
  census data government shutdown
Visit annotations in context

Tags

government shutdown

census data

Annotators

sheri42

URL

tutormentor.blogspot.com/2019/01/how-ill-honor-ml-king-jr-holiday.html
sia.tech sia.tech

Technology - Sia

1
1. tjfwalker 21 Jan 2019
  
  in Public
  
  where any 10 of 30 segments can fully recover a user's files
  
  Reed–Solomon error correction
  
  explore further dev ∋ data ∋ error correction ∋ instance
Visit annotations in context

Tags

dev ∋ data ∋ error correction ∋ instance

explore further

Annotators

tjfwalker

URL

sia.tech/technology
demandlab.weebly.com demandlab.weebly.com

AUDIENCE

1
1. Eric.Hollebone 15 Jan 2019
  
  in Public
  
  y bosses want to see quick wins, but I know we can achieve big w
  
  add "My data (database) quality sucks"
  
  data
Visit annotations in context

Tags

data

Annotators

Eric.Hollebone

URL

demandlab.weebly.com/audience.html
bookbook.pubpub.org bookbook.pubpub.org

Terms of Service · MIT Press Open

1
1. offray 04 Jan 2019
  
  in Public
  
  You may not access or use the Site in any manner that could damage or overburden any MIT server, or any network connected to any MIT server. You may not use the Site in any manner that would interfere with any other party’s use of the Site.
  
  Vamos a realizar pequeños scrapping, que no sobrecargarán el servidor, así que estamos cumpliendo con esta parte y de hecho, después de que trabajemos, permitiran repartir la carga del servidor, pues una copia estará en nuestros servidores.
  
  data activism enactive citizenship
Visit annotations in context

Tags

enactive citizenship

data activism

Annotators

offray

URL

bookbook.pubpub.org/tos
hcommons.org hcommons.org

Microsoft Word - The Visibility of Open Access Monographs in a European Context_KUR_format[LM].docx

1
1. micahvandegrift 03 Jan 2019
  
  in Public
  
  Adoption of good practice to generate high quality data will depend on sharing the burden of capacity building in some way. That in turn, can-not happen until there is a framework that provides sufficient trust to allow the sharing and compar-ison of data and its management.
  
  harkening to the 'data trust' concept being discussed from U.S. Mellon-funded projects, also co-authored by the authors of this paper.
  
  data
Visit annotations in context

Tags

data

Annotators

micahvandegrift

URL

hcommons.org/deposits/objects/hc:18270/datastreams/CONTENT/content
Dec 2018
inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net

Dumais-2014-Understanding-User-Behavior-Through-Log-Data-and-Analysis.pdf

12
1. wendynorris 31 Dec 2018
  
  in Public
  
  Outliers : All data sets have an expected range of values, and any actual data set also has outliers that fall below or above the expected range. (Space precludes a detailed discussion of how to handle outliers for statistical analysis purposes, see: Barnett & Lewis, 1994 for details.) How to clean outliers strongly depends on the goals of the analysis and the nature of the data.
  
  Outliers can be signals of unanticipated range of behavior or of errors.
  
  log data human-computer interaction
2. wendynorris 31 Dec 2018
  
  in Public
  
  Understanding the structure of the data : In order to clean log data properly, the researcher must understand the meaning of each record, its associated fi elds, and the interpretation of values. Contextual information about the system that produced the log should be associated with the fi le directly (e.g., “Logging system 3.2.33.2 recorded this fi le on 12-3-2012”) so that if necessary the specifi c code that gener-ated the log can be examined to answer questions about the meaning of the record before executing cleaning operations. The potential misinterpretations take many forms, which we illustrate with encoding of missing data and capped data values.
  
  Context of the data collection and how it is structured is also a critical need.
  
  Example, coding missing info as "0" risks misinterpretation rather than coding it as NIL, NDN or something distinguishable from other data
  
  log data human-computer interaction
3. wendynorris 31 Dec 2018
  
  in Public
  
  Data transformations : The goal of data-cleaning is to preserve the meaning with respect to an intended analysis. A concomitant lesson is that the data-cleaner must track all transformations performed on the data .
  
  Changes to data during clean up should be annotated.
  
  Incorporate meta data about the "chain of change" to accompany the written memo
  
  log data human-computer interaction
4. wendynorris 31 Dec 2018
  
  in Public
  
  Data Cleaning A basic axiom of log analysis is that the raw data cannot be assumed to correctly and completely represent the data being recorded. Validation is really the point of data cleaning: to understand any errors that might have entered into the data and to transform the data in a way that preserves the meaning while removing noise. Although we discuss web log cleaning in this section, it is important to note that these principles apply more broadly to all kinds of log analysis; small datasets often have similar cleaning issues as massive collections. In this section, we discuss the issues and how they can be addressed. How can logs possibly go wrong ? Logs suffer from a variety of data errors and distortions. The common sources of errors we have seen in practice include:
  
  Common sources of errors:
  
  • Missing events
  
  • Dropped data
  
  • Misplaced semantics (encoding log events differently)
  
  log data human-computer interaction
5. wendynorris 31 Dec 2018
  
  in Public
  
  In addition, real world events, such as the death of a major sports fi gure or a political event can often cause people to interact with a site differently. Again, be vigilant in sanity checking (e.g., look for an unusual number of visitors) and exclude data until things are back to normal.
  
  Important consideration for temporal event RQs in refugee study -- whether external events influence use of natural disaster metaphors.
  
  log data human-computer interaction refugee
6. wendynorris 31 Dec 2018
  
  in Public
  
  Recording accurate and consistent time is often a challenge. Web log fi les record many different timestamps during a search interaction: the time the query was sent from the client, the time it was received by the server, the time results were returned from the server, and the time results were received on the client. Server data is more robust but includes unknown network latencies. In both cases the researcher needs to normalize times and synchronize times across multiple machines. It is common to divide the log data up into “days,” but what counts as a day? Is it all the data from midnight to midnight at some common time reference point or is it all the data from midnight to midnight in the user’s local time zone? Is it important to know if people behave differently in the morning than in the evening? Then local time is important. Is it important to know everything that is happening at a given time? Then all the records should be converted to a common time zone.
  
  Challenges of using time-based log data are similar to difficulties in the SBTF time study using Slack transcripts, social media, and Google Sheets
  
  log data human-computer interaction time
7. wendynorris 31 Dec 2018
  
  in Public
  
  Log Studies collect the most natural observations of people as they use systems in whatever ways they typically do, uninfl uenced by experimenters or observers. As the amount of log data that can be collected increases, log studies include many different kinds of people, from all over the world, doing many different kinds of tasks. However, because of the way log data is gathered, much less is known about the people being observed, their intentions or goals, or the contexts in which the observed behaviors occur. Observational log studies allow researchers to form an abstract picture of behavior with an existing system, whereas experimental log stud-ies enable comparisons of two or more systems.
  
  Benefits of log studies:
  
  • Complement other types of lab/field studies
  
  • Provide a portrait of uncensored behavior
  
  • Easy to capture at scale
  
  Disadvantages of log studies:
  
  • Lack of demographic data
  
  • Non-random sampling bias
  
  • Provide info on what people are doing but not their "motivations, success or satisfaction"
  
  • Can lack needed context (software version, what is displayed on screen, etc.)
  
  Ways to mitigate: Collecting, Cleaning and Using Log Data section
  
  log data human-computer interaction
8. wendynorris 31 Dec 2018
  
  in Public
  
  Two common ways to partition log data are by time and by user. Partitioning by time is interesting because log data often contains signifi cant temporal features, such as periodicities (including consistent daily, weekly, and yearly patterns) and spikes in behavior during important events. It is often possible to get an up-to-the- minute picture of how people are behaving with a system from log data by compar-ing past and current behavior.
  
  Bookmarked for time reference.
  
  Mentions challenges of accounting for time zones in log data.
  
  log data human-computer interaction time
9. wendynorris 31 Dec 2018
  
  in Public
  
  An important characteristic of log data is that it captures actual user behavior and not recalled behaviors or subjective impressions of interactions.
  
  Logs can be captured on client-side (operating systems, applications, or special purpose logging software/hardware) or on server-side (web search engines or e-commerce)
  
  log data human-computer interaction
10. wendynorris 31 Dec 2018
  
  in Public
  
  Table 1 Different types of user data in HCI research
  
  log data human-computer interaction
11. wendynorris 31 Dec 2018
  
  in Public
  
  Large-scale log data has enabled HCI researchers to observe how information diffuses through social networks in near real-time during crisis situations (Starbird & Palen, 2010 ), characterize how people revisit web pages over time (Adar, Teevan, & Dumais, 2008 ), and compare how different interfaces for supporting email organi-zation infl uence initial uptake and sustained use (Dumais, Cutrell, Cadiz, Jancke, Sarin, & Robbins, 2003 ; Rodden & Leggett, 2010 ).
  
  Wide variety of uses of log data
  
  log data human-computer interaction
12. wendynorris 29 Dec 2018
  
  in Public
  
  Behavioral logs are traces of human behavior seen through the lenses of sensors that capture and record user activity.
  
  Definition of log data
  
  log data human-computer interaction
Visit annotations in context

Tags

log data

human-computer interaction

time

refugee

Annotators

wendynorris

URL

inst-fs-iad-prod.inscloudgate.net/files/8b7d7174-04d4-4573-9cc2-810b10794da9/978-1-4939-0378-8_14.pdf
inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net

Geertz - 1973 - Thick Description

1
1. wendynorris 30 Dec 2018
  
  in Public
  
  Ethnographic findings are not privileged, just particular: another country heard from. To regard them as anything more (or anything less) than that distorts both them and their implications, which are far profounder than mere primitivity, for social theory.
  
  This tension exists in HCI as well.
  
  Interpreted data vs empirical data and how each is systematically analyzed.
  
  ethnography thick description data analysis
Visit annotations in context

Tags

data analysis

ethnography

thick description

Annotators

wendynorris

URL

inst-fs-iad-prod.inscloudgate.net/files/07d76fb9-01b2-4f43-a0d1-64eb91e29af8/Geertz-1973-Thick-Description_-Toward-an-interpretive-theory-of-cultures.pdf
www-sciencedirect-com.ezproxy.rice.edu www-sciencedirect-com.ezproxy.rice.edu

The copy-number of plasmids and other genetic elements can be determined by SYBR-Green-based quantitative real-time PCR

1
1. pbk1 22 Dec 2018
  
  in Public
  
  Fig. 4
  
  Graph is extremely unclear. Bad usage of point shapes
  
  data visualization simple graph
Visit annotations in context

Tags

data visualization

simple graph

Annotators

pbk1

URL

www-sciencedirect-com.ezproxy.rice.edu/science/article/pii/S0167701205002848
bid.berkeley.edu bid.berkeley.edu

Main Page - CS 294-1 Spring 2012

1
1. ildar 03 Dec 2018
  
  in Public
  
  data science @course
Visit annotations in context

Tags

@course

data science

Annotators

ildar

URL

bid.berkeley.edu/cs294-1-spring12/index.php/Main_Page
bcourses.berkeley.edu bcourses.berkeley.edu

Introduction to Data Science Fall 2015

1
1. ildar 03 Dec 2018
  
  in Public
  
  data science @course
Visit annotations in context

Tags

@course

data science

Annotators

ildar

URL

bcourses.berkeley.edu/courses/1377158/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators