Hypothesis

109 Matching Annotations

Apr 2026
epoch.ai epoch.ai

https://epoch.ai/blog/have-ai-capabilities-accelerated

1
1. fxp007 25 Apr 2026
  
  in Public
  
  The minimum training cutoffs are: ECI (June 2024), METR Time Horizon (January 2024), Combined Math (September 2024), and WeirdML V2 (January 2025).
  
  这些时间节点表明研究使用的数据集长度不同，从2024年初到2024年中不等。较短的训练数据集(如WeirdML V2只有约1年的推理模型前数据)可能限制了检测加速的能力，这解释了为什么该指标未能显示加速趋势。时间跨度的差异也反映了不同AI能力指标的发展历史不同。
  
  data-point time-span dataset-limits
Visit annotations in context

Tags

dataset-limits

time-span

data-point

Annotators

fxp007

URL

epoch.ai/blog/have-ai-capabilities-accelerated
Aug 2024
www.youtube.com www.youtube.com

Semantic Folding for Natural Language Understanding with Francisco Webber - #451

3
1. stopresetgo 23 Aug 2024
  
  in Public
  
  for example our standard english language model is trained with something like maybe 100 gigabytes or so of text um that gives it a strength as if you would throw bird at it with the google corpus so the other thing is of course uh a small corpus like that is computed in two hours or three hours on a on a laptop yeah so that's the other thing uh by the way i didn't mention our fingerprints are actually a boolean so when we when we train as i said we are not using floating points
  
  for - comparison - cortical io vs normal AI - training dataset size and time
  
  comparison - cortical io vs normal AI - training dataset size and time
2. stopresetgo 23 Aug 2024
  
  in Public
  
  we basically grow models of let's say same quality like all the others by using thousand time or ten thousand times less training data
  
  for - comparison - semantic folding vs normal machine learning - training dataset sizes and times
  
  comparison - semantic folding vs normal machine learning - training dataset sizes and times
3. stopresetgo 23 Aug 2024
  
  in Public
  
  in that bitmap representation at the end i can look at every position in my bitmap and i can refer it back explicitly to the bits of reference information that i trained it with
  
  for - semantic fingerprint bitmap - tracing bitmap to training dataset
  
  semantic fingerprint bitmap - tracing bitmap to training dataset
Visit annotations in context

Tags

comparison - cortical io vs normal AI - training dataset size and time

semantic fingerprint bitmap - tracing bitmap to training dataset

comparison - semantic folding vs normal machine learning - training dataset sizes and times

Annotators

stopresetgo

URL

youtube.com/watch
Mar 2024
datadryad.org datadryad.org

Dryad | Data -- Exploring phenotypic diversity of pigmented traits and iris features in Pakistani population

1
1. lmichan 02 Mar 2024
  
  in Public
  
  Hub Biocolores🌈 spar/fabio/Dataset
Visit annotations in context

Tags

spar/fabio/Dataset

Hub Biocolores🌈

Annotators

lmichan

URL

datadryad.org/stash/dataset/doi:10.5061/dryad.sbcc2fr6n
Oct 2023
arxiv.org arxiv.org

2301.05169.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  "Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning" Yuejiang Liu1, 2,* YUEJIANG.LIU@EPFL.CH Alexandre Alahi2 ALEXANDRE.ALAHI@EPFL.CH Chris Russell1 CMRUSS@AMAZON.DE Max Horn1 HORNMAX@AMAZON.DE Dominik Zietlow1 ZIETLD@AMAZON.DE Bernhard Sch ̈olkopf1, 3 BS@TUEBINGEN.MPG.DE Francesco Locatello1 LOCATELF@AMAZON.DE
  
  causality causal-inference open-dataset dataset student-shayan
Visit annotations in context

Tags

open-dataset

causal-inference

causality

dataset

student-shayan

Annotators

mark.crowley

URL

arxiv.org/pdf/2301.05169.pdf
Oct 2022
repositorio.usp.br repositorio.usp.br

Free ions in kerosene-based ferrofluid detected by impedance spectroscopy

1
1. Brunopcl 18 Oct 2022
  
  in Public
  
  Free ions in kerosene-based ferrofluid detected by impedance spectroscopy (2021)
  
  Dados Relacionados - clique aqui -
  
  Dados de pesquisa Dataset Conjunto de dados
Visit annotations in context

Tags

Conjunto de dados

Dataset

Dados de pesquisa

Annotators

Brunopcl

URL

repositorio.usp.br/item/003021307
www.alice.cnptia.embrapa.br www.alice.cnptia.embrapa.br

Tempo de cultivo contínuo de cana-de-açúcar e influência nas características físicas e carbono orgânico de latossolos vermelhos distróficos em Guaíra/SP.

1
1. Brunopcl 17 Oct 2022
  
  in Public
  
  Tempo de cultivo contínuo de cana-de-açúcar e influência nas características físicas e carbono orgânico de latossolos vermelhos distróficos em Guaíra/SP.
  
  Dados Relacionados - clique aqui -
  
  Dados de pesquisa Dataset Conjunto de dados
Visit annotations in context

Tags

Conjunto de dados

Dataset

Dados de pesquisa

Annotators

Brunopcl

URL

alice.cnptia.embrapa.br/alice/handle/doc/1116235
www.arca.fiocruz.br www.arca.fiocruz.br

An old drug and different ways to treat cutaneous leishmaniasis: Intralesional and intramuscular meglumine antimoniate in a reference center, Rio de Janeiro, Brazil

2
1. Brunopcl 17 Oct 2022
  
  in Public
  
  An old drug and different ways to treat cutaneous leishmaniasis: Intralesional and intramuscular meglumine antimoniate in a reference center, Rio de Janeiro, Brazil.
  
  Dados Relacionados - clique aqui -
  
  Dados de pesquisa Dataset Conjunto de dados
2. Brunopcl 17 Oct 2022
  
  in Public
  
  AN OLD DRUG AND DIFFERENT WAYS TO TREAT CUTANEOUS LEISHMANIASIS: INTRALESIONAL AND INTRAMUSCULAR MEGLUMINE ANTIMONIATE IN A REFERENCE CENTER, RIO DE JANEIRO, BRAZIL
  
  Dados Relacionados - clique aqui -
  
  Dados de pesquisa Dataset Conjunto de dados
Visit annotations in context

Tags

Dataset

Conjunto de dados

Dados de pesquisa

Annotators

Brunopcl

URL

arca.fiocruz.br/handle/icict/50184
Jun 2022
data-feminism.mitpress.mit.edu data-feminism.mitpress.mit.edu

6. The Numbers Don’t Speak for Themselves

1
1. pvu23 25 Jun 2022
  
  in Public
  
  The major issue with much of the data that can be downloaded from web portals or through APIs is that they come without context or metadata. If you are lucky you might get a paragraph about where the data are from or a data dictionary that describes what each column in a particular spreadsheet means. But more often than not, you get something that looks like figure 6.3.
  
  I think that the reason behind data's lack of context is the reluctance in making extra column for data's description and the inconsiderate and misleading vision that those in technologies hold when they put forth that data should be clean and concise.
  
  I encountered the insufficient provision of data multiple times and I found it extremely inconvenient when trying to use downloaded online reports and attached them to my work experiences as a way to illustrate the efficient changes in driving audiences for a social media platform (Facebook). I used to help run an facebook page for a student organization. After being done with the role, I went to the "Insights" section of Facebook, hoping to download the report of increases in Page Likes, Visits, and Interactions during the period that I was an admin of the page. It took me several glitches to download the report (because it was a year-long term). When the pdf file was ready to be viewed, I was surprised, because they did not mention the years I was working, the name of the student organization, and other categorizations that should have been highlighted. Apparently, it's not hard to include the years or even the name because they were included in the filter when I wanted to extract certain part of the report and because it was the source where they took the data from, respectively. This laziness in showing competent data for analysis was desperate, and I had to add extra analysis to it. Even after I finished with the "extra work", I started to question to validity of the report I was downloading. Would it be trustworthy anymore, because without my clarification, no analysis could be made even by a person involved in data science field. Even if they could, it would take them a while to collect other external information before making clear of the data presented to them.
  
  Understanding and constantly being bothered by this ongoing problem gives me justification to call for a more thorough data translation and presentation process. More questions should be raised and answered regarding what might a user wonder about this dataset when encountering it.
  
  context explanation justification dataset background
Visit annotations in context

Tags

background

justification

context

dataset

explanation

Annotators

pvu23

URL

data-feminism.mitpress.mit.edu/pub/czq9dfs5
May 2022
www.gwern.net www.gwern.net

1988-lang.pdf

1
1. mshook 18 May 2022
  
  in Public
  
  Such a highly non-linear problem would clearly benefitfrom the computational power of many layers. Unfortu-nately, back-propagation learning generally slows downby an order of magnitude every time a layer is added toa network.
  
  The problem in 1988
  
  1988 ml nn nonlinear spiral dataset c vintage
Visit annotations in context

Tags

vintage

spiral

dataset

nn

c

1988

nonlinear

ml

Annotators

mshook

URL

gwern.net/docs/ai/1988-lang.pdf
Apr 2022
www.abc.net.au www.abc.net.au

Charting the COVID-19 spread: How Australia is faring

1
1. Marlene_Wulf 27 Apr 2022
  
  in BehSci
  
  Charting the COVID-19 spread: How Australia is faring. (2020, March 16). ABC News. https://www.abc.net.au/news/2020-03-17/coronavirus-cases-data-reveals-how-covid-19-spreads-in-australia/12060704
  
  is:webpage lang:en COVID-19 Australia spread hospital dashboard chart pandemic dataset daily update age gender test case hospitalization infection
Visit annotations in context

Tags

chart

age

dashboard

daily update

Australia

test

is:webpage

infection

pandemic

dataset

lang:en

gender

spread

COVID-19

hospitalization

hospital

case

Annotators

Marlene_Wulf

URL

abc.net.au/news/2020-03-17/coronavirus-cases-data-reveals-how-covid-19-spreads-in-australia/12060704
Nov 2021
arxiv.org arxiv.org

Can I use this publicly available dataset to build commercial AI software? Most likely not

1
1. michael_rowe 09 Nov 2021
  
  in Public
  
  Just because a dataset is publicly available doesn't mean that you can use it to build commercial AI software.
  
  artificial intelligence license rights open source data dataset
Visit annotations in context

Tags

license

artificial intelligence

dataset

open source

data

rights

Annotators

michael_rowe

URL

arxiv.org/abs/2111.02374
Jun 2021
www.medrxiv.org www.medrxiv.org

https://medrxiv.org/cgi/content/10.1101/2021.01.27.21250604

1
1. jackiekrauss 09 Jun 2021
  
  in BehSci
  
  Karlinsky, A., & Kobak, D. (2021). The World Mortality Dataset: Tracking excess mortality across countries during the COVID-19 pandemic. MedRxiv, 2021.01.27.21250604. https://doi.org/10.1101/2021.01.27.21250604
  
  is:article lang:en COVID-19 mortality World Mortality Dataset testing capacity reporting policy social distancing excess mortality infectious mortality underreporting
Visit annotations in context

Tags

excess mortality

mortality

World Mortality Dataset

reporting policy

lang:en

COVID-19

testing capacity

underreporting

is:article

infectious mortality

social distancing

Annotators

jackiekrauss

URL

medrxiv.org/content/10.1101/2021.01.27.21250604v3
May 2021
moodle.southwestern.edu moodle.southwestern.edu

Revenge of the Radical Right

1
1. hiebelc 20 May 2021
  
  in Public
  
  To investigate these hypotheses, I created an election-year-country dataset covering the period from the early 1990s to the present for all post- communist democracies.7 The dataset is structured as a quasi-time series of 93 parliamentary elections in 17 countries from 1991 to 2012, and the depen-dent variable is the natural log of the radical right party’s combined vote share in elections held at time t.
  
  this is the data, her explanation of the dataset she created
  
  data dataset
Visit annotations in context

Tags

dataset

data

Annotators

hiebelc

URL

moodle.southwestern.edu/pluginfile.php/473175/mod_resource/content/1/bustikova revenge of the radical right.pdf
Mar 2021
arxiv.org arxiv.org

A temporal network version of Watts's cascade model

1
1. n.parfitt 26 Mar 2021
  
  in BehSci
  
  Karimi, Fariba, and Petter Holme. ‘A Temporal Network Version of Watts’s Cascade Model’. ArXiv:2103.13604 [Physics], 25 March 2021. http://arxiv.org/abs/2103.13604.
  
  lang:en is:other temporal network cascade modelling threshold social science economics opinion innovation agent interaction resistance structure timing influence dataset
Visit annotations in context

Tags

resistance

threshold

interaction

timing

influence

social science

structure

temporal

innovation

economics

opinion

dataset

is:other

modelling

lang:en

network

cascade

agent

Annotators

n.parfitt

URL

arxiv.org/abs/2103.13604
data.cdc.gov data.cdc.gov

COVID-19 Case Surveillance Public Use Data with Geography | Data | Centers for Disease Control and Prevention

1
1. n.parfitt 26 Mar 2021
  
  in BehSci
  
  Calgary, Open. ‘COVID-19 Case Surveillance Public Use Data with Geography | Data | Centers for Disease Control and Prevention’. Accessed 26 March 2021. https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data-with-Ge/n8mc-b4w4.
  
  lang:en is:report COVID-19 data dataset CDC demographic geography county state exposure antigen serologic public health jurisdiction reporting
Visit annotations in context

Tags

CDC

data

is:report

geography

serologic

jurisdiction

reporting

state

public health

county

antigen

demographic

dataset

lang:en

COVID-19

exposure

Annotators

n.parfitt

URL

data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data-with-Ge/n8mc-b4w4
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

GEO Accession viewer

9
1. mehu 24 Mar 2021
  
  in Public
  
  14 of which were sampled at multiple timepoints
  
  COVID-19 Dataset
2. mehu 24 Mar 2021
  
  in Public
  
  RNA sequencing on samples from 46 individuals with PCR-positive, symptomatic SARS-CoV-2 infection
  
  COVID-19 Dataset
3. mehu 24 Mar 2021
  
  in Public
  
  77 peripheral blood samples across 46 subjects with COVID-19 and compared them to subjects with seasonal coronavirus, influenza, bacterial pneumonia, and healthy controls.
  
  COVID-19 Dataset
4. mehu 24 Mar 2021
  
  in Public
  
  seasonal coronavirus (n=59)
  
  COVID-19 Dataset
5. mehu 24 Mar 2021
  
  in Public
  
  divided based on disease severity and time from symptom onset
  
  COVID-19 Dataset
6. mehu 24 Mar 2021
  
  in Public
  
  elucidate novel aspects of the host response to SARS-CoV-2
  
  COVID-19 Dataset
7. mehu 24 Mar 2021
  
  in Public
  
  influenza (n=17)
  
  COVID-19 Dataset
8. mehu 24 Mar 2021
  
  in Public
  
  bacterial pneumonia (n=20)
  
  COVID-19 Dataset
9. mehu 24 Mar 2021
  
  in Public
  
  healthy controls (n=19)
  
  COVID-19 Dataset
Visit annotations in context

Tags

Dataset

COVID-19

Annotators

mehu

URL

ncbi.nlm.nih.gov/geo/query/acc.cgi
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

GEO Accession viewer

2
1. mehu 24 Mar 2021
  
  in Public
  
  elucidate key pathways in the host transcriptome of patients infected with SARS-CoV-2, we used RNA sequencing (RNA Seq) to analyze nasopharyngeal (NP) swab and whole blood (WB) samples from 333 COVID-19 patients and controls, including patients with other viral and bacterial infections.
  
  COVID-19 Dataset
2. mehu 24 Mar 2021
  
  in Public
  
  host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood
  
  COVID-19 Dataset
Visit annotations in context

Tags

Dataset

COVID-19

Annotators

mehu

URL

ncbi.nlm.nih.gov/geo/query/acc.cgi
www.nature.com www.nature.com

COVID-19 Government Response Event Dataset (CoronaNet v.1.0)

1
1. SIYANYE 19 Mar 2021
  
  in BehSci
  
  Cheng, C., Barceló, J., Hartnett, A. S., Kubinec, R., & Messerschmidt, L. (2020). COVID-19 Government Response Event Dataset (CoronaNet v.1.0). Nature Human Behaviour, 1–13. https://doi.org/10.1038/s41562-020-0909-7
  
  is:article lang:en COVID-19 policy pandemic dataset model
Visit annotations in context

Tags

policy

lang:en

COVID-19

dataset

is:article

model

pandemic

Annotators

SIYANYE

URL

nature.com/articles/s41562-020-0909-7
Dec 2020
saveriomiroddi.github.io saveriomiroddi.github.io

Installing Ubuntu on a ZFS root, with encryption and mirroring

1
1. almereyda 26 Dec 2020
  
  in Public
  
  Databases If databases data is stored on a ZFS filesystem, it’s better to create a separate dataset with several tweaks: zfs create -o recordsize=8K -o primarycache=metadata -o logbias=throughput -o mountpoint=/path/to/db_data rpool/db_data recordsize: match the typical RDBMSs page size (8 KiB) primarycache: disable ZFS data caching, as RDBMSs have their own logbias: essentially, disabled log-based writes, relying on the RDBMSs’ integrity measures (see detailed Oracle post)
  
  ZFS database dataset
Visit annotations in context

Tags

ZFS

database

dataset

Annotators

almereyda

URL

saveriomiroddi.github.io/Installing-Ubuntu-on-a-ZFS-root-with-encryption-and-mirroring/
Oct 2020
ourworldindata.org ourworldindata.org

Coronavirus Disease (COVID-19) – the data

1
1. lmichan 27 Oct 2020
  
  in Public
  
  charts
  
  unidad_COVID2019 schema.org/Dataset Linfodemia META schema.org/ImageObject prioridad1 ⭕Act_now
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

schema.org/ImageObject

prioridad1

Linfodemia

⭕Act_now

META

Annotators

lmichan

URL

ourworldindata.org/coronavirus-data
docs.google.com docs.google.com

Dimensions COVID-19 publications, data sets, clinical trials - updated daily

1
1. lmichan 27 Oct 2020
  
  in Public
  
  publications clinical trials datasets
  
  prioridad1 unidad_COVID2019 schema.org/Dataset ⭕Act_now META Linfodemia hub/observatory_infomationCOVID19 nodoCOVID19_datos nodoCOVID19/datos_investigacion
Visit annotations in context

Tags

unidad_COVID2019

nodoCOVID19_datos

prioridad1

Linfodemia

hub/observatory_infomationCOVID19

⭕Act_now

schema.org/Dataset

META

nodoCOVID19/datos_investigacion

Annotators

lmichan

URL

docs.google.com/spreadsheets/d/1-kTZJZ1GAhJ2m4GAIhw1ZdlgO46JpvX0ZQa232VWRmw/edit
www.kaggle.com www.kaggle.com

Novel Corona Virus 2019 Dataset

1
1. lmichan 13 Oct 2020
  
  in Public
  
  unidad_COVID2019 META schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

META

schema.org/Dataset

Annotators

lmichan

URL

kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset
github.com github.com

echen102/COVID-19-TweetIDs

1
1. lmichan 13 Oct 2020
  
  in Public
  
  unidad_COVID2019 META schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

META

schema.org/Dataset

Annotators

lmichan

URL

github.com/echen102/COVID-19-TweetIDs
storymaps.arcgis.com storymaps.arcgis.com

Mapping the novel coronavirus outbreak

1
1. lmichan 13 Oct 2020
  
  in Public
  
  unidad_COVID2019 schema.org/Map schema.org/ImageObject prioridad1 schema.org/Dataset Linfodemia META
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

schema.org/ImageObject

schema.org/Map

prioridad1

Linfodemia

META

Annotators

lmichan

URL

storymaps.arcgis.com/stories/4fdc0d03d3a34aa485de1fb0d2650ee0
nextstrain.org nextstrain.org

Nextstrain / ncov

1
1. lmichan 13 Oct 2020
  
  in Public
  
  1442 genomes
  
  unidad_COVID2019 aplicacion schema.org/Dataset Linfodemia META txid2697049(SARS-CoV-2) hub/observatory_infomationCOVID19 nodoCOVID19_datos_coleccion nodoCOVID19/datos_investigacion
Visit annotations in context

Tags

unidad_COVID2019

aplicacion

txid2697049(SARS-CoV-2)

Linfodemia

hub/observatory_infomationCOVID19

nodoCOVID19_datos_coleccion

schema.org/Dataset

META

nodoCOVID19/datos_investigacion

Annotators

lmichan

URL

nextstrain.org/ncov
www.arcgis.com www.arcgis.com

Coronavirus COVID-19 (2019-nCoV)

1
1. lmichan 13 Oct 2020
  
  in Public
  
  441187 total confirmed cases 111933 recovered 19784 deadhs
  
  schema.org/Map unidad_COVID2019 schema.org/Dataset META Linfodemia hub/observatory_infomationCOVID19
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

hub/observatory_infomationCOVID19

schema.org/Map

Linfodemia

META

Annotators

lmichan

URL

arcgis.com/apps/opsdashboard/index.html
Sep 2020
github.com github.com

Uncaught (in promise) TypeError: Cannot read property 'removeChild' of null · Issue #2086 · sveltejs/svelte

1
1. TylerRick 30 Sep 2020
  
  in Public
  
  I forgot to mention in the original issue way back that I have a lot of data. Like 1 to 3 MB that is being passed around via export let foo.
  
  large objects javascript large dataset
Visit annotations in context

Tags

javascript

large dataset

large objects

Annotators

TylerRick

URL

github.com/sveltejs/svelte/issues/2086
arxiv.org arxiv.org

Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (version 1.0)

1
1. katietaylor_99 07 Sep 2020
  
  in BehSci
  
  Bavadekar, Shailesh, Andrew Dai, John Davis, Damien Desfontaines, Ilya Eckstein, Katie Everett, Alex Fabrikant, et al. ‘Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (Version 1.0)’. ArXiv:2009.01265 [Cs], 2 September 2020. http://arxiv.org/abs/2009.01265.
  
  is:preprint lang:en COVID-19 Google search trend symptom dataset anonymization aggregation
Visit annotations in context

Tags

is:preprint

aggregation

anonymization

Google

dataset

symptom

lang:en

COVID-19

search trend

Annotators

katietaylor_99

URL

arxiv.org/abs/2009.01265
Jul 2020
osf.io osf.io

Human-dog relationships during COVID-19 pandemic; booming dog adoption during social isolation

1
1. ErikStuchly 15 Jul 2020
  
  in BehSci
  
  Morgan, L., Protopopova, A., Birkler, R. I. D., Itin-Shwartz, B., Sutton, G. A., gamliel, alexandra, Yakobson, B., & Raz, T. (2020). Human-dog relationships during COVID-19 pandemic; booming dog adoption during social isolation [Preprint]. SocArXiv. https://doi.org/10.31235/osf.io/s9k4y
  
  is:preprint lang:en COVID-19 human-dog relationship dog adoption social isolation pet wellbeing companion animal social relationship stress prospective study retrospective dataset quality of life benefit
Visit annotations in context

Tags

is:preprint

social relationship

prospective study

dog adoption

benefit

companion animal

social isolation

wellbeing

lang:en

stress

COVID-19

human-dog relationship

pet

quality of life

retrospective dataset

Annotators

ErikStuchly

URL

osf.io/preprints/socarxiv/s9k4y/
psyarxiv.com psyarxiv.com

Depression symptoms during the COVID-19 pandemic in different regions in Germany.

1
1. Marlene_Wulf 14 Jul 2020
  
  in BehSci
  
  Schelhorn, I., Ecker, A., Bereznai, J., Tran, T., Rehm, S., Lugo, R., Sütterlin, S., Kinateder, M., & Shiban, Y. (2020). Depression symptoms during the COVID-19 pandemic in different regions in Germany. [Preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/p9wz8
  
  is:preprint lang:en COVID-19 depression online survey Germany psychological service availability accessibility restrictive measure dataset
Visit annotations in context

Tags

is:preprint

online survey

accessibility

availability

depression

dataset

restrictive measure

Germany

lang:en

COVID-19

psychological service

Annotators

Marlene_Wulf

URL

psyarxiv.com/p9wz8/
Jun 2020
www.youtube.com www.youtube.com

EU Datathon 2020 - Webinar on COVID-19 and air quality

1
1. edampf 25 Jun 2020
  
  in BehSci
  
  EU Datathon 2020—Webinar on COVID-19 and media and data monitoring. (2020, April 22). https://www.youtube.com/watch?v=wyNgmEfi_vk&feature=youtu.be
  
  is:other webinar COVID-19 lang:en air quality environment dataset EU Datathon 2020
Visit annotations in context

Tags

environment

webinar

dataset

EU

lang:en

COVID-19

Datathon 2020

air quality

is:other

Annotators

edampf

URL

youtube.com/watch
www.youtube.com www.youtube.com

EU Datathon 2020 - Webinar on COVID-19 and media and data monitoring

1
1. edampf 25 Jun 2020
  
  in BehSci
  
  EU Datathon 2020—Webinar on COVID-19 and media and data monitoring. (2020, April 22). https://www.youtube.com/watch?v=wyNgmEfi_vk&feature=youtu.be
  
  is:other webinar lang:en COVID-19 media data monitoring dataset EU Datathon 2020
Visit annotations in context

Tags

webinar

dataset

EU

data monitoring

lang:en

media

COVID-19

Datathon 2020

is:other

Annotators

edampf

URL

youtube.com/watch
www.youtube.com www.youtube.com

EU Datathon 2020 - Webinar dedicated to COVID-19 data

1
1. edampf 25 Jun 2020
  
  in BehSci
  
  EU Datathon 2020—Webinar dedicated to COVID-19 data. (2020, April 9). https://www.youtube.com/watch?v=JIy6NO7QRQM&list=PLT5rARDev_rlAZ21iedz0ynnN4Na3UIoW&index=14&t=270s
  
  is:other webinar lang:en COVID-19 EU Datathon 2020 dataset
Visit annotations in context

Tags

lang:en

COVID-19

Datathon 2020

webinar

dataset

EU

is:other

Annotators

edampf

URL

youtube.com/watch
eml.berkeley.edu eml.berkeley.edu

NudgeToScale2020-03-20.pdf

1
1. edampf 12 Jun 2020
  
  in BehSci
  
  DellaVigna, S & Linos E. (2020). RCTs to scale: Comprehensive evidence from two nudge units. UC Berkeley. https://eml.berkeley.edu/~sdellavi/wp/NudgeToScale2020-03-20.pdf
  
  is:article lang:en pdf nudge intervention behavior motivation nudge units USA government dataset RCT trials publication academic prediction forecast intervention practitioner study
Visit annotations in context

Tags

intervention

nudge intervention

practitioner

trials

USA

nudge units

publication

RCT

is:article

forecast

behavior

prediction

dataset

motivation

government

pdf

lang:en

study

academic

Annotators

edampf

URL

eml.berkeley.edu/~sdellavi/wp/NudgeToScale2020-03-20.pdf
psyarxiv.com psyarxiv.com

COVIDiSTRESS Global Survey dataset on psychological and behavioural consequences of the COVID-19 outbreak

1
1. Marlene_Wulf 03 Jun 2020
  
  in BehSci
  
  Yamada, Y., Ćepulić, D.-B., Coll-Martín, T., Debove, S., Gautreau, G., Han, H., Rasmussen, J., Tran, T. P., Travaglino, G. A., & Lieberoth, A. (2020). COVIDiSTRESS Global Survey dataset on psychological and behavioural consequences of the COVID-19 outbreak [Preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/v7cep
  
  is:preprint lang:en compliance COVID-19 stress social support personality trsut in institution preventive measure social science dataset global survey open science human experience pandemic demographic background variable
Visit annotations in context

Tags

human experience

social support

open science

global

social science

preventive measure

personality

demographic background variable

is:preprint

dataset

pandemic

survey

stress

lang:en

COVID-19

compliance

trsut in institution

Annotators

Marlene_Wulf

URL

psyarxiv.com/v7cep/
May 2020
docs.google.com docs.google.com

Proposal for collaboration

1
1. edampf 29 May 2020
  
  in BehSci
  
  is:webpage COVID-19 lang:en collaboration proposal CoMuNe Lab dataset data Twitter Infodemic Observatory project
Visit annotations in context

Tags

CoMuNe Lab

Infodemic Observatory

collaboration

dataset

data

lang:en

project

COVID-19

Twitter

is:webpage

proposal

Annotators

edampf

URL

docs.google.com/forms/d/e/1FAIpQLSeqZsfAGIpTbDV62-LNf8xt3MjJRtfVmM8DLInLCTlZPo5hjA/viewform
www.ukcdr.org.uk www.ukcdr.org.uk

COVID-19 Research Project Tracker by UKCDR & GloPID-R

1
1. edampf 26 May 2020
  
  in BehSci
  
  UKCDR - COVID-19 Research Project Tracker
  
  is:webpage lang:en COVID-19 UK database research project tracker future investment coordination information sharing live interactive map dataset clinical trial resources
Visit annotations in context

Tags

coordination

future investment

interactive

live

resources

is:webpage

clinical trial

information sharing

database

dataset

tracker

lang:en

UK

COVID-19

map

research project

Annotators

edampf

URL

ukcdr.org.uk/funding-landscape/covid-19-research-project-tracker/
ai.googleblog.com ai.googleblog.com

Understanding the Shape of Large-Scale Data

1
1. edampf 13 May 2020
  
  in BehSci
  
  Tsitsulin, A. & Perozzi B. Understanding the Shape of Large-Scale Data. (2020 May 05). Google AI Blog. http://ai.googleblog.com/2020/05/understanding-shape-of-large-scale-data.html
  
  is:blog lang:en Google large-scale data dataset graph mathematics modeling relationship learning data evaluation data visualization spectrum DDGK data analysis time-varying
Visit annotations in context

Tags

time-varying

relationship

data analysis

learning

spectrum

mathematics

is:blog

large-scale data

data visualization

Google

dataset

DDGK

lang:en

data evaluation

graph

modeling

Annotators

edampf

URL

ai.googleblog.com/2020/05/understanding-shape-of-large-scale-data.html
www.kaggle.com www.kaggle.com

COVID-19 Open Research Dataset Challenge (CORD-19)

1
1. Marlene_Wulf 07 May 2020
  
  in BehSci
  
  COVID-19 Open Research Dataset Challenge (CORD-19). (n.d.). Retrieved May 6, 2020, from https://kaggle.com/allen-institute-for-ai/CORD-19-research-challenge
  
  is:webpage lang:en COVID-19 research dataset challenge community resources literature White House
Visit annotations in context

Tags

literature

dataset

community

White House

resources

lang:en

COVID-19

challenge

is:webpage

research

Annotators

Marlene_Wulf

URL

kaggle.com/allen-institute-for-ai/CORD-19-research-challenge
leoferres.info leoferres.info

Leo's blog · COVID19 Mobility Reports

1
1. edampf 07 May 2020
  
  in BehSci
  
  Ferres, L. (2020 April 10). COVID19 mobility reports. Leo's Blog. https://leoferres.info/blog/2020/04/10/covid19-mobility-reports/
  
  is:blog COVID-19 lang:en mobility geographic information research public health dataset data metrics analysis smartphone social media technology
Visit annotations in context

Tags

technology

is:blog

metrics

mobility

dataset

data

smartphone

lang:en

social media

COVID-19

research

public health

analysis

geographic information

Annotators

edampf

URL

leoferres.info/blog/2020/04/10/covid19-mobility-reports/
coviz.apps.allenai.org coviz.apps.allenai.org

About

1
1. Marlene_Wulf 06 May 2020
  
  in BehSci
  
  About. (n.d.). Retrieved May 6, 2020, from https://coviz.apps.allenai.org/
  
  is:webpage lang:en COVID-19 network science research dataset acceleration visualization literature concept connection biomedical
Visit annotations in context

Tags

science

acceleration

literature

dataset

visualization

connection

lang:en

COVID-19

network

biomedical

concept

is:webpage

research

Annotators

Marlene_Wulf

URL

coviz.apps.allenai.org/
epjdatascience.springeropen.com epjdatascience.springeropen.com

News and the city: understanding online press consumption patterns through mobile data

1
1. edampf 06 May 2020
  
  in BehSci
  
  Vilella, S., Paolotti, D., Ruffo, G. et al. News and the city: understanding online press consumption patterns through mobile data. EPJ Data Sci. 9, 10 (2020). https://doi.org/10.1140/epjds/s13688-020-00228-9
  
  is:article lang:en news consumption mobile data deep packet inspection urban geo-referenced analysis connectivity information media dataset Chile city smartphone demographics education age online behavior digital media
Visit annotations in context

Tags

age

urban

geo-referenced analysis

information

online behavior

media

mobile data

is:article

digital media

connectivity

education

deep packet inspection

city

consumption

Chile

dataset

smartphone

demographics

lang:en

news

Annotators

edampf

URL

epjdatascience.springeropen.com/articles/10.1140/epjds/s13688-020-00228-9
Apr 2020
rajpurkar.github.io rajpurkar.github.io

The Stanford Question Answering Dataset

1
1. zajo 28 Apr 2020
  
  in Public
  
  dataset
Visit annotations in context

Tags

dataset

Annotators

zajo

URL

rajpurkar.github.io/SQuAD-explorer/
arxiv.org arxiv.org

A County-level Dataset for Informing the United States' Response to COVID-19

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Killeen, B.D., et al. (2020, April 1). A country-level dataset for informing the United States' response to COVID-19. Cornel University. arXiv:2004.00756.
  
  citation COVID-19 lang:en is:preprint dataset intervention aggregation USA response metrics information data code
Visit annotations in context

Tags

citation

is:preprint

intervention

response

metrics

dataset

USA

data

information

code

lang:en

COVID-19

aggregation

Annotators

edampf

URL

arxiv.org/abs/2004.00756
www.ofcom.org.uk www.ofcom.org.uk

Covid-19 news and information: consumption and attitudes

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Ofcom. (2020 April 09). Covid-19 news and information: consumption and attitudes. https://www.ofcom.org.uk/research-and-data/tv-radio-and-on-demand/news-media/coronavirus-news-consumption-attitudes-behaviour
  
  is:webpage lang:en COVID-19 response information survey dataset BARB comScore news access attitude consumption misinformation interactive
Visit annotations in context

Tags

consumption

response

comScore

interactive

dataset

BARB

information

misinformation

survey

access

lang:en

attitude

COVID-19

is:webpage

news

Annotators

edampf

URL

ofcom.org.uk/research-and-data/tv-radio-and-on-demand/news-media/coronavirus-news-consumption-attitudes-behaviour
www.pnas.org www.pnas.org

Measuring the predictability of life outcomes with a scientific mass collaboration

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  Salganik, M. J., Lundberg, I., Kindel, A. T., Ahearn, C. E., Al-Ghoneim, K., Almaatouq, A., Altschul, D. M., Brand, J. E., Carnegie, N. B., Compton, R. J., Datta, D., Davidson, T., Filippova, A., Gilroy, C., Goode, B. J., Jahani, E., Kashyap, R., Kirchner, A., McKay, S., … McLanahan, S. (2020). Measuring the predictability of life outcomes with a scientific mass collaboration. Proceedings of the National Academy of Sciences. https://doi.org/10.1073/pnas.1915006117
  
  is:article lang:en prediction collaboration social science dataset prediction life outcome
Visit annotations in context

Tags

collaboration

prediction

dataset

outcome

life

lang:en

social science

is:article

Annotators

Marlene_Wulf

URL

pnas.org/content/117/15/8398
trello.com trello.com

Collective Intelligence and COVID-19 | Trello

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  Collective Intelligence and COVID-19 | Trello. (n.d.). Retrieved April 20, 2020, from https://trello.com/b/STdgEhvX/collective-intelligence-and-covid-19
  
  is:webpage lang:en COVID-19 collective intelligence modeling crowdprediction dataset data analysis science project collaboration mapping crowdsourcing symptom self-assessment contact tracing community network hackathon repository
Visit annotations in context

Tags

crowdsourcing

science

collaboration

self-assessment

hackathon

data

collective

symptom

project

repository

contact

intelligence

mapping

crowdprediction

tracing

dataset

community

lang:en

network

COVID-19

modeling

analysis

is:webpage

Annotators

Marlene_Wulf

URL

trello.com/b/STdgEhvX/collective-intelligence-and-covid-19
arxiv.org arxiv.org

Standardizing and Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  Alam, F., Sajjad, H., Imran, M., & Ofli, F. (2020). Standardizing and Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing. ArXiv:2004.06774 [Cs]. http://arxiv.org/abs/2004.06774
  
  is:article lang:en social media dataset humanitarian information processing analysis organization response disaster standardizing
Visit annotations in context

Tags

standardizing

response

organization

dataset

information

humanitarian

social media

lang:en

is:article

analysis

processing

disaster

Annotators

Marlene_Wulf

URL

arxiv.org/abs/2004.06774
github.com github.com

indirect/unpwn

1
1. TylerRick 22 Apr 2020
  
  in Public
  
  haveibeenpwned.com ruby library password validation/policy uses: offline dataset
Visit annotations in context

Tags

ruby library

haveibeenpwned.com

password validation/policy

uses: offline dataset

Annotators

TylerRick

URL

github.com/indirect/unpwn
github.com github.com

DanielHeath/has_unpublished_password

1
1. TylerRick 20 Apr 2020
  
  in Public
  
  uses: offline dataset haveibeenpwned.com
Visit annotations in context

Tags

haveibeenpwned.com

uses: offline dataset

Annotators

TylerRick

URL

github.com/DanielHeath/has_unpublished_password
experience.arcgis.com experience.arcgis.com

Novel coronavirus (COVID-19) situation

1
1. TylerRick 02 Apr 2020
  
  in Public
  
  data visualization interesting visualizations world map coronavirus schema.org/Dataset schema.org/Map
Visit annotations in context

Tags

schema.org/Dataset

coronavirus

schema.org/Map

interesting visualizations

data visualization

world map

Annotators

TylerRick

URL

experience.arcgis.com/experience/685d0ace521648f8a5beeeee1b9125cd
Mar 2020
Local file Local file

Untitled document

2
1. zhentg 24 Mar 2020
  
  in Public
  
  ll datasets were supplied by Suther-land in the Supporting Information as 3D geometriesaligned according to the original literature, namely byflexible alignment on one or more templates obtained bycrystallographic enzyme-inhibitor complexes
  
  https://pubs.acs.org/doi/10.1021/jm0497141
  
  dataset
2. zhentg 24 Mar 2020
  
  in Public
  
  eight comprehensive datasets
  
  what are the datasets look like? this may help to understand the application domain of this tool.
  
  dataset
Tags

dataset

Annotators

zhentg
ourworldindata.org ourworldindata.org

Coronavirus Disease (COVID-19)

1
1. lmichan 19 Mar 2020
  
  in Public
  
  favorito,data_science
  
  unidad_COVID2019 schema.org/Dataset schema.org/ImageObject hub
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

hub

schema.org/ImageObject

Annotators

lmichan

URL

ourworldindata.org/coronavirus
multimedia.scmp.com multimedia.scmp.com

Coronavirus: the new disease Covid-19 explained

1
1. lmichan 17 Mar 2020
  
  in Public
  
  unidad_COVID2019,favorita
  
  unidad_COVID2019 difusion schema.org/Dataset prioridad1 UCOVID19_infografías schema.org/ImageObject
Visit annotations in context

Tags

unidad_COVID2019

difusion

schema.org/Dataset

schema.org/ImageObject

prioridad1

UCOVID19_infografías

Annotators

lmichan

URL

multimedia.scmp.com/infographics/news/china/article/3047038/wuhan-virus/index.html
serendipia.digital serendipia.digital

Datos abiertos Coronavirus México: descarga aquí la información

1
1. lmichan 17 Mar 2020
  
  in Public
  
  schema.org/Dataset unidad_COVID2019 mx
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

mx

Annotators

lmichan

URL

serendipia.digital/2020/03/datos-abiertos-sobre-casos-de-coronavirus-covid-19-en-mexico/
www.visualcapitalist.com www.visualcapitalist.com

Visualizing the History of Pandemics

1
1. lmichan 16 Mar 2020
  
  in Public
  
  favorito,hermoso
  
  unidad_COVID2019 difusion imagen UCOVID19_infografías schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

difusion

imagen

UCOVID19_infografías

Annotators

lmichan

URL

visualcapitalist.com/history-of-pandemics-deadliest/
avatorl.org avatorl.org

Coronavirus COVID-19 updates: dashboard and report

1
1. lmichan 13 Mar 2020
  
  in Public
  
  unidad_COVID2019 schema.org/Map schema.org/Dataset schema.org/ImageObject prioridad1
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

schema.org/ImageObject

prioridad1

schema.org/Map

Annotators

lmichan

URL

avatorl.org/covid-19/
coronavirus.thebaselab.com coronavirus.thebaselab.com

Coronavirus: Real-time News Updates and Data

1
1. lmichan 13 Mar 2020
  
  in Public
  
  schema.org/Dataset unidad_COVID2019
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

Annotators

lmichan

URL

coronavirus.thebaselab.com/
experience.arcgis.com experience.arcgis.com

Novel coronavirus (COVID-19) situation

1
1. lmichan 13 Mar 2020
  
  in Public
  
  unidad_COVID2019 schema.org/Map schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Map

schema.org/Dataset

Annotators

lmichan

URL

experience.arcgis.com/experience/685d0ace521648f8a5beeeee1b9125cd
www.apprise.org.au www.apprise.org.au

APPRISE – Covid-19 resources

1
1. lmichan 13 Mar 2020
  
  in Public
  
  pais,
  
  unidad_COVID2019 schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

Annotators

lmichan

URL

apprise.org.au/resources/2019-novel-coronavirus-2019-ncov/
www.gov.uk www.gov.uk

COVID-19: track coronavirus cases

1
1. lmichan 13 Mar 2020
  
  in Public
  
  pais
  
  unidad_COVID2019 schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

Annotators

lmichan

URL

gov.uk/government/publications/covid-19-track-coronavirus-cases
github.com github.com

simonw/covid-19-datasette

1
1. lmichan 13 Mar 2020
  
  in Public
  
  unidad_COVID2019
  
  unidad_COVID2019 schema.org/Dataset aplicacion
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

aplicacion

Annotators

lmichan

URL

github.com/simonw/covid-19-datasette
coronavirus.jhu.edu coronavirus.jhu.edu

Johns Hopkins Coronavirus Resource Center

1
1. lmichan 13 Mar 2020
  
  in Public
  
  unidad_COVID2019
  
  schema.org/Map schema.org/Dataset schema.org/ImageObject
Visit annotations in context

Tags

schema.org/Map

schema.org/Dataset

schema.org/ImageObject

Annotators

lmichan

URL

coronavirus.jhu.edu/
www.worldometers.info www.worldometers.info

Coronavirus Cases: Statistics and Charts - Worldometer

1
1. lmichan 13 Mar 2020
  
  in Public
  
  schema.org/Dataset unidad_COVID2019
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

Annotators

lmichan

URL

worldometers.info/coronavirus/coronavirus-cases/
bnonews.com bnonews.com

Tracking coronavirus: Map, data and timeline

1
1. lmichan 13 Mar 2020
  
  in Public
  
  linea_tiempo
  
  unidad_COVID2019 schema.org/Dataset schema.org/Map
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Map

schema.org/Dataset

Annotators

lmichan

URL

bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/
covid2019.app covid2019.app

COVID-2019

1
1. lmichan 12 Mar 2020
  
  in Public
  
  acceso_abierto
  
  unidad_COVID2019 schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

Annotators

lmichan

URL

covid2019.app/
www.consulta.mx www.consulta.mx

Encuesta: Coronavirus en México

1
1. lmichan 12 Mar 2020
  
  in Public
  
  unidad_COVID2019,encuesta
  
  unidad_COVID2019 schema.org/ImageObject schema.org/Dataset mx
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

schema.org/ImageObject

mx

Annotators

lmichan

URL

consulta.mx/index.php/encuestas-e-investigaciones/item/1339-encuesta-coronavirus-en-mexico
coronavirus-disasterresponse.hub.arcgis.com coronavirus-disasterresponse.hub.arcgis.com

COVID-19 GIS Hub

1
1. lmichan 12 Mar 2020
  
  in Public
  
  unidad_COVID2019,imprescindible
  
  unidad_COVID2019 hub prioridad1 schema.org/Dataset schema.org/Map schema.org/ImageObject
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

schema.org/ImageObject

prioridad1

schema.org/Map

hub

Annotators

lmichan

URL

coronavirus-disasterresponse.hub.arcgis.com/
www.kff.org www.kff.org

Coronavirus (COVID-19)

1
1. lmichan 12 Mar 2020
  
  in Public
  
  unidad_COVID2019 hub prioridad1 schema.org/ImageObject schema.org/Dataset
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

schema.org/ImageObject

prioridad1

hub

Annotators

lmichan

URL

kff.org/coronavirus-covid-19/
www.worldometers.info www.worldometers.info

Coronavirus Update (Live): 31,532 Cases and 638 Deaths from the Wuhan China Virus Outbreak - Worldometer

1
1. lmichan 12 Mar 2020
  
  in Public
  
  unidad_COVID2019 schema.org/Dataset prioridad1
Visit annotations in context

Tags

unidad_COVID2019

prioridad1

schema.org/Dataset

Annotators

lmichan

URL

worldometers.info/coronavirus/
www.kff.org www.kff.org

COVID-19 Coronavirus Tracker – Updated as of March 11, 2020

1
1. lmichan 12 Mar 2020
  
  in Public
  
  schema.org/Map schema.org/Dataset unidad_COVID2019 prioridad1
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Map

schema.org/Dataset

prioridad1

Annotators

lmichan

URL

kff.org/global-health-policy/fact-sheet/coronavirus-tracker/
www.ecdc.europa.eu www.ecdc.europa.eu

Situation update worldwide, as of 11 March 2020 08:00

1
1. lmichan 12 Mar 2020
  
  in Public
  
  schema.org/Map schema.org/ImageObject unidad_COVID2019 schema.org/Dataset prioridad1
Visit annotations in context

Tags

unidad_COVID2019

schema.org/Dataset

schema.org/ImageObject

prioridad1

schema.org/Map

Annotators

lmichan

URL

ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases
Feb 2019
iphysresearch.github.io iphysresearch.github.io

A Paper A Day

4
1. Herb 11 Feb 2019
  
  in Public
  
  Impact of Fully Connected Layers on Performance of Convolutional Neural Networks for Image Classification
  
  作者总结说：1）CNN 层越少，FC 层里的node 就要越多才行。相反 CNN 越深，FC node 少就够了；2）浅的 CNN 除了需要更多 FC node 外，数据集 class 类目数越多，FC 层应该越多越好，反之亦然；3）对于单个 class 内样本越多的数据集，网络越深越好，但若 class 类目数很多，浅的网络表现会更好。
  
  dataset structure
2. Herb 06 Feb 2019
  
  in Public
  
  Do we train on test data? Purging CIFAR of near-duplicates
  
  作者玩了把 CIFAR 测试数据集，认为有些样本作为 test 会与 train 样本太相近而过拟合的问题，于是就自己替换了疑似问题样本提出了新 test 数据集，最后拿那些著名模型实验后，庆幸说貌似它们没有过拟合而被错误评估模型优劣~（有点打脸的感觉~）
  
  dataset
3. Herb 02 Feb 2019
  
  in Public
  
  Semantic Redundancies in Image-Classification Datasets: The 10% You Don't Need
  
  深度神经网络版的“特征工程”技术~ [doge]
  
  dataset
4. Herb 01 Feb 2019
  
  in Public
  
  Deep Learning on Small Datasets without Pre-Training using Cosine Loss
  
  在当代深度学习中，有两件事似乎无可争议：
  
  softmax激活后的分类交叉熵损失是分类的首选方法；
  
  在小型数据集上从零开始训练CNN分类器效果不佳。在本文中作者证明，当处理小数据样本类时余弦损失函数比交叉上能够提供更好的性能。
  
  loss dataset
Visit annotations in context

Tags

structure

loss

dataset

Annotators

Herb

URL

iphysresearch.github.io/paper_summary/APaperADay.html
towardsdatascience.com towardsdatascience.com

Top Sources For Machine Learning Datasets – Towards Data Science

1
1. aerobius 01 Feb 2019
  
  in Public
  
  Top Sources For Machine Learning Datasets
  
  dataset ML
Visit annotations in context

Tags

dataset

ML

Annotators

aerobius

URL

towardsdatascience.com/top-sources-for-machine-learning-datasets-bb6d0dc3378b
Jan 2019
iphysresearch.github.io iphysresearch.github.io

A Paper A Day

2
1. Herb 28 Jan 2019
  
  in Public
  
  Fitting A Mixture Distribution to Data: Tutorial
  
  目测是一篇很有爱的教程！
  
  Tutorial dataset
2. Herb 18 Jan 2019
  
  in Public
  
  Optimization Models for Machine Learning: A Survey
  
  感觉此文于我而言真正有价值的恐怕只有文末附录的 Dataset tables 汇总整理了。。。。。
  
  dataset Optimization review
Visit annotations in context

Tags

Optimization

Tutorial

review

dataset

Annotators

Herb

URL

iphysresearch.github.io/paper_summary/APaperADay.html
Dec 2018
iphysresearch.github.io iphysresearch.github.io

A Paper A Day

2
1. Herb 18 Dec 2018
  
  in Public
  
  Are All Training Examples Created Equal? An Empirical Study
  
  从此paper了解到了叫 Active learning 的有趣概念，这似乎和自己设计的连续参数训练数据采样池很接近。。。。
  
  这篇文章的主要工作是给出了一个在图像分类中关于训练样本重要性的研究，对于样本的重要度采用基于梯度的方法进行度量。文章的结论可能表明在深度学习中主动学习或许并不总是有效的。
  
  dataset active learning
2. Herb 07 Dec 2018
  
  in Public
  
  Image Score: How to Select Useful Samples
  
  提出的 semi-supervised learning 这个概念比较有趣。给数据集每个 sample 打分或许对 interpretability 有点帮助吧。。。。
  
  sample dataset
Visit annotations in context

Tags

sample

dataset

active learning

Annotators

Herb

URL

iphysresearch.github.io/paper_summary/APaperADay.html
Nov 2018
iphysresearch.github.io iphysresearch.github.io

A Paper A Day

2
1. Herb 15 Nov 2018
  
  in Public
  
  Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift
  
  该文做的实验是探索对数据集进行 shifts (某种可控的扰动) 后的模型表现，提出了classifier-based的方法/pipeline 来观察和评价：
  
  这对于我的引力波数据研究来说，可以借鉴其数据的 shift 方法以及评价机制（two-sample tests）。
  
  performace model evaluation dataset
2. Herb 12 Nov 2018
  
  in Public
  
  Training neural audio classifiers with few data
  
  这是一个比较初步的简单实验。
  
  图像结论其实并不意外：数据量越多当然表现越好；迁移学习在极小量数据上表现良好；Prototypical 模型可能因结构的特异性会表现出一定程度上的优势；数据量越小，过拟合问题越严重。。。
  
  dataset audio Transfer learning
Visit annotations in context

Tags

performace

Transfer learning

model evaluation

audio

dataset

Annotators

Herb

URL

iphysresearch.github.io/paper_summary/APaperADay.html
Sep 2016
www.ukbiobank.ac.uk www.ukbiobank.ac.uk

Untitled document

1
1. tal 06 Sep 2016
  
  in Public
  
  UK Biobank
  
  Large UK dataset containing extensive phenotypic, genotypic, and neuroimaging data.
  
  License: Unclear, but restrictive. Access: Human, ? Needs data use agreement: Yes Needs institutional signature for access: No (?)
  
  NHW16 Dataset
Visit annotations in context

Tags

Dataset

NHW16

Annotators

tal

URL

ukbiobank.ac.uk/about-biobank-uk/
openfmri.org openfmri.org

OpenfMRI

1
1. tal 06 Sep 2016
  
  in Public
  
  View Data Sets
  
  Public fMRI dataset repository.
  
  License: PDDL v.1.0
  
  Access: Human, s3 Needs data use agreement: No Needs institutional signature for access: No
  
  NHW16 Dataset
Visit annotations in context

Tags

Dataset

NHW16

Annotators

tal

URL

openfmri.org/
dataverse.harvard.edu dataverse.harvard.edu

Brain Genomics Superstruct Project (GSP) - Brain Genomics Superstruct Project (GSP) Dataverse

1
1. satra 06 Sep 2016
  
  in Public
  
  Brain Genomics Superstruct Project (GSP)
  
  License: Data use agreement Access: Human, API Needs data use agreement: Yes Needs institutional signature for access: No
  
  NHW16 Dataset
Visit annotations in context

Tags

Dataset

NHW16

Annotators

satra

URL

dataverse.harvard.edu/dataset.xhtml
studyforrest.org studyforrest.org

studyforrest.org

1
1. satra 06 Sep 2016
  
  in Public
  
  What is studyforrest?
  
  Rich multimodal dataset on naturalistic stimuli
  
  License: PDDL v.10
  
  Access: Human, rsync, git annex
  
  Needs data use agreement: No
  
  Needs institutional signature for access: No
  
  NHW16 Dataset
Visit annotations in context

Tags

Dataset

NHW16

Annotators

satra

URL

studyforrest.org/
myconnectome.org myconnectome.org

Data sharing |

1
1. dankessler 06 Sep 2016
  
  in Public
  
  License: PDDL v.10
  
  Access: Human, s3, openfmri
  
  Needs data use agreement: No
  
  Needs institutional signature for access: No
  
  NHW16 DataSet
Visit annotations in context

Tags

DataSet

NHW16

Annotators

dankessler

URL

myconnectome.org/wp/data-sharing/
May 2016
www.jstage.jst.go.jp www.jstage.jst.go.jp

The data set of song frequency of forest bird

1
1. fbaumgardt 13 May 2016
  
  in Public
  
  Bird song data set
  
  bioacoustics dataset
Visit annotations in context

Tags

bioacoustics

dataset

Annotators

fbaumgardt

URL

jstage.jst.go.jp/article/birdresearch/8/0/8_R1/_article
Aug 2015
europepmc.org europepmc.org

Sizing the Problem of Improving Discovery and Access to NIH-Funded Data: A Preliminary Study

1
1. mavery 11 Aug 2015
  
  in Public
  
  the definition of a “dataset,”
  
  this is interesting, and will be interesting to track within and across disciplines
  
  data dataset
Visit annotations in context

Tags

dataset

data

Annotators

mavery

URL

europepmc.org/abstract/MED/26207759

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators