- Jul 2023
-
arxiv.org arxiv.org
-
Paper that introduced the PPO algorithm. PPO is, in a way, a response to the TRPO algorithm, trying to use the core idea but implement a more efficient and simpler algorithm.
TRPO defines the problem as a straight optimization problem, no learning is actually involved.
-
-
arxiv.org arxiv.org
-
Tom Schaul, John Quan, Ioannis Antonoglou and David Silver. "PRIORITIZED EXPERIENCE REPLAY", ICLR, 2016.
-
- Aug 2022
-
psyarxiv.com psyarxiv.com
-
Teodorescu, K., Plonsky, O., Ayal, S., & Barkan, R. (2021). Enforcement policies: Frequency of inspection is more important than the severity of punishment. PsyArXiv. https://doi.org/10.31234/osf.io/pbvzr
-
- Feb 2022
-
twitter.com twitter.com
-
Claudia Sahm. (2022, January 5). “We, as experts, have a responsibility to policymakers and everyday people to match the strength of our recommendations to the strength of our data. When I read Oster, I see a tone and conviction that far exceeds the many limitations of her data.” https://t.co/NqWwj0hi28 [Tweet]. @Claudia_Sahm. https://twitter.com/Claudia_Sahm/status/1478532000441151488
-
- Nov 2021
-
socialsciences.nature.com socialsciences.nature.com
-
Portfolio, B. and S. S. at N. (2021, November 3). No evidence school closures reduce the spread of COVID-19. Behavioural and Social Sciences at Nature Portfolio. http://socialsciences.nature.com/posts/no-evidence-school-closures-reduce-the-spread-of-covid-19
-
- Jun 2021
-
-
Chadi, M.-A., & Mousannif, H. (2021). Reinforcement Learning Based Decision Support Tool For Epidemic Control [Preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/tcr8s
-
- Aug 2020
-
-
Moya, C., Cruz y Celis Peniche, P. D., Kline, M. A., & Smaldino, P. (2020). Dynamics of Behavior Change in the COVID World [Preprint]. SocArXiv. https://doi.org/10.31235/osf.io/kxajh
-
-
www.nber.org www.nber.org
-
Bacher-Hicks, A., Goodman, J., & Mulhern, C. (2020). Inequality in Household Adaptation to Schooling Shocks: Covid-Induced Online Learning Engagement in Real Time (Working Paper No. 27555; Working Paper Series). National Bureau of Economic Research. https://doi.org/10.3386/w27555
-
-
covid-19.iza.org covid-19.iza.org
-
Work That Can Be Done from Home: Evidence on Variation within and across Occupations and Industries. COVID-19 and the Labor Market. (n.d.). IZA – Institute of Labor Economics. Retrieved August 4, 2020, from https://covid-19.iza.org/publications/dp13374/
-
- Jul 2020
-
-
Jena, P. K. (2020). Impact of Covid-19 on Higher Education in India [Preprint]. SocArXiv. https://doi.org/10.31235/osf.io/jg8fr
-
- Jun 2020
-
www.politico.com www.politico.com
-
‘It’s just way too much to take on’: School systems struggle with the politics of reopening. (n.d.). POLITICO. Retrieved June 28, 2020, from https://www.politico.com/news/2020/06/17/reopening-schools-coronavirus-327020
-
-
www.bruno-latour.fr www.bruno-latour.fr
-
Latour, B. (2020 March 29). A little exercise to make sure that, after the virus crisis, things don't start again as they were before. Bruno-latour.fr. http://www.bruno-latour.fr/node/852.html
-
-
royalsociety.org royalsociety.org
-
DELVE group publishes evidence paper on the use of face masks in tackling Coronavirus (COVID-19) pandemic | Royal Society. (2020 May 04). https://royalsociety.org/news/2020/05/delve-group-publishes-evidence-paper-on-use-of-face-masks/
Tags
- physical distancing
- droplet
- learning
- social distancing
- evidence
- DELVE
- Data Evaluation and Learning for Viral Epidemics
- infection
- COVID-19
- lang:en
- face mask
- SAGE
- asymptomatic
- Royal Society
- is:webpage
- behavioral change
- policy
- publication
- public health
- management
- transmission reduction
Annotators
URL
-
-
psyarxiv.com psyarxiv.com
-
Rahman, M. (2020, June 1). COVID-19 Public Sentiment Insights and Machine Learning for Tweets Classification. https://doi.org/10.31234/osf.io/sw2dn
-
- May 2020
-
www.thelancet.com www.thelancet.com
-
Schwalbe, N., & Wahl, B. (2020). Artificial intelligence and the future of global health. The Lancet, 395(10236), 1579–1586. https://doi.org/10.1016/S0140-6736(20)30226-9
-
-
psyarxiv.com psyarxiv.com
-
Barnby, J. M., Bell, V., Mehta, M., & Moutoussis, M. (2020, April 17). Reduction in social learning and policy uncertainty about intentional social threat underlies paranoia: evidence from modelling a modified serial dictator game. https://doi.org/10.31234/osf.io/jvx5y
-
- Dec 2019
-
wellcomeopenresearch.org wellcomeopenresearch.org
-
Regarding recommended practices in international ethical policy documents, these are not sufficiently disseminated or internalized, hence gaps still exist in relation to best practices and critical aspects of data practices. To address this challenge, it is not only essential to disseminate and promote these policies, but to also adapt them to the contexts and situations where they are applicable through training and capacity building.
Given that the article is framed as being about policy diffusion and using a policy learning framework, I would have expected more details here.
-
- Nov 2019
-
ppsd.smapply.io ppsd.smapply.io
-
Private post-secondary institutions that provide educational services in the State of New Mexico are subject to either the New Mexico Post-Secondary Educational Institution Act (Section 21-23-1 et seq. NMSA 1978) or the Interstate Distance Education Act (Section 21-23B-1 et seq. NMSA 1978) and can use this site to apply for State Authorization or submit other required applications to comply with State regulations. Students may request transcripts of closed schools where the New Mexico Higher Education Department is the designated custodian of records or may file complaints against any post-secondary institution that provides educational services in our State.
The NMHE website is about providing academic, financial and policies to new mexico public higher education institutions and community.
-
- May 2019
-
policychangeindex.org policychangeindex.org
-
policy change index - machine learning on corpus of text to identify and predict policy changes in China
Tags
Annotators
URL
-
- Mar 2019
-
arxiv.org arxiv.org
-
A potential draw-back with such pre-training approach is that themodel may suffer from the mismatch of dialoguestate distributions between supervised training andinteractive learning stages. While interacting withusers, the agent’s response at each turn has a di-rect influence on the distribution of dialogue statethat the agent will operate on in the upcoming di-alogue turns.
策略学习也是对话过程很重要的一环。 最近的策略学习过程有用基于有监督的预训练然后线上强化学习再训练的来提高学习的方案。但是这种方案有个潜在的毛病,在离线的数据中受限于数据量,线上一旦碰到了不常见的情况,容易直接恢复不来。(这个问题应该只是推断吧?有什么实证么?)
所以本文其实想说的是用一种方法来减轻线上和离线的差距。
Tags
Annotators
URL
-