Category Archives: rationality

2017-18 New Year review

2017 progress


FLI / other AI safety:

Rationality / effectiveness:

  • Streamlined self-tracking data analysis and made an iPython notebook for plots. Found that the amount of sleep I get is correlated with tiredness (0.32), but not with mood indicators (anger, anxiety, or distractability). Anger and anxiety are correlated with each other though (0.36). Distractability is correlated with tiredness (0.27) and anticorrelated with anger the next day for some reason (-0.31).
  • Ran house check-in sessions on goals and habits 1-2 times a month, two house sessions on Hamming questions, and check-ins with Janos every 1-2 weeks.
  • Did a sleep CBT program with sleep restriction for 2 months. Comparing the 5 months before the program vs the 5 months after the program, evening insomnia rate went down from 16% to 8.2% of the time, and morning insomnia rate didn’t change (9%). Average hours of sleep didn’t change (7 hours), but going to sleep around 22 minutes earlier on average. This excludes jetlag days (at most 3 days after a flight with at least 3 hours of time difference).
  • Did around 80 exercise classes (starting in March)

Fun stuff:

  • Moved into our new group house (Deep End).
  • Explored the UK (hiking in Wales, Scotland, Lake District).
  • Got back into aerial silks.
  • Got into circling.
  • Got a pixie haircut.
  • Family reunion in France with Russian relatives I haven’t seen in a decade.
  • Went to Burning Man and learned to read Tarot (as part of our camp theme).
  • Did the Stoic Week.
  • Played a spy scavenger hunt game.



2017 prediction outcomes


  1. Our AI safety team will have at least two papers accepted for publication at a major conference, not counting workshops (70%) – 2 papers (human preferences paper at NIPS and reward corruption paper at IJCAI)
  2. I will write at least 9 blog posts (50%) – 6 posts
  3. I will meditate at least 250 days (45%) – 237 days
  4. I will exercise at least 250 days (55%) – 194 days
  5. I will visit at least 2 new countries (80%) – France, Switzerland
  6. I will attend Burning Man (85%) – yes


  • Everything that got at least 70% confidence was correct, everything lower was wrong.
  • Like last year, my low predictions seem overconfident (though too few data points to judge).

2018 goals and predictions


  1. Write at least 2 AI blog posts that are not about conferences (1 last year) (70%)
  2. Avoid processed sugar* at least until end of March (90%)
  3. Do at most 4 non-research talks/panels (7 last year) (50%)
  4. Meditate on at least 250 days (50%)

* not in a super strict way: it’s ok to eat fruit and 90% chocolate and try a really small quantity (< teaspoon) of a dessert.


  1. Our AI safety team will have at least two papers accepted for publication at a major conference, not counting workshops (80%)
  2. I will write at least 6 blog posts (60%)
  3. I will go to at least 100 exercise classes (80 last year) (60%)
  4. 1-2 housemate turnover at the Deep End (3 last year) (70%)
  5. I will visit at least 3 new cities with population over 100,000 (4 last year) (50%)
  6. I will go on at least 2 hikes (4 last year) (90%)

Past new year reviews: 2016-17, 2015-16, 2014-15.


Takeaways from self-tracking data

I’ve been collecting data about myself on a daily basis for the past 3 years. Half a year ago, I switched from using 42goals (which I only remembered to fill out once every few days) to a Google form emailed to me daily (which I fill out consistently because I check email often). Now for the moment of truth – a correlation matrix!

The data consists of “mood variables” (anxiety, tiredness, and “zoneout” – how distracted / spacey I’m feeling), “action variables” (exercise and meditation) and sleep variables (hours of sleep, sleep start/end time, insomnia). There are 5 binary variables (meditation, exercise, evening/morning insomnia, headache) and the rest are ordinal or continuous. Almost all the variables have 6 months of data, except that I started tracking anxiety 5 months ago and zoneout 2 months ago.

The matrix shows correlations between mood and action variables for day X, sleep variables for the night after day X, and mood variables for day X+1 (marked by ‘next’):

corr heatmap over 2017.png

The most surprising thing about this data is how many things are uncorrelated that I would expect to be correlated:

  • evening insomnia and tiredness the next day (or the same day)
  • anxiety and sleep variables the following night
  • exercise and sleep variables the following night
  • tiredness and hours of sleep the following night
  • average hours of sleep (over the past week) is only weakly correlated with tiredness the next day (-0.15)
  • hours of sleep (average or otherwise) and anxiety or zoneout the next day (so my mood is less affected by sleep than I have expected)
  • action variables and mood variables the next day
  • meditation and feeling zoned out

Some things that were correlated after all:

  • hours of sleep and tiredness the next day (-0.3) – unsurprising but lower than expected
  • tiredness and zoneout (0.33)
  • tiredness and insomnia the following morning (0.29) (weird)
  • anxiety and zoneout were anticorrelated (-0.25) on adjacent days (weird)
  • exercise and anxiety (-0.18)
  • meditation and anxiety (-0.15)
  • meditating and exercising (0.17) – both depend on how agenty / busy I am that day
  • meditation and insomnia (0.24), probably because I usually try to meditate if I’m having insomnia to make it easier to fall asleep
  • headache and evening insomnia (0.14)

Some falsified hypotheses:

  • Exercise and meditation affect mood variables the following day
  • My tiredness level depends on the average amount of sleep the preceding week
  • Anxiety affects sleep the following night
  • Exercise helps me sleep the following night
  • I sleep more when I’m more tired
  • Sleep deprivation affects my mood

The overall conclusion is that my sleep is weird and also matters less than I thought for my well-being (at least in terms of quantity).

Addendum:  For those who would like to try this kind of self-tracking, here is a Google Drive folder with the survey form and the iPython notebook. You need to download the spreadsheet of form responses as a CSV file before running the notebook code. You can use the Send button in the form to email it to yourself, and then bounce it back every day using Google Inbox,, or a similar service.

2016-17 New Year review

2016 progress

Research / career:

  • Got a job at DeepMind as a research scientist in AI safety.
  • Presented MiniSPN paper at ICLR workshop.
  • Finished RNN interpretability paper and presented at ICML and NIPS workshops.
  • Attended the Deep Learning Summer School.
  • Finished and defended PhD thesis.
  • Moved to London and started working at DeepMind.


  • Talk and panel (moderator) at Effective Altruism Global X Boston
  • Talk and panel at the Governance of Emerging Technologies conference at ASU
  • Talk and panel at Brain Bar Budapest
  • AI safety session at OpenAI unconference
  • Talk and panel at Effective Altruism Global X Oxford
  • Talk and panel at Cambridge Catastrophic Risk Conference run by CSER

Rationality / effectiveness:

  • Went to a 5-day Zentensive meditation retreat with Janos, in between grad school and moving to London. This was very helpful for practicing connecting with my direct emotional experience, and a good way to reset during a life transition.
  • Stopped using 42goals (too glitchy) and started recording data in a Google form emailed to myself daily. Now I am actually entering accurate data every day instead of doing it retroactively whenever I remember. I tried a number of goal tracking apps, but all of them seemed too inflexible (I was surprised not to find anything that provides correlation charts between different goals, e.g. meditation vs. hours of sleep).

Random cool things:

  • Hiked in the Andes to an altitude of 17,000 feet.
  • Visited the Grand Canyon.
  • New countries visited: UK, Bolivia, Spain.
  • Started a group house in London (moving there in a few weeks).
  • Started contributing to the new blog Approximately Correct on societal impacts of machine learning.


2016 prediction outcomes


  1. Finish PhD thesis (70%) – done
  2. Write at least 12 blog posts (40%) – 9
  3. Meditate at least 200 days (50%) – 245
  4. Exercise at least 200 days (50%) – 282
  5. Do at least 5 pullups in a row (40%) – still only 2-3
  6. Record at least 50 new thoughts (50%) – 29
  7. Stay up past 1:30am at most 20% of the nights (40%) – 26.8%
  8. Do at least 10 pomodoros per week on average (50%) – 13


  1. At least one paper accepted for publication (70%) – two papers accepted to workshops
  2. I will get at least one fellowship (40%)
  3. Insomnia at most 20% of nights (20%) – 18.3%
  4. FLI will co-organize at least 3 AI safety workshops (50%) – AAAI, ICML, NIPS


  • Low predictions (20-40%): 1/5 = 20% (overconfident)
  • Medium predictions (50-70%): 6/7 = 85% (underconfident)
  • It’s interesting that my 40% predictions were all wrong, and my 50% predictions were almost all correct. I seem to be translating system 1 labels of ‘not that likely’ and ‘reasonably likely’ to 40% and 50% respectively, while they should translate to something more like 25% and 70%. After the overconfident predictions last year, I tried to tone down the predictions for this year, but the lower ones didn’t get toned down enough.
  • I seem to be more accurate on predictions than resolutions, probably due to wishful thinking. Experimenting with no resolutions for next year.

2017 predictions

  1. Our AI safety team will have at least two papers accepted for publication at a major conference, not counting workshops (70%).
  2. I will write at least 9 blog posts (50%).
  3. I will meditate at least 250 days (45%).
  4. I will exercise at least 250 days (55%).
  5. I will visit at least 2 new countries (80%).
  6. I will attend Burning Man (85%).

Using humility to counteract shame

u0sm9wx“Pride is not the opposite of shame, but its source. True humility is the only antidote to shame.”

Uncle Iroh, “Avatar: The Last Airbender”


Shame is one of the trickiest emotions to deal with. It is difficult to think about, not to mention discuss with others, and gives rise to insidious ugh fields and negative spirals. Shame often underlies other negative emotions without making itself apparent – anxiety or anger at yourself can be caused by unacknowledged shame about the possibility of failure. It can stack on top of other emotions – e.g. you start out feeling upset with someone, and end up being ashamed of yourself for feeling upset, and maybe even ashamed of feeling ashamed if meta-shame is your cup of tea. The most useful approach I have found against shame is invoking humility.

What is humility, anyway? It is often defined as a low view of your own importance, and tends to be conflated with modesty. Another common definition that I find more useful is acceptance of your own flaws and shortcomings. This is more compatible with confidence, and helpful irrespective of your level of importance or comparison to other people. What humility feels like to me on a system 1 level is a sense of compassion and warmth towards yourself while fully aware of your imperfections (while focusing on imperfections without compassion can lead to beating yourself up). According to LessWrong, “to be humble is to take specific actions in anticipation of your own errors”, which seems more like a possible consequence of being humble than a definition.

Humility is a powerful tool for psychological well-being and instrumental rationality that is more broadly applicable than just the ability to anticipate errors by seeing your limitations more clearly. I can summon humility when I feel anxious about too many upcoming deadlines, or angry at myself for being stuck on a rock climbing route, or embarrassed about forgetting some basic fact in my field that I am surely expected to know by the 5th year of grad school.

While humility comes naturally to some people, others might find it useful to explicitly build an identity as a humble person. How can you invoke this mindset? One way is through negative visualization or pre-hindsight, considering how your plans could fail, which can be time-consuming and usually requires system 2. A faster and less effortful way is to is to imagine a person, real or fictional, who you consider to be humble. I often bring to mind my grandfather, or Uncle Iroh from the Avatar series, sometimes literally repeating the above quote in my head, sort of like an affirmation. I don’t actually agree that humility is the only antidote to shame, but it does seem to be one of the most effective.

(Cross-posted to LessWrong. Thanks to Janos Kramar for his feedback on this post.)

2015-16 New Year review

2015 progress


  • Finished paper on the Selective Bayesian Forest Classifier algorithm
  • Made an R package for SBFC (beta)
  • Worked at Google on unsupervised learning for the Knowledge Graph with Moshe Looks during the summer (paper)
  • Joined the HIPS research group at Harvard CS and started working with the awesome Finale Doshi-Velez
  • Ratio of coding time to writing time was too high overall


  • Co-organized two meetings to brainstorm biotechnology risks
  • Co-organized two Machine Learning Safety meetings
  • Gave a talk at the Shaping Humanity’s Trajectory workshop at EA Global
  • Helped organize NIPS symposium on societal impacts of AI

Rationality / effectiveness:

  • Extensive use of FollowUpThen for sending reminders to future selves
  • Mapped out my personal bottlenecks
  • Sleep:
    • Tracked insomnia (26% of nights) and sleep time (average 1:30am, stayed up past 1am on 31% of nights)
    • Started working on sleep hygiene
    • Stopped using melatonin (found it ineffective)

Random cool things I did:

  • Improv class
  • Aerial silks class
  • Climbed out of a glacial abyss (moulin)
  • Placed second at Toastmasters area speech contest

2015 prediction outcomes

Out of the 17 predictions I made a year ago, 5 were true, and the rest were false.

  1. Submit the SBFC paper for publication (95%)
  2. Submit another paper besides SBFC (40%)
  3. Present SBFC results at a conference (JSM, ICML or NIPS) (40%) – presented at a workshop (NESS)
  4. Get a new external fellowship to replace my expiring NSERC fellowship (50%)
  5. Skim at least 20 research papers in machine learning (70%) – probably a lot more
  6. Write at least 12 blog posts (70%) – wrote 9 posts
  7. Climb a 5.12 without rope cheating (50%) – no longer endorsed at this level
  8. Lead climb a 5.11a (50%) – no longer endorsed at this level
  9. Do 10 pullups in a row (60%) – no longer endorsed at this level
  10. Meditate at least 150 times (80%) – 206 times
  11. Record at least 150 new thoughts (70%) – recorded 62, no longer endorsed at this level
  12. Make at least 100 Anki cards by the end of the year (70%)
  13. Read at least 10 books (60%) – read 4 books, no longer endorsed at this level
  14. Attend Burning Man (90%)
  15. Boston will have a second rationalist house by the end of the year (30%)
  16. FLI will hire a full-time project manager or administrator (80%) – no, but we now have a full time website editor…
  17. FLI will start a project on biotech safety (70%) – had some meetings, but no concrete action plan yet


  • low predictions, 30-60%: 0/8 = 0% (super overconfident)
  • high predictions, 70-95%: 5/9 = 56% (overconfident)

(Yikes! Worse than last year…)


  • I forgot about most of these goals after a few months – will need a recurring reminder for next year.
  • All 3 physical goals ended up disendorsed – I think I set those way too high. My climbing habits got disrupted by moving to California in summer and a hand injury, so I’m still trying to return to my spring 2014 skill level.

2016 goals and predictions

Given the overconfidence of last year’s predictions, toning it down for next year.


  1. Finish PhD thesis (70%)
  2. Write at least 12 blog posts (40%)
  3. Meditate at least 200 days (50%)
  4. Exercise at least 200 days (50%)
  5. Do at least 5 pullups in a row (40%)
  6. Record at least 50 new thoughts (50%)
  7. Stay up at most 20% of the nights (40%)
  8. Do at least 10 pomodoros per week on average (50%)


  1. At least one paper accepted for publication (70%)
  2. I will get at least one fellowship (40%)
  3. Insomnia at most 20% of nights (20%)
  4. FLI will co-organize at least 3 AI safety workshops (50%)

Systems I have tried: an overview

I have used various organization and productivity systems in the past few years – this is an overview of what worked and what didn’t.

Main systems I currently use:

  1. Follow Up Then: Sends an email to a future self, with the date and time specified in the email address, e.g. I use it for delaying tasks, recurring reminders, and following up on email threads. This reduces clutter in my todo list, calendar and inbox, and frees my working memory. Lately, I noticed myself remembering a thing shortly before receiving a follow up about it – probably due to the same mechanism that sometimes wakes me up a few minutes before the morning alarm.
  2. Complice: Daily to-do list organized according to goals, with archives and regular reviews. Helpful for specifying the next action to take at a given time, and for tracking progress on individual goals. Downside: I sometimes hesitate to enter tasks into the list, because entered tasks cannot be erased, and leaving a task unfinished is aversive, so often end up entering tasks after they are done instead.
  3. Workflowy: Nested list structure – searchable, with collapsible and sharable sublists. I keep my ongoing todo list (in GTD form) and most of my notes here. Downside: doesn’t work for goal factoring, since it only supports tree structures.
  4. Google Calendar: Self-explanatory. I have recently started adding tentative meeting slots, indicated by a question mark, e.g. “dinner with Janos?”. This has been helpful for keeping track of which time slots I’ve offered to someone. I also added a calendar that shows Facebook events that I’ve been invited to, which is handy.
  5. 42 Goals: Goal tracking with summary graphs and cute symbols. I use this for tracking habits (like exercise and meditation) and other random things (like insomnia occurrences). The graphs are useful – this is how I know that I have the most insomnia on Mondays! Downsides: doesn’t allow non-binary categories, and the phone app is so unreliable that I never use it – if you know good alternative tracking systems, let me know!

Systems I no longer use:

  • Beeminder: Goal tracking with nice graphs, and goal setting with reminders and financial penalties in case of failure. I liked the graphs and reminders, but the penalties made me feel even more overwhelmed than usual, and sometimes induced suboptimal short-term priorities. I decided to obtain the different benefits separately, setting recurring reminders for habits on Follow Up Then, and using 42 Goals for tracking.
  • Toggl: Time tracking for activities and tasks, organized by project or goal, with an option for retroactive time entries. I started out using it to track all my time, and though I stopped after about a month due to the excessive overhead of tracking and categorizing short activities, I learned a lot about where my time was going. I used it for about a year after that to track work hours, and eventually stopped because of overhead and redundancy with Complice.
  • Paper checklist: Checklist for daily habits. Worked well in terms of catching my eye in the morning, but was often forgotten when traveling. It was redundant with 42 Goals, and required double data entry, so I eventually gave up on the paper version.
  • Habit tracking with reminders, with a pretty good phone app. I found it particularly useful for several-times-a-week habits. It also has built-in habit programs like building up to a certain number of chinups. I mostly stopped using it because I had too many other systems that were redundant with it.
  • Pomodoros: Setting a timer to focus on a specific task for 25-40 minutes, followed by a break of 5 minutes. I found it unpleasant to be forced to take breaks, developed a habit of ignoring the break signal, and gave up on using pomodoros altogether.

Over the past couple of years, I have become less willing to force myself to do things or overwhelm myself with instructions or data entry overhead, which has led me to reduce the number of systems I use, and to prefer gently guiding systems to strict ones.

Hamming questions and bottlenecks

The CFAR alumni workshop on the first weekend of May was focused on the Hamming question. Mathematician Richard Hamming was known to approach experts from other fields and ask “what are the important problems in your field, and why aren’t you working on them?”. The same question can be applied to personal life: “what are the important problems in your life, and what is stopping you from working on them?”.

Over the course of the weekend, the twelve of us asked this question of ourselves and each other, in many forms and guises: “if Vika isn’t making a major impact on the world in 5 years, what would have stopped her?”, “what are your greatest bottlenecks?”, “how can we actually try?”, etc. The intense focus on mental pain points was interspersed with naps and silly games to let off steam. On the last day, we did a group brainstorm, where everyone who wanted to receive feedback took a turn in the center of the circle, and everyone else speculated on what they thought were the biggest bottlenecks of the person in the center. By this time, we had mostly gotten to know each other, and even the impressions from those who knew me less well were surprisingly accurate. I am very grateful to everyone at the workshop for being so insightful and supportive of each other (and actually caring).

Most of the issues that came up were things I was aware of on some level, but over the course of the workshop it became particularly salient to me how interconnected my problems are and how the gears in the system affect each other. Working memory overload leads to confusion, which reduces confidence. Sleep deprivation reduces working memory and increases anxiety. Anxiety reduces the affordance for exploration and creativity, and increases the frequency of insomnia. Ignoring or neglecting signals from system 1 takes up working memory slots with looping messages from system 1, and increases anxiety. After a few of these circular explanations, I gave up on writing them all down, and made a diagram instead.

bottlenecks - New Page (1)

A few things jump out at me about this diagram. The highest degree nodes are anxiety and working memory, both of which are difficult to affect directly. The two nodes I have the most influence over are the amount of sleep I get and the degree to which I listen to system 1 signals. I have started experimenting with sleep interventions that I haven’t yet tried, like taking melatonin 4 hours before bedtime, using a weighted blanket, etc. Attunement to system 1 can be improved through meditation, Focusing, belief reporting and such. While I have sporadically meditated for years, I could use more practice at the other techniques, which involve more explicit internal querying than meditation.

Curiously, the graph also appears to have a source and a sink. The source node is my overdeveloped sense of duty and a tendency to assume I should do things or be able to do things causes a lot of downstream issues. It would be impactful to directly hack this and become more selfish, but it appears to be a bit trickier than doing a find-and-replace on my source code, replacing “I have to do X” with “my goals require X”. The sink node has to do with my capacity to allow myself time and mental space for exploration and creativity, which would among other things enable me to do my high-level goals better (e.g. research and organization strategy).

A week after the workshop, I moved to California for my summer internship at Google. The context shift and my new location a few blocks down from the CFAR office will allow me to work on my bottlenecks more systematically. I have wrestled with these for a long time, but now I feel that I have better tools and resources than ever before.