Showing 356 to 359

When is a model good?

Parsimony

statmodeling.stat.columbia.edu/2009/05/07/bayes_jeffreys/

There is a body of work behind why model parsimony is good, but the reasoning doesn't apply everywhere. When does it not?

Components of a trustworthy data analysis

From simplystatistics.org/2018/06/04/trustworthy-data-analysis/

  • How was the data gathered?
  • How were the data processed?
  • Sampling frame?
  • Is there a reason why the variables might be causally related?

Rubin's basic questions

Donald Rubin has two questions he likes to ask any researcher:

  • What would you do if you had all the data?
  • What were you doing before you had any data?
Created (3 years ago)

Frequentist "probability" means frequency

#statistics

In the "classical" (frequentist) approach, the concept of probability is the limit (the stable value converged-on) of a long-run frequency of a thing relative to another thing. For an event A, one's uncertainty about its occurrence is calculated like in elementary school probability math, as the ratio of the number of times the event occurred to the number of trials.

If we roll a die many times, it will come up showing the number two approximately a sixth of the time, thus the probability of showing that number will be node:internal/modules/cjs/loader:1228 throw err; ^ Error: Cannot find module 'katex' Require stack: - /home/kept/private-dotfiles/.config/emacs/texToMathML.js at Module._resolveFilename (node:internal/modules/cjs/loader:1225:15) at Module._load (node:internal/modules/cjs/loader:1051:27) at Module.require (node:internal/modules/cjs/loader:1311:19) at require (node:internal/modules/helpers:179:18) at Object. (/home/kept/private-dotfiles/.config/emacs/texToMathML.js:1:15) at Module._compile (node:internal/modules/cjs/loader:1469:14) at Module._extensions..js (node:internal/modules/cjs/loader:1548:10) at Module.load (node:internal/modules/cjs/loader:1288:32) at Module._load (node:internal/modules/cjs/loader:1104:12) at Function.executeUserEntryPoint [as runMain] (node:internal/modules/run_main:173:12) { code: 'MODULE_NOT_FOUND', requireStack: [ '/home/kept/private-dotfiles/.config/emacs/texToMathML.js' ] } Node.js v20.18.1 . The probability is objective, it is a characteristic of objects (e.g. of the dice) and cannot differ for different subjects.

Straightforward in the case of dice, but…

There are some concerns to this definition of probability. First, considering the probability of event A as a frequency means that we are only able to calculate it if we know the entire sample space Ω. Second, this definition is based on the concept of repeatability, which is not necessarily a characteristic of the event of interest: for instance the events "Caesar crossed the Rubicon" or "The next US president will be a woman" do not satisfy this assumption as they can only happen once. You see why, if Bayesian probability theory gives you the tools to quantify your guess about such events, it can be used to fuel decisions in your life where frequentist cannot.

I think it may be a good idea whenever you write papers and articles to use the term "probability" and notation "Pr[]" only when using the Bayesian definition thereof. Since objective probability does not exist to a Bayesian, it is confusing for a Bayesian to be posited to calculate Pr[Pr[a] > 0.10], a probability of a probability. Better to write Pr[Freq[a] > 0.10], keeping track of what we are talking about.

Technically, you can call limits of long-run frequencies a probability, since a Bayesian can produce the same number in special cases such as throwing dice, when he has no other information to go on and uses something called an uniform prior. Thus the notation Freq[a] is a renamed Pr[a] that meets specific conditions. Perhaps you could write that Freq[a] == Pr[a | uniform prior ∩ repeatable event ∩ trust in sources (like the provider of your dice) ∩ no knowledge ∩ whatever else], making it clear that Freq[a] is a shorthand if anything and that it is an abuse on the reader to merely say Pr[a] when you mean the aforementioned things. Though there may be no way to rigorously define Freq[a] as any edition of Pr[a], look up Lindley's paradox to be sure.

Created (3 years ago)

Ledger CLI

Cheatsheets/TILs

How to track physical cash in Ledger?

Should you track cash as "Assets:Cash"? Actually no.

I used to do this, try to track the difference between what I knew I'd paid with cash and the amount that had apparently left my wallet, chalking it up to "Expenses:Lost cash", but you get this for free if you track all cash as "Expenses:Cash" instead.

That is, you take out cash at the ATM, and view that as a purchase of cash bills. So it's already an expense. Then, if you want, you write down what you use the cash for, which just shuffles money from one expense account to another, like this:

2019-12-01 * "Willys"           ;
    Expenses:Groceries                           145 SEK
    Expenses:Cash

That way, the "cash" expense shrinks and the "groceries" expense grows. The amount still remaining in Expenses:Cash after you've written down all purchases you remember is the unaccounted-for portion, or what I called "Lost cash".

Maybe it's still in your wallet and you haven't used it yet, maybe you've dropped it, maybe you bought something you forgot.

Maybe it doesn't matter. If you decide ahead of time that most of your cash should go to groceries, you can just… leave it in "Expenses:Cash" since you know what it means anyway.

Created (3 years ago)
Showing 356 to 359