Simon McNamara’s journal round-up for 6th August 2018

Every Monday our authors provide a round-up of some of the most recently published peer reviewed articles from the field. We don’t cover everything, or even what’s most important – just a few papers that have interested the author. Visit our Resources page for links to more journals or follow the HealthEconBot. If you’d like to write one of our weekly journal round-ups, get in touch.

Euthanasia, religiosity and the valuation of health states: results from an Irish EQ5D5L valuation study and their implications for anchor values. Health and Quality of Life Outcomes [PubMed] Published 31st July 2018

Do you support euthanasia? Do you think there are health states worse than death? Are you religious? Don’t worry – I am not commandeering this week’s AHE journal round-up just to bombard you with a series of difficult questions. These three questions form the foundation of the first article selected for this week’s round-up.

The paper is based upon the hypothesis that your religiosity (“adherence to religious beliefs”) is likely to impact your support for euthanasia and, subsequently, the likelihood of you valuing severe health states as worse than death. This seems like a logical hypothesis. Religions tend to be anti-euthanasia, and so it appears likely that religious people will have lower levels of support for euthanasia than non-religious people. Equally, if you don’t support the principle of euthanasia, it stands to reason that you are likely to be less willing to choose immediate death over living in a severe health state – something you would need to do for a health state to be considered as being worse than death in a time trade-off (TTO) study.

The authors test this hypothesis using a sub-sample of data (n=160) collected as part of the Irish EQ-5D-5L TTO valuation study. Perhaps unsurprisingly, the authors find evidence in support of the above hypotheses. Those that attend a religious service weekly were more likely to oppose euthanasia than those who attend a few times a year or less, and those who oppose euthanasia were less likely to give “worse than death” responses in the TTO than those that support it.

I found this paper really interesting, as it raises a number of challenging questions. If a society is made up of people with heterogeneous beliefs regarding religion, how should we balance these in the valuation of health? If a society is primarily non-religious is it fair to apply this valuation tariff to the lives of the religious, and vice versa? These certainly aren’t easy questions to answer, but may be worth reflecting on.

E-learning and health inequality aversion: A questionnaire experiment. Health Economics [PubMed] [RePEc] Published 22nd July 2018

Moving on from the cheery topic of euthanasia, what do you think about socioeconomic inequalities in health? In my home country, England, if you are from the poorest quintile of society, you can expect to experience 62 years in full health in your lifetime, whilst if you are from the richest quintile, you can expect to experience 74 years – a gap of 12 years.

In the second paper to be featured in this round-up, Cookson et al. explore the public’s willingness to sacrifice incremental population health gains in order to reduce these inequalities in health – their level of “health inequality aversion”. This is a potentially important area of research, as the vast majority of economic evaluation in health is distributionally-naïve and effectively assumes that members of the public aren’t at all concerned with inequalities in health.

The paper builds on prior work conducted by the authors in this area, in which they noted a high proportion of respondents in health inequality aversion elicitation studies appear to be so averse to inequalities that they violate monotonicity – they choose scenarios that reduce inequalities in health even if these scenarios reduce the health of the rich at no gain to the poor, or they reduce the health of the poor, or they may reduce the health of both groups. The authors hypothesise that these monotonicity violations may be due to incomplete thinking from participants, and suggest that the quality of their thinking could be improved by two e-learning educational interventions. The primary aim of the paper is to test the impact of these interventions in a sample of the UK public (n=60).

The first e-learning intervention was an animated video that described a range of potential positions that a respondent could take (e.g. health maximisation, or maximising the health of the worst off). The second was an interactive spreadsheet-based questionnaire that presented the consequences of the participant’s choices, prior to them confirming their selection. Both interventions are available online.

The authors found that the interactive tool significantly reduced the amount of extreme egalitarian (monotonicity-violating) responses, compared to a non-interactive, paper-based version of the study. Similarly, when the video was watched before completing the paper-based exercise, the number of extreme egalitarian responses reduced. However, when the video was watched before the interactive tool there was no further decrease in extreme egalitarianism. Despite this reduction in extreme egalitarianism, the median levels of inequality aversion remained high, with implied weights of 2.6 and 7.0 for QALY gains granted to someone from the poorest fifth of society, compared to the richest fifth of society for the interactive questionnaire and video groups respectively.

This is an interesting study that provides further evidence of inequality aversion, and raises further concern about the practical dominance of distributionally-naïve approaches to economic evaluation. The public does seem to care about distribution. Furthermore, the paper demonstrates that participant responses to inequality aversion exercises are shaped by the information given to them, and the way that information is presented. I look forward to seeing more studies like this in the future.

A new method for valuing health: directly eliciting personal utility functions. The European Journal of Health Economics [PubMed] [RePEc] Published 20th July 2018

Last, but not least, for this round-up, is a paper by Devlin et al. on a new method for valuing health.

The relative valuation of health states is a pretty important topic for health economists. If we are to quantify the effectiveness, and subsequently cost-effectiveness, of an intervention, we need to understand which health states are better than others, and how much better they are. Traditionally, this is done by asking members of the public to choose between different health profiles featuring differing levels of fulfilment of a range of domains of health, in order to ‘uncover’ the relative importance the respondent places on these domains, and levels. These can then be used in order to generate social tariffs that assign a utility value to a given health state for use in economic evaluation.

The authors point out that, in the modern day, valuation studies can be conducted rapidly, and at scale, online, but at the potential cost of deliberation from participants, and the resultant risk of heuristic dominated decision making. In response to this, the authors propose a new method – direct elicitation of personal utility functions, and pilot its use for the valuation of EQ-5D in a sample of the English public (n=76).

The proposed approach differs from traditional approaches in three key ways. Firstly, instead of simply attempting to infer the relative importance that participants place on differing domains based upon choices between health profiles, the respondents are asked directly about the relative importance they place on differing domains of health, prior to validating these with profile choices. Secondly, the authors place a heavy emphasis on deliberation, and the construction, rather than uncovering, of preferences during the elicitation exercises. Thirdly, a “personal utility function” for each individual is constructed (in effect a personal EQ-5D tariff), and these individual utility functions are subsequently aggregated into a social utility function.

In the pilot, the authors find that the method appears feasible for wider use, albeit with some teething troubles associated with the computer-based tool developed to implement it, and the skills of the interviewers.

This direct method raises an interesting question for health economics – should we be inferring preferences based upon choices that differ in terms of certain attributes, or should we just ask directly about the attributes? This is a tricky question. It is possible that the preferences elicited via these different approaches could result in different preferences – if they do, on what grounds should we choose one or other? This requires a normative judgment, and at present, it appears both are (potentially) as legitimate as each other.

Whilst the authors apply this direct method to the valuation of health, I don’t see why similar approaches couldn’t be applied to any multi-attribute choice experiment. Keep your eyes out for future uses of it in valuation, and perhaps beyond? It will be interesting to see how it develops.

Credits

Chris Sampson’s journal round-up for 23rd July 2018

Every Monday our authors provide a round-up of some of the most recently published peer reviewed articles from the field. We don’t cover everything, or even what’s most important – just a few papers that have interested the author. Visit our Resources page for links to more journals or follow the HealthEconBot. If you’d like to write one of our weekly journal round-ups, get in touch.

Quantifying life: understanding the history of quality-adjusted life-years (QALYs). Social Science & Medicine [PubMed] Published 3rd July 2018

We’ve had some fun talking about the history of the QALY here on this blog. The story of how the QALY came to be important in health policy has been obscured. This paper seeks to address that. The research adopts a method called ‘multiple streams analysis’ (MSA) in order to explain how QALYs caught on. The MSA framework identifies three streams – policy, politics, and problems – and considers the ‘policy entrepreneurs’ involved. For this study, archival material was collected from the National Archives, Department of Health files, and the University of York. The researchers also conducted 44 semi-structured interviews with academics and civil servants.

The problem stream highlights shocks to the UK economy in the late 1960s, coupled with growth in health care costs due to innovations and changing expectations. Cost-effectiveness began to be studied and, increasingly, policymaking was meant to be research-based and accountable. By the 80s, the likes of Williams and Maynard were drawing attention to apparent inequities and inefficiencies in the health service. The policy stream gets going in the 40s and 50s when health researchers started measuring quality of life. By the early 60s, the idea of standardising these measures to try and rank health states was on the table. Through the late 60s and early 70s, government economists proliferated and proved themselves useful in health policy. The meeting of Rachel Rosser and Alan Williams in the mid-70s led to the creation of QALYs as we know them, combining quantity and quality of life on a 0-1 scale. Having acknowledged inefficiencies and inequities in the health service, UK politicians and medics were open to new ideas, but remained unconvinced by the QALY. Yet it was a willingness to consider the need for rationing that put the wheels in motion for NICE, and the politics stream – like the problem and policy stream – characterises favourable conditions for the use of the QALY.

The MSA framework also considers ‘policy entrepreneurs’ who broker the transition from idea to implementation. The authors focus on the role of Alan Williams and of the Economic Advisers’ Office. Williams was key in translating economic ideas into forms that policymakers could understand. Meanwhile, the Economic Advisers’ Office encouraged government economists to engage with academics at HESG and later the QoL Measurement Group (which led to the creation of EuroQol).

The main takeaway from the paper is that good ideas only prevail in the right conditions and with the right people. It’s important to maintain multi-disciplinary and multi-stakeholder networks. In the case of the QALY, the two-way movement of economists between government and academia was crucial.

I don’t completely understand or appreciate the MSA framework, but this paper is an enjoyable read. My only reservation is with the way the authors describe the QALY as being a dominant aspect of health policy in the UK. I don’t think that’s right. It’s dominant within a niche of a niche of a niche – that is, health technology assessment for new pharmaceuticals. An alternative view is that the QALY has in fact languished in a quiet corner of British policymaking, and been completely excluded in some other countries.

Accuracy of patient recall for self‐reported doctor visits: is shorter recall better? Health Economics [PubMed] Published 2nd July 2018

In designing observational studies, such as clinical trials, I have always recommended that self-reported resource use be collected no less frequently than every 3 months. This is partly based on something I once read somewhere that I can’t remember, but partly also on some logic that the accuracy of people’s recall decays over time. This paper has come to tell me how wrong I’ve been.

The authors start by highlighting that recall can be subject to omission, whereby respondents forget relevant information, or commission, whereby respondents include events that did not occur. A key manifestation of the latter is ‘telescoping’, whereby events are included from outside the recall period. We might expect commission to be more likely in short recalls and omission to be more common for long recalls. But there’s very little research on this regarding health service use.

This study uses data from a large trial in diabetes care in Australia, in which 5,305 participants were randomised to receive either 2-week, 3-month, or 12-month recall for how many times they had seen a doctor. Then, the trial data were matched with Medicare data to identify the true levels of resource use.

Over 92% of 12-month recall participants made an error, 76% of the 3-month recall, and 46% of the 2-week recall. The patterns of errors were different. There was very little under-reporting in the 2-week recall sample, with 3-month giving the most over-reporting and 12-month giving the most under-reporting. 12-month recall was associated with the largest number of days reported in error. However, when the authors account for the longer period being considered, and estimate a relative error, the impact of misreporting is smallest for the 12-month recall and greatest for the 2-week recall. This translates into a smaller overall bias for the longest recall period. The authors also find that older, less educated, unemployed, and low‐income patients exhibit higher measurement errors.

Health surveys and comparative studies that estimate resource use over a long period of time should use 12-month recall unless they can find a reason to do otherwise. The authors provide some examples from economic evaluations to demonstrate how selecting shorter recall periods could result in recommending the wrong decisions. It’s worth trying to understand the reasons why people can more accurately recall service use over 12 months. That way, data collection methods could be designed to optimise recall accuracy.

Who should receive treatment? An empirical enquiry into the relationship between societal views and preferences concerning healthcare priority setting. PLoS One [PubMed] Published 27th June 2018

Part of the reason the QALY faces opposition is that it has been used in a way that might not reflect societal preferences for resource allocation. In particular, the idea that ‘a QALY is a QALY is a QALY’ may conflict with notions of desert, severity, or process. We’re starting to see more evidence for groups of people holding different views, which makes it difficult to come up with decision rules to maximise welfare. This study considers some of the perspectives that people adopt, which have been identified in previous research – ‘equal right to healthcare’, ‘limits to healthcare’, and ‘effective and efficient healthcare’ – and looks at how they are distributed in the Netherlands. Using four willingness to trade-off (WTT) exercises, the authors explore the relationship between these views and people’s preferences about resource allocation. Trade-offs are between quality vs quantity of life, health maximisation vs equality, children vs the elderly, and lifestyle-related risk vs adversity. The authors sought to test several hypotheses: i) that ‘equal right’ respondents have a lower WTT; ii) ‘limits to healthcare’ people express a preference for health gains, health maximisation, and treating people with adversity; and iii) ‘effective and efficient’ people support health maximisation, treating children, and treating people with adversity.

A representative online sample of adults in the Netherlands (n=261) was recruited. The first part of the questionnaire collected socio-demographic information. The second part asked questions necessary to allocate people to one of the three perspectives using Likert scales based on a previous study. The third part of the questionnaire consisted of the four reimbursement scenarios. Participants were asked to identify the point (in terms of the relevant quantities) at which they would be indifferent between two options.

The distribution of the viewpoints was 65% ‘equal right’, 23% ‘limits to healthcare’, and 7% ‘effective and efficient’. 6% couldn’t be matched to one of the three viewpoints. In each scenario, people had the option to opt out of trading. 24% of respondents were non-traders for all scenarios and, of these, 78% were of the ‘equal right’ viewpoint. Unfortunately, a lot of people opted out of at least one of the trades, and for a wide variety of reasons. Decisionmakers can’t opt out, so I’m not sure how useful this is.

The authors describe many associations between individual characteristics, viewpoints, and WTT results. But the tested hypotheses were broadly supported. While the findings showed that different groups were more or less willing to trade, the points of indifference for traders within the groups did not vary. So while you can’t please everyone in health care priority setting, this study shows how policies might be designed to satisfy the preferences of people with different perspectives.

Credits

Chris Sampson’s journal round-up for 2nd July 2018

Every Monday our authors provide a round-up of some of the most recently published peer reviewed articles from the field. We don’t cover everything, or even what’s most important – just a few papers that have interested the author. Visit our Resources page for links to more journals or follow the HealthEconBot. If you’d like to write one of our weekly journal round-ups, get in touch.

Choice in the presence of experts: the role of general practitioners in patients’ hospital choice. Journal of Health Economics [PubMed] [RePEc] Published 26th June 2018

In the UK, patients are in principle free to choose which hospital they use for elective procedures. However, as these choices operate through a GP referral, the extent to which the choice is ‘free’ is limited. The choice set is provided by the GP and thus there are two decision-makers. It’s a classic example of the principal-agent relationship. What’s best for the patient and what’s best for the local health care budget might not align. The focus of this study is on the applied importance of this dynamic and the idea that econometric studies that ignore it – by looking only at patient decision-making or only at GP decision-making – may give bias estimates. The author outlines a two-stage model for the choice process that takes place. Hospital characteristics can affect choices in three ways: i) by only influencing the choice set that the GP presents to the patient, e.g. hospital quality, ii) by only influencing the patient’s choice from the set, e.g. hospital amenities, and iii) by influencing both, e.g. waiting times. The study uses Hospital Episode Statistics for 30,000 hip replacements that took place in 2011/12, referred by 4,721 GPs to 168 hospitals, to examine revealed preferences. The choice set for each patient is not observed, so a key assumption is that all hospitals to which a GP made referrals in the period are included in the choice set presented to patients. The main findings are that both GPs and patients are influenced primarily by distance. GPs are influenced by hospital quality and the budget impact of referrals, while distance and waiting times explain patient choices. For patients, parking spaces seem to be more important than mortality ratios. The results support the notion that patients defer to GPs in assessing quality. In places, it’s difficult to follow what the author did and why they did it. But in essence, the author is looking for (and in most cases finding) reasons not to ignore GPs’ preselection of choice sets when conducting econometric analyses involving patient choice. Econometricians should take note. And policymakers should be asking whether freedom of choice is sensible when patients prioritise parking and when variable GP incentives could give rise to heterogeneous standards of care.

Using evidence from randomised controlled trials in economic models: what information is relevant and is there a minimum amount of sample data required to make decisions? PharmacoEconomics [PubMed] Published 20th June 2018

You’re probably aware of the classic ‘irrelevance of inference’ argument. Statistical significance is irrelevant in deciding whether or not to fund a health technology, because we ought to do whatever we expect to be best on average. This new paper argues the case for irrelevance in other domains, namely multiplicity (e.g. multiple testing) and sample size. With a primer on hypothesis testing, the author sets out the regulatory perspective. Multiplicity inflates the chance of a type I error, so regulators worry about it. That’s why triallists often obsess over primary outcomes (and avoiding multiplicity). But when we build decision models, we rely on all sorts of outcomes from all sorts of studies, and QALYs are never the primary outcome. So what does this mean for reimbursement decision-making? Reimbursement is based on expected net benefit as derived using decision models, which are Bayesian by definition. Within a Bayesian framework of probabilistic sensitivity analysis, data for relevant parameters should never be disregarded on the basis of the status of their collection in a trial, and it is up to the analyst to properly specify a model that properly accounts for the effects of multiplicity and other sources of uncertainty. The author outlines how this operates in three settings: i) estimating treatment effects for rare events, ii) the number of trials available for a meta-analysis, and iii) the estimation of population mean overall survival. It isn’t so much that multiplicity and sample size are irrelevant, as they could inform the analysis, but rather that no data is too weak for a Bayesian analyst.

Life satisfaction, QALYs, and the monetary value of health. Social Science & Medicine [PubMed] Published 18th June 2018

One of this blog’s first ever posts was on the subject of ‘the well-being valuation approach‘ but, to date, I don’t think we’ve ever covered a study in the round-up that uses this method. In essence, the method is about estimating trade-offs between (for example) income and some measure of subjective well-being, or some health condition, in order to estimate the income equivalence for that state. This study attempts to estimate the (Australian) dollar value of QALYs, as measured using the SF-6D. Thus, the study is a rival cousin to the Claxton-esque opportunity cost approach, and a rival sibling to stated preference ‘social value of a QALY’ approaches. The authors are trying to identify a threshold value on the basis of revealed preferences. The analysis is conducted using 14 waves of the Australian HILDA panel, with more than 200,000 person-year responses. A regression model estimates the impact on life satisfaction of income, SF-6D index scores, and the presence of long-term conditions. The authors adopt an instrumental variable approach to try and address the endogeneity of life satisfaction and income, using an indicator of ‘financial worsening’ to approximate an income shock. The estimated value of a QALY is found to be around A$42,000 (~£23,500) over a 2-year period. Over the long-term, it’s higher, at around A$67,000 (~£37,500), because individuals are found to discount money differently to health. The results also demonstrate that individuals are willing to pay around A$2,000 to avoid a long-term condition on top of the value of a QALY. The authors apply their approach to a few examples from the literature to demonstrate the implications of using well-being valuation in the economic evaluation of health care. As with all uses of experienced utility in the health domain, adaptation is a big concern. But a key advantage is that this approach can be easily applied to large sets of survey data, giving powerful results. However, I haven’t quite got my head around how meaningful the results are. SF-6D index values – as used in this study – are generated on the basis of stated preferences. So to what extent are we measuring revealed preferences? And if it’s some combination of stated and revealed preference, how should we interpret willingness to pay values?

Credits