Quality Statement
It is recommend viewing this Rainbow/LGBTIQ+ indicator alongside the information by concepts for gender, variations of sex characteristics, cisgender and transgender status, and sexual identity for important context about how these concepts interrelate and the recommended use of the data.
The Rainbow/LGBTIQ+ indicator is derived based on responses to gender, sexual identity, and variations of sex characteristics as well as cisgender and transgender status
The LGBTIQ+ category is derived where responses have been received to one or more of the following:
- another gender
- transgender (derived)
- if a person has given a response that is coded to be homosexual, bisexual, or sexual identity not elsewhere classified
- persons who know they were born with a variation of sex characteristics.
Poor quality
Data quality processes section below has more detail on the rating.
Priority level 3
A priority level is assigned to all census concepts: priority 1, 2, or 3 (with 1 being highest and 3 being the lowest priority).
Rainbow/LGBTIQ+ indicator is a priority 3 concept. Priority 3 concepts are given third priority in terms of quality, time, and resources across all phases of the census.
Priority 3 concepts are those that are:
- data that census would not be solely run for, and information about population groups that could not be captured without being in a census
- data that is important to certain groups
- data that can be used to create sampling frames for other surveys.
Rainbow/LGBTIQ+ is a new concept in the 2023 Census.
The 2023 Census: Final content report has more information on priority ratings for census concepts.
Census usually resident population count aged 15 years and over
‘Subject population’ means the people, families, households, or dwellings that the variable applies to.
Rainbow/LGBTIQ+ indicator is classified into the following categories:
Census Rainbow LGBTIQ+ indicator V1.0.0 – level 1 of 2
Code | Category |
---|---|
1 | LGBTIQ+ |
2 | Not LGBTIQ+ |
3 | LGBTIQ+ status unidentifiable |
Rainbow/LGBTIQ+ indicator uses a 2-level hierarchical classification with the level 1 categories presented in the table above.
LGBTIQ+ status unidentifiable is a residual category.
At level 2 of the classification, the category ‘LGBTIQ+’ is broken down to distinguish between,
- another gender
- transgender
- homosexual, bisexual, or sexual identity not elsewhere classified
- persons who know they were born with a variation of sex characteristics.
While level 1 of the classification for rainbow/LGBTIQ+ indicator is single response, at level 2 of the classification it is a multiple response variable, so the number of responses will be greater than the number of respondents.
Follow the link above the table for more detail on the classifications.
Standards and classifications has information on what classifications are, how they are reviewed, where they are stored, and how to provide feedback on them.
The rainbow/LQBTIQ+ indicator is derived and produced from the individual form questions on gender (question 3 paper form), sexual identity (question 29 paper form), variations of sex characteristics (question 30 paper form), and the cisgender and transgender status derivation.
Stats NZ Store House has samples for both the individual and dwelling paper forms.
Data-use by and outside of Stats NZ:
- to give a representative picture of the diversity of New Zealand
- to provide data to assist people in advocating for the needs of their communities
- by central and local government to inform planning, service provision, and policy development
- to understand how wellbeing outcomes differ for LGBTIQ+ and non-LGBTIQ+ people in New Zealand.
The table below shows the distribution of data sources for rainbow/LGBTIQ+ indicator data. All data was from census forms as no alternative data sources were available.
The rainbow/LGBTIQ+ indicator was only produced for records where individual census form responses provide the necessary information; alternatively sourced data or imputed data was not used to complete the variable.
For records where that was not possible, the value would show ‘LGBTIQ+ status unidentifiable’, and the data source indicator was coded to ‘No information’.
Methodologies for filling gaps in gender and sex at birth concepts for the 2023 Census has further information on the data quality concerns that arise from using alternatively sourced or imputed data in this derived variable.
Data sources for rainbow/LGBTIQ+ indicator data, as a percentage of census usually resident population count aged 15 years and over, 2023 Census | |
---|---|
Source of rainbow/LGBTIQ+ indicator data | Percent |
2023 Census response | 87.0 |
Historical census | 0.0 |
Admin data | 0.0 |
Deterministic derivation | 0.0 |
Statistical imputation | 0.0 |
No information | 13.0 |
Total | 100.0 |
Note: Due to rounding, individual figures may not always sum to the stated total(s) or score contributions. |
Editing, data sources, and imputation in the 2023 Census has more information around how data sources are improved by editing.
The percentage of ‘LGBTIQ+ status unidentifiable’ in the 2023 Census is 13.0 percent.
Overall quality rating: Poor
Data has been evaluated to assess whether it meets quality standards and is suitable for use.
Three quality metrics contributed to the overall quality rating:
- data sources and coverage
- consistency and coherence
- accuracy of response.
The lowest rated metric determines the overall quality rating.
Data quality assurance in the 2023 Census provides more information on the quality rating scale.
Data sources and coverage: Poor quality
The quality of all the data sources that contribute to the output for the variable were assessed. To calculate the data sources and coverage quality score for a variable, each data source is rated and multiplied by the proportion it contributes to the total output.
The rating for a valid census response is defined as 1.00. Ratings for other sources are the best estimates available of their quality relative to a census response. Each source that contributes to the output for that variable is then multiplied by the proportion it contributes to the total output. The total score then determines the metric rating according to the following range:
- 0.98–1.00 = very high
- 0.95–<0.98 = high
- 0.90–<0.95 = moderate
- 0.75–<0.90 = poor
- <0.75 = very poor.
Only census responses are used in the derivation for the rainbow/LGBTIQ+ indicator, which results in a high level of ‘No information’, and a score of 0.87. This leads to a quality rating of poor.
Data sources and coverage rating calculation for rainbow/LGBTIQ+ indicator data, census usually resident population count aged 15 years and over, 2023 Census | |||
---|---|---|---|
Source for rainbow/LGBTIQ+ data | Rating | Percent | Score contribution |
2023 Census response | 1.00 | 87.05 | 0.87 |
No information | 0.00 | 12.95 | 0.00 |
Total | 100.00 | 0.87 | |
Note: Due to rounding, individual figures may not always sum to the stated total(s) or score contributions. |
Consistency and coherence: Moderate quality
Rainbow/LGBTIQ+ indicator data is mostly consistent overall with expectations across consistency checks. The data generally aligns with expectations for regional council, territorial authority and local board, and statistical area 2 geographies, with any differences likely resulting from real-world impacts. The census results show higher levels of LGBTIQ+ individuals in areas with tertiary institutions and younger age groups.
2023 Census is the first census that has collected information on the input variables that produce rainbow/LGBTIQ+ data, so this variable cannot be compared with historical data. The quality ratings of the input variables for this indicator also contribute to the quality rating of moderate. Rainbow/LGBTIQ+ indicator data has no data quality issues that have an observable effect on the data. This first collection of data will establish a baseline for future comparison.
Accuracy of response: High quality
Data has only minor data quality issues. The quality of coding and responses within classification categories is high. Any issues with the variable appear in a low number of cases (typically in the low hundreds).
The quality rating of the input variables for this indicator also contributed to the quality rating of high.
When using the data, users should be aware of the following:
- At level 1 of the classification, rainbow/LGBTIQ+ indicator data is appropriate for use at the national, regional council, territorial authority and local board, statistical area 2 and statistical area 1 level, and can be cross tabulated with other census data.
- At all geographic levels there is a high proportion of missing data, with some areas, such as those impacted by severe weather events, having higher proportions of missing data.
- When conducting analysis with specific level 2 categories of the rainbow/LGBTIQ+ classification, users should be aware of the observed quality issues with the input variables and consult the information by concept and recommendations for the input variables. For example, there are some observed quality issues with the ‘Another gender’ category from the input variable gender, and ‘Persons who know they were born with a variation of sex characteristics’ from the input variable variations of sex characteristics.
Comparisons to other data sources
Comparing 2023 Census rainbow/LGBTIQ+ indicator data with other data sources should be done with care. Users should familiarise themselves with the strengths and limitations of sources before use.
Rainbow/LGBTIQ+ indicator is a new concept in the 2023 Census.
Contact our Information centre for further information about using this concept.