11 Expressive Vocabulary – Technical Manual

11.1 Task Description

Children are shown pictures and are asked to name them.

11.2 Construct

The Expressive Vocabulary task measures the construct of semantic knowledge. Students are prompted to name images, thereby measuring the ability to accurately label nouns and verbs.

11.3 Item Development

The Multitudes development process involved developing parallel measures in English and Spanish when possible. To be able to develop a linguistically and culturally appropriate measure scored conceptually, we needed to select a list of words that were relevant for the population being studied, with comparable levels of difficulty, and easily represented with an image.

For the selection of the words being targeted with our measure, we first reviewed reading curricula commonly used in California, specifically, and in the United States more broadly, in addition to other literacy materials sourced from Chile, Mexico, and Panama. To extend the list of selected words, we used Age of Acquisition (AoA) databases for English (Brysbaert and Biemiller 2017) and Spanish (Alonso, Fernández, and Díez 2015; Alonso, Díez, and Fernández 2016) to compile a comprehensive list with somewhat comparable AoA in both languages. The AoA was used as a proxy of difficulty, hypothesizing that words acquired earlier in development were more likely to yield easier items. In addition to AoA information, we used Clearpond (Marian et al. 2012) to retrieve information on the word’s frequency, orthographic and phonological length, and neighborhood frequency.

We used iStock to choose the real pictures of the selected words. The chosen pictures underwent a rigorous selection process to meet specific criteria:

Easily Recognizable. Emphasis was placed on selecting images that could be easily identified.
Minimal Construct Irrelevant Features. Preference was given to pictures with a clean and unobtrusive background, and a white background was opted for whenever possible.
Removal of Irrelevant Information. Unnecessary elements, such as a leaf in a pear or surrounding mountains, were eliminated through cropping, focusing solely on the essential components of the image.
Culturally Representative. Images that were representative of a culturally diverse population. The final pool of images was reviewed by the Justice, Equity, Diversity, and Inclusion committee, and any images that were identified as not representative of the target word or that had potential concerns were deliberately excluded from the final pool.

11.3.1 Dialectal considerations.

English. The researchers listed all acceptable responses, and an additional online search was conducted to find other regional terms for specific items in English.
Spanish. A group of six Spanish-English bilingual research assistants was shown the images and requested to label them. To avoid penalizing the use of regional terms, we asked research assistants from different countries or regions (including Chile, Colombia, three different states in Mexico, and Venezuela) to list all the different terms known to them to label said pictures. An additional search was conducted to find other regional terms for both English and Spanish.

Additionally, a blank box was available for examiners to annotate other possible acceptable responses given by the participating children during piloting. We then reviewed all of those responses and decided which ones should be added to the correct response list for each item.

11.4 Scoring

11.4.1 English

A list of accepted terms for English was provided. The assessment uses a dichotomous fixed response format of 0 points for incorrect responses or non-responses and 1 point for correct ones.

11.4.2 Spanish-English Bilingual

A list of accepted words for Spanish and English is provided. The assessment was calibrated using conceptual scoring meaning a child can respond in either English or Spanish. The assessment uses a dichotomous fixed response format of 0 points for incorrect responses or non-responses and 1 point for correct ones.

11.5 Calibration Samples

Table 11.1: Demographic Characteristics of Calibration Samples for the English and Spanish Expressive Vocabulary Tasks

Characteristic	English			Spanish
Characteristic	K N = 2,127	G1 N = 2,805	G2 N = 2,887	K N = 1,164	G1 N = 1,270	G2 N = 1,013
Timepoint
Spring 2023	0 (0%)	0 (0%)	0 (0%)	621 (53%)	630 (50%)	0 (0%)
Fall 2023	611 (29%)	668 (24%)	703 (24%)	0 (0%)	0 (0%)	345 (34%)
Winter 2024	0 (0%)	0 (0%)	0 (0%)	0 (0%)	0 (0%)	264 (26%)
Fall 2024	1,516 (71%)	2,137 (76%)	2,184 (76%)	543 (47%)	640 (50%)	404 (40%)
Administration Format
CAT	1,516 (71%)	2,137 (76%)	2,184 (76%)
Forms	611 (29%)	668 (24%)	703 (24%)	621 (100%)	630 (100%)	609 (100%)
Race
American/Alaskan Native	71 (3.6%)	97 (3.7%)	64 (2.4%)	40 (3.5%)	45 (3.6%)	13 (1.3%)
Asian	155 (7.8%)	223 (8.6%)	197 (7.4%)	20 (1.8%)	32 (2.6%)	24 (2.4%)
Black/African American	208 (10%)	283 (11%)	305 (11%)	11 (1.0%)	19 (1.5%)	11 (1.1%)
Not reported	196 (9.9%)	263 (10%)	246 (9.2%)	484 (42%)	552 (45%)	465 (47%)
Other	442 (22%)	343 (13%)	350 (13%)	230 (20%)	151 (12%)	67 (6.7%)
White	917 (46%)	1,395 (54%)	1,503 (56%)	355 (31%)	438 (35%)	419 (42%)
Unknown	138	201	222	24	33	14
Ethnicity
Hispanic/Latin(o/a)	1,245 (68%)	1,806 (69%)	1,851 (70%)	979 (96%)	1,150 (97%)	926 (94%)
Intentional nonreport	16 (0.9%)	8 (0.3%)	5 (0.2%)	3 (0.3%)	3 (0.3%)	2 (0.2%)
Not Hispanic/Latin(o/a)	573 (31%)	787 (30%)	783 (30%)	34 (3.3%)	37 (3.1%)	55 (5.6%)
Unknown	293	204	248	148	80	30
Gender
Female	946 (51%)	1,290 (50%)	1,303 (50%)	537 (53%)	662 (57%)	508 (51%)
Male	920 (49%)	1,307 (50%)	1,322 (50%)	477 (47%)	501 (43%)	481 (49%)
Non-binary	0 (0%)	0 (0%)	0 (0%)	0 (0%)	0 (0%)	1 (0.1%)
Unknown	261	208	262	150	107	23
Home Language
English	1,295 (70%)	1,633 (68%)	1,633 (66%)	114 (10%)	126 (10%)	101 (10%)
Spanish	471 (25%)	672 (28%)	740 (30%)	1,009 (89%)	1,100 (89%)	858 (89%)
Other	82 (4.4%)	87 (3.6%)	110 (4.4%)	16 (1.4%)	5 (0.4%)	9 (0.9%)
Unknown	279	413	404	25	39	45
English Proficiency Label
(Re-)Classified Proficient	60 (3.5%)	111 (4.8%)	258 (11%)	74 (7.6%)	96 (8.6%)	121 (13%)
English Learner	462 (27%)	651 (28%)	593 (24%)	813 (84%)	912 (82%)	718 (77%)
English-only	1,179 (69%)	1,574 (67%)	1,606 (65%)	84 (8.7%)	107 (9.6%)	93 (10.0%)
Unknown	426	469	430	193	155	81
Ever IEP/504	118 (8.1%)	201 (9.8%)	212 (10%)	74 (10%)	70 (9.3%)	76 (11%)
Unknown	665	752	818	452	515	330
Unknown				543	640	404

11.6 Psychometric Analysis

11.6.1 Basic Item Statistics

We excluded 0 items from the English task and 2 items from the Spanish task based on low response counts (n < 90). 4 items were excluded because they had no variance in the Spanish task, and 0 items in the English task. Additionally, we excluded 10 items from the English task and 5 items from the Spanish task based on low point-biserial correlations (r < 0.2). Table 11.2 summarizes the basic item characteristics, Figure 11.1 shows the relationship between point-biserial correlations and the proportion of correct responses for each item.

Table 11.2: Basic Item Statistics Before and After Application of Exclusion Criteria, for the English and Spanish Expressive Vocabulary Tasks

	English		Spanish
Characteristic	Before Excl.	After Excl.	Before Excl.	After Excl.
Characteristic	N = 151	N = 141	N = 156	N = 147
No. of Responses	704 (480)	739 (476)	280 (253)	292 (255)
Proportion Correct	0.54 (0.23)	0.55 (0.21)	0.49 (0.25)	0.50 (0.23)
Point-biserial Correlation	0.44 (0.15)	0.46 (0.13)	0.45 (0.13)	0.46 (0.11)
Excluded (n < 90)	0 (0%)	0 (0%)	2 (1.3%)	0 (0%)
Excluded (pbis < .2)	10 (6.6%)	0 (0%)	5 (3.3%)	0 (0%)
Excluded (no variation)	0 (0%)	0 (0%)	4 (2.6%)	0 (0%)

Figure 11.1: Scatterplot Showing Point-biserial (Item-total) Correlations and Proportion of Correct Responses for the English (Panel A) and Spanish (Panel B) Expressive Vocabulary Tasks

Figure 11.2: Network Graphs Showing Connections Between Items on the English (Panel A) and Spanish (Panel B) Expressive Vocabulary Tasks

11.6.2 Rasch Analysis

11.6.2.1 Item Location Estimates

Figure 11.3: Scatterplot Showing Item Location and Proportion of Correct Response for the English (Panel A) and Spanish (Panel B) Expressive Vocabulary Tasks

11.6.2.2 Item Fit Statistics

Table 11.3: Frequencies of Item Misfit Categories Based on Infit/Outfit MSE Values for the English and Spanish Expressive Vocabulary Tasks

	English					Spanish
	Infit MSE
	A	B	C	D	Total	A	B	C	D	Total
Outfit MSE
A	131	0	0	0	131	139	0	0	0	139
B	3	0	0	0	3	4	0	0	0	4
C	5	0	0	0	5	4	0	0	0	4
D	2	0	0	0	2	0	0	0	0	0
Total	141	0	0	0	141	147	0	0	0	147

11.6.2.3 Person Location Estimates

Figure 11.4: Scatterplot Showing Person Location Estimates (Obtained using the MLE method) and the Proportion of Correct Responses for English and Spanish Expressive Vocabulary Tasks

11.6.2.4 Person Fit Statistics

Table 11.4: Frequencies of Person Misfit Categories Based on Infit/Outfit MSE Values for the English and Spanish Expressive Vocabulary Tasks

	English					Spanish
	Infit MSE
	A	B	C	D	Total	A	B	C	D	Total
Outfit MSE
A	5,932	0	41	0	5,973	2,434	0	13	1	2,448
B	921	319	0	0	1,240	375	233	0	0	608
C	205	0	86	3	294	107	0	40	3	150
D	141	0	83	28	252	58	0	48	6	112
Total	7,199	319	210	31	7,759	2,974	233	101	10	3,318

11.6.2.5 Distribution of Theta Estimates

Figure 11.5: Distribution of Theta Estimates for the English and Spanish Expressive Vocabulary Tasks

11.6.2.6 Wright Maps

Figure 11.6: Wright Maps Showing the Relationship Between Item and Person Location Estimates for the English Expressive Vocabulary Task

Figure 11.7: Wright Maps Showing the Relationship Between Item and Person Location Estimates for the Spanish Expressive Vocabulary Task

11.6.2.7 Model Summary

Table 11.5: Summary of Rasch Model Statistics for the English and Spanish Expressive Vocabulary Tasks

	English		Spanish
	Item	Person	Item	Person
Characteristic	N = 141	N = 7,759	N = 147	N = 3,318
Logit Scale Location	-0.47 (1.88)	0.03 (-0.98, 1.10)	0.01 (1.84)	-0.01 (-0.89, 0.99)
Outfit	1.02 (0.30)	0.71 (0.56, 0.93)	1.01 (0.22)	0.77 (0.59, 0.96)
Infit	0.98 (0.11)	0.85 (0.71, 1.02)	0.99 (0.08)	0.87 (0.74, 1.02)
Reliability of Separation	0.8292	0.8206	0.7708	0.7491

11.6.2.7.1 Final Number of Items

Following the exclusion of items with point-biserial correlations < .20 and items with poor fit statistics, the final versions of the task contain 141 and 147 for the English and Spanish task, respectively.

11.7 Criterion Validity Evidence

11.7.1 Sample

Table 11.6: Demographic Characteristics of the Concurrent Criterion Validity Evidence Samples for the English and Spanish Expressive Vocabulary Tasks

Characteristic	English			Spanish
Characteristic	K N = 262	G1 N = 231	G2 N = 204	K N = 240	G1 N = 225	G2 N = 265
Timepoint
Winter 2024	262 (100%)	231 (100%)	204 (100%)	240 (100%)	225 (100%)	265 (100%)
Race
American/Alaskan Native	5 (1.9%)	3 (1.3%)	1 (0.5%)	2 (0.8%)	4 (1.8%)	4 (1.5%)
Asian	36 (14%)	37 (16%)	8 (4.3%)	8 (3.4%)	2 (0.9%)	0 (0%)
Black/African American	28 (11%)	30 (13%)	34 (18%)	1 (0.4%)	0 (0%)	0 (0%)
Not reported	30 (12%)	32 (14%)	13 (7.0%)	132 (55%)	153 (69%)	172 (65%)
Other	74 (29%)	44 (19%)	3 (1.6%)	41 (17%)	8 (3.6%)	18 (6.8%)
White	86 (33%)	85 (37%)	127 (68%)	54 (23%)	55 (25%)	69 (26%)
Unknown	3	0	18	2	3	2
Ethnicity
Hispanic/Latin(o/a)	106 (40%)	96 (42%)	121 (60%)	217 (92%)	209 (93%)	248 (98%)
Intentional nonreport	8 (3.1%)	2 (0.9%)	0 (0%)	1 (0.4%)	0 (0%)	2 (0.8%)
Not Hispanic/Latin(o/a)	148 (56%)	133 (58%)	82 (40%)	19 (8.0%)	16 (7.1%)	2 (0.8%)
Unknown	0	0	1	3	0	13
Gender
Female	132 (50%)	106 (46%)	98 (48%)	126 (53%)	110 (49%)	137 (52%)
Male	130 (50%)	125 (54%)	106 (52%)	114 (48%)	115 (51%)	128 (48%)
Home Language
English	190 (74%)	172 (75%)	128 (82%)	29 (12%)	23 (10%)	23 (8.7%)
Spanish	33 (13%)	25 (11%)	23 (15%)	206 (87%)	198 (89%)	239 (91%)
Other	34 (13%)	33 (14%)	5 (3.2%)	2 (0.8%)	1 (0.5%)	1 (0.4%)
Unknown	5	1	48	3	3	2
English Proficiency Label
(Re-)Classified Proficient	12 (5.5%)	18 (8.0%)	11 (7.1%)	31 (14%)	24 (11%)	42 (17%)
English Learner	50 (23%)	41 (18%)	17 (11%)	184 (81%)	176 (80%)	178 (73%)
English-only	155 (71%)	167 (74%)	128 (82%)	11 (4.9%)	19 (8.7%)	23 (9.5%)
Unknown	45	5	48	14	6	22
Ever IEP/504	19 (9.4%)	22 (12%)	18 (12%)	20 (9.5%)	23 (11%)	16 (12%)
Unknown	59	50	48	30	14	127

English Expressive Vocabulary was correlated with the Expressive One-Word Picture Vocabulary Test, 4th Edition (Martin and Brownell 2011). Spanish Expressive Vocabulary was correlated with the Expressive One-Word Picture Vocabulary Test, 4th Edition, Bilingual Edition (Martin 2013).

Table 11.7: Concurrent Criterion Validity Correlations for the English and Spanish Expressive Vocabulary Tasks

	English				Spanish
	All		EL		All
Grade	n	r [CI]	n	r [CI]	n	r [CI]
K	262	0.75 [0.70, 0.80]	50	0.70 [0.52, 0.82]	240	0.59 [0.50, 0.67]
G1	231	0.76 [0.70, 0.81]	41	0.80 [0.66, 0.89]	225	0.51 [0.41, 0.60]
G2	204	0.72 [0.65, 0.78]	NA	NA	265	0.48 [0.39, 0.57]