Broadcast police communications may pose privacy risks, especially to Black men
by Mary FetzerThis article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:
fact-checked
proofread
Police radio transmissions contain personally identifiable information that could pose privacy risks for members of the public, especially Black males, according to a new study by researchers at Penn State and the University of Chicago.
"This study provides a window into police activity as events unfold," said Shomir Wilson, associate professor of information sciences and technology at Penn State and study co-author. "We found that because police radio transmissions disproportionately involve Black suspects, there's a proportionally higher privacy risk for Black people in these communications."
The researchers studied a total of 24 hours of human-transcribed and annotated broadcast police communications transmitted on a single day in three Chicago dispatch zones, or regions used to coordinate police activity. According to U.S. census data, one zone was majority non-Hispanic white, one majority Hispanic and one majority non-Hispanic Black. The team found that broadcast police communications mentioned males nine times more frequently than females and that Black males were most often mentioned of all groups, even in the majority white zone.
The researchers presented their findings at the 27th Association for Computing Machinery Conference on Computer-Supported Cooperative Work and Social Computing on Nov. 9-13 in Costa Rica. The team received a diversity, equity and inclusion recognition from the conference's awards committee.
"The typical police radio transmission is short and serves a coordinating purpose, something like "Car 54, where are you?'" said Chris Graziul, research assistant professor at the University of Chicago, study co-author and one of two principal investigators leading the project. "These transmissions try to communicate what's happening and describe who's involved. In the process, sensitive information is often disclosed."
The researchers obtained 9,115 transmissions—what they called "utterances"—that occurred when police or dispatch communicated via radio broadcast. They manually transcribed the transmissions and then randomly chose 2,000 utterances from across the three zones to analyze further. They developed a qualitative annotation scheme to label the text. They divided the annotated data into six categories, ranging from event information, such as "residential alarm break in" or "traffic stop," and procedural transmissions, such as the "Car 54" example, to casual transmissions like "Morning, squad."
The researchers found that event utterances contained the most references—about 60%—to gender, race/ethnicity, age and protected health information, which can be used to identify individuals. Nearly 68% of utterances that included a sociodemographic indicator used male gendered terms, and approximately 69% of those utterances referred to Black people, according to the researchers.
"Our findings contribute to a larger body of evidence about racial disparities in policing. What is novel here is the data source: radio transmissions," Graziul said. "Despite prolific use by police systems around the world, few have explored what this means of communication can tell us about how policing operates in practice.
"Disproportionate mentions of Black people reflect a novel way to observe how officers' attention is unevenly distributed across racial/ethnic groups, and identifying this disparity helps us understand challenges to the ethical use of this data source for research, like preventing leakage of sensitive personal information, which would impact Black communities substantially more than other communities."
Discover the latest in science, tech, and space with over 100,000 subscribers who rely on Phys.org for daily insights. Sign up for our free newsletter and get updates on breakthroughs, innovations, and research that matter—daily or weekly.
Subscribe
After examining the utterances, the team tested a large language model (LLM), a widely used artificial intelligence tool, to determine its capacity to find personal information in the transcripts. Despite the unique nature of the language involved with broadcast police communications, the LLM detected personally identifiable information with high accuracy, highlighting the risk of privacy vulnerability. Bad actors, such as identity thieves, could use AI technology to quickly find and misuse the personal information in transcripts of police radio activity, according to the researchers.
"This work reveals a concerning trend of racial inequality in terms of the exposure of sensitive information during police radio transmissions," said Pranav Narayanan Venkit, graduate student pursuing a doctoral degree in informatics in the College of Information Sciences and Technology and first author on the paper. "This study may help researchers and developers give more thought to interactions between LLM and different segments of society—the policing community, minority populations and various other populations—to identify biases and protect personal information."
Miranda Goodman, who graduated with her bachelor's degree from Penn State this past summer, and fourth-year Penn State student Samantha Kenny also contributed to this work.
More information: Pranav Narayanan Venkit et al, Race and Privacy in Broadcast Police Communications, Proceedings of the ACM on Human-Computer Interaction (2024). DOI: 10.1145/3686921
Provided by Pennsylvania State University