Featured

Entity Extraction: Not As Simple As Some Vendors Say

dino orange_thumb_thumb_thumb_thumb_thumbNo smart software. Just a dumb dinobaby. Oh, the art? Yeah, MidJourney.

Most of the systems incorporating entity extraction have been trained to recognize the names of simple entities and mostly based on the use of capitalization. An “entity” can be a person’s name, the name of an organization, or a location like Niagara Falls, near Buffalo, New York. The river “Niagara” when bound to “Falls” means a geologic feature. The “Buffalo” is not a Bubalina; it is a delightful city with even more pleasing weather.

The same entity extraction process has to work for specialized software used by law enforcement, intelligence agencies, and legal professionals. Compared to entity extraction for consumer-facing applications like Google’s Web search or Apple Maps, the specialized software vendors have to contend with:

  • Gang slang in English and other languages; for example, “bumble bee.” This is not an insect; it is a nickname for the Latin Kings.
  • Organizations operating in Lao PDR and converted to English words like Zhao Wei’s Kings Romans Casino. Mr. Wei has been allegedly involved in gambling activities in a poorly-regulated region in the Golden Triangle.
  • Individuals who use aliases like maestrolive, james44123, or ahmed2004. There are either “real” people behind the handles or they are sock puppets (fake identities).

Why do these variations create a challenge? In order to locate a business, the content processing system has to identify the entity the user seeks. For an investigator, chopping through a thicket of language and idiosyncratic personas is the difference between making progress or hitting a dead end. Automated entity extraction systems can work using smart software, carefully-crafted and constantly updated controlled vocabulary list, or a hybrid system.

Automated entity extraction systems can work using smart software, carefully-crafted and constantly updated controlled vocabulary list, or a hybrid system.

Let’s take an example which confronts a person looking for information about the Ku Group. This is a financial services firm responsible for the Kucoin. The Ku Group is interesting because it has been found guilty in the US for certain financial activities in the State of New York and by the US Securities & Exchange Commission. 

Read more »

Interviews

DarkCyber, March 29, 2022: An Interview with Chris Westphal, DataWalk

Chris Westphal is the Chief Analytics Officer of DataWalk, a firm providing an investigative and analysis tool to commercial and government organizations. The 12-minute interview covers DataWalk’s unique capabilities, its data and information resources, and the firm’s workflow functionality. The video can be viewed on YouTube at this location.

Stephen E Arnold, March 29, 2022

Latest News

Amazon: Black FridAI for Smart Software Arrives

This write up was created by an actual 80-year-old dinobaby. If there is art, assume that smart software was involved. Just a tip. Five years ago, give or take a... Read more »

December 9, 2024 | Comment

Hiding Messages: The You-Will-Not-Pay-Attention Tactic

This blog post flowed from the sluggish and infertile mind of a real live dinobaby. If there is art, smart software of some type was probably involved. I worked... Read more »

December 9, 2024 | Comment

Smart Software Is Coming for You. Yes, You!

This write up was created by an actual 80-year-old dinobaby. If there is art, assume that smart software was involved. Just a tip. “Those smart software companies... Read more »

December 9, 2024 | Comment

Google and 2025: AI Scurrying and Lawsuits. Lots of Lawsuits

This is the work of a dinobaby. Smart software helps me with art, but the actual writing? Just me and my keyboard. I think there are 193 nations which are members... Read more »

December 6, 2024 | Comment

Grousing about Smart Software: Yeah, That Will Work

This is the work of a dinobaby. Smart software helps me with art, but the actual writing? Just me and my keyboard. I read “Writers Condemn Startup’s Plans... Read more »

December 6, 2024 | Comment

Batting Google and Whiffing the Chance

This is the work of a dinobaby. Smart software helps me with art, but the actual writing? Just me and my keyboard. I read “The AI War Was Never Just about AI.”... Read more »

December 6, 2024 | Comment

Googlers Face Another Ka-Ching Moment in the United Kingdom

This write up is from a real and still-alive dinobaby. If there is art, smart software has been involved. Dinobabies have many skills, but Gen Z art is not one of... Read more »

December 5, 2024 | Comment

China Seeks to Curb Algorithmic Influence and Manipulation

Someone is finally taking decisive action against unhealthy recommendation algorithms, AI-driven price optimization, and exploitative gig-work systems. That someone... Read more »

December 5, 2024 | Comment

Listary: A Chinese Alternative to Windows File Explorer

For anyone frustrated with Windows’ built-in search function, Lifehacker suggests an alternative. “Listary Is a Fast, Powerful Search Tool for Windows,” declares... Read more »

December 5, 2024 | Comment

The Very Expensive AI Horse Race

This write up is from a real and still-alive dinobaby. If there is art, smart software has been involved. Dinobabies have many skills, but Gen Z art is not one of... Read more »

December 4, 2024 | Comment


  • Archives

  • Recent Posts

  • Meta