Featured

Entity Extraction: Not As Simple As Some Vendors Say

dino orange_thumb_thumb_thumb_thumb_thumbNo smart software. Just a dumb dinobaby. Oh, the art? Yeah, MidJourney.

Most of the systems incorporating entity extraction have been trained to recognize the names of simple entities and mostly based on the use of capitalization. An “entity” can be a person’s name, the name of an organization, or a location like Niagara Falls, near Buffalo, New York. The river “Niagara” when bound to “Falls” means a geologic feature. The “Buffalo” is not a Bubalina; it is a delightful city with even more pleasing weather.

The same entity extraction process has to work for specialized software used by law enforcement, intelligence agencies, and legal professionals. Compared to entity extraction for consumer-facing applications like Google’s Web search or Apple Maps, the specialized software vendors have to contend with:

  • Gang slang in English and other languages; for example, “bumble bee.” This is not an insect; it is a nickname for the Latin Kings.
  • Organizations operating in Lao PDR and converted to English words like Zhao Wei’s Kings Romans Casino. Mr. Wei has been allegedly involved in gambling activities in a poorly-regulated region in the Golden Triangle.
  • Individuals who use aliases like maestrolive, james44123, or ahmed2004. There are either “real” people behind the handles or they are sock puppets (fake identities).

Why do these variations create a challenge? In order to locate a business, the content processing system has to identify the entity the user seeks. For an investigator, chopping through a thicket of language and idiosyncratic personas is the difference between making progress or hitting a dead end. Automated entity extraction systems can work using smart software, carefully-crafted and constantly updated controlled vocabulary list, or a hybrid system.

Automated entity extraction systems can work using smart software, carefully-crafted and constantly updated controlled vocabulary list, or a hybrid system.

Let’s take an example which confronts a person looking for information about the Ku Group. This is a financial services firm responsible for the Kucoin. The Ku Group is interesting because it has been found guilty in the US for certain financial activities in the State of New York and by the US Securities & Exchange Commission. 

Read more »

Interviews

DarkCyber, March 29, 2022: An Interview with Chris Westphal, DataWalk

Chris Westphal is the Chief Analytics Officer of DataWalk, a firm providing an investigative and analysis tool to commercial and government organizations. The 12-minute interview covers DataWalk’s unique capabilities, its data and information resources, and the firm’s workflow functionality. The video can be viewed on YouTube at this location.

Stephen E Arnold, March 29, 2022

Latest News

Another Google AI PR Push from a British Googler

This write up is the work of a humanoid who admits he is a dinobaby; that is, deadwood too old to employ. By the way, the “dinobaby” lingo allegedly emerged... Read more »

November 27, 2024 | Comment

A New Frankie Bursts on the Music Scene

So here is a minor but unfortunate thing that just happened to our culture: As the BBC reports, “Zuckerberg Records ‘Romantic’ Cover of Explicit Rap Hit.”... Read more »

November 27, 2024 | Comment

Modern Library Patrons Present Challenging Risky Business Situations

Librarians have one of the most stressful jobs in the world. Why? They do much more than assist people locating books or reading to children. They also are therapists,... Read more »

November 27, 2024 | Comment

FOGINT: Telegram Shifts from Pretending to Promoting Its Casino Play

An online service named “EuropeanGaming.eu” published an interesting story about Telegram. As you may know, the founder of VKontakte.ru and Telegram Messenger... Read more »

November 26, 2024 | Comment

Google Chrome Generating Attention. A Lot of Attention

The US Department of Justice (DOJ) took the first step in breaking up Google’s Big Tech monopoly by forcing Alphabet Inc. to sell its popular Web browser, Chrome.... Read more »

November 26, 2024 | Comment

Marketing Jobs Require More Than AI-Know How

Marketing remains a lucrative industry, but it’s become even more complex with the advent of AI. McKinsey recommends that marketers will find growth through portfolio... Read more »

November 26, 2024 | Comment

Explaining Graykey: Helpful or Harmful for Law Enforcement?

I am not keen on making some “secrets” publicly available. Those keen on channeling Edward Snowden may have glory words to describe their activities. I take... Read more »

November 25, 2024 | Comment

Apple: Another Problem Becoming Evident

Apple is a beast in Big Tech with its cult of loyal devotees, technology advancement (especially in mobile devices), and Apple TV. Apple TV invested big money in... Read more »

November 25, 2024 | Comment

Early AI Adoption: Some Benefits

Is AI good or is it bad? The debate is still raging about, especially in Hollywood where writers, animators, and other creatives are demanding the technology be... Read more »

November 25, 2024 | Comment

FOGINT: Security Tools Over Promise & Under Deliver

While the United States and the rest of the world has been obsessed with the fallout of the former’s presidential election, bad actors planned terrorist plots.... Read more »

November 22, 2024 | Comment


  • Archives

  • Recent Posts

  • Meta