Extract Indicators From File - Generic v2

Extracts indicators from a file. Supported file types:

  • CSV
  • PDF
  • TXT
  • HTM, HTML
  • DOC, DOCX
  • PPT
  • PPTX
  • RTF
  • XLS
  • XLSX
  • XML

Dependencies

This playbook uses the following sub-playbooks, integrations, and scripts.

Sub-playbooks

This playbook does not use any sub-playbooks.

Integrations

This playbook does not use any integrations.

Scripts

  • Set
  • ExtractIndicatorsFromTextFile
  • ConvertFile
  • ReadPDFFileV2
  • ExtractIndicatorsFromWordFile

Commands

  • image-ocr-extract-text

Playbook Inputs


NameDescriptionDefault ValueRequired
FileThe file from which to extract indicators.File.NoneOptional

Playbook Outputs


PathDescriptionType
Domain.NameExtracted domains.unknown
Account.Email.AddressExtracted emails addresses.unknown
File.MD5Extracted MD5 hash.unknown
File.SHA1Extracted SHA1 hash.unknown
File.SHA256Extracted SHA256 hash.unknown
IP.AddressExtracted IP addresses.unknown
File.TextThe text or images extracted from the PDF file.unknown
File.ProducerThe PDF file producer.unknown
File.TitleThe title of the PDF file.unknown
File.xapThe xap of the PDF file.unknown
File.AuthorThe author of the file.unknown
File.dcThe dc of the file.unknown
File.xapmmThe xapmm of the file.unknown
File.ModDateThe ModDate of the file.unknown
File.CreationDateThe CreationDate of the file.unknown
File.PagesNumber of pages in file.unknown
URL.DataList of URLs that were extracted from the file.unknown

Playbook Image


Extract Indicators From File - Generic v2