Default ROT Rules for Shinydocs Pro
Shinydocs Pro ships with the following rules for defining redundant, obsolete and trivial (ROT) content in your data sources.
ROT category | Rule name | Date condition | Metadata condition | Rule description |
---|---|---|---|---|
Obsolete | log | Created Date older than 6 months ago | File extension: .log | Old log files. |
Obsolete | event_log | Created Date older than 6 months ago | File extension: .evtx | Windows Event Viewer logs are useful for troubleshooting systems but do not provide much value (if any) after resolution. |
Obsolete | packet_capture | Created Date older than 6 months ago | File extension: .pcapng, .pcap, .har, .saz, .bfr, .ncf, .ncfx, .snoop | Packet capture logs are common for IT to keep while troubleshooting network problems. Once the issue has been resolved, these logs do not have value to the business. |
Trivial | calendar | Created Date older than 3 months ago | File extension: .ical, .ics, .ifb, .icalendar | Common calendar event files that can be downloaded. These files are generally used once to import into a calendar (Exchange, Apple Calendar), once they are imported the file no longer has use. |
Trivial | pirated | N/A | Common pirated media format/containers and keywords | Likely pirated content. |
Trivial | windows_thumbnail | N/A | File name: thumbs.db | Windows thumbnail files. |
Trivial | empty | N/A | File size: 0 bytes | Empty files. |
Trivial | temporary_tilde | Last Modified Date older than 1 week | File name starts with the character "~" | Temporary file commonly left behind by Microsoft Office Products. |
Trivial | temporary_file | N/A | File extension: .tmp, .temp, .swp | Temporary file. |
Trivial | recycle_bin | Last Modified Date older than 30 days | Parent: \\$Recycle.Bin | Recycle bins can be left behind on file shares. This rule only applies to file system repositories. |
Trivial | windows_memory_dump | Last Modified Date older than 3 months | File extension: dmp | Windows Memory Dump used for debugging applications/Windows. These can be made automatically by an application that has crashed. |
Trivial | crypto_miner | N/A | File name matches common crypto mining software executables | Executables (.exe) that have a name known to be a crypto miner. |
Trivial | crypto_wallet | N/A | File name: wallet, extension: .dat | While crypto wallets can be virtually any extension, the common file name for crypto wallets is "wallet" with the extension ".dat". |
Obsolete | cache | Last Modified Date older than 30 days | Folder name: "cache" or with an extension '.cache', folders containing "windows" excluded | Cache directories typically hold temporary data for an application. |
Trivial | macos_thumbnail | N/A | File name: ".DS_Store" | Similar to Windows "thumbs.db", left by Mac OS. Contains view options, icon position, and other metadata about displaying folder contents. While these are useful on local drives, they do not function in a file share setting. |
Trivial | email_spam | N/A | Email subject: contains "spam" | Spam and junk email filters typically prefix detected spam/junk with "[SPAM]" (or similar). |
Trivial | email_no_subject | N/A | Email subject: no subject | Emails with no subject are typically not records and/or contain no business value. |
Trivial | email_no_body | N/A | Email body (fullText): no body | Emails with no body are typically not records and/or contain no business value. |
Trivial | email_noreply | N/A | Email fromAddress: contains "noreply", "no_reply", or "no-reply" | Emails from an account called "noreply" are typically notifications from services or newsletters. |
Trivial | entertainment_media | N/A | Extension: .m4v, .m4a, .azw, .azw1, .mkv, .vob | m4v and m4a are common iTunes media file extensions for music and movies purchased via iTunes. azw and azw1 are Amazon Kindle files. Mkv/vob files are common DVD\Blu-Ray copies. |
Trivial | game | N/A | See Game Files tab | Most popular video games and emulation files. Some of these file types are used by other applications and have been filtered out. |
Trivial | temporary_backup | Last Modified Date older than 30 days ago | File extension: .$$$, .old, .bak, wbk | Temporary backup files. |
Obsolete | flash_media | N/A | File extension: .flv, .fla, .swf except if the name is "socketpool" | Files related to Adobe Flash, typically these files are animations or old web videos. Adobe Flash ended in 2020. Does not match "SocketPool" as that is used by applications and should not be modified. |
Trivial | nsfw | N/A | Dictionary of over 500 terms that are generally not suitable for the workplace. Basic Office Files and PDF are excluded from this rule as well as some development file types | Note: The script contains graphic/offensive language due to the nature of the content. Files whose name or folder contains terms that are Not Safe For Work are identified by this script. |
Obsolete | windows_registry | Last Modified Date older than 1 year ago | File extension: .reg excludes paths that contain "System32" and "http://Microsoft.NET " | Windows registry files are used to import or report on registry values in the Windows operating system. These files can be used in the automation of application deployments/IT systems or registry backups. Often forgotten about once their purpose has been served. |
Obsolete | old_draft | Last Modified Date older than 1 year ago | File name or Folder contains "draft" | Drafts are common when creating digital content, rarely deleted once the final version is complete. |
Redundant | email_copy | N/A | File extension: .boe, .box, .eml, .mbox, .mbs, .mbx, .mmdf, .msf, .msg, .nsf, .ost, .pst, .tbb OR Folder named "email archive" | Email Files or folders named "email archive". |
Redundant | java | N/A | File extension: .class, .jad, .jar, .java, .jsp, .idx, .cod, .j | Java files. |
Redundant | smartphone_software | N/A | File extension: .ipa, .apk, .ipsw | Typical iOS and Android application file extensions. |
Obsolete | supersede | N/A | File name: supersede, superseed, supercede, superceded, superseded, superseeded | Files that contain the term "supersed" or "superseded", along with common incorrect spellings. |
Obsolete | copied | N/A | File name matches regular expression ".*- [cC]opy\..*" OR ".*\\([0-9]*\\).*" | Copy and pasted files typically have "- copy" or "(n)" in their name (where 'n' is the number of copies that already exist in that folder. |
Trivial | no_business_value | N/A | File name or Folder name contains terms such as: funny, funnies, wedding, jokes, married, selfie, joke, vacation pictures, dog, dogs, cats, puppy, kitty, cute | No business value. |
Trivial | no_extension | N/A | File with no extension | Typically files of importance will have an extension. Note: Directories with application data should be reviewed as they sometimes use extensionless files for their operation. |
Obsolete | unused | Last Modified & Creation older than 7 Years | Old files. | |
Redundant | large_disk_image | Last Modified Date older than 30 days ago | File extension: | Large software disk images. |
Obsolete | archive | Last Modified older than 1 year | File path name contains "archive", "archived" or "archives", common Windows and Python directories have been omitted | Files and folders that are outdated as indicated by the user including archive keywords in the file name or folder name. |