All features extracted from HTML source and DOM structure. Hover over any feature to see its description.
Document Size & Text
dom_depth
html_length
text_length
num_words
text_to_html_ratio
inline_css_length
num_tags
Metadata & Page Identity
has_title
has_description
has_keywords
has_author
has_copyright
has_viewport
has_favicon
num_meta_tags
DOM Elements & Layout
num_divs
num_spans
num_paragraphs
num_headings
num_lists
num_tables
num_images
num_iframes
num_hidden_iframes
num_data_uri_images
num_css_files
num_scripts
num_inline_scripts
num_inline_styles
num_input_fields
Link & Resource Analysis
num_links
num_internal_links
num_external_links
ratio_external_links
num_unique_external_domains
num_mailto_links
num_javascript_links
num_ip_based_links
num_suspicious_tld_links
num_empty_links
num_anchor_text_mismatch
num_external_css
num_external_images
num_external_scripts
Forms & Inputs
has_form
has_login_form
num_forms
num_email_fields
num_password_fields
num_text_fields
num_submit_buttons
num_hidden_fields
num_forms_without_labels
num_empty_form_actions
num_external_form_actions
password_with_external_action
Scripts & Dynamic Behavior
has_eval
has_escape
has_unescape
has_atob
has_base64
has_fromcharcode
has_document_write
has_window_open
has_location_replace
has_meta_refresh
num_onclick_events
num_onload_events
num_onerror_events
Visibility & Interaction Tricks
has_display_none
has_visibility_hidden
has_right_click_disabled
has_status_bar_customization
Contact & Social Engineering Signals
has_email_address
has_phone_number
num_brand_mentions
num_urgency_keywords
Engineered Features (23 computed)
Ratios, interactions, density metrics and risk scores computed from raw features via the engineer_features() pipeline.
Ratios
empty_to_total_links
external_to_total_links
forms_to_inputs_ratio
hidden_to_visible_inputs
iframes_to_tags_ratio
images_to_tags_ratio
password_to_inputs_ratio
scripts_to_tags_ratio
Interaction Features
brand_with_forms
external_scripts_links
forms_with_passwords
hidden_with_external
iframes_with_scripts
urgency_with_forms
Density Metrics
content_density
form_density
links_per_word
scripts_per_form
Risk Scores
form_risk_score
legitimacy_score
obfuscation_score
phishing_risk_score
has_suspicious_elements