Omar Torres, Bella Mendoza, Zachary Kuo
University of California, Berkeley
Ordinal regression is a classification method for categories on an ordinal scale -- e.g. [1, 2, 3, 4, 5] or [G, PG, PG-13, R]. This notebook implements ordinal regression using the method of Frank and Hal 2001, which transforms a k-multi-class classifier into k-1 binary classifiers (each of which predicts whether a data point is above a threshold in the ordinal scale -- e.g., whether a movie is "higher" than PG). This method can be used with any binary classification method that outputs probabilities; here L2-regularizaed binary logistic regression is used.
This notebook trains a model (on train.txt), optimizes L2 regularization strength on dev.txt, and evaluates performance on test.txt. Reports test accuracy with 95% confidence intervals.
Splitting Annotated Data
Modeling
Analysis
Our model defines classes primarily with the use of exclamation points, usage of capitalization, and negative bigrams. While these are categories are sufficient for a relatively accurate classifier, there are still some systematic mistakes made by the model. For example, rhetorical questions not getting caught as signs of aggression, or short sentences being commonly classified as aggressive while they are actually very neutral, are both common errors made by our model. This could be happening because shorter sentences mean the presence of one of these features can be weighted more heavily and there are fewer chances for the presence of non-aggressive features. Two labels that are often mistaken for each other are 1 and 2. This could be due to both of them being extremely common, making it difficult to differentiate the minor differences between the two.
In terms of biases, there is a lack of data for certain registers of English such as "intimate" or "frozen" due to the nature of complaints. This makes it difficult to identify potential bias in these registers of the language. While not this does not necessarily equate to bias, there is a tendency for casual English to have higher aggression scores and formal English to have lower scores, likely due to the attitude of the speaker that is associated with each respective register. The spread of data labels was also very skewed. Over half (around 60%) of our dataset is labeled as 1, and the next most prevalent labels are 2, 3, then 4 respectively. This could cause our model to place less importance on the 3 and 4 levels, as the prevalence of 1's will be focused on during the training process. This means that our dataset would be a good candidate for changing class weights. By increasing prediction error for misclassifying 3's and 4's, we can work around this imbalance of labels.
________________________
The source of our data is the Consumer Financial Protection Bureau, a federal dataset that contains complaints specifically regarding financial products and services from companies. The data is published after the company responds to the complaint, or after 15 days. This data is listed as, “intended for public access and use”. This means that it is neither private nor under copyright. It contains CSV and JSON versions of data and was processed and prepared by the US government. The dataset was initially created in November 2020 and is stated to be updated daily.
In our dataset of consumer complaints, we are primarily considering annotating the “Consumer complaint narrative” category, as it contains the actual text used by consumers when filing a complaint. Rather than sentiment analysis, the goal of our project is to analyze the level of aggression displayed in each of the complaints by detecting elements such as harsh wording, euphemisms, and the overall tone of the message.
Aggression Level Analysis Annotation Guidelines
1. Introduction
The purpose of this annotation is to utilize identified aspects of aggression and frustration within consumer complaints to gauge the overall level of anger that is expressed. The source of the data used for this task stems from the Consumer Financial Protection Bureau, a federal dataset that contains complaints specifically regarding financial products and services from companies. The data is published after the company responds to the complaint, or after 15 days. For each given complaint, the task of the annotator is to identify elements of frustration and/or aggression within the sentences and use this information to decide upon a designated level of anger.
1a. Notable Definitions
Euphemism Mild or indirect words or expressions substituted for one considered too harsh or blunt when referring to something unpleasant or embarrassing. For example, the sentence “He was let go from his job” uses euphemism by using “let go” rather than “fired” to describe the end of the person’s employment. Frustration The feeling of being upset or annoyed, especially because of inability to change or achieve something. From a psychological perspective, frustration is often correlated with anger and disappointment, making it an important feeling to detect when identifying levels of anger.
2. Levels of Anger
- 1 (no anger expressed): the user is filing for a complaint or request in a respectful manner that has minimal signs of expressed frustration. Although the user is complaining, they are generally only listing facts that could help to solve their case and are opting to uses euphemisms rather than being too harsh. Common factors to notice for level 1 are: o Mentioning failure to take action, or being unsure about a process o Primarily providing facts o Making an inquiry o Expressing concern without being aggressive or threatening
- 2 (slightly angry): the user is communicating their problem, but there are indications of frustration through diction, tone, and/or punctuation that imply a level of anger that is slightly above neutral, but not too extreme. Rather than sticking to providing raw facts, the user expresses some of their feelings that come up when dealing with the problem they are trying to solve. Common factors to notice for level 2 are: o Expressing how the situation being complained about is having a negative impact on their life o Mild frustration, whether directly stated or detected through the tone of the complaint o Mentions of the company refusing to take action or do anything to help the situation o Not too much aggression, but rather more focus on frustration
- 3 (angry): Rather than using a more reserved level of expressing frustration, the user is more blatant with their aggression and/or takes actions that indicate anger. The user may point out the fault of the company or lean more heavily on opinionated statements such as how they feel about the situation or company in question. Making accusations towards the company or claiming that something is illegal is also a good indicator. Common factors to notice for level 3 are: o Threatening to take action against the company o Stating that people were rude, or similar negative experiences o Making accusations against the company
- 4 (very angry): The highest level of anger, with the user very blatantly showing aggression in their complaint. There may be offensive or harsh wording, threats, or demands. The user expresses their thoughts and opinions about the company with the use of negative extremities (“worst”, “slowest”, etc.). Common factors to notice for level 4 are: o Clear expression of anger, frustration, and aggression o Harsh wording o Negative extremities and use of exclamation points to highlight negative points o Threats or demands made against the company