The New Role: Complete Career Change

I am going to start writing more frequently, so I can hopefully encourage others to do the same in their journey to learn more, and also to hopefully get feedback about how to improve my posts or personal knowledge base. I am always open to learning something new.

So why wait till now to start writing? Well, I started my new role, and I simply love it, however, I haven’t been writing much because of how content I am with the team and the things I get to work on and learn. 

Photo by Immo Wegmann on Unsplash

Hold on pause! BACK IT UP!

Okay, here is some context. I was working in the Sales department with a great team of people, however, the work was unrelated to what I wanted to do long term, which is, to work with databases, and eventually, work as a Data Scientist!

Photo by Myriam Jessier on Unsplash

So far things have been going great in my new role, but getting to where I am now was not the easiest thing for me to do. I had to put together a plan because attending classes is not enough if you are going through a complete career change like I was. Let me give you the rundown of the different roles I have held over the years.

career_journey <- c('Paraprofessional at Roberts Academy','English Teacher in Japan','Long Term English Substitute Teacher for Finneytown Highschool','Kroger IT Support Analyst','Sales Underwriting Assistant','Business Intelligence Developer')
summary(career_journey)
career_journey <- data.frame(career_journey)
The Various Jobs I have held on my Arbitrary Path Towards My Career
career_journey
Paraprofessional at Roberts Academy
English Teacher in Japan
Long Term English Substitute Teacher for Finneytown Highschool
Kroger IT Support Analyst
Sales Underwriting Assistant
Business Intelligence Developer

I graduated from school wanting to be a teacher helping students learn the English language through literature, well, that was fun while it lasted, but proved to be not the right career move for me. I was scared of moving into something related to computers, however, when I jumped in and realized how fun and rewarding working with data can be, I felt like I had found my mate for life…data. Hence the blog name and image…

Maybe you find yourself in a similar situation, where your current career goals…

1. seem to not be working out the way you thought

2. have become a lie you tell yourself to get through the day

3. you are finding that the thing you loved the most about your career is getting replaced or overrun by something else (this is the category I fell into).

Don’t give in! Explore your options and never doubt yourself! Your brain is a powerful tool that can be reconfigured to whatever you set your mind to. As cliché as that sounds, it’s true. The only limit is what you put into action. For me, this was a Data Analytics Program, a ton of self-study (SQL phone apps, Quizlet for vocab, R phone apps, and practice on my PC), and a determination to succeed.

If you are thinking about switching careers you probably have a long laundry list of things you think will be holding you back. One of mine was having my first kid! Can you believe that? I actually thought an incredible innocent little creature was going to “hold me back,” but guess what, she became another motivating factor for me to work harder if anything else, because now my daughter was apart of my journey, and I couldn’t let myself, my wife, and my daughter down!!! There were quite a number of mornings at 5 am where my daughter would only fall asleep in the carrier which I would strap on and start rocking her to sleep while also typing SQL queries, and SAS or R code on my computer for my class projects (Rest assured, if I were to put her into her crib, she would be back up in a heartbeat 😉 , so the carrier was the best option for her and myself).

So what if I am just not as amazing as you? This question assumes that I am some degree of “amazing” by myself, but the truth is the most important ingredient to your success is the team of people you have behind you whole love and support you.

I hope you found this piece encouraging! I want to help those who wish to move into a different career but feel trapped. You can do it! It takes a lot of hard work, but you can do it!

I am going to more regularly blog tips and tricks I have picked up that have proven to be helpful for me. I also want to post some different projects I complete.

Thank you for reading!

PART 2: GOT $CAMMED?

In the last post, you were able to see an image of my beloved dog and hear a little bit about fraud and theft in America. This post you will also see a picture of my beloved dog, and I am hoping that with the context from the last blog post you can at least try to picture yourself in Sophia’s shoes who is the person I interviewed for the substance of this blog post. If people can be duped into paying pseudo-government officials money, and phony boyfriends and girlfriends, why couldn’t they be duped into paying a company that was hiring them? And this is where Sophia’s story begins…

The Dataset… This dataset was scrubbed the old fashioned way…manually. Through a series of messages and pictures. Thank you, Sophia, for helping me construct this timeline with your data.

Sophia, whose real name will be kept secret to protect her identity, is a young blonde who really likes to look fashionable and has a cat or, at least that is what Facebook seemed to suggest. Sophia began to talk through some of the things that she had felt leading up to the week where she was robbed, and in reflecting what she learned from the incident, Sophia said, “I learned a lot about their techniques,” and when I asked her for advice for others if they found themselves in a questionably scammy scenario, she said, “[If they ask you for money] have them buy it for you…It’s not your responsibility,” and when they do send you money, “wait till the paycheck goes through.” At the beginning of our conversation, Sophia looked distraught and bewildered, but after making this statement her face reddened with anger, as she thought of the people who did this to her. When we reviewed some of the correspondence, one of the last messages Sophia sent seemed to match her mood now the most which involved some colorful language meant to offend the scammers. The scammers left without a trace and the whole exchange took only a week. If you are meeting someone for the first time and there is even the remotest suggestion of a scammer vibe this person is giving you, see FTC.gov, and look at their information on scammers. Their website has a lot of great resources for how to distinguish whether someone is a scammer or not and they have a phone number you can call to report scams. If you think you are alone, think again and then look at all of the data the FTC provides on other individuals who were also scammed out of their money or had their identity stolen. 

Stepping down from my soapbox, please look at the transaction line chart alongside the timeline of events as they unfolded between Sophia, the Scammer, and the bank. If you don’t want the annoying giant play button in the middle of the viz please go to this link http://public.tableau.com/views/viz_of_scam_timeline_with_balance_v3/TimelineFinal?:display_count=y&:origin=viz_share_link where you can see the best format of this viz.

 

Returning to a bit of advice Sophia gave earlier, “It’s not your responsibility,” and it is not okay to make a transaction without the guarantee of having the money bounce free in your hand or in your bank account. This bit of advice is important because I think it shows how the power should be in the potential employee’s hands or the requestee, as is the case for all other scamming incidents, not the employer’s or potential scammer.

 

References:

  1. https://www.consumer.ftc.gov/features/feature-0037-imposter-scams
  2. https://playfairdata.com/how-to-make-a-timeline-in-tableau/
  3. https://www.ftc.gov/enforcement/data-visualizations/explore-data

 

Part 1: Got $cammed?

First, a note to the reader. This is a two part series. This part we will look at some data reported to the FTC and in the second part, I have a surprise of sorts that will be ruined, if I give too much information. Reddington is back just because he’s ridiculously cute. He has nothing to do with scamming, but if you look into his adorable little eyes, it makes you thankful he is spared from this harm humans inflict on other humans.

The FTC files complaints from individuals who claim to have been scammed by someone posing as a government representative. What do we mean by scammed? Scamming can come in many different forms, but let’s talk about the general concept first and then provide some examples. According to the Merriam Webster online dictionary, a scam is “to obtain (something, such as money) by a scam,” and I believe defining the word with the word is never a good practice, so I am going to combine both definitions provided by Merriam Webster and here is the finalized definition, “to obtain (something, such as money) by” an act of deception. This definition is very broad so let’s look at first the list provided by the FTC of different cases they have received.

  1. Most recently, the FTC stated that these government imposters were posing as companies providing Coronavirus treatment.
  2. People claimed being contacted by someone claiming to be a member of “the government or a well-known business, a romantic interest, or a family member with an emergency” were received by consumers in 2019 and reported to the FTC.  These consumers reported a total of “$667 million” in losses.
  3. Within the scams that occurred that were labeled government imposters for 2019, the top imposter scam reported were Social Security imposters. There were “166,190 reports about Social Security scam, and the median individual loss was $1,500” (consumer.FTC.gov).

So how do these individuals scam people out of their money? “Gift cards” are reported to be the payment of choice (consumer.FTC.gov). Now that you have some context, let’s look at a small sample of this massive collection of data, and let’s focus on the Ohio fraud and theft reports from 2018. This analysis was completed in R and is available on my github repository (github.com/sterlingn/scammerdata).

A note to the reader on formatting: If you read this on a phone or tablet the scatter plots and bar charts should show with little issue, however, if you are viewing on a desktop, the images will appear ginormous. To get a better view of the scatter plots and the bar charts, simply click on the image and you can change the scale yourself.

library(tidyverse)
library(ggplot2)
library(sqldf)

First we will look at the Ohio fraud data. Here is a briefing on the set:

Metropolitan Areas are defined by the Office of Management and Budget, and population estimates are based on 2018 U.S. Census figures. Metropolitan Areas are ranked based on the number of reports per 100,000 population. Reports exclude state-specific data contributor reports.

I did most of my editting in excel to the database.

Ohiofraud <- read.csv("Desktop:/Ohio_normalized.csv")
knitr::kable(Ohiofraud, caption = "Ohio Fraud")
Ohio Fraud
metro_area_f state reports
Cleveland-Elyria OH Metropolitan Statistical Area 714
Columbus OH Metropolitan Statistical Area 705
Dayton OH Metropolitan Statistical Area 621
Weirton-Steubenville WV-OH Metropolitan Statistical Area 614
Akron OH Metropolitan Statistical Area 596
Canton-Massillon OH Metropolitan Statistical Area 588
Youngstown-Warren-Boardman OH-PA Metropolitan Statistical Area 587
Cincinnati OH-KY-IN Metropolitan Statistical Area 586
Springfield OH Metropolitan Statistical Area 561
Toledo OH Metropolitan Statistical Area 545
Huntington-Ashland WV-KY-OH Metropolitan Statistical Area 489
Mansfield OH Metropolitan Statistical Area 489
Salem OH Micropolitan Statistical Area 484
Lima OH Metropolitan Statistical Area 467
Wheeling WV-OH Metropolitan Statistical Area 459
Wooster OH Micropolitan Statistical Area 454

Which cities have the highest number of incidents?

knitr::kable((Ohiofraud[1:3,] %>%
  arrange(desc(reports))), caption ="Cities with the Highest Number of Incidents")
Cities with the Highest Number of Incidents
metro_area_f state reports
Cleveland-Elyria OH Metropolitan Statistical Area 714
Columbus OH Metropolitan Statistical Area 705
Dayton OH Metropolitan Statistical Area 621

Cleveland with 714, Columbus with 705, and Dayton with 621

Where’s the nasty Nati?

Ohiofraud %>%filter(str_detect(metro_area,"Cincinn")

mean(Ohiofraud$reports)# 559.9375

The nasty Nati is just above the average at 586 reports. Let see if Cincinnati falls within the upper interquartile range.

summary(Ohiofraud$reports)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   454.0   487.8   573.5   559.9   600.5   714.0

Cincinnati is just under the 3rd Quartile which is 600. Was the average the best summary statistic to use? Let’s see a visual of the data.

The scatterplot shows several points that are scattered about in what appear to be clusters.

d <- Ohiofraud$reports

View(d)

qplot(Ohiofraud$reports,Ohiofraud$metro_area)

Now let’s also look at add a second column which will contain the data on id theft. I will need to revert to SQL for this query.

Ohioidtheft <- read.csv("Desktop:/Ohio_idtheft_BCNF.csv")
View(Ohioidtheft)


Ohioidtheft %
  rename(metro_area_t=Metropolitan.Area)%>%
  rename(reportsoftheft= X) %>%
  rename(state=Reports.per.100K.Population)

sqldf("SELECT *
      FROM Ohioidtheft")
##                  metro_area_t                                   state
## 1            Cleveland-Elyria        OH Metropolitan Statistical Area
## 2                    Columbus        OH Metropolitan Statistical Area
## 3                      Dayton        OH Metropolitan Statistical Area
## 4                       Akron        OH Metropolitan Statistical Area
## 5                      Toledo        OH Metropolitan Statistical Area
## 6  Youngstown-Warren-Boardman     OH-PA Metropolitan Statistical Area
## 7                  Cincinnati  OH-KY-IN Metropolitan Statistical Area
## 8                     Wooster        OH Micropolitan Statistical Area
## 9            Canton-Massillon        OH Metropolitan Statistical Area
## 10                  Mansfield        OH Metropolitan Statistical Area
## 11                       Lima        OH Metropolitan Statistical Area
## 12                      Salem        OH Micropolitan Statistical Area
## 13                Springfield        OH Metropolitan Statistical Area
## 14       Weirton-Steubenville     WV-OH Metropolitan Statistical Area
## 15         Huntington-Ashland  WV-KY-OH Metropolitan Statistical Area
## 16                   Wheeling     WV-OH Metropolitan Statistical Area
##    reportsoftheft
## 1             226
## 2             121
## 3             116
## 4             115
## 5             104
## 6              96
## 7              95
## 8              88
## 9              77
## 10             73
## 11             72
## 12             68
## 13             64
## 14             59
## 15             51
## 16             43

Do we have a primary key? Is it unique, irreducible, and not null… There should be 16 rows in this next query.

sqldf("SELECT DISTINCT metro_area_f
      FROM Ohiofraud")

So the easiest way I know of as of now, is to sort based on my primary key and add the data that way. If I had more data, I would need to first create a database, but for our purposes, this is fine.

sqldf("SELECT *
      FROM Ohiofraud f
      LEFT JOIN Ohioidtheft t
      ON metro_area.f = metro_area.t")#Error: No such column: metro_area.f?

Since the SQL join statment isn’t working. I will have to resort to the join function within…tidyverse. I am not going to show all of the queries I ran, but I used the sample queiries for the tidyverse package to figure out how to apply the same method to my dataset.

Using left join?

View(Ohiofraud)
View(Ohioidtheft)

Ohio_reports %
left_join( Ohioidtheft, by=c("metro_area_f"="metro_area_t"))

View(Ohio_reports)

I need to remove a column and rename a column. For Simplicities sake we will call our collection of data ‘r’

r %
  rename(metro_area= Ohio_reports.metro_area_f) %>%
  rename(state=Ohio_reports.state.x) %>%
  rename(fraud_reports=Ohio_reports.reports) %>%
  rename(theft_reports=Ohio_reports.reportsoftheft)
)

r %
  rename(metro_area= Ohio_reports.metro_area_f) %>%
  rename(state=Ohio_reports.state.x) %>%
  rename(fraud_reports=Ohio_reports.reports) %>%
  rename(theft_reports=Ohio_reports.reportsoftheft)

View(r)

Now we finally have our dataset to work with. Truly 80% of the work.

rp <- ggplot(r, aes(r$metro_area, r$fraud_reports))

rp+ geom_point(color = "blue")+coord_flip()
rp2 <- ggplot(r, aes(r$metro_area, r$theft_reports))
 
rp2 + geom_point(color ="red")+coord_flip()

What is the correlation between the two variables?

cor(r$theft_reports, r$fraud_reports)
## [1] 0.7238463
# 0.7238 
rp <- ggplot(r, aes(r$theft_reports, r$fraud_reports, colour = r$metro_area))

rp+geom_point(mapping = aes(color=r$metro_area))+
  labs(title = "The Imperfect High Fraud High Theft Relationship",
         subtitle = "The relationship between fraud reports and theft reports.")

As you can see there is somewhat of an upward trend. If the fraud reports increase, so do the theft reports. Although a definite outlier does seem to be Cleveland-Elyria.

theft<- ggplot(r, aes(r$metro_area,r$theft_reports, colour = r$metro_area))

theft+coord_flip()+geom_col()
fraud<- ggplot(r, aes(r$metro_area,r$fraud_reports, colour = r$metro_area))

fraud+coord_flip()+geom_col()

Clevland-ELyria also has the highest number of reports here too with 714 fraud repots.

Take a look at more data at FTC.gov/data. Check out their incredible Tableau vizzes here: https://www.ftc.gov/enforcement/data-visualizations/explore-data.

https://rpubs.com/natester

Thank you for reading and stay tuned for part 2!!!

References:

  1. Merriam Webster Online Dictionary https://www.merriam-webster.com/dictionary/scam
  2. https://www.consumer.ftc.gov/blog/2020/03/ftc-fda-warnings-sent-sellers-scam-coronavirus-treatments
  3. https://www.consumer.ftc.gov/blog/2020/01/top-frauds-2019
  4. https://www.consumer.ftc.gov/features/feature-0037-imposter-scams
  5. Tableau Interactive Viz: https://public.tableau.com/profile/federal.trade.commission#!/vizhome/GovernmentImposter/Infographic
  6. https://www.ftc.gov/enforcement/data-visualizations/explore-data