data science Tag Archives - General Assembly Blog

Applied AI Engineering Workshop: The Latest Skills for Software Development and Data Science

By

The groundwork has been laid for what many consider the dawn of the fourth industrial revolution, propelled by transformative artificial intelligence. GenAI, debuting in 2023, gains momentum in 2024, with a rapidly growing market valued at $22.12 billion, expected to achieve a 46% adoption rate by 2025.

In short? Now’s the time to bring your teams up to speed to stay ahead of the curve.

You don’t have to look far to see how AI is benefitting industry. AI algorithms are already enhancing security with city-wide drone detection, driving efficient recycling, revolutionizing accessibility, skyrocketing creativity, and addressing complex societal concerns.

Ethics, data visualization, and AI are all cornerstones of a data-driven culture. Companies who put data at the heart of their operation outperform competitors in revenues by 16%, operational efficiency by 23%, and customer retention by 32%.

Given the transformative potential, it’s no surprise 25% of global workers deem AI skills crucial. Yet, 66% of senior IT leaders say their employees need more AI skills to harness predictive analytics and machine learning.

There’s no time like the present. Let’s get started.

General Assembly’s Applied AI Engineering Workshop, designed for existing software engineers and data scientists, builds core AI competencies, empowering your team to become enablers of GenAI throughout your organization.

Continue reading

Why You Should Consider a Career in Data Analytics

By

Singapore learner on laptop

Alert: approaching maximum storage capacity. 

The world’s data use increases each year, with a forecast of 147. zettabytes created, consumed, and stored in 2024 – which is enough storage for 55 billion 4K movies.

This is a good thing – right? More data means more innovation, which means more advancements for society. 

Not necessarily. 

Think of data the same way you think about a library. There are so many books in one place (which is awesome) but it’s only useful if you know: 

  • How to find the information you need.
  • And how to apply it. 

Businesses have more data than ever before – about their company, their customers, and the world – but no one to tell them what it means. 

That’s where data analysts come in. 

WHAT DO DATA ANALYSTS DO? 

When there’s a problem, data analysts help solve it. 

The first step to addressing business challenges is gathering information (data) and finding answers and insights to guide companies towards better decisions. That’s the role of the data analyst. 

For example, a company may want to know which segment of customers is driving the most revenue from a marketing campaign. 

The data analyst will gather all the data related to the campaign. This may mean exploring customer demographics, marketing acquisition sources, behavioural data, and purchase data. 

They’ll look for notable statistical findings. They’ll form these into insights and create written and/or visual reports to help stakeholders learn and apply the findings to their future campaigns. 

As a distinction from data scientists, data analysts typically work with structured data from a single source and provide historical analysis as opposed to predictive modelling.

WHY SHOULD YOU CONSIDER A DATA ANALYTICS CAREER? 

The most obvious reasons to work in the field of data analytics include these top three reasons: 

  1. You’re dealing with data, numbers, and statistics, but you still get to creatively work to solve problems. 
  2. You’re paid well for this skill.
  3. Data keeps growing and so will the need for data analysts. 

But there are other benefits that may not be quite so apparent: 

  • Most employers are interested in talent with skills. There is not a big focus on degrees and further education. 
  • Many data analyst jobs are remote. No more commuting!
  • The technical skills you learn are easily transferable to other jobs like coding, data science, and more. 

5 TOP JOB TITLES FOR DATA ANALYSTS

What kind of jobs can you get as a data analyst? There are varying specialities and job titles in the field of data analytics. Here are some job titles you may see in this family of jobs:

DATA ANALYST

Related job titles: Junior Data Analyst, Entry-Level Data Analyst, Associate Data Analyst

You can find a data analyst at nearly every company in the world, in every industry imaginable. The average data analyst needs to know some basic programming languages like Python and SQL, and they should be comfortable running statistical analyses and visualising data. 

OPERATIONS ANALYST 

Related job title: Operations Research Analyst 

An operations analyst focuses on the inner workings of a business, helping it run more efficiently. They typically work for larger companies or they work at consulting firms employed by bigger businesses.

MARKETING ANALYST 

Related job title: Market Research Analyst 

One of the biggest parts of any company’s budget is the money they spend on marketing efforts. A marketing analyst looks at market, campaign, and demographic data to ensure companies are executing marketing efforts in the most cost-effective and impactful way possible.

BUSINESS INTELLIGENCE ANALYST 

You’ll spend your days as a BI analyst looking for patterns in your company’s data. You’ll have to make sure you’re good at communicating and that you enjoy visualising data and modelling future scenarios. 

Is a business analyst the same as a data analyst? While the skill sets are similar, there are some differences. Here’s our take on business analyst vs. data analyst

LOGISTICS ANALYST

Logistics analysts look at every stage of a production process and product lifecycle. They may analyse supply chain flows and find areas of improvement to increase efficiency and profit for a company. 

DATA ANALYST CAREER OUTLOOK 

Companies, including retailers, investment banks, big tech, and professional services (including accounting and insurance), are all ramping up their data analytics workforce. Other industries hiring for data analysts include logistics, healthcare, government, and sports. 

ARE DATA ANALYST JOBS IN DEMAND? 

The Singapore Economic Development Board (EDB) stated that the data science industry in Singapore contributes an estimated $730 million (USD) to the economy annually. Operations research analysts and market research analysts are also high-growth job categories. 

WHAT’S THE AVERAGE SALARY WORKING IN DATA ANALYTICS? 

In Singapore, the median salary for a data analyst is SGD $99,000, with the middle 50% earning between SGD $75,000 and $137,000. Of course, how much you can earn as a data analyst depends on several factors including education, experience, industry, and geography. For example, the median data analyst salary in the United States is $113,250, with the middle 50% earning between $93,000 and $134,000. In Australia, the typical data analyst salary is in the range of AUD $114,500 and $143,500

Experience and industry can also have an impact on your expected salary. An entry-level data analyst in Singapore’s financial services industry, for example, earns a median salary of SGD $60,000, while a senioranalyst in the same industry earns SGD $74,000 and a director in analytics earns SGD $132,000. 

HOW TO BECOME A DATA ANALYST

Most data analytics jobs require a bachelor’s degree. Degree programs in mathematics, statistics, business, or economics are ideal, but college grads can re-skill for data analytics with any major. 

There have never been more options for individuals to skill up for a career switch, and some employers will even pay for it because of fast-changing business needs. Here are twoways to gain the data analytics skills you need to fast-track a new career in this field:

#1: PART-TIME DATA ANALYTICS COURSE

If you have a full-time job or other responsibilities, a part-time course can be a good option and offers accountability for a set curriculum and timeline. However, the part-time model takes longer to finish and longer to reach the job market than a full-time option. 

#2: FULL-TIME DATA ANALYTICS BOOTCAMP

What is a data analytics bootcamp? Bootcamps provide immersive, intensive training for entry-level professionals in a field. Bootcamps can be in-person or online and are instructor-led, often with multiple speakers and mentors for a course and a cohort. 

So, which option is the best for you? It really depends on your background and learning style.  

If you have transferable skills and experience, you may only need to brush up on a programming language like Python to make the leap. If you’re coming from an unrelated field or from a career break, a more immersive, structured program like a bootcamp may be your best bet to get job-ready. 

GO FROM BEGINNER TO DATA ANALYST IN 12 WEEKS

General Assembly’s Data Analytics Bootcamp is designed for complete beginners. Get hands-on training from actual data analysts working in the field, and graduate in just 12 weeks ready for your first data analyst job. It’s the most direct route to your new data analyst career.

Data And AI: Best Friends Or Foes Of The Future?

By

Estimated reading time: 5 minutes


If you work in data, chances are you’re hearing a lot of buzz about AI and how it’s going to automate everything. While the headlines spell doom and gloom for knowledge worker roles like yours, the reality isn’t quite so dark. In fact, there won’t be any future of AI without people like you, who have the skills required to prepare and use data. 

Right now, AI adoption is in a strange phase. The media is telling you that you could be replaced by it, but your day-to-day probably hasn’t changed all that much. You might be thinking that this all seems like a bunch of hype… and you wouldn’t be wrong. That’s because most companies’ use of AI is still in its infancy. 

While 94% of companies say they are using AI today, most aren’t using it to its full potential. They’re struggling with data quality and infrastructure issues that make layering on AI nearly impossible. In the same study, almost three quarters of execs said that data issues would be the most likely reason they fail to achieve their AI goals. As it turns out, even a robot can’t make lemonade out of bad data. 

This is where you, and your skills, step in to save the day. And you’re in higher demand than ever before. There’s been a 2,000% surge in roles requiring AI skills, such as data science and data analytics

Before you breathe a sigh of relief, it’s important to recognize that your skills will need to evolve for an AI-led future. Let’s dig into exactly how. 

Continue reading

Back To Skills: Free Workshops To Prep For Your Tech Future

By

GA student smiling and walking into a coffee shop with their bike

Hot lunch or cold lunch. First day outfit. Which background for your yearbook photo. Those back-to-school choices may be behind you, but you have a new choice right in front of you when it comes to getting Back to Skills

We’re taking you Back to Skills with our popular free workshops from September 19 – October 26. Whether you’re looking to kickstart a career change, prepare for one of our bootcamps, or simply add new skills to your repertoire, our free workshop series will help you make it happen.

Just like our bootcamps, these workshops are led by our team of dedicated instructors, who have real-world experience in today’s most in-demand tech fields like data, coding, and UX design. 

Continue reading

Adapting to the Future: Exploring the Intersection of AI and Career Change

By

Student standing next to wall art in GA NYC campus

Estimated reading time: 8 minutes

In this blog, we demystify the influence of AI on the job market and delve into the transformative potential it brings to various industries. We believe that embracing tech-forward careers like data science, software engineering, and UX design opens up new horizons for professionals seeking to thrive in the future of work.

At GA, we pave the way for rewarding careers in the AI-driven world, offering competitive salaries, and continuous learning opportunities.


Rapid advancements in artificial intelligence (AI) have sparked concerns about its impact on the job market, leading to misconceptions about AI stealing jobs and replacing human skills. 

In this blog, we’ll demystify these notions and explore how recent innovations in AI are not eliminating jobs but instead opening up new opportunities and revolutionizing the way we work. 

Let’s delve into the truth behind AI’s influence on career change and understand how it can augment and enhance various industries and empower individuals with unique human skills. 

Continue reading

How Data Is Revolutionizing Basketball: Q&A with Ballstar Co-Founder Vaughn Caldon

By

Basketball has always had an incredibly passionate fanbase, fueled by the exciting and fast pace of gameplay and high-scoring matches. Some of the most famous athletes globally are professional basketball players, like LeBron James and Stephen Curry. Today, the NBA is broadcast in over 200 countries. 

As basketball has grown into a global phenomenon, technology has played a key role in taking the sport to new levels. Data analytics has transformed everything from how the sport is played and coached, to how fans engage with games and athletes. From tracking players’ performance to predicting game outcomes, data has become an indispensable tool for athletes and fans at every level. 

This surge of data in the industry has created unique opportunities for data scientists and data analysts who have a passion for both numbers and sports, as well as for UX designers and software engineers who bring this data to life via fan engagement apps and websites. 

To learn more about what’s behind this data-driven revolution in sports and the career opportunities it’s unlocked, we sat down with Vaughn Caldon, GA instructor and co-founder of Ballstar, a company at the forefront of data analytics in basketball. 

Continue reading

Top 5 Industries Hiring Data Analysts and Data Scientists

By

Without data, humans make decisions based on intuition. However, we don’t make very good decisions with our gut. That’s because as humans we have our emotions, unconscious biases, and gaps in information to contend with. 

Everyone makes better decisions with data. For a business, poor leadership decisions can be incredibly costly. But companies don’t just need to collect data—they need professionals who can analyze and interpret it. That’s where data analysts and data scientists come in. 

Glassdoor and U.S. News & World Report have both named data scientist among their best jobs based on salary, job satisfaction, and career opportunities. Data analysts earn a median base salary of $66,370 in the U.S., while data scientists earn $103,525 on average. If you’re ready to jump into your first data analyst entry-level job, read on for how to break in and the top industries hiring data analysts and data scientists

Continue reading

Beginner’s Python Cheat Sheet

By

Do you want to be a data scientist? Data Science and machine learning are rapidly becoming a vital discipline for all types of businesses. An ability to extract insight and meaning from a large pile of data is a skill set worth its weight in gold. Due to its versatility and ease of use, Python programming has become the programming language of choice for data scientists.

In this Python crash course, we will walk you through a couple of examples using two of the most-used data types: the list and the Pandas DataFrame. The list is self-explanatory; it’s a collection of values set in a one-dimensional array. A Pandas DataFrame is just like a tabular spreadsheet, it has data laid out in columns and rows.

Let’s take a look at a few neat things we can do with lists and DataFrames in Python!
Get the PDF here.

BEGINNER’S Python Cheat Sheet

Lists

Creating Lists

Let’s start this Python tutorial by creating lists. Create an empty list and use a for loop to append new values. What you need to do is:

#add two to each value
my_list = []
for x in range(1,11):
my_list.append(x+2)

We can also do this in one step using list comprehension:

my_list = [x + 2 for x in range(1,11)]

Creating Lists with Conditionals

As above, we will create a list, but now we will only add 2 to the value if it is even.

#add two, but only if x is even
my_list = []
for x in range(1,11):
if x % 2 == 0:
my_list.append(x+2)
else:
my_list.append(x)

Using a list comp:

my_list = [x+2 if x % 2 == 0 else x \
for x in range(1,11)]

Selecting Elements and Basic Stats

Select elements by index.

#get the first/last element
first_ele = my_list[0]
last_ele = my_list[-1]

Some basic stats on lists:

#get max/min/mean value
biggest_val = max(my_list)
smallest_val = min(my_list)avg_val = sum(my_list) / len(my_list)

DataFrames

Reading in Data to a DataFrame

We first need to import the pandas module.

import pandas as pd

Then we can read in data from csv or xlsx files:

df_from_csv = pd.read_csv(‘path/to/my_file.csv’,
sep=’,’,
nrows=10)
xlsx = pd.ExcelFile(‘path/to/excel_file.xlsx’)
df_from_xlsx = pd.read_excel(xlsx, ‘Sheet1’)

Slicing DataFrames

We can slice our DataFrame using conditionals.

df_filter = df[df[‘population’] > 1000000]
df_france = df[df[‘country’] == ‘France’]

Sorting values by a column:

df.sort_values(by=’population’,
ascending=False)

Filling Missing Values

Let’s fill in any missing values with that column’s average value.

df[‘population’] = df[‘population’].fillna(
value=df[‘population’].mean()
)

Applying Functions to Columns

Apply a custom function to every value in one of the DataFrame’s columns.

def fix_zipcode(x):
”’
make sure that zipcodes all have leading zeros
”’
return str(x).zfill(5)
df[‘clean_zip’] = df[‘zip code’].apply(fix_zipcode)

Ready to take on the world of machine learning and data science? Now that you know what you can do with lists and DataFrames using Python language, check out our other Python beginner tutorials and learn about other important concepts of the Python programming language.

What is Data Science?

By

It’s been anointed “the sexiest job of the 21st century”, companies are rushing to invest billions of dollars into it, and it’s going to change the world — but what do people mean when they mention “data science”? There’s been a lot of hype about data science and deservedly so, but the excitement has helped obfuscate the fundamental identity of the field. Anyone looking to involve themselves in data science needs to understand what it actually is and is not.

In this article, we’ll lay out a deep definition of the field, complete descriptions of the data science workflow, and data science tasks used in the real world. We hope that any would-be entrants into this line of work will come away reading this article with a nuanced understanding of data science that can help them decide to enter and navigate this exciting line of work.

So What Actually is Data Science?

A quick definition of data science might be articulated as an interdisciplinary field that primarily uses statistics and computer programming to derive insights from and base decisions from a collection of information represented as numerical figures. The “science” part in data science is quite apt because data science very much follows a scientific process that involves formulating a hypothesis and using a specific toolset to confirm or dispel that hypothesis. At the end of the day, data science is about turning a problem into a question and a question into an answer and/or solution.

Tackling the meaning of data science also means interrogating the meaning of data. Data can be easily described as “information encoded as numbers” but that doesn’t tell us why it’s important. The value of data stems from the notion that data is a tangible manifestation of the intangible. Data provides solid support to aid our interpretations of the world. For example, a weather app can tell you it’s cold outside but telling you that the temperature is 38 degrees fahrenheit provides you with a stronger and specific understanding of the weather.

Data comes in two forms: qualitative and quantitative.

Qualitative data is categorical data that does not naturally come in the form of numbers, such as demographic labels that you can select on a census form to indicate gender, state, and ethnicity.

Quantitative data is numerical data that can be processed through mathematical functions; for example stock prices, sports stats, and biometric information.

Quantitative can be subdivided into smaller categories such as ordinal, discrete, and continuous.

Ordinal: A sort of qualitative and quantitative hybrid variable in which the values have a hierarchical ranking. Any sort of star rating system of reviews is a perfect example of this; we know that a four-star review is greater than a three-star review, but can’t say for sure that a four- star review is twice as good as a two-star review.

Discrete: These are countable and finite values that often appear in the form of integers. Examples include number of franchises owned by a company and number of votes cast in an election. It’s important to remember discrete variables have a finite range of numbers and can never be negative.

Continuous: Unlike discrete variables, continuous can appear in decimal form and have an infinite range of possibilities. Things like company profit, temperature, and weight can all be described as continuous. 

What Does Data Science Look Like?

Now that we’ve established a base understanding of data science, it’s time to delve into what data science actually looks like. To answer this question, we need to go over the data science workflow, which encapsulates what a data science project looks like from start to finish. We’ll touch on typical questions at the heart of data science projects and then examine an example data science workflow to see how data science was used to achieve success.

The Data Science Checklist

A good data science project is one that satisfies the following criteria:

Specificity: Derive a hypothesis and/or question that’s specific and to the point. Having a vague approach can often lead to a waste of time with no end product.

Attainability: Can your questions be answered? Do you have access to the required data? It’s easy to come up with an interesting question but if it can’t be answered then it has no value. The same goes for data, which is only useful if you can get your hands on it.

Measurability: Can what you’re applying data science to be quantified? Can the problem you’re addressing be represented in numerical form? Are there quantifiable benchmarks for success? 

As previously mentioned, a core aspect of data science is the process of deriving a question, especially one that is specific and achievable. Typical data science questions ask things like, does X predict Y and what are the distinct groups in our data? To get a sense of data science questions, let’s take a look at some business-world-appropriate ones:

  • What is the likelihood that a customer will buy this product?
  • Did we observe an increase in sales after implementing a new policy?
  • Is this a good or bad review?
  • How much demand will there be for my service tomorrow?
  • Is this the cheapest way to deliver our goods?
  • Is there a better way to segment our marketing strategies?
  • What groups of products are customers purchasing together?
  • Can we automate this simple yes/no decision?

All eight of these questions are excellent examples of how businesses use data science to advance themselves. Each question addresses a problem or issue in a way that can be answered using data science.

The Data Science Workflow

Once we’ve established our hypothesis and questions, we can now move onto what I like to call the data science workflow, a step-by-step description of a typical data science project process.

After asking a question, the next steps are:

  1. Get and Understand the Data. We obviously need to acquire data for our project, but sometimes that can be more difficult than expected if you need to scrape for it or if privacy issues are involved. Make sure you understand how the data was sampled and the population it represents. This will be crucial in the interpretation of your results.
  1. Data Cleaning and Exploration. The dirty secret of data science is that data is often quite dirty so you can expect to do significant cleaning which often involves constructing your variables in a way that makes your project doable. Get to know your data through exploratory data analysis. Establish a base understanding of the patterns in your dataset through charts and graphs.
  1. Modeling. This represents the main course of the data science process; it’s where you get to use the fancy powerful tools. In this part, you build a model that can help you answer a question such as can we predict future sales of a product from your dataset.
  1. Presentation. Now it’s time to present the results of your findings. Did you confirm or dispel your hypothesis? What are the answers to the questions you started off with? How do your results advance our understanding of the issue at hand? Articulate your project in a clear and concise manner that makes it digestible for your audience, which could be another team in your company or your company’s executives.

Data Science Workflow Example: Predicting Neonatal Infection

Now let’s parse out an example of how data science can affect meaningful real-world impact, taken from the book Big Data: A Revolution That Will Transform How We Live, Work, and Think.

We start with a problem: Children born prematurely are at high risk of developing infections, many of which are not detected until after a child is sick.

Then we turn that problem into a question: Can we detect patterns in the data that accurately predict infection before it occurs?

Next, we gather relevant data: variables such as heart rate, respiration rate, blood pressure, and more.

Then we decide on the appropriate tool: a machine learning model that uses past data to predict future outcomes.

Finally, what impact do our methods have? The model is able to predict the onset of infection before symptoms appear, thus allowing doctors to administer treatment earlier in the infection process and increasing the chances of survival for patients.

This is a fantastic example of data science in action because every step in the process has a clear and easily understandable function towards a beneficial outcome.

Data Science Tasks

Data scientists are basically Swiss Army knives, in that they possess a wide range of abilities — it’s why they’re so valuable. Let’s go over the specific tasks that data scientists typically perform on the job.

Data acquisition: For data scientists, this usually involves querying databases set up by their companies to provide easy access to reams of data. Data scientists frequently write SQL queries to retrieve data. Outside of querying databases, data scientists can use APIs or web scraping to acquire data.

Data cleaning: We touched on this before, but it can’t be emphasized enough that data cleaning will take up the vast majority of your time. Cleaning oftens means dealing with null values, dropping irrelevant variables, and feature engineering which means transforming data in a way so that it can be processed by a model.

Data visualization: Crafting and presenting visually appealing and understandable charts is a hugely valuable skill. Visualization has an uncanny ability to communicate important bits of information from a mass of data. Good data scientists will use data visualization to help themselves and their audiences better understand what’s going on.

Statistical analysis: Statistical tests are used to confirm and/or dispel a data scientist’s hypothesis. A t-test or chi-square are used to evaluate the existence of certain relationships. A/B testing is a popular use case of statistical analysis; if a team wants to know which of two website designs leads to more clicks, then an A/B test is the right solution.

Machine learning: This is where data scientists use models that make predictions based on past observations. If a bank wants to know which customers are likely to pay back loans, then they can use a machine learning model trained on past loans to answer that question.

Computer science: Data scientists need adequate computer programming skills because many of the tasks they undertake involve writing code. In addition, some data science roles require data scientists to function as software engineers because data scientists have to implement their methodologies into their company’s backend servers.

Communication: You can be a math and computer whiz, but if you can’t explain your work to a novice audience, your talents might as well be useless. A great data scientist can distill digestible insights from complex analyses for a non-technical audience, translating how a p-value or correlation score is relevant to a part of the company’s business. If your company is going to make a potentially costly or lucrative decision based on your data science work, then it’s incumbent on you to make sure they understand your process and results as much as possible.

Conclusion

We hope this article helped to demystify this exciting and increasingly important line of work. It’s pertinent to anyone who’s curious about data science — whether it’s a college student or an executive thinking about hiring a data science team — that they understand what this field is about and what it can and cannot do.

Data at Work: 3 Real-World Problems Solved by Data Science

By

At first glance, data science seems to be just another business buzzword — something abstract and ill-defined. While data can, in fact, be both of these things, it’s anything but a buzzword. Data science and its applications have been steadily changing the way we do business and live our day-to-day lives — and considering that 90% of all of the world’s data has been created in the past few years, there’s a lot of growth ahead of this exciting field.

While traditional statistics and data analysis have always focused on using data to explain and predict, data science takes this further. It uses data to learn — constructing algorithms and programs that collect from various sources and apply hybrids of mathematical and computer science methods to derive deeper actionable insights. Whereas traditional analysis uses structured data sets, data science dares to ask further questions, looking at unstructured “big data” derived from millions of sources and nontraditional mediums such as text, video, and images. This allows companies to make better decisions based on its customer data.

So how is this all manifesting in the market? Here, we look at three real-world examples of how data science drives business innovation across various industries and solves complex problems.

Continue reading