A. Group Assignment
a. Discuss the two data mining methodologies
The process of going through massive sets of data looking out for unsuspected patterns which can provide us with advantageous information is known as data mining. With data mining, it is more than possible or helping us predict future events or even group populations of people into similar characteristics.
Cross Industry Standard Process for Data Mining (CRISP-DM) is a 6-phase model of the entire data mining process which is commonly used across industries for a wide array of data mining projects and provides a structured approach to planning a data mining project. The 6 phases are:
Business Understanding – Focuses and understand what the project objectives, requirements
…show more content…
From this phase, we generated a dashboard from SAS with the data set which we already had. From the dashboard, we used the different data to create four different charts to create our hypothesis. The four different charts are pie, simple bar, stacked bar and needle plot. While creating the charts, we were given the chance to select the two different corresponding data so that we can obtain a chart which made most sense to us. Data Preparation – Decides the data used and covers all activities to construct the final dataset from the initial raw data with the relevant data mining goals, quality and technical constraints.
In this phase, we had to select the data which we wanted to input and reject. Under this phase, we can choose certain variables to reject as they are not relevant to our data and it would not help us in concluding for our hypothesis.
Modelling – Various specific modelling techniques are selected and applied. Their parameters are calibrated to obtain the optimal
…show more content…
This bar graph shows the quantity sold of the products. Product 2822 sold the most, contributing mostly to the total sales. Hence, most of the product bought during the month of March is product 2822.
The graph shows that product 2816 has more total sales value than 2822. Even though a large amount of 2822 was sold during the outbreak as compared to 2816, it still did not bring in the most sales.
Thus, the hypothesis is wrong.
Deployment:
The company should not focus on products that cure the specific disease during an outbreak to increase sales. During outbreaks, the company should not purposely stock up and promote more of the specific cure for the outbreak. They should focus on other products that bring in more total sales as an outbreak is not a major factor in bringing in more total sales, revenue for the company.
Individual – Jan
After studying all the data, I have come up with the hypothesis that when the employee SalesRepFN130 sells the product Item5 CAP 110 MG 10’s, more revenues will be earned. Clustering technique is used for the pharmaceutical data set. With the hypothesis, I have decided to use the variables Employee ID, Item ID and PNR 8030 to generate the charts
The data was then graphed and the slope of the line of best fit for the data points was found. The slope of the line of best fit for each treatment was determined and represented
I have chosen the predictive modeling assignment as a project to prioritize the three critical constraints scope, time and money of the priority matrix. Here, the project champion is my professor with whom I have discussed regarding the negotiation of the constraints to complete the project successfully with the given budget and time specified. Firstly, consider the budget for the project .It will be rigid as our professor gave me a SAS JMP software to get installed in my system to complete the project. So, money will be the least priority.
Plankton Activity Postlab 1) Apply 2.5: Use mathematical and/or computational representations to support explanations of factors that affect carrying capacity of ecosystems at different scales. Explain how the abundance of phytoplankton in the pond depends on the abundance of an abiotic resource such as nitrogen. - Phytoplankton are the producers in the pond. Species of organisms in ecosystems have their own carrying capacity relative to other species, and carrying capacities are determined by particular abiotic and biotic resources in an ecosystem. An increase in nitrogen levels in the pond (e.g. from fertilizers) will result in a dramatic increase in phytoplankton levels, but once phytoplankton population reaches its carrying capacity (determined by availability of other resources besides nitrogen, [DO] levels, predation etc.),
In developing a database, one of the first things one must know is how the database(DB) will be used within the organization. Seconda,y what type of data will be required to develop the database and how it will enhance productivity and reliability to the organization. All the information is gathered in the first phase of the database life cycle, which is planning. In the planning phase, you are gathering information on the need, cost and feasibility of the database within the organization. Also within this phase you would look to see if there are databases within the organization that can meet the requirements.
When an organization is struggling to sell a product, the organization should reposition it so that it is a deal that
It also demonstrates how I know how to use spreadsheets and make graphs. This was not an easy assignment either because there was a lot of glitches going on with google that we had to deal with. Next time I do something like this again, I will try to work around all of the glitches and find out how to solve them and show them to my fellow classmates. The strengths of my spreadsheet is that everything is centered and looks neat and organized. Another one of my strengths is my pie chart showing my distribution of sales, because I can see what I need to spend more or less of if I was to do this project again.
My scores from the LCI are as following: Sequence - 28 , Precision - 27, Technical - 24, and Confluence - 24. From my scores it shows I'm a Dynamic Learner. I use at least two Patterns as Use First Levels, then I use the remainders as either Use as Needed or Avoid Pattern. In my case I use them as Use as Needed Patterns. As a Dynamic Learner can move from one Pattern to another within one setting.
SQL server database latest version is installed and its connection with the proposed visualization tools is tested. Appendix/ data dictionary is analyzed. Our aim for this project is tell stories using data visualizations. This will be accomplished by developing various types of visualization to appreciate and identify trends in data. 3 suggested visualizations, i.e. histograms, stacked charts and barrel chart will be used
Leaves change colors because of chlorophyll. 2. Animals are affected due to seasonal weather changes. Some animals migrate to warmer places while others hibernate until it is warmer. 3.
It was the marketer’s goal to devise a plan to normalize depression in Japanese society. “The objective was to influence, at the most fundamental level, the Japanese understanding of sadness and depression. In short, they were learning how to market a disease” (Watters 516). Watters repeats the idea of marketing a disease throughout his narrative, even including it in the title. This concept caught my attention and brought up the question, can a company successfully market a disease?
The purpose of this paper will show the audience the data
To understand key indicators of sales and marketing performance, information can be collected from financial reports for the year ending and also by going through employees’ appraisals, records of employees in sales and marketing departments. Other information can also be collected from the accounting records, sales records and from daily business
Data warehouses are databases that stores large amount of data that is obtained from various database sources. Databases stores structured data or records, which are used to perform transactions or queried to create reports. Data warehouses are analytical tools used to help companies with their decision making. Data mining is analytical software that is used to query the data in warehouses to uncover unique relationships between the data (Turban, Volonino, Wood, & Sipior, 2013). Data warehousing and data mining can be value tools to the Department of Homeland Security (DHS).
BA 670 Week 7 Business Analytics Research Paper JoAnn Calderon Brenau University Abstract Business analytics is used by firms that are dedicated to using data when making decisions for the organization. Business analytics is primarily used to help companies obtain an understanding of information gathered to make business decisions that can be applied to the automation and optimization of its business processes. Business analytics can be placed into three categories: descriptive analytics, predictive analytics and prescriptive analytics.
The data for this thesis paper will be obtained from research online, from