{"id":2811,"date":"2023-02-19T20:47:14","date_gmt":"2023-02-19T20:47:14","guid":{"rendered":"https:\/\/www.goodacademic.com\/blog\/questions\/python-analysis\/"},"modified":"2023-02-19T20:47:14","modified_gmt":"2023-02-19T20:47:14","slug":"python-analysis","status":"publish","type":"questions","link":"https:\/\/www.goodacademic.com\/blog\/questions\/python-analysis\/","title":{"rendered":"[Python] Analysis"},"content":{"rendered":"<div class=\"col-sm-12 messageContent\">\n <b>Learning Goal: <\/b>I&#8217;m working on a python question and need an explanation and answer to help me learn.<\/p>\n<p>Pick <strong>one<\/strong> dataset from <a class=\"external\" href=\"https:\/\/data.sanjoseca.gov\/dataset\" target=\"_blank\" rel=\"noopener\">https:\/\/data.sanjoseca.gov\/dataset <\/a><\/p>\n<p>1. Data Description and Curiosity Questions about the data:<\/p>\n<ul>\n<li>background or the context of data selected &#8211; sources, description of how it was collected, time period it represents, context in it was collected if available,<\/li>\n<li>reason(s) why you selected it?<\/li>\n<li>Description of the data:\n<ol>\n<li>how big is it (number of observations, variables),<\/li>\n<li>how many numeric variables,<\/li>\n<li>how many categorical variables,<\/li>\n<li>description of the variables, if available<\/li>\n<li>Are there any missing values?<\/li>\n<li>Any duplicate rows?<\/li>\n<\/ol>\n<\/li>\n<li>Compute summary statistics on continuous variable(s) (mean, median, mode, standard deviation, variance, range).<\/li>\n<li>Select one categorical variable, compute these statistics on a numeric variable by grouping on a categorical variable<\/li>\n<li>Record your observation. What did you find the most fascinating from your descriptive analysis.<\/li>\n<\/ul>\n<p>2. Descriptive Statistics and Visualization (at least two out of the four listed below)<\/p>\n<ul>\n<li>Relationship between variables<\/li>\n<li>Trend<\/li>\n<li>Distribution of the variable(s)<\/li>\n<li>Spatial data representation<\/li>\n<li>Comparison of summary statistics across categories<\/li>\n<\/ul>\n<p>3. Generate at least one hypothesis and perform hypothesis test.<\/p>\n<p>4. Summarize your observations<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Learning Goal: I&#8217;m working on a python question and need an explanation and answer to help me learn. Pick one dataset from https:\/\/data.sanjoseca.gov\/dataset 1. Data Description and Curiosity Questions about the data: background or the context of data selected &#8211; sources, description of how it was collected, time period it represents, context in it was [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"closed","template":"","meta":[],"disciplines":[734],"paper_types":[],"tagged":[],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/questions\/2811"}],"collection":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/questions"}],"about":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/types\/questions"}],"author":[{"embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/comments?post=2811"}],"version-history":[{"count":0,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/questions\/2811\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/media?parent=2811"}],"wp:term":[{"taxonomy":"disciplines","embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/disciplines?post=2811"},{"taxonomy":"paper_types","embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/paper_types?post=2811"},{"taxonomy":"tagged","embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/tagged?post=2811"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}