Skip to content

Commit

Permalink
l3 and error bars notes
Browse files Browse the repository at this point in the history
  • Loading branch information
brownsarahm committed Oct 21, 2024
1 parent cc5cc8c commit 5b59bfa
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 1 deletion.
14 changes: 13 additions & 1 deletion assignments/07-classification.md
Original file line number Diff line number Diff line change
Expand Up @@ -88,12 +88,24 @@ Do an experiment to compare test set size vs performance:
2. Use EDA on this data frame to interpret the results of your experiment. How does training vs test size impact the model's performance? Does it impact training and test accuracy the same way?


```{warning}
Please make sure you complete the above and get feedback before you go on to
the following, because if you attempt the following with an error in the above
you may spend a lot of time on something that might not earn credit
```

:::::{margin}
:::{note}
This also will extend on visualize and summarize.

See the [seaborn error bars tutorial](https://seaborn.pydata.org/tutorial/error_bars.html)
:::
::::

```{admonition} Thinking Ahead
_ideas for level 3 evaluate, not required for A7_
Repeat the problem setup experiment with multiple test/train splits at each size and plot with error bars.
Repeat the problem setup experiment with multiple test/train splits at each size and plot with {term}`error bars`.
- What is the tradeoff to be made in choosing a test/train size?
- What is the best test/train size for this dataset?
Expand Down
4 changes: 4 additions & 0 deletions resources/glossary.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,10 @@ discriminative
document
unit of text for analysis (one sample). Could be one sentence, one paragraph, or an article, depending on the goal
error bars
typically vertical, but sometimes also horizontal lines on a point in a line graph or bar in bar chart that indicate the spread of the samples used to create that point or bar height
generative
a model that describes the data and therefore can also be used to generate new data that looks like the training data.
Expand Down

0 comments on commit 5b59bfa

Please sign in to comment.