You must type this and all other homework assignments. Do not e-mail the assignment to me; turn it in early (at 322 Wolf) for a foreseeable absence, or turn it in late after an unexpected absence from class.
1. Collect some data that could be analyzed with correlation and regression. This could be published data; data you found on the web; data from your own research; or something you measure for this assignment. Don't just make up some numbers, it must be real data. The data must be biological, involving living or dead organisms in some way. You must have at least 20 observations.
Write a sentence or two explaining what the data set is. Then say what biological question the data could be used to answer. Turn in the raw data. Look at a graph of the data without any trendlines (you don't need to turn this one in) and consider your biological question, then say which method you think would be most appropriate: linear regression/correlation of the untransformed data, linear regression/correlation of the transformed data, polynomial regression, or Spearman's rank correlation.
Analyze the data using the technique you think is best, and interpret the result. Use both a spreadsheet and SAS. You must show your SAS program and the output from SAS.
Then, just for practice, analyze the data using the other possible techniques (for the regression using transformed data, use the log-transformation unless you have a reason to use something else). Give the equations of the regression lines, the r2, degrees of freedom, and P-value for each technique (except Spearman's rank correlation, for which you won't give an equation of a line).
Print a graph showing the regression line for the technique you think is most appropriate. If you thought Spearman's rank correlation was most appropriate, print the graph for linear regression instead. Compare the results of the four techniques--do you still think your original choice was the best way to analyze your data?
2. McDonald (1989) collected the amphipod crustacean Platorchestia platensis on a beach near Stony Brook, Long Island, in April, 1987, counted the number of eggs each female was carrying, freeze-dried them, weighed them, then used protein electrophoresis to determine the genotype at the locus for mannose-6-phosphate isomerase (Mpi). The data consist of the identification number of the individual, the weight (in mg), the number of eggs, then the genotype. The biological question is whether the Mpi genotypes differ in size-adjusted fecundity. Analyze with ancova, using SAS. You must show your SAS program and the SAS output.
1301 5.42 29 fs 1302 5.38 29 ss 1303 7.36 23 ss 1304 4.47 19 fs 1305 5.75 23 ff 1306 7.27 25 fs 1307 6.64 13 fs 1308 4.67 22 ff 1309 5.55 24 ff 1310 6.13 22 ss 1311 8.94 25 fs 1312 5.85 22 ff 1313 4.15 16 ff 1314 9.91 27 fs 1315 8.57 23 fs 1316 4.75 20 ss 1317 6.89 17 fs 1318 6.64 21 fs 1319 7.12 20 fs 1320 5.92 19 ff 1321 7.06 27 ff 1322 7.48 33 fs 1323 6.14 23 fs 1324 5.85 26 fs 1325 6.69 21 fs 1326 8.10 25 ss 1327 4.97 25 ff 1328 4.15 23 fs 1329 8.62 25 ss 1330 6.30 17 ss 1331 6.68 26 fs 1332 4.46 22 ff 1333 7.44 24 ss 1334 5.38 28 ff 1335 4.64 15 fs 1336 7.93 21 fs 1337 4.18 20 ff 1338 5.69 21 ff 1339 7.04 25 fs 1340 6.41 16 fs 1341 3.70 18 fs 1342 7.26 20 ss 1343 7.17 27 ss 1344 4.78 27 fs 1345 8.24 26 fs 1346 4.68 24 fs 1347 5.25 16 ff 1348 4.64 24 ff 1349 7.78 24 ss 1350 6.23 21 ss 1351 3.96 21 fs 1352 5.42 22 ss 1353 4.86 13 ff 1354 7.87 22 ss 1355 4.56 20 ff 1356 6.36 19 fs 1357 5.25 23 ss 1358 4.38 22 fs 1359 4.58 22 fs 1360 5.74 18 fs 1361 7.37 35 ss 1362 6.14 23 fs 1363 7.25 22 fs 1364 5.21 22 ff 1365 8.56 28 fs 1366 8.01 27 ss 1367 7.38 16 fs 1368 6.71 21 fs 1369 6.85 20 fs 1370 4.92 23 ss 1371 10.16 35 fs 1372 7.02 29 fs 1373 5.71 16 ff 1374 7.03 25 ss 1375 6.45 24 ss 1376 6.52 26 ff 1377 5.50 29 fs 1378 5.53 23 ff 1379 4.91 23 ff 1380 6.81 31 fs 1381 5.51 27 fs 1382 9.58 30 fs 1383 5.06 19 ss 1384 7.77 22 fs 1385 7.11 19 fs 1386 6.72 21 ss 1387 7.24 27 fs 1388 5.51 24 ff 1389 7.47 30 ff 1390 6.72 32 fs 1391 7.05 22 fs 1392 6.92 19 fs 1393 6.07 18 fs 1394 6.63 24 fs 1395 7.00 20 ss 1396 4.85 20 ff 1397 3.97 16 fs 1398 3.95 17 ff 1399 5.49 26 ff 1400 8.49 19 fs 1401 4.54 26 fs 1402 6.51 27 ff 1403 5.71 13 fs 1404 3.44 13 fs 1405 9.39 33 ss 1406 7.42 17 ff 1407 6.49 17 ss 1408 5.66 15 fs 1409 4.75 22 fs 1410 4.82 27 fs 1411 8.20 31 ff 1412 7.27 32 fs 1413 7.09 19 fs 1414 6.34 21 ss 1415 6.16 25 ss 1416 5.74 22 ss 1417 7.63 22 ff 1418 5.89 25 fs 1419 6.86 23 fs 1420 4.22 17 fs
McDonald, J.H. 1989. Selection component analysis of the Mpi locus in the amphipod Platorchestia platensis. Heredity 62: 243-249.
Return to the Biological Data Analysis syllabus
Return to John McDonald's home page
This page was last revised November 15, 2012. Its URL is http://udel.edu/~mcdonald/stathw9.html