Bob van Vliet

Reflections of a first-time ungrader

I just ran my first ungraded course. Obviously, I did it all wrong. Nonetheless, it was wonderful. I’ve never had better conversations with students about what they take away from a course.

In this post, I want to write down some first rough thoughts before the experience fades, also in preparation for the Ungrading as Emancipation track at MYFest.

About the course

KT2700 ‘Designing Medical Technology’ is an introductory design course in the 2nd year of the ‘Clinical Technology’ BSc programme at Delft University of Technology. It’s a group project, with 90 students in 18 groups of 5. There’s clinical and technical experts that come by for one-time meetings with each group, but I mostly teach this course by myself. The course runs for 7 weeks, with three 4-hour afternoons of class time each week.

Students choose a problem to work on, do problem analysis, develop concepts, practice drawing, and conduct a simple empirical validation of their design.

My approach to ungrading

As this was my first real course-wide ungrading attempt, I wanted to play it semi-safe and use a common, tried and tested model. In essence, this meant I replaced my assessment of the final report with a self-assessment by each group, that we discussed in a final 20 minute meeting. I did what Jesse Stommel does and told my students that although I reserve the right to change a grade, I fully intended to follow their proposals. (And in the end, I really did not change anyone’s self-determined grade.)

This course already contained a number of rounds of peer feedback on intermediate results. This year I added a kick-off workshop in which I asked students what they hoped to get out of the course, and what their personal challenges and goals were. I also added a (very light-weight and informal) mid-course reflection that asked students to think about how the course was going and what they were learning.

So, what did I learn?

1. First, nothing changed

I expected revolution from day one. I was super excited to blow my students’ minds in the introductory lecture with how they’d determine their own grades. Education as the practice of freedom! Equity! Rainbows and unicorns!

Yeah, not so much. The first few weeks, nothing changed. The course felt very similar to previous years. At week 5 or so, I was feeling decidedly disappointed.

Looking back, my expectations were pretty naïeve, of course. These students are so used to just doing what they’re told, getting the grade, and moving on. It was foolish to think that students can go from paint-by-numbers to full autonomy in a few short weeks.

I also realized that teacher-assigned grades are not the only thing that determines the overall character of a course. (No shit, Sherlock.) The structure I set up, with frequent peer-review assignments and deadlines, including review questions that gave the impression that there were ‘right’ or ‘correct’ ways of doing things, worked against a culture of freedom, exploration, and self-direction. I also have a lot to learn in how I respond to student questions that assume or look for a single right answer. I’m not yet so good at responding to those in a way that ignores or challenges those assumptions.

In other words, students just did the work as usual. And they still asked a lot of questions in the category ‘How do you want us to do it?’ instead of feeling free to decide for themselves. They asked to be told what to do and they did what they were told. Just like in previous years.

2. Then, everything changed

Ungrading really is transformative. Even though the students were behaving as usual in those first weeks of the course, for me my relationship with them felt completely different from the get-go. And the way the course ended transformed the experience for students as well.

Throughout the course, I felt a lot better about the way I was coaching students. Previously, I was pushing them to take risks, knowing that it might lead to output that I had to rank lower than it could have been ranked if they took the safe approach. That felt massively unfair. As betrayal, almost. And that feeling was completely gone. Even if the students were still a bit too focused on doing it right, and according to my standards, I was much more free and able to direct their attention to what they were actually learning, and to prioritize learning experiences over polished results in how I advised them.

And then I sat down to read the group self-assessments and individual reflections. Oh, boy. So much good stuff. Responding to a very general prompt, many students pointed out the exact things I would list as the core lessons for this course. And many students described learning experiences that started with apprehension, worry, or expecting not to be interested, moving through initial surprises and a growing sense of skill and comprehension, to outright enthusiasm, pride, and consequences for how they saw their future after graduating from the programme.

And then, I sat down to talk with them about all this. Those two afternoons of final meetings were some of the best conversations I’ve ever had with students. A lot of students mentioned how they rarely think about what they’d learned so effectively either. “Usually, when you hand in a report you hope to never have to see it again,” one student said. “But the way we discussed our end-results with other groups this time, and looking back at what we delivered together, was really fun and valuable.”

I’ve had final meetings after the end of a project in other courses. But those were mostly me explaining and justifying a grade. Those did not feel like conversations between equals. But these were real, open discussions.

3. Self-grading is still grading

In those final meetings I had with student groups about their reflection and self-assessment, many students explained their grade in comparative terms. They were still ranking. A few also asked me – after I’d written down their self-determined grade – whether it matched what I would have given them. They wanted to know wheter the grade was ‘correct’. I told them I have no clue what a ‘correct’ grade is anymore.

Most students didn”t really know how to translate their assessment of what they’d learned and accomplished (or not) into a single number. At the time, I felt a bit guilty for not giving them more guidance. Later I realized that this was the whole reason for not wanting to grade them in the first place: it’s impossible to distill a multidimensional set of qualitative judgements into one unidimensional number.

I don’t know yet whether my conclusion from this is that next time I want to go with a pass/fail model, or that I should accept this cognitive dissonance because the attempt to come up with the ‘right’ grade does seem to serve as an effective means of stimulating evaluative reflection and in-depth comparison with others’ work.

Side-note: the Dunning-Kruger effect

In general the level of grades was half a point higher than in previous years (the average rose from 7.8 to 8.3). That effect was less strong than I’d expected. And in any case, I really don’t care. The students did great work and learned lots. Who cares about the numbers?

But I also got the strong sense that there had been a reversal at the top end. Many of the self-given 8/10s would have been 9/10s if I had done the grading, and most 9/10s would have been 8/10s.

The Dunning–Kruger effect is a cognitive bias whereby people with low ability at a task overestimate their ability. Some researchers also include in their definition the opposite effect for high performers: their tendency to underestimate their skills.
Wikipedia

So yeah. That seems to be relevant for self-grading . At least in the context of projects, where it’s not a question of self-assessing correctness on something like exam questions, but an overall, holistic judgement of open-ended work.

4. Self assessment in groups can be problematic

One of the bigger ‘Oh, crap!’-moments in this course was when I realized that I was putting students in a difficult situation with self-grading as a group. I don’t think I ever read anything ungrading in group projects, where students have to come up with a grade proposal together as a team. In groups that didn’t get along so well, this led to issues. And even in groups that worked well together, it could lead to friction.

In one case, this led (almost) to a tragic situation with a student who’d had some issues with the group work because of his autism, but also learned a ton by working through those issues (and taught me something in the process, which I wrote about elsewhere), that still wanted to grade himself lower than the other group members because he had ‘contributed less’. Luckily I didn’t need to step in. Another member of his team would’t let him be that unfair to himself. (Students are awesome.)

5. It’s the reflection, stupid!

Earlier, I wrote:

We want our students to learn to evaluate their own work critically. The best way to do this is by practising evaluating their own work. But it is difficult to have students do this honestly when it is ultimately our evaluation that counts.
Me, in a column on ungrading for the Delta, TU Delft’s ‘journalistic platform’, informed by the work of D Royce Sadler.

When we grade our students, we take one of the most important acts of learning away from our students by doing it for them: practicing judgement. Even in its most basic, mistake-laden form (i.e. what I did in this course), ungrading makes students actually responsible for their own learning by putting the final say in their hands.

If, whatever happens during a course, in the end it’s our decisions that count, then what reason do students have to really think about the quality of their work? Why reflect seriously if the outcome of that reflection doesn’t matter, and it’s someone else’s judgement and only that other’s judgement that matters? If someone else is going to tell you what the conclusion of your own reflection should have been?

Ungrading breaks that power structure. It breaks that adversarial relationship. It emancipates. Education as the practice of freedom. Rainbows and unicorns, after all.

Hi, #MYFest22

This June, July, and August, I’m ‘attending’ the 2022 MYFest ‘conference’ (this thing is so out-of-the-ordinary I could put every description of it in quotes :-P).

There is a blog network to go along with all the events that I plan to contribute to.

This post is just to say hi to all the people there, and to make sure I have at least one post with the #MYFest22 tag, so that I can join the collaborative RSS fun.

To be continued!

Teacher-centered framing of discussions on teaching

Recently I was interviewed – kinda – about what changes I made to my teaching during the whole Covid mess. It was fun (I like to talk), but I left with a nagging feeling that I didn’t really answer properly, that I gave bad advice, and that I had described and discussed my practice dishonestly somehow, but I couldn’t put my finger on it.

The questions were about learning activities, tools, student engagement, and social aspects. For each of these four, I was asked the same things: what did you change? What do you intend to keep going forward? What advice would you give other teachers about this?

Thinking back, I realize these questions framed the conversation completely in terms of what I as the teacher did to shape the course. It appears the assumption – mine and/or the interviewers’ – was that I was the sole designer and actor in the situation. As the teacher, I am assumed to have all the power, to set the tasks, to organize the social aspects around the course, and to engage students.

Especially the conversation about ‘student engagement’ bugs me more and more, because it casts students in such a passive role. OK, they might be ‘engaged’ by questions, activities, and a lively teacher, but fundamentally they are talked about as the object of a teacher’s actions, not as subjects themselves.

Whereas one of the things I think I do to ‘engage’ students is to actually be interested in what they have to say and contribute, and to let that guide the session and shape the course. With a course of any size, that can be super difficult, and it can be as small as changing the background music for breaks based on what they say would be nice, but still. Even putting a student-made sticker on the wall behind you when you’re on-camera during lectures is a way of making the space a little bit theirs, instead of just mine.

That feels more like inclusion than it does ‘engagement’.

Teaching is not something you do to students, or with them in the same sense as you draw things with a pencil, but it’s a situation, a relationship you enter into together.

I feel that this way of framing it would have led me to discuss things differently than questions based on a more hierarchical, directive model.

An incomplete list of lectures’ functions

Since the switch to online education 18 months ago, it’s become clear that on-campus lectures used to serve a whole range of functions – not all of which can be served by online classes. We designed around that for our online and half-half courses.

Now that we’re expecting to be able to get back to fully on-campus for all our courses again, if we want to, people are asking: do we want to?

I’m not so sure. At least not for large scale lectures – as I wrote in my Delta column at the start of this year.

In any case, it seems good to explicitly think about what physical lectures do, so that we can think critically about how we want to achieve those things going forward.

OK, here goes:

One-way, teacher -> students stuff:
- Information delivery. Similar to what books do. The part many feel that Gutenberg made obsolete and the thing that the ‘flipped classroom’ transfers to asynchronous reading/watching by students themselves. Multiple categories:
  - Content
  - Logistics / Practical matters
- Motivation, by an enthusiastic teacher/speaker, by showing cool examples, by physical demonstrations, etc.
Teacher <-> Students (interaction is obviously low in traditional lecture halls)
- Quick checks on understanding (i.e. polls and multiple choice questions)
- Opportunity for asking questions/discussion, both during and before or after the lecture. Becomes more scary and more difficult as student numbers increase.
Students -> Teacher
- Signal perceived value of the lectures by showing up or not.
- Motivation for the teacher. Speaking to a hall of students is something to get out of bed for. It’s why many like ‘hybrid’, because at least they have some students there to see and make them feel like they’re doing it for actual people.
Students <-> Students
- Social gathering, seeing and interacting with peers and friends, during the lecture, before, and after.
- Discussing course content and materials, helping each other out, bot in terms of figuring out difficult topics, and answering each other’s questions about logistics: When is this due again? Wait, what is the assignment, exactly?
For students individually
- Structure to the day, a reason to get out of bed in the morning. Knowing a teacher is coming to a room, and knowing other students are going, is more social pressure/motivation than an online session nobody will see you at and that you can always watch the recording of.
- Being seen, feeling like you exist to others.
- Feedback on how well you’re doing, in the form of seeing other peope either understanding everything or struggling as much as you, or more.
For the teacher individually (I feel this category gets ignored a lot, as is Students -> Teacher, see above)
- Feeling cool. You’re on stage, giving a show, after all. Or at least, you’re delivering profound and valuable knowledge to the unknowing. Good for the ego.
- Provides a ready made size for chunking your content. Far too large, but still. It’s a template. That’s nice.

Critical Pedagogy and Engineering Design

So I’ve been reading a lot of Critical Pedagogy lately – Paulo Freire’s Pedagogy of the Oppressed, bell hooks’ Teaching to Transgress, and, currently, Jesse Stommel and Sean Michael Morris’ An Urgency of Teachers. I find myself both in strong agreement, nodding along and thinking ‘Yes!’, and at the same time with strong doubts about whether and how to translate it to engineering education, especially with first year’s BSc courses.

The question of how Critical Pedagogy applies to STEM fields has been addressed, but still: almost all the examples and proposals from its main proponents and practitioners appear to relate to liberal education humanities classes.

Design education, whether in engineering design, architecture, or other design disciplines, is a little bit in-between outright mechanics and maths classes and full-on humanities education. On the one hand, it is a matter of thinking critically about the world and your values and goals in relation to it, it empowers, it is already sometimes a liberating experience, I believe. But on the other hand – again, especially at the lower levels, in introductory projects – it very much has the feel of ‘training’ and ‘instruction’ as opposed to true education that is interactive and egalitarian from the outset.

As a design teacher, you do act from a position of authority, the authority of expertise. You have a skill, a set of abilities, that your students don’t yet have. They came to your faculty because they want to learn how to do what you do. And for that to happen, they need to submit to your instructions. First they need to do without understanding, before being able to look back critically and understand why it is you had them do certain things (Cf. Donald Schön).

Now that I’m writing this, I realize that student numbers make a big difference here. In a studio of ~25 students, it’s actually not so difficult to be truly responsive, to interrogate students’ ideas and ideals together as a group. With ~750 first year’s students, in 14 clusters of 8 groups of 6 or 7 students, together with a small army of coaches and student assistants, it’s a whole different story. There, you’re practically forced to put up a sort of obstacle course for the students to run through, egged on and managed by strict deadlines, and then to respond only in a much more limited way, only to selected work by a limited subset of students.

Perhaps the main obstacle to transforming my pedagogy, then, is simply the raw numbers? That would be ironic, as it’s exactly that massive quality of more and more classes that contributes to students learning to just do what’s required, to listen, and to adapt to how things are, instead of developing their own critical awareness.

Ranking, Evaluating, and Liking

I’ve been thinking and reading about ‘ungrading’.

I first encountered the argument against numerical grading of students’ performance in Sanjoy Mahajan’s course Teaching College-Level Science and Engineering, which links to Alfie Kohn’s The Case Against Grades. The term ‘ungrading’ was introduced by Jesse Stommel, who comes at the idea informed by the much broader notion of ‘critical pedagogy’. In a recent online presentation, Stommel mentioned the work of Peter Elbow, which led me to Elbow’s essay Ranking, evaluating, and liking: Sorting out three forms of judgment.

Now, I’ve read many arguments against grading — that it decreases intrinsic motivation, that grades are not effective feedback, that they do not express how much was learned, etc. — but Elbow states the case in a way I found striking:

Differences between student work are multi-dimensional.
Grades are one-dimensional.
Therefore, grades are mostly meaningless.
Peter Elbow, paraphrased.

When you put it like that, it’s so obvious! Of course grades feel like bullshit.

In the first part of his essay, Elbow argues that we should rank as little as possible. In the second part, he argues that we should try, instead, to evaluate — to provide feedback on multiple criteria. And he argues that teaching should include ‘evaluation-free-zones’, where students are free to follow their own judgement without worrying about what the teacher wants to see. I think Elbow is correct here, but this second part of the essay was hardly surprising, novel, or uncommonly insightful to me.

But then on to the last part, the part on liking. Elbow writes:

It’s not improvement that leads to liking, but rather liking that leads to improvement.
Elbow, P. (1993). Ranking, evaluating, and liking: Sorting out three forms of judgment. College English, 55(2), 187-206.

I found his discussion of the need for teachers to like their students’ work in order to be able to give good feedback spot-on. He’s talking about teaching and evaluating writing assignments, but the same goes for design projects, in my experience:

If I like a piece, I don’t have to pussyfoot around with my criticism. It’s when I don’t like their writing that I find myself tiptoeing: trying to soften my criticism, trying to find something nice to say–and usually sounding fake, often unclear. I see the same thing with my own writing. If I like it, I can criticize it better. I have faith that there’ll still be something good left, even if I train my full critical guns on it.

This!

I find it easy and natural to be excited and enthusiastic about student work. And I’ve always known that this was a big part of being able to teach well. But I had never quite put it together with my equally great eagerness to critique, to point out problems and possible improvements.

Good teachers see what is only potentially good, they get a kick out of mere possibility–and they encourage it. When I manage to do this, I teach well.

Yes!

Although I think I might need to make this more explicit to students, and to become better at pointing out what exactly I see that is potentially wonderful.

A Feedback Checklist

Using a ‘cover sheet’ is a simple and effective tool to enhance feedback on student work, this study claims.

Giving teachers a form to fill out instead of just having them make notes in the margines of the work, makes sure feedback is not limited to comments on what was handed in (feed-back), but includes an appraisal of how it matches up to the final criteria or end-goals (feed-up) and practical advice on how to proceed (feed-forward).

Sadly, the study didn’t test using only this form against the more traditional way of giving feedback. The experimental condition was adding this form to notes on the work, while the control was just those notes alone.

So yeah, extra feedback is better. No shit, Sherlock.

Nonetheless, a form like this seems like a good idea to use as a form of checklist. Because I do think i often forget one or two out of the three kinds of feedback.

(Via ScienceGuide)

An Online Lecture Platform Actually Designed for Online Lectures

Or: Why and How I Built My Own Online Lecture System

Current options for large scale online lectures are not designed for lectures.

Zoom, Teams, and similar platforms are enterprise software. They’re built for business meetings. Lectures are not business meetings. Especially when they are for large groups, they involve a very different, and very particular relationship between ‘presenter’ and ‘attendees’.

Let Them Make the Thing First

When introducing design methods, perhaps it would be a good idea to focus instruction purely on the product at first. What table, matrix, or whatever should they make? What criteria should that thing adhere to? Explanation as to why those rules apply, and what role the method can and/or cannot play in a design process seem to fall mostly on deaf ears at first contact.

Make the thing. Do it again in a different course. Reflect on the process of making and using it. Learn the theoretical considerations then instead of beforehand. Concrete application before abstract explanation. The other way ’round feels logical to teachers, but may simply not be very effective. You haven’t created a ‘time to tell’ yet (c.f. Daniel Schwartz)

Onderwijsontwikkeling: Programma van Eisen

Mijn eerste jaar als docent bij het ontwerpproject in het eerste kwartaal van het eerste jaar van Werktuigbouwkunde richtte ik me bij het programma van eisen vooral op het uitleggen van toetsbaarheid: operationalisatie en toegankelijkheid. Maar op het tentamen bleek dat studenten dat compleet verkeerd begrepen hadden. In plaats van dat ze keken naar of een criterium objectief meetbaar was (bijv. ‘zo licht mogelijk’), hadden ze vooral opgepikt dat er een harde grens moest zijn (bijv. ‘lichter dan de vorige versie’). Als er wél een grens was, maar géén objectief meetbare grootheid, dan rekenden ze het criterium goed (bijv. ‘makkelijker te gebruiken dan het product van de concurrent’). Vice versa hadden studenten begrepen dat criteria zonder harde grens, maar wel met een goed meetbare variable, niet goed waren (bijv. ‘zo goedkoop mogelijk’).

Het volgende jaar heb ik daarom niet alleen aandacht besteed aan toetsbaarheid, maar ook aan verschillende typen criteria. Het belangrijkst daarbij was het onderscheid tussen functionele eisen (‘Wat MOET het doen?’) en prestatiecriteria (‘Wanneer doet het dat GOED?’). Maar ook randvoorwaarden en specificaties had ik onderdeel van de stof gemaakt. Dit ging beter, maar het verschil tussen die 4 typen criteria kreeg ik maar lastig overgebracht.