October 15, 2012August 22, 2023 The Mitr

Reason and Faith – Misconceptions in Science Education

Reason does not work in matters of faith. But it may have a chance at clearing misconceptions.
via Tehelka

Truly so. In case of my field of study, namely science education research, it may be the other way round. The classic studies in science education aim at identifying the misconceptions that the learners have regarding a particular subject and then finding a mechanism by which they could be addressed.
This was a very simple but very basic presentation of what most studies try to achieve, though the methodology may be different. There are some studies which present us with a conceptual framework so that all the responses and the problems with the learners can be seen in light of a theoretical construct. This they say will enable us to make sense of what we see in the classrooms, and what is present as representation in the learners mind. What I think they are trying to say is that we need to get to the conceptual structures that lead to formation of the misconceptions.
Now mind you that many of these misconceptions in science are very stubborn and people are very reluctant to give them up. The reason may be that many of these misconceptions come from direct factual experience in the real world. And from what I know about Philosophy of Science, we might want to make a case that all science is counter-intuitive to our everyday experience. This would explain why misconceptions in science arise. But would this case explain all the known misconceptions?
Let us do some analysis of how a particular misconception might arise.There can be two different reasons for a misconception to arise, if we adhere to deductive logic. That is to say we assume that we have a set of starting statements that are given, whose authenticity is not questioned. And from these set of statements we make certain deductions regarding the world out there. Now there can be two problems with this scenario, one is that the set of statements that we are taking for granted might be wrong, the other is that in the process of deduction that we have followed we made a mistake. The mistake is learnt only when the end result of our analysis is not consistent with the observations in the real world. Or it might be even the case that the so called misconception will lead to a correct answer, at least in some cases. In these cases we have to resort to more detailed analysis of the thought structure which lead to the answers. Another identifying characteristic of the misconceptions is presence of the inconsistencies across different areas known to the learners. Whereas they might get a particular concept clearly and correctly, in applying same thing for another concept they just might revert to a completely opposite argument and in doing this they do not realise the inconsistency.
We will be clearer on this issue when we talk with a few examples. Suppose that we have a scenario in which we are trying to understand the phenomena of day and night, its causes and consequences. A typical argument in our class goes like this:

How many have seen the Sun set?

Almost all hands would go up, then comes the next question:

How many have seen the Sun rise?

Almost same number of hands go up, excepting a few, who are late risers like me. Some of the more intelligent and the more knowledgeable would say,
“Wait! Sun doesn’t rise and set, it is the Earth that is moving, so it causes the apparent motion of Sun across the sky, the start and end of which we call as day and night. So in conclusion the Sun doesn’t rise and set, it is an illusion created by motion of Earth.”
To this all of the class agrees. This is what they have learned in the text-book, and mind you the text-book represents truth and only truth, nothing else. It is there to dispel your doubts and misconceptions and is made by a committee of experts who are highly knowledgeable about these things. Now let us continue this line of reasoning and ask them the next question in this series.

Does the Moon rise? If so, does it rise everyday?

The responses to this question are mixed. Most of them would say that it does not rise, it is always there, up in the sky. Some would gather courage and say that it does rise.

Does the Moon set?

Again to this the response is mixed, and mostly negative. Most of them are adamant about the ever presence of the moon in the sky. The next question really upsets them

Do the stars rise and set?

Now this question definitely gets a negative response from almost all of them. Even the more knowledgeable ones fall. They have read different parts of the story, but have not connected them. They tell you the following: “No the stars do not move, they are there all the time.” They also tell you that there is something called as the fixed stars and this is in the text-book, which cannot be wrong. And when asked:

Why are we not able to see the stars during the day time?

They tell you “Of course you cannot see the stars during the day time. This is because our Sun, which is also a star, is too bright and the other stars too far away and hence are dim. So our Sun’s brightness, overwhelms the other stars, and hence they are not visible during the day time, but they are there nonetheless. In the night time, since the Sun is no longer visible, the stars become visible. Have you never noticed that during the evening twilight the stars become visible one by one, the brighter ones first. Whereas in the morning the brightest are the last ones to disappear.”
Of course, the things said above and the reasoning given sounds good. So much so that the respondents are convinced that they understand how things work, and have an elaborate reasoning mechanism to explain the observed things, in this case the formation of day and night and appearance / disappearance of stars during night and day respectively.
You ask them:

Don’t you think there is a problem with what you have just said?

“Where is the problem?”, they tell you. “We just explained scientifically how things are in heaven.”
Then you open the Pandora’s box,
“Well you have just said that the Sun doesn’t move really, it is the Earth that moves, and hence we see the apparent Sun rise and Sun set.”
Then they say, “Yes, that is the case. The Sun doesn’t move, but the Earth does.”
You ask, “How do you know this? Do you see that the Earth is moving?”
They say, “The textbook tells us so ” Some of the more knowledgeable ones say that “Galileo proved that the Earth moves and not the Sun. Since we are on Earth, we see only apparent motion of the Sun.”
You say: “But wait, just now you said that the Moon does not move, it is always in the sky. Also you said that the stars do not move, they are there all the time. Now if the Earth moves, then all these bodies should also move, if only, apparently.Then the stars must also move, just like the Sun does, do not forget that Sun is a star too! So other stars should also just set and rise like the Sun, and so should also the Moon!”
Or you can argue just the opposite: “I claim that it is the Sun that moves, Earth does not move. Isn’t it a lot more easier to explain this way, why we do see the Sun moving, because it moves. And we anyway do not see Earth moving! How will disprove me?”
Then the grumbles start. They have never thought about this. They knew the facts, but never connected them. This lead to the misconceptions regarding these things. They were right in parts, but never got a chance to connect the dots, metaphorically speaking.The reason for these misconceptions is the faith in the text-books, but if the text-books fail to perform the job of asking them the right question, where the reasoning alone can get rid of many of the misconceptions.
If we choose the alternative question, of challenging them to disprove that the Earth is stationary, almost most of them are unable to answer the question of disproving that the idea that the Sun moves and not Earth. They would suggest that we can see this from the satellite in the sky (Can we really?).
Most of us take the things for granted and never question many (or as in most cases, any) of them. And many times the facts are something we do not question. We say that “It is a fact.” This statement basically posits that the information which we think is out there can be unquestionable. But there are many flavours of the post-modern philosophy which challenge this position. They think that the facts themselves are relative, that is to say that one culture has different science than another one. But let us leave this, and come back to our problem of the stars and the Sun and Moon.
Lets put out the postulates for the above arguments and try to deduce deductively the results that were obtained.
Claim 1: Sun doesn’t move.
Claim 2: Earth moves.
Observation 1: We see the Sun moving across the sky daily, it rises and it sets.
Explanation 1: Since the Earth moves, and the Sun is stationary, we see that Sun moves apparently. This apparent motion of the Sun is seen as the Sunrise and the Sunset by us. This is what causes the day and night.
But we can have Observation 1 explained by another set of claims, which is exactly opposite, namely, that the Earth doesn’t move but the Sun moves.
Claim 3: The Sun moves.
Claim 4: The Earth does not move.
Explanation 2: Since the Earth does not move, and the Sun does, we just see the Sun passing by in the sky, around the Earth. This causes day and night.
We see that Explanations 1 and 2 are both valid for Observation 1, if the claims 1 and 2, 3 and 4 are true then the respective deductions from them, in this case the Explanations 1 and 2 respectively are also true.So in this case the logical deduction is correct, provided that the Claims or assumptions are correct. But this process does not tell you whether the claims themselves are true or not. But both set of assumptions, cannot be true at the same time. Either the Earth moves or it does not, it cannot be in a state of both. If at all we had an explanation which came from these assumptions which did not correspond with the observations, but was logically deducible, then we can question the assumptions or premises as philosophers call them.
Of course, the things said above and the reasoning given sounds good. So much so that the respondents are convinced that they
understand how things work, and have an elaborate reasoning mechanism
We can have one example of this type.
Assumption 5: Stars do not move, there are so called “fixed stars”.
Assumption 5: During the day time the Sun is too bright, as compared to the other stars.
Now in this case combining Assumption 5 (A5) with Observation 1 (Ob1) we would get the following:
Explanation 3: The stars are too dim as compared to Sun, hence we cannot see them during the day time, but they are present. Hence they do not move.
In Explanation 3 (E3) above the deduction has a problem. The deduction does not follow from the assumption. This is the other problem in which we talked about above.
Most of the people who would suggest these responses have mostly no background in astronomy. Even then the basic facts that Earth goes round the Sun and not the other way round are forced upon them, without any critical emphasis on why it is so. Neither are they presented at point with the cognitive struggle of another view point, namely the geo-centric view. So presenting the learners with opportunities that will make them observe things and make sense of the explanations in light of the assumptions that were made, will enhance the reasoning and help them to overcome some of their misconceptions.
But there is another observation which can be made of the skies. And it can be either done in the classroom with the aid of Free Softwares like Stellarium. After the round of above questions, we usually show the class the rising of the stars from the east. In a darkened room with a projector the effect is quite dramatic for those who have not witnessed such a thing before. So you can show the class, just as the Sun rises, all other celestial bodies like the Moon and the stars also must rise and this is an observed fact.
Observation 2: The stars and planets and the Moon also rise and set everyday.
So how do we make sense of this observation, Ob2 in the light of the assumptions that we have.
Assumption 6: Sun is a star.
Explanation 4: We observe that Sun moves during the day, from East to West. Sun is a star, hence all other stars should also move.
Now why this should be the case will be different for the geo-centric and the helio-centric theories. In case of H-C theory the explantion is simple. The Earth moves hence the stars appear to move in the opposite direction. And this applies to all the objects in the sky.
Since the Earth moves all other celestial objects will appear to move. In case of G-C theory we have to make an assumption that the
stars are “fixed” on some imaginary sphere, and the sphere as a whole rotates.
But coming back to the misconceptions, it is just the ad-hoc belief that the stars do not move (“fixed stars”) in conjunctions with another observation that in presence of too bright objects dim objects cannot be seen leads to belief that the stars are immobile and do not rise and set as the Sun does. There is another disconnection from another fact that they know, or are told in the textbooks, that the apparent movement of the Sun is caused by the actual movement of the Earth. There is no connection between these two facts which is made explicit.
We think that providing opportunities for direct observation aided by software, Stellarium in this case, which help in visualizing the movements of celestial bodies will help in developing the skill of reasoning and explaining an observed phenomena.

September 21, 2012August 22, 2023 The Mitr

On-line Education | RMS

Educators, and all those who wish to contribute to on-line educational works: please do not to let your work be made non-free. Offer your assistance and text to educational works that carry free/libre licenses, preferably copyleft licenses so that all versions of the work must respect teachers’ and students’ freedom. Then invite educational activities to use and redistribute these works on that freedom-respecting basis, if they will. Together we can make education a domain of freedom.

via On-line Education|RMS
Mostly people don’t bother about what they get for gratis on the Internet, but institutions cannot adopt the same approach. Licensing is as much important as much as the actual content. But an archaic system will not go down till it is compelled to, and it will fight till the very end.

July 26, 2012August 22, 2023 The Mitr

What Wikipedia is not… then what it is?

Although anyone can be an editor, there are community processes and standards that make Wikipedia neither an anarchy, democracy, nor bureaucracy.

via What Wikipedia is Not

Disclaimer: Let me make some things clear, I am not against Wikipedia, or its policies. I am (great) admirer and (very heavy) user, and (very little) contributor to the wonderful platform, which aims to provide free knowledge to everyone. In this post I am just trying to collect thoughts that I have about the Wikipedia’s social system and its relation to the society at large.
Then what is wikipedia? Is it a feudal system, which they do not mention in the list above? Although there are people who are called bureaucrats, they say it is not a bureaucracy, I think they mean it in the traditional sense of the wor(l)d (pun intended).
But for a new person, who is trying to edit the first article, there is too much of bureaucracy (read rules), involved, and it may not be a pleasant experience at all, especially for the so called technologically-challenged people. To describe in one word it is intimidating. The trouble is only there till, actually you become used to it, and become part of the system. This is more like the adaptation to smell, after a while in a stinking place, you don’t feel the stink anymore (just an analogy, I do not mean that Wikipedia stinks!). The rules become a part of your editing skills, which you do want to see in other editors. But how many people are able to get over this first major hurdle is not known to me, but I guess (which can be completely wrong) this number can be significant. This will in general reduce the number of producers and just tend to increase the number of consumers in the commercial sense of the word.
Another thing that the above quote says it is not a democracy. Again here I think, Wikipedia is not a democracy in the sense of common usage of the term. In a democracy, by definition the popular aspirations get through, and they may not be even the best for a society, as we many times see in the Indian context. But then it mostly the people who are editing the Wikipedia who decide by consensus that certain thing should be done. Is it not like majority win? So there is in fact a strong democratic element in Wikipedia.
Do we also want a society that is same as above “neither an anarchy, democracy, nor bureaucracy”? What kind of society would you like to live in?

July 18, 2012August 22, 2023 The Mitr

Expected and Unexpected Malware

We have to distinguish the unexpected malware such as viruses from the expected malware such as Windows or Mac OS or Flash Player and so on, which are also malware; they have features that hurt the user but users know what they are installing.
via Richard Stallman on Restricted Boot | Techrights.

June 14, 2012February 13, 2025 The Mitr

Remaking ebooks from existing pdfs, djvu

Update: 13 Feb 2025

I have made a script to combine images to form searchable pdf which also supports Indic languages. The script and some documentation can be accessed at

https://gitlab.com/the-mitr/ocr-indic-pdf/

Suppose you have an ebook or an article in pdf format, which unfortunately is not cleaned. By not cleaned we mean

Single page scan with edge darkening, pages not aligned that is text is rotated differently , page size different, library and use marks marks etc.
2-in-1 scan: Two pages simultaneously scanned together, the central spine dark band, pages not rotated properly, edge and wear marks, library marks etc.

In this case we cannot use the tools like scantailor for cleaning the images directly. For this we first need to extract images from the PDF file and then do a processing on these images. One can do extract the images one by one and process them, but then we can do it in a better way also.
First we split the pdf file into single PDFs by using the most versatile pdftk
For this in the terminal type
$ pdftk file.pdf burst
It will create as many pdf files as there are pages. with names like pg_0000.pdf etc.
Now next task is to convert these pdf to images, for this we use the convert command, but we don’t want to convert files one by one by
convert pg_0000.pdf pg_0000.tiff
But this is not very useful for large number of files, we want to make this in one go. So we do the following
$ for i in $(ls | grep pdf;);
do
convert -density 600 $i $i.tiff;
done
Lets see what these commands do:
ls
will list all the files in that directory
ls | grep pdf
This will filter out the files with pdf in the filename and provide us with a list
On this list we can do a lot of operations as we do in on any other list
for i in $(ls | grep pdf)
is calling each member of this list that we generated and treating it as variable i
and for each memberwe
do
the following
convert -density 600 $i $i.tiff
and after this is over the task is
done
We can set the dpi for the output images by passing the number, above it is set as 600. The output images will be named same as the input pdf files.
Now we can happily run scantailor on these images to clean them up!
PS:
Instead of a PDF if you have a djvu file we have another approach.
Step 1
Convert the djvu file into a multipage tif file, by using ddjvu command.
$ddjvu -format=tiff -verbose -quality=uncompressed input_file.djvu output_file.tif
With this command we will get a tiff format, with same resolution as the original djvu file.
Once the multipage tif file is there, it can be split into its original pages by tiffsplit command.
$tiffsplit input_file.tif
And we are done. Now we can happily run scantailor on these tiff files.

May 10, 2011August 22, 2023 The Mitr

Free Software Tools for scanning and making e-books

How to give a new life to books which are out of copyright!

Here is a short summary of the Free Software tools that I have found useful for converting hard copies into readable/searchable formats in GNU/Linux!

Typically the making a soft-copy from a hard-copy involves following steps:

Step1:
Scan the Hard copy using a scanner / camera. This step generates image files
typically .tiff, .png or .jpeg. Some scanning programs also have option of directly generating to .pdf
Basically at this stage you have all the data, if you compress the folder into a comic book reader format .cbr or .cbz format you are good to go. But for a more professional touch read on. The main step to scan the books properly. Some do’s and dont’s
Align the pages to the sides of the scanner.
If the book is small size scan 2 pages at once.
If the book is too large adjust the scan in the image preview side so that only one page is scanned.
If these steps are done properly there is a little that we have to do in the second step. And we can directly jump to Step 3.
Preferably scan in the ~~binary~~ grayscale form, unless there are colored images in the text. This will help reduce the final size of the file.
Scan at minimum 300 dpi, this is the optimum level that I have come to after trials and errors with different resolutions, their final results and the time taken for each scan. Of course this can differ depending on what is that you are scanning. Many people do the scanning at 600 dpi, but I am happy at 300 dpi. Note: The 300 dpi images can be upscaled in scan-tailor to 600 dpi.
First of all for the scanning itself. Most of the scanners come with an installation disk for M$-Windows or Mac-OSX. But for GNU/Linux there seems to be no ‘installation disk’. The Xsane package allows quite a few scanners which are detected and are ready for use as soon as you plug them in.
The list of the scanners which are supported by Xsane can be found here:
http://www.sane-project.org/sane-mfgs.html
When we bought our scanner we had to search this list to get the compatible scanner.
What is the problem with the manufacturers, why do they not want to sell more, to people who are using Free Software?
If your scanner is not in the list, then you might have to do some R&D before your scanner is up and running like I had to do for my old HP 2400 Scanjet at my home.
Once your scanner is up and running. You scan the images preferably in .tiff format as they can be processed and compressed without much loss of quality. This again I have found by trial and error.
Step2:
Crop the files and rotate them to remove unwanted white spaces or
accidental entries of adjoining pages from the images that were obtained. When the pages are scanned as 2 pages in one image, we may need to separate the pages.
Initially I did it manually, it was the second most boring part after the scanning. But I have found a very wonderful tool for this work.
Imagemagick provides a set of tools which work like magick in images, hence the name I guess 🙂
This is one of the best tools for batch processing image files.
Then I found out the dream tool that I was looking for.
The is called Scan-Tailor, as the name suggests it is meant for processing of scanned images.
Scan Tailor can be found at http://scantailor.sourceforge.net/ or directly from Ubuntu Software Centre.
Step by step scan tailor cleans and creates amazingly good output files from relatively unclean images.
There are a total of 6 steps in scan-tailor which produces the desired output.
You have to choose the folder in which your scanned images are. Scan-tailor produces a directory called out in the same folder by default. The steps are as follows

Change the Orientation: This enables one to change the orientation of all the files in the directory. This is good option in case you have scanned the book in a different orientation.
Split Pages: This step will tell whether the scans that we have made are single page scans, single page with some marginal text from other page or two page scans. Most of the times the auto detection works well with single page and two page scans. But it is a good idea to check manually whether all the pages have been divided correctly, so that it does not create problems later. If you find that a page has been divided incorrectly then we can slide the margin to correct it. In case of two page scans the two pages are shown with a semitransparent blue or red layer on top of them. After looking at all the pages we commit the result.
Deskew: After the pages have been split we need to change the orientation for better alignment of the text. Here in my experience most of the auto-orientation works fine. But still it is a good idea to check manually the pages, in case something is missed.
Select Content: This is the one step that I have found as the most useful one in the scan-tailor. Here you can select the portion of the text that will appear in the final output. So that you can say goodbye to all the dark lines that come inevitably as part of scanning. Also some library marks can be removed easily by this step. The auto option works well when the text is in nice box shape, but it may leave wide areas open also. The box shape can be changed the way we want. If you want a blank page, remove the content box, by right clicking on the box.
Page Layout: Here one can set the dimensions for the output page and how each page content will be on the page.
Output: Produces the final output with all the above changes.

The output is stored in a directory called Out in the same folder. The original images are not changed, so that in case you want some changes or something goes wrong we can always go back to the original files. Also numbering of the images is done.
So we have cleaned pages of same size from the scanned pages.
Update: The latest scantailor has image -de-warping facility. See the amazing thing at work here:

Step 3:

Collate the processed files in Step 2 to one single PDF. For this I have used the convert command.
Typical synatax is like this

convert *.tiff output.pdf

This command will take all the .tiff files in the given directory and collate these files into a pdf named output.pdf

http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/
Alternative to Step 3
Another alternative is to use gscan2pdf for joining the image files into pdf and doing the OCR which can be tesseract or cunieform. gscan2pdf is also able to scan files and stich them into pdf , but I would recommend that you use scantailor as one of the intermediate steps.
Also using gscan2pdf gives you an option for editing the files, if, for example, you might want to remove some marks from the images. For this it opens the image in GIMP.

Step 4:
OCR the PDF file.
Now this is again tricky, I could not find a good application which would OCR the pdf file and embed the resulting text on the pdf file. But I have found a hack on the following link which seems to work fine 🙂
http://blog.konradvoelkel.de/2010/01/linux-ocr-and-pdf-problem-solved/
The hack is a bash script which does the required work.
Alternate
gscan2pdf can do OCR for you using cunieform or tesseract as backends. The end result is a searchable text, but it does not sit on the image, as it would happen in a vector pdf, but is embedded on the page as “note” at the top-left-hand corner.
Step 5:
Optimize the PDF file generated in Step 4.
Here there is a nautilus shell script which I have found in the link below which does optimization.
http://www.webupd8.org/2010/11/download-compress-pdf-12-nautilus.html
Step 6:
In case you want to convert the .pdf to .djvu there is one step solution for that also

pdf2djvu -o output.djvu input.pdf

The tips and tricks here are by no means complete or the best. But this is what I have found to be useful. Some of the professional and non-free softwares can do all these, but the point of writing this article was to make a list of Free and Open Source Softwares for this purpose.
Comments and suggestions are welcome!

Temet Nosce

Know Thyself Too…

free software