BinaryWorks.it Official Forum
BinaryWorks.it Official Forum
Home | Profile | Active Topics | Members | Search | FAQ
 All Forums
 Plugins
 Information
 IMDB

Note: You must be registered in order to post a reply.

Screensize:
UserName:
Password:
Format Mode:
Format: BoldItalicizedUnderlineStrikethrough Align LeftCenteredAlign Right Horizontal Rule Insert HyperlinkInsert EmailInsert Image Insert CodeInsert QuoteInsert List
   
Message:

* HTML is OFF
* Forum Code is ON
Smilies
Smile [:)] Big Smile [:D] Cool [8D] Blush [:I]
Tongue [:P] Evil [):] Wink [;)] Clown [:o)]
Black Eye [B)] Eight Ball [8] Frown [:(] Shy [8)]
Shocked [:0] Angry [:(!] Dead [xx(] Sleepy [|)]
Kisses [:X] Approve [^] Disapprove [V] Question [?]

 
   

T O P I C    R E V I E W
JDommi Posted - 05 Jan 2025 : 16:50:29
The tt number must be passed as a parameter, including the page to be loaded and the Reference, as a single parameter (without spaces). Example MovieConnections for The Matrix: tt0133093/movieconnections/?ref_=tt_dyk_cnn

// Get the tt-Number
#LOADWEBSLOTS#
    #STARTREADWEB# // Main Page
    #FINDLINE#=<link rel="canonical"#15##ONERROR#=STOP
    #FIND#=/tt#11# #ONERROR#=STOP
    #COMPUTECURSOR#=+2
    #PUTCURSOR#=#11#
    #FIND#=/#12# #ONERROR#=STOP
    #TAKEWORD#=#47#,#11#,#12#
#ENDLOADWEBSLOTS#


Code at the beginning of each respective block for every page that has "see more" buttons. Connections, Awards, etc.

// Block to take MovieConnections
    #PUT#=\Exporter\IMDB.exe#11#
    // The EXE to call
    #STRING#=#12#=#47#/movieconnections/?ref_=tt_dyk_cnn#12#
	// releaseinfo/?ref_=tt_ov_at_dt_rdat 
	// keywords/?ref_=tttg_at_pl_kw 
	// companycredits/?ref_=ttkw_at_dt_cmpy
	// locations/?ref_=ttco_at_dt_flmg
	// goofs/?ref_=ttpg_at_dyk_gf // crazycredits/?ref_=ttg f_at_dyk_cc
	// awards/?ref_=ttalt_at_op_awd
	#EXECUTEFILEPARAMETER#=#12# // Pass Parameter
	#EXECUTEFILEFROMPLUGINS#=#11# // The Initial path is the EXE Folder\Plugins

    #STRING#=#11#=.\tempdir\SavedWebHtml.txt
    // This instruction opens a file from a fixed location, like c:\temp...
    #OPENFILE#=#11# 
    #STARTREADWEB#

    // Parse the saved page to get ALL movie connections
    // the code goes here
    ...
// End of block to take MovieConnections


Input to the script: tt0133093/movieconnections/?ref_=tt_dyk_cnn
Output of the script: .\tempdir\SavedWebHtml.txt

Unfortunately there is still only the imdb_plus script available that was based on the IMDB Scraper by tarzibou. That script has parsed all info and saved it in JSON-format.
Mine is intended to keep most work on the MagicScript-Engine as there are more "scripter" than Delphi-Programmer
Therefore the parsing has to be done completely from scratch.

Download:
https://www.mediafire.com/file/nmbsok8s28t9cuq/IMDB.rar/file
4   L A T E S T    R E P L I E S    (Newest First)
Mawu Posted - 12 Feb 2025 : 14:57:03
I can confirm all of this, but I don't have a real explanation for it either.
JDommi Posted - 08 Feb 2025 : 10:30:48
Would someone take a look at https://www.mediafire.com/file/r5m4yc5l1mbchaq/IMDB.rar/file under release 10, please.
I can't get the goofs managed. In the beginning everything works fine but the last categories are incomplete.
My tests are always with "The Matrix (1999)". As example the last category "Character error" shows only the first goof. Same for the one before...
Mawu Posted - 19 Jan 2025 : 13:56:30
quote:
Well, Connections are now working. But there is a limitation on the length: 65.000 letters. For some movies like The Matrix that's not enough.Does anyone have an idea of how to limit the list?


The 65000 letters limit is the limit of the text field in access. Therefore you can nothing to do against it. The only way would be to cut one of the "connections" subcategorys. The first candidate would be "Featured in".
JDommi Posted - 16 Jan 2025 : 23:16:39
Well, Connections are now working. But there is a limitation on the length: 65.000 letters. For some movies like The Matrix that's not enough.
Does anyone have an idea of how to limit the list?

I only have added following code to the end of the standard IMDB (Complete) script:
// Hier
    #PUT#=\Exporter\IMDB.exe#11#
    // The EXE to call
    #STRING#=#12#=tt#47#/movieconnections/?ref_=tt_dyk_cnn#12#
	// releaseinfo/?ref_=tt_ov_at_dt_rdat 
	// keywords/?ref_=tttg_at_pl_kw 
	// companycredits/?ref_=ttkw_at_dt_cmpy
	// locations/?ref_=ttco_at_dt_flmg
	// goofs/?ref_=ttpg_at_dyk_gf // crazycredits/?ref_=ttg f_at_dyk_cc
	// awards/?ref_=ttalt_at_op_awd        
	#EXECUTEFILEPARAMETER#=#12# // Pass Parameter
	#EXECUTEFILEFROMPLUGINS#=#11# // The Initial path is the EXE Folder\Plugins

    #STRING#=#11#=.\tempdir\SavedWebhtml.txt
    // This instruction opens a file from a fixed location, like c:\temp...
    #OPENFILE#=#11# 
    #STARTREADWEB#
	
#PUT#=#13#
#FINDLINE#=textPrimary"><a href="#11#
#IF# #11#==#00#
	#GOTO#=NOCONNECTIONS
#ENDIF#

:LOOPCAT 
#FIND#=a href="#11#
#IF# #11#==#00#
	#GOTO#=NOMORECONNECTIONS
#ENDIF#
#FIND#=<span id="#11#
#FIND#=>#11#
#FIND#=<#12#
#TAKEWORD#=#14#,#11#,#12#	// Category
#STRING#=#13#=#13##14##NEWLINE#___________#NEWLINE#
#PUTCURSOR#=#08#
#FIND#=textPrimary"><a href="#09#
#LOADCURSOR#=#08#

:LOOPINCAT 
#FIND#=><a href="/title/#11#
#IF# #11#==#00#
     #GOTO#=NOMORECONNECTIONS
#ENDIF#
#FIND#=</div></div></ul><div class="ipc-html-content#12#
#TAKEWORD#=#14#,#11#,#12#
#CHECKVARIABLE#=#14#
#STARTDELETETAGS#
#STARTSUBSTITUTETAGS#
#CLEARTAGS#
#TRIM#
#STRING#=#13#=#13##14##NEWLINE##NEWLINE#
#FIND#=<p><a href="/title/#11#
#IF# #11#<<#09#
	#GOTO#=LOOPINCAT
#ENDIF#
#IF# #09#>>#00#
     #LOADCURSOR#=#09#
     #GOTO#=LOOPCAT
#ENDIF#

:NOMORECONNECTIONS 
#TAKEMOVIECONNECTION#=#13#

:NOCONNECTIONS 
#ENDTRAILER#

BinaryWorks.it Official Forum © Binaryworks.it Go To Top Of Page
Generated in 0.07 sec. Powered By: Snitz Forums 2000 Version 3.4.07